Using word2vec embeddings in


Hello I was just wondering if using word2vec embeddings is exactly the same process as with glove.

my word2vec text file has one line that states vocab and vector size. Then it is followed by lines of vectors (of size 100 in this case). Also I have only created embeddings for the target language.
So far so good, however do I need to do anything extra to this file or can I just do this:

_./tools/ -emb_file “myWord2vec_emb.txt” _
_-dict_file “data/” _
-output_file "data/embeddings"

then do I just add these lines to the command:

_-word_vec_size 100 _
_-pre_word_vecs_enc “data/” _
_-pre_word_vecs_dec “data/” _

best wishes

(Pltrdy) #2


From what I’ve seen the only difference between textual word2vec and GloVe is that first line, which we can just ignore.

I’ve submitted a pull request ( so that you can use:

./tools/ -emb_file myWord2vec_emb.txt -dict_file data/ -output_file data/embeddings -type word2vec

You can pull this code by running

git remote add pltrdy
git pull pltrdy word2vec_to_torch


nice one. thank you very much. i will give it a whirl.