In embeddings page there is a guide to map pretrained word2vec vectors.
th tools/embeddings.lua -embed_type word2vec -embed_file data/GoogleNews-vectors-negative300.bin -dict_file data/demo.src.dict\ -save_data data/demo-src-emb
I think that -embed_file data/GoogleNews-vectors-negative300.bin means the file generated from word2vec.
But what is the -dict_file data/demo.src.dict means?
Is that the *.dict file I generate from preprocessing?
I can’t understand well what is the relation of preprocessing and embeddings.
Please tell me any advice about embeddings.