Pre-trained embedding in Transformer training

mazida · October 31, 2022, 11:47am

The OpenNMT-py documentation states about using pre-trained embedding (word2vec, GLOVE). My question is: ‘Can we train a transformer with pre-trained embedding as the model already has an embedding layer. Can FastText be used?’

vince62s · November 1, 2022, 12:47pm

in theory it should work, OpenNMT-py/FAQ.md at v3.0 · OpenNMT/OpenNMT-py · GitHub

you just point to your embeddings and it will load them.
If you’re willing to test the v3.0 branch, then I’ll help debug if it doesn’t work.

Cheers.

larsbun · May 12, 2023, 9:49am

Should this work also for OpenNMT-tf?