LSTM or Transformer?

tuankstn · June 19, 2022, 4:25am

I am building a machine translator. My dataset has 90,000 sentence pairs. Should I use Transformer to train the machine translator with such a small dataset? Will it be better than using LSTM?
Thank you.

ymoslem · June 25, 2022, 6:30pm

If this is a research work, I assume you can try both. Consider also the possibility of fine-tuning a (pretrained) multilingual model, especially if your language pair is similar to some high-resource languages.

Check also some posts on the forum about low-resource languages, including this one.

Kind regards,
Yasmin

tuankstn · July 5, 2022, 8:57am

Thank you so much for your help.

argosopentech · July 21, 2022, 12:22am

I would also recommend fine tuning a model for a larger similar language if possible. I trained an Azerbaijani model by fine tuning from Turkish with good results.