I have been working on the Finnish to English Translation project and have used OpenNMT-tf in Linux.
Initally with small dataset the accuracy of the model was pretty good, like around 60-70 %.
But with increase in data we found the accuracy to fall drastically like to 16-17% and also many words and numbers were missing in the target file.
The dataset used had a sentence per line and was tokenized. And we used the same code from the Github available for the German to English translation. No features were modified.
Can you suggest us why the model failed, and also whether the code from Github could be directly used to translate finning to english text or any language pair it be?
Begineer in Machine Learning Field