I have trained OpenNMT to translate text from English to Icelandic. Once in a while I get strange translations where the same English word is repeated. Here is an example:
src : “When, therefore, the signal for the polonaise resounded through the saloons, I went home”
tgt: “Þegar, signal signal signal signal i i i i i i”
If I skip the three las words (“I went home”), the translation will be:
“Þegar, gefur merki um polonaise …”
This is far from being a perfect translation, but at least somewhere close (but I wonder why I get three dots?)
Does anyone know what could possible be the reason for this behavior?
This is the content of rest_config.yml:
- model: ‘/NMTModels/en-fr/model-enfr_epoch600_3.03.t7’
replace_unk: true
mode: aggressive
joiner_annotate: true
case_feature: true
segment_case: true
beam_size: 5