Weight table:set weight of some vocabulary

netxiao · May 4, 2017, 9:09am

In some specific domain, the some vocabulary are prone to translation errors, even if the domain corpus is trained, how to set weight for specify vocabulary to improve quality of translation?

Etienne38 · May 11, 2017, 12:47pm

Suggestion : duplicate N copies of the sentences with the specific vocab in the training set.

tel34 · May 11, 2017, 5:57pm

How many copies of those sentences would be needed in your view to override the translations contained in a baseline?

Etienne38 · May 11, 2017, 6:06pm

The ideal would be to make some tests, and see how it evolves. To get more weight, in front of other sentences, tries something like N between 5 and 10. Then, make a BLEU evaluation on both the whole sentences, and the specific vocab ones. Compare it with the previous values.

Etienne38 · May 12, 2017, 2:53pm

Suggestion 2:

netxiao · May 15, 2017, 3:27am

thank you very much, it is very useful for me.