Issues when running the English-German WMT15 training

vince62s · August 31, 2017, 2:54pm

personally, I do:
learn_bpe.lua -lc …
tokenize -case_feature -joiner_annotate …
similar to your second option.

tel34 · September 1, 2017, 8:28am

I’ve just trained a model with exactly that configuration: 5M segments with a 4x1000 network. Although the improvement in BLEU was not dramatic I have noticed that a lot of small “annoying issues”, particularly regarding number entities, have now been solved. Yes, it’s all about experimenting.

cemicel · November 22, 2018, 8:16pm

Can be interesting for you.
https://arxiv.org/abs/1703.03906