Hi I have built a translation model on a largish set of wmt data and it was ok.
I now want to recreate an experiment I did on 42k sentences of Gale data, except the training size is too small to get good results. (MY SMT results got 19 in BLEU, NMT is getting around 13)
Is it possible to use the wmt model and train it some more using the gale data to try and get better results on the gale data?
I did try the train_from option but the final model was much worse, so I have a feeling that my method wasn’t good.