I am testing out OpenNMT and CTranslate with a trained model posted on the OpenNMT website (the English summary one trained on CPU). Curiously I got different outputs from using OpenNMT vs CTranslate, and I am wondering why that is and if there is some model parameters that I needed to set explicitly.
I ran OpenNMT as follows:
th translate.lua -model model/textsum_epoch7_14.69_release.t7 -src data/short-src-test.txt -output short-sum.txt
I ran CTranslate as follows:
./build/cli/translate --model …/model/textsum_epoch7_14.69_release.t7 --src …/data/short-src-test.txt --tgt …/short-sum.txt
The short-src-test.txt contains the first 115 lines of test inputs from data/src-test.txt that came with OpenNMT. The outputs short-sum.txt produced by both systems are quite different.
Based on the reading of document / code, I believe that the beam search width used by both systems is 6 by default. Is there other configuration parameters that I might be missing? Or, is there an explanation why I am seeing different results?
Any help / pointers are greatly appreciated!