Reproducing pre-trained Transformer model

paulkp · March 31, 2020, 11:45pm

To FULLY reproduce the TRAINING of the pre-trained model, WHICH SentencePiece parameters were used? I’ve got the SentencePiece model but I’d love to know how to do it MYSELF and get the same answer.
When reproducing the BLEU (26 on news14 28 on news17 for the pre-trained) I presume the test.de & pred.de must be detokenized (back into no underscores)?

Thanks in advance everyone eg @guillaumekln @francoishernandez

guillaumekln · April 1, 2020, 7:17am

The SentencePiece model was generated with this script: https://github.com/OpenNMT/OpenNMT-tf/blob/r1/scripts/wmt/prepare_data.sh. Look for “spm_train” to find the SentencePiece training parameters.
Yes, BLEU is reported on detokenized output.

vince62s · April 1, 2020, 4:04pm

Paul,
I did this more than 2 years ago.
If you’re doing this for an academic purpose it’s fine.
If you want a higher Bleu (> 32-33) you’ll need to use back translations.
Enjoy.
Vincent

paulkp · April 8, 2020, 1:26am

Thanks @guillaumekln but why can’t I get the same BLEU score on the pre-trained onmt model EVEN AFTER detokenizing?

I get:
BLEU = 23.16, 51.6/29.0/17.5/11.1 (BP=0.998, ratio=0.998, hyp_len=52721, ref_len=52833)
not a BLEU of 26.

I’m comparing to test.de from wmt14.
Is that the same as news14??
Isn’t news14 in the training??

vince62s · April 9, 2020, 8:41am

news14 is not in the training of course.
post your command line to compute your BLEU

paulkp · April 15, 2020, 5:09am

@vince62s
My BLEU is the perl script provided with OpeNMT-py:

perl tools/multi-bleu.perl
/datadrive/wmt14-ende_sp/data/test.de
<
/datadrive/wmt14-ende_sp/preds/wmt14-ende_sp5_200K_ntok.pred
&>
/datadrive/wmt14-ende_sp/logs/wmt14-ende_sp5_200K_BLEU.log

Thoughts?

vince62s · April 15, 2020, 7:14am

well not sure about what your files above are but the workflow is the following.

detokenized data => Tokenize with sentence piece => translate => tokenized output => detokenize output

preferably use multi-bleu-detok.perl on detokenized data to compare with papers.

if your test.de is detokenized then you need to detokenize your .pred file and use the other perl script.

hope this helps.

paulkp · April 15, 2020, 9:22am

@vince62s
That’s what i’m doing BUT it looks like I’m using the wrong BLEU script . .

paulkp · April 16, 2020, 12:59am

@vince62s
That was it.
I was using the wrong perl script.
Q. What would the non-detok BLEU perl have been doing??