Has anyone trained transformer on the multi30k dataset?

PengboLiu · November 6, 2020, 4:42am

The bleu value reported in general papers is around 37, but I cannot reproduce it. Can someone share the settings of transformer’s hyperparameters on the multi30k dataset? ?

francoishernandez · November 6, 2020, 3:43pm

This task is a multimodal translation task, where the source text is supplemented with an image. Is that what you’re doing?
Or maybe you’re referring to some papers using this dataset as a traditional “text to text” machine translation task?
In that case, could be useful to cite the papers in question.

PengboLiu · November 7, 2020, 2:27am

I just try to implement text2text translation without image.

francoishernandez · November 9, 2020, 12:40pm

This dataset is probably way too small for such a task. But, again, I don’t know which paper(s) you’re referring to. Are there really any that use this dataset for text2text?