OpenNMT

Fine-tuning mBART

Hello!

Is it possible to use OpenNMT-py or OpenNMT-tf to fine-tune mBART for machine translation?

Thanks!
Yasmin

1 Like

IIRC mBART is fairseq-based. You would have to write some custom code to load/convert the weights. And maybe adapt some of the layers. I think they added a few additional layers after the traditional transformer encoder/decoder also.

All of this should not be very difficult to add, but is not implemented as of now. We would gladly welcome some contributions around this though.

2 Likes