I have an image captioning dataset for which I want to train OpenNMT-py model to complete the image captioning task. I can’t find any guidelines in the documentation about this task.
There is some explanation/code about this is given but for legacy version on the following link.
Can I perform Image to Text in OpenNMT-py latest version or I need to install the legacy version from the source to use this functionality?
Thank you for your help!