Two-to-one translation - combined or seperate models?

ledaat · October 6, 2024, 3:13pm

Hi there,

I’m in the process of creating translators from English and Hebrew to Yiddish. Would it be better to create two separate models (EN-YI, HE-YI) or one combined model?

Yiddish uses the Hebrew alphabet, and up to 20% of Yiddish words have their roots in Hebrew. On the other hand, Yiddish is fundamentally a Germanic language, and its sentence structure and most of its vocabulary are much closer to English than to Hebrew. That’s why I thought that combining the two would have a “whole is greater than its parts” effect. Does that make sense?

Assuming I go the combined model route, is there anything special I need to do in the corpus? Can I just combine the parallel corpus for both languages into one, given that the source languages use different alphabets (so no room for confusion)?

Thank you very much!

alexeir · April 23, 2025, 8:52am

We tried different approaches and decided to stay as 1 model for each language pair. For example we made separate English to Hebrew and English to Yiddish models. With that approach it has better quality than combined model