Hi all.
I’m wondering if there is any implementation of wait-k decoding in OpenNMT-py. It is widely used in simultaneous MT research because it is the easiest way to choose an explicit trade-off between quality and latency.
If not, I would like to implement it and would like to know if you are interested into merging it into the main branch.
I also tag this post with CTranslate2, because if it works with OpenNMT-py, it would be great to have it even faster.
Thanks!