How to use copy-mechanism in OpenNMT-tf?

WeiKangLee · March 27, 2018, 6:46am

When I using OpenNMT-tf do Chinese to Chinese task, it generated lots of lable unk , the performance is poor, how to deal with it?, is there unk_replace option?

guillaumekln · March 27, 2018, 7:48am

What type of model are you training?

If you are specifically looking for copy mechanism, it is implemented in OpenNMT-py under the -copy_attn flag.

WeiKangLee · March 27, 2018, 7:54am

Hello ,
Thanks first
The model i used is nmt_medium , and my project is based on tensorflow

guillaumekln · March 27, 2018, 7:55am

Oh, before your edit it was not clear that the issue concerns unknown words. Do you use some kind of subword tokenization, like SentencePiece or BPE?

WeiKangLee · March 27, 2018, 7:59am

The tokenization was original, i only add a word2vec pre-trained embedding (Chinese)

Iloveopennmt · March 28, 2018, 4:08pm

Hi, could you tell me when the copy mechanism will be introduced into opennmt-tf? Thank you very much~

guillaumekln · March 28, 2018, 4:27pm

Hello,

I don’t know anyone working on this or willing to work on this so I can’t give any ETA for now.