Weird translation result, when try to build a short-conversation model

jiangsuonion · April 5, 2017, 6:42am

Hello, I am really confused with my translation result from my model.
I am trying to reproduced the result from the papar:Neural Responding Machine for Short-Text Conversation

And I tried with the weibo data refered in the paper, I tried first with a two-layer model,with config like this:
rnn_size = 1000
word_vec_size = 620
rnn_type = GRU

And I tried the model after 12 epch training,and the translation result down below is really wired:
[04/05/17 14:40:41 INFO] SENT 85: 卷福，你够了《神探夏洛克》第三季片场。转
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] BEST HYP:
[04/05/17 14:40:41 INFO] [-7.39] ！
[04/05/17 14:40:41 INFO] [-7.69] 。
[04/05/17 14:40:41 INFO] [-12.37] ，个
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] SENT 86: 上课干过这事的童鞋举手！
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] BEST HYP:
[04/05/17 14:40:41 INFO] [-7.16] ！
[04/05/17 14:40:41 INFO] [-7.18] 。
[04/05/17 14:40:41 INFO] [-12.69] ，是
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] SENT 87: 以退为进，奋斗不息。不多说，都明白。各位，辛苦。
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] BEST HYP:
[04/05/17 14:40:41 INFO] [-6.70] ！
[04/05/17 14:40:41 INFO] [-7.00] 。
[04/05/17 14:40:41 INFO] [-11.66] ，个
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] SENT 88: 阿狸周边雨伞杯子棒球帽新品上市咯 alink
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] BEST HYP:
[04/05/17 14:40:41 INFO] [-6.23] ！
[04/05/17 14:40:41 INFO] [-6.52] 。
[04/05/17 14:40:41 INFO] [-12.63] 我是是
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] SENT 89: 有多少人以友谊的名义，爱一个人。
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] BEST HYP:
[04/05/17 14:40:41 INFO] [-6.97] ！
[04/05/17 14:40:41 INFO] [-7.26] 。
[04/05/17 14:40:41 INFO] [-11.80] ，个
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] SENT 90: 莫愁老师也许是没有自称父母双亡走到现在的唯一一个选手。
[04/05/17 14:40:41 INFO]
[04/05/17 14:40:41 INFO] BEST HYP:
[04/05/17 14:40:41 INFO] [-6.97] ！
[04/05/17 14:40:41 INFO] [-7.24] 。
[04/05/17 14:40:41 INFO] [-12.05] ，个

It seems that all the translation prediction is all the same ,so similar ,and of course this is not what i want it .
It seems that this kind of thing always happens when training sequence to sequence model,

Anyone saw this kind of result before? Please help! Thanks!

guillaumekln · April 5, 2017, 7:52am

How many sentences are in this dataset? Also, what is the final validation perplexity?

jiangsuonion · April 5, 2017, 8:48am

there are 4430000 sentence in the dataset ,and final perplexity is 483.14, I am runing to epoch 11, and learning rate is 0.0039 now ,more training seems not helping reduce the perlexity.

guillaumekln · April 5, 2017, 8:54am

This is a high perplexity. Could you share all the options you used?

jiangsuonion · April 5, 2017, 8:59am

My config is here:
rnn_size = 1000
word_vec_size = 620
rnn_type = GRU
and the other options are default
I wonder if this model is not well trained, but this is already the 11th epch ,and learning rate is so small now.

Etienne38 · April 5, 2017, 9:01am

There is perhaps some thing wrong with your training sets. Can you share the 20 first lines of your input prepared files as an example ?

jiangsuonion · April 5, 2017, 9:05am

sure , what i am trying to do is to build a chinese short converstation model(from post to response),so my examples are chinese,tokenized, and splited by blank space.

here is examples:
---------------post examples------------
中国移动营销行来发展报告 alink
小马也疯狂 ------ 地位之争。
那些年，我们一起偷看过的电视。「暴走漫画」
北京的小纯洁们，周日见。 # 硬汉摆拍清纯照 #
要是这一年哭泣的理由不再是难过而是感动会多么好
对于国内动漫画作者引用工笔素材的一些个人意见。
猫咪保镖最赞了！你们看懂了吗？！（来自： 9gag ）
莫愁和所有人开了一个玩笑 —— 其实，她是会正常唱歌的 … …
你见过皮卡丘喝水的样子吗？
如果有个人能让你忘掉过去，那TA 很可能就是你的未来。
我在北京， 24 岁，想去马尔代夫，一个人。
哥你还跳不跳楼了？我们要下班啊！
龙生龙，凤生凤，是个喵咪它就萌。
从胚胎期开始的面部特征演变过程
本届轮值主席王石致开幕词。讲 60 岁上哈佛。
非常不喜欢北京现在的天气 … … 非常 … …
我第一次坐飞机是进安达信的入职培训，在深圳。你们哪？
人生如戏，全靠演技。小受吓坏了。
为什么这世上会有人以刁难他人为乐呢？
算了算了，我看出来了，你们都想看男人！上张美男图。

----------------response examples------------------
王大姐，打字细心一点
于老师不给劝劝架么告诉他们再挣也不是老大
真不愧是这么走出来的爹·······
嗷嗷大湿的左手在干嘛，看着小纯洁撸么。
我已经快感动得哭了。
( ﾉ´∀ ｀ * ) ﾉ大师说的好！各种长知识了
喵喵保镖抢镜了哟。
说是会咳嗽就会唱歌。她的问题是一咳嗽猫都跑。
小日本的东西再好都烦
····关键是那人是否忘记他的过去。
我在太原， 24 岁，想去捷克，两个人。
我想说很多这样的人其实不是想跳楼的。
微信都加了，说好的 8.20 呢！
自然生理变化中的面部演化过程
任总，再有这机会带我去呗
我也不喜欢那种又冷又下雨的天气了！
去帝都游乐园玩。那时候 20 块进去随便玩
人生如戏，全靠演技。
因为他们没钱买可乐
多放点男模的照片看看

guillaumekln · April 5, 2017, 9:12am

Could you try a training with LSTM instead of GRU to see if it makes a difference?

jiangsuonion · April 5, 2017, 9:16am

ok,I will try tonight and see if there is differences,but i think it will take some time,
i will come back and post the result here

Etienne38 · April 5, 2017, 9:23am

Of course, since I’m not reading chinese, I used a translator to have an idea of them. Is there really some kind of logic between source and target sentences ? With the translated ones, I find it really hard…

jiangsuonion · April 5, 2017, 9:39am

This is trying to build a conversation model, and this idea comes from the paper: Neural Responding Machine for Short-Text Conversation (L. Shang, 2015)

In this paper ,they do build such a model and achieve a state of art results. What i am doing now is trying to reproduce the result from this paper,and the dataset is offered by the author in the github.

jiangsuonion · April 5, 2017, 9:43am

you can consider the dataset are from the conversations in the chinese twitter (weibo)

jiangsuonion · April 12, 2017, 6:01am

hi，after trying lstm ，it seems everything goes well now ，Thanks everyone for your advices，thank you！

guillaumekln · April 12, 2017, 7:05am

For everyone wondering, the GRU implementation is actually correct but it may require a lower initial learning rate.

ano · August 3, 2017, 8:10am

I have tried LSTM, 2 layers with 500 hidden units, similar results with you.