I am using the default training configure, 2 layers for en/dec, 500 hidden nodes, 13 epchos,
the final model is as following:
Epoch 13, 30950/30963; acc: 34.54; ppl: 42.56; 3122 src tok/s; 2786 tgt tok/s; 11166 s elapsed Train perplexity: 43.2077 Train accuracy: 34.3917 Validation perplexity: 144.799 Validation accuracy: 25.7105 Decaying learning rate to 0.00390625
I tried to translate the sentences in training source, lots of words are unknown. Any suggestions? Thank you very much!