Learning rate not decaying when perplexity stops decreasing on validation set

davidstap · December 11, 2018, 3:19am

I am using the following relevant settings:
-learning_rate 1.0
-learning_rate_decay 0.5
-start_decay_steps 10000

Now, according to the documentation, the decay learning rate will be decayed if (i) perplexity does not decrease on the validation set or (ii) steps have gone past start_decay_steps. Indeed, option (ii) seems to work. I print the validation perplexity every 1000 steps and notice a (sharp) increase, but the learning rate is not decreasing. How do I fix this?

vince62s · December 11, 2018, 7:39am

This is not implemented in OpenNMT-py. The doc refers to OpenNMT-Lua

davidstap · December 11, 2018, 8:36am

Thanks for your reply Vincent. That explains why I was not able to find anything in the OpenNMT-py code. Confusingly, the options are listed in the OpenNMT-py documentation (see http://opennmt.net/OpenNMT-py/options/train.html).

I will use the OpenNMT-lua, consider my problem solved.

vince62s · December 11, 2018, 8:53am

You are correct, I wil fix the documentation.
thanks.

Jeff · February 3, 2019, 9:59pm

They are still there in OpenNMT-py documentation. Are they now supported by pytorch?

vince62s · February 4, 2019, 7:27am

right, thanks for reporting, fixed