What's the difference between model.zero_grad() and optim.zero_grad()

xjtu-zeng · June 21, 2017, 12:17pm

What’s the difference between model.zero_grad() and optim.zero_grad()?
It seems that we oftern use the second one.

guillaumekln · June 21, 2017, 12:24pm

That’s not specific to OpenNMT. See:

tl;dr: they are the same.