Is there a model conversion tool?

netxiao · April 7, 2017, 7:53am

I have a trained model(layers = 2, rnn_size = 512), is there a tools convert it to another network topology (layers = 4, rnn_size = 1024)?

thanks!

guillaumekln · April 11, 2017, 8:47am

There is no such tool currently.

However, @Dakun experimented on this subject. Maybe he could share his findings and how he proceeded.

Dakun · April 11, 2017, 9:05am

It’s not difficult to expand/convert from e.g. layers =2, rnn_size = 512 to layers = 4, rnn_size = 1024.

you may output all the parameters in each encoder/decoder for different configurations.
load the previous model
create a new encoder/decoder
copy or expand corresponding parameters for each matrix.
train with the newly generated encoder/decoder

BTW, personally, I don’t think it is a good way to directly jump from 2512 to 41024.
It’s really big for 41024 network. Maybe you can start from 2512 to 4*512 for instance.
That will be much easier.

netxiao · April 11, 2017, 10:11am

Thanks for your great answer!

tel34 · April 11, 2017, 6:42pm

Great post - I was thinking about this problem this morning