How to merge two trained models?

netxiao · February 13, 2017, 2:12am

is there a way to merge two trained models?

I would like to use 5 computers to train separately, and then merge them.

guillaumekln · February 13, 2017, 9:09am

Ensembling models is not supported yet.

You could take a look at this paper that claims to successfully simulate model ensembling by averaging models weights. However, results vary depending on how different these models are.

netxiao · February 13, 2017, 9:15am

many thanks !

liluhao1982 · February 13, 2017, 2:58pm

Thanks for sharing this.

If this means current version of NMT doesn’t support incremental learning for the time being, right?

I found another article on the forum:

Please correct me if wrong.

Thanks again.

srush · February 13, 2017, 3:25pm

Incremental learning of a single model is supported. We just don’t have ensembling yet.

netxiao · February 14, 2017, 1:45am

Please see this:
How to process Large Train Data out of memory? #72
https://github.com/OpenNMT/OpenNMT/issues/72

use this method to train incremental model.

liluhao1982 · February 14, 2017, 2:06pm

Thanks for sharing, it helps me.

Nart · August 11, 2020, 6:36pm

I’m currently training different models for different language pairsets, these languages are from the same family, would ensembling these models into a multilingual model be beneficial?
Is there an intent to support ensembling models?

guillaumekln · August 12, 2020, 6:51am

You can check this experiment:

The tutorial is for OpenNMT-lua, but you can use similar techniques with any frameworks.

Nart · August 13, 2020, 5:33pm

Thank you.
You answered part of the question but are you going to support ensembling in the near future?

guillaumekln · August 17, 2020, 11:34am

Ensembling is implemented in OpenNMT-py.

We currently have no plan to add it in OpenNMT-tf, but it should not be too complex to integrate so it could happen in the future.