How to load the checkpoints ckpt-1.data-00001-of-00002

yashugupta786 · July 6, 2020, 11:30am

I am using opennmt-tf . After training i am geting checkpoints as ckpt-1.data-00001-of-00002
how to load the checkpoints for infer as its introducing the error while loading command for training and infer are

training !onmt-main --model_type NMTMediumV1 --auto_config --config /content/data.yml train --num_gpus 1

infer
!onmt-main
–config /content/toy-ende/data.yml --auto_config
–checkpoint_path /content/toy-ende/run/ ckpt-1.data-00001-of-00002
infer --features_file toy-ende/src-test.txt --predictions_file toy-ende/pred.txt

training on google colab

guillaumekln · July 6, 2020, 11:42am

To load the latest checkpoint:

onmt-main --config /content/toy-ende/data.yml --auto_config infer --features_file toy-ende/src-test.txt --predictions_file toy-ende/pred.txt

To load ckpt-1:

onmt-main --config /content/toy-ende/data.yml --auto_config --checkpoint_path /content/toy-ende/run/ckpt-1 infer --features_file toy-ende/src-test.txt --predictions_file toy-ende/pred.txt

yashugupta786 · July 6, 2020, 11:51am

In my run folder there is no file like ckpt-1 exist .
the file exist in the run folder is in the nomenclature of ckpt-1.data-00000-of-00002
so when i am loading its throwing the error

guillaumekln · July 6, 2020, 11:55am

In my run folder there is no file like ckpt-1 exist

A checkpoint consists of multiple files, so you should pass the common prefix which should be /content/toy-ende/run/ckpt-1.

What is the error when you set --checkpoint_path /content/toy-ende/run/ckpt-1?

yashugupta786 · July 6, 2020, 12:05pm

Thanks a lot it worked

I have few things to ask if you can help me
We are building es-en nmt production ready engine -
1 Is there any pretrained model available .
2 in opennmt-tf how to handle unk token to original source token while doing translation
foreg if some word not exist while translating it get replaced with unk token and after that its should not assign a random translation to that token and make it as a source token in the translated text
3 can you help me in defining the optimum configuration for es-en for training the model and which model you suggest with how many steps

4 how to text inference for single query instead of passing file .

Hope you will help me in making a good nmt engine as i have already tried different architecture but still not get production ready environment

guillaumekln · July 6, 2020, 12:10pm

No.
Usually the unk problem can be resolved (or largely mitigated) by using subword tokenization, such as SentencePiece
Use --model_type Transformer --auto_config
Look at the serving documentation and examples:

yashugupta786 · July 6, 2020, 12:18pm

can we use BPE for handling the rare words ? . but suppose if some numeric number is passed to engine is there a way to mask that so that in the target translation it should remain same and for that word not translation should happen
as in this below link its describe can it can be achieve in opennmt-tf

guillaumekln · July 6, 2020, 2:37pm

Yes.

This kind of custom preprocessing and postprocessing should be applied outside of OpenNMT-tf.

yashugupta786 · July 6, 2020, 4:26pm

1- Add -replace_unk to the translation command, and it will replace the tag with the original word, i.e. it will keep it untranslated.

2- Add -phrase_table to the translation command followed by a dictionary file path to replace the tag with a translation from the file. So the -replace_unk option should be there as well.

The phrase table file should include a single translated word (token) per line in the format:
source|||target
Is this above functionality available in Tensorflow version of opennmt-tf. If so how to achieve this in inference

guillaumekln · July 6, 2020, 4:33pm

Phrase table is not implemented in OpenNMT-tf.

yashugupta786 · July 6, 2020, 4:35pm

for the point 1 is it there to convert the unk tag to the original word i.e. it will keep it untranslated.?

guillaumekln · July 7, 2020, 7:37am

Look for replace_unknown_target in https://opennmt.net/OpenNMT-tf/configuration.html. However, this was mainly useful for word-based translation with RNNs, which has been superseded by subword tokenization and Transformer where this option is no longer relevant.

yashugupta786 · July 7, 2020, 8:32am

So what i have done is and please correct me if i gave done some thing new as i am new to this

After loading the data my cod is learning the bpe and then applying the bpe . based on that i am creating the vocab and perform the training but in the training i haven’t used any subword and i am using the below configuration . so please correct me for handling the unk with source word in target

params:
decoding_noise:
- dropout: 0.1
- replacement: [0.1, ｟unk｠]
- permutation: 3
decoding_subword_token: ￭
replace_unknown_target: false

what i have to change in the parameter configuration and what needs to be add as i have already applied the bpe before training so in the parameter what needs to be add
Hope you will answer and thanks for the support in advance