How to get multiple translations for a sentence (beam search?)

chiapas · June 7, 2018, 8:04am

After training, I would like to get multiple translations for a sentence.
I am using opennmt-tf, and the config file is given at the end of this post.

The command I used to get translation is:
onmt-main infer --config config.yml data.yml --features_file test.txt --predictions_file translations.out

In the documentation : http://opennmt.net/OpenNMT/translation/beam_search/,
-beam_size and -n_best are mentioned, but I am confused where to put these two arguments in the config file. There is already beam_width: 5 in the section params.

config.yml

Default OpenNMT parameters from the Lua version.

params:
optimizer: AdamOptimizer
optimizer_params:
beta1: 0.9
beta2: 0.998
learning_rate: 0.0001
param_init: 0.1
clip_gradients: 5.0
decay_type: exponential_decay
decay_rate: 0.85
decay_steps: 10000
start_decay_steps: 10000
beam_width: 5
maximum_iterations: 2500

train:
batch_size: 100
bucket_width: 1
save_checkpoints_steps: 5000
save_summary_steps: 100
train_steps: 1000000
maximum_features_length: 30
maximum_labels_length: 30

Consider setting this to -1 to match the number of training examples.

sample_buffer_size: -1

eval:
eval_delay: 3600 # Every 1 hours.

infer:
batch_size: 30

guillaumekln · June 7, 2018, 8:07am

You can add the n_best option under the infer section. See:

github.com

OpenNMT/OpenNMT-tf/blob/v1.4.1/config/sample.yml#L135-L136


# (optional) For compatible models, the number of hypotheses to output (default: 1).
n_best: 1

chiapas · June 7, 2018, 8:26am

Thank you. By the way, does beam_width: 5 under the param section has relation with n_best under infer section? For example, if beam_width = 1, does n_best = 10 is meaningful?

guillaumekln · June 7, 2018, 8:27am

An error is thown if n_best is greater than beam_width.

chiapas · June 8, 2018, 12:26pm

I set beam_width: 50 under section params
During training, everything is fine.
However, during inference, I got error as shown at the end.
I then make the file in test.txt in --features_file test.txt to contain one line.
The error is still there. Then I change beam_width to 13 and it works.
But for different sentences, the maximal values for beam_width to work during inference are different. This cause me problems because I want test.txt to contain several sentences and just launch the inference once.

What is the right way to get several (a fixed number) translations for each sentence in one inference command. (If it can only find fewer translations than specified for a particular sentence, just output something like None).

Error message:

INFO:tensorflow:Done running local_init_op.
2018-06-08 05:00:49.064659: W tensorflow/core/framework/op_kernel.cc:1318] OP_REQUIRES failed at gather_nd_op.cc:50 : Invalid argument: flat indices[233, :] = [12, 0, 19] does not index into param (shape: [14,1,18,7]).
Traceback (most recent call last):
File “/home/shieh/Applications/miniconda3/envs/deep-learning-base/lib/python3.6/site-packages/tensorflow/python/client/session.py”, line 1322, in _do_call
return fn(*args)
File “/home/shieh/Applications/miniconda3/envs/deep-learning-base/lib/python3.6/site-packages/tensorflow/python/client/session.py”, line 1307, in _run_fn
options, feed_dict, fetch_list, target_list, run_metadata)
File “/home/shieh/Applications/miniconda3/envs/deep-learning-base/lib/python3.6/site-packages/tensorflow/python/client/session.py”, line 1409, in _call_tf_sessionrun
run_metadata)
tensorflow.python.framework.errors_impl.InvalidArgumentError: flat indices[233, :] = [12, 0, 19] does not index into param (shape: [14,1,18,7]).
[[Node: seq2seq/decoder_1/decoder_1/GatherNd = GatherNd[Tindices=DT_INT32, Tparams=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"](seq2seq/decoder_1/decoder_1/Reshape_2, seq2seq/decoder_1/decoder_1/stack)]]

During handling of the above exception, another exception occurred:

guillaumekln · June 8, 2018, 2:09pm

See this issue: