CTranslate2's target prefix

panosk · May 26, 2020, 10:43am

Hello,

My understanding is that this feature requires all the tokens from the beginning of the output up to the token we wish to change. What if we want to change a token at or near the end of the output? Then effectively we are providing the correct output and we are just asking the engine to reproduce it. Is there another mechanism or strategy so we can update a single or multiple tokens in any position?

Thanks!

guillaumekln · May 26, 2020, 11:27am

Hi,

What do you have in mind?

The target prefix is a simple and effective way to know what tokens should be force decoded and where to start the unconstrained decoding.

Also I don’t see how you can change multiple positions in one request, because the first unconstrained token may completely change the rest of the translation.

panosk · May 26, 2020, 1:07pm

Hi

On-the-fly translation improvement using the same model :).

Yes, of course, this makes perfect sense --I’ve just started looking into this feature so I’m still digesting the concept.

Basically, the idea is to implement some translation memory features to continuously improve the output.

guillaumekln · May 26, 2020, 1:15pm

I’m pretty sure DeepL is using the exact same approach to provide their autocompletion and alternative words features in the translation box.

ajitesh3 · March 3, 2021, 12:16pm

Hi @guillaumekln
I wish to see how this target_prefix auto complete thing is implemented in OpenNMT latest version. Can you direct me to it

guillaumekln · March 3, 2021, 1:07pm

You can search for “prefix” in the decoding code:

github.com

OpenNMT/CTranslate2/blob/master/src/decoding.cc

#include "ctranslate2/decoding.h"

#include <cmath>
#include <map>

#include "ctranslate2/ops/ops.h"
#include "device_dispatch.h"
#include "type_dispatch.h"

namespace ctranslate2 {

  static const ops::Gather gather;

  static void split_batch_beam(StorageView& input, dim_t beam_size) {
    Shape shape = input.shape();
    shape.insert(shape.begin() + 1, beam_size);
    shape[0] /= beam_size;
    input.reshape(shape);
  }

This file has been truncated. show original

ajitesh3 · March 3, 2021, 1:56pm

@guillaumekln Arent we having this feature in OpenNMT-py itself

guillaumekln · March 3, 2021, 1:57pm

For OpenNMT-py see this PR:

ajitesh3 · March 3, 2021, 2:01pm

ok
thanks @guillaumekln

ymoslem · March 3, 2021, 10:34pm

Here is an implementation from MS India. I believe it is OpenNMT-py 1.x though, but I hope it can give you an idea.

Kind regards,
Yasmin

ajitesh3 · March 4, 2021, 1:34pm

thanks @ymoslem
Looking into this