|
The translation results consist entirely of the special character `<unk>`.
|
|
0
|
692
|
August 23, 2024
|
|
How to use pre-trained BPEmb subword embeddings with latest versions of OpenNMT and OpenNMT-py?
|
|
5
|
4203
|
June 16, 2024
|
|
Input to reshape is a tensor with 528066 values, but the requested shape has 352022
|
|
0
|
938
|
November 20, 2023
|
|
Weighted datasets and tokenization
|
|
1
|
927
|
September 19, 2023
|
|
Error converting model to ctranslate2
|
|
5
|
2188
|
April 7, 2023
|
|
Translate_batch(): incompatible function arguments. for ctranslate2
|
|
1
|
3701
|
January 18, 2023
|
|
Low bleu score with Sentencepiece comparing to othoner tokenizers
|
|
2
|
2231
|
August 12, 2022
|
|
Getting no output when usig SentencePiece
|
|
1
|
1407
|
April 23, 2022
|
|
Using Sentencepiece/Byte Pair Encoding on Model
|
|
42
|
22848
|
March 16, 2022
|
|
Overfitting Model
|
|
2
|
1396
|
February 28, 2022
|
|
Question about English to Chinese
|
|
7
|
4250
|
February 17, 2022
|
|
Translation Example in OpenNMT 2.0 Docs
|
|
12
|
4689
|
October 2, 2021
|
|
Single character tokenization?
|
|
10
|
6887
|
August 26, 2021
|
|
Hard spaces lost when tokenizing
|
|
5
|
2986
|
July 19, 2021
|
|
How Much Does Tokenization Affect Neural Machine Translation?
|
|
1
|
2271
|
June 21, 2021
|
|
Different subword tokenization in same word pattern
|
|
3
|
1381
|
November 20, 2020
|
|
Tokenizer v1.20.0 with SentencePiece v0.1.92 potentially problematic?
|
|
5
|
2176
|
October 3, 2020
|
|
Tokenizer (sp_model, vocabulary_threshold) with unexpected results
|
|
6
|
1377
|
September 29, 2020
|
|
How to define the tokenization technique in opennmt-tf
|
|
1
|
1230
|
July 8, 2020
|
|
Korean - English Model
|
|
14
|
5396
|
May 25, 2020
|
|
Tensor conversion/ValueError when training with online tokenizer
|
|
7
|
2352
|
May 11, 2020
|
|
Data not being Tokenized properly
|
|
4
|
1273
|
May 10, 2020
|
|
Character tokenizer with TF2 version
|
|
2
|
1442
|
March 17, 2020
|
|
Problems with pyonmttok
|
|
4
|
2991
|
October 2, 2019
|
|
Core dump while loading the tokenizer
|
|
3
|
1810
|
September 16, 2019
|
|
Issue with special character U+FF5F
|
|
18
|
3086
|
August 14, 2019
|