First OpenNMT Workshop in Paris - Registration is open!
|
|
13
|
4319
|
March 1, 2018
|
Cannot scale well with multiple GPUs
|
|
32
|
4900
|
November 8, 2022
|
OpenNMT-py BERT Tutorial
|
|
13
|
7479
|
October 13, 2021
|
Pre-processing corpora
|
|
23
|
5680
|
December 12, 2017
|
Translation server crashing after initialization
|
|
31
|
4524
|
March 7, 2017
|
Bpe, vocab size
|
|
20
|
5578
|
August 3, 2021
|
Load a savedmodel in tensorFlow
|
|
18
|
5835
|
April 12, 2022
|
Linguistic features surprisingly decrease the performance!
|
|
19
|
5645
|
May 14, 2019
|
Best way to handle emojis during translation
|
|
37
|
4078
|
June 3, 2020
|
Tokenize.lua results in
|
|
16
|
6053
|
June 23, 2017
|
Are you interested in training Russian-Abkhazian parallel corpus?
|
|
29
|
4483
|
June 9, 2020
|
Multi-GPUs is slower than single GPU
|
|
12
|
6744
|
April 16, 2019
|
Epochs Determination
|
|
21
|
5138
|
April 2, 2024
|
Issues when running the English-German WMT15 training
|
|
22
|
5023
|
November 22, 2018
|
Sentence length & translation quality
|
|
13
|
6396
|
June 11, 2018
|
Training stuck (multi GPU, transformer)
|
|
12
|
6494
|
June 1, 2024
|
Ensemble decoding
|
|
16
|
5520
|
October 19, 2017
|
Improvement of performance by data normalization
|
|
23
|
4626
|
August 16, 2019
|
CTranslate2 on OpenNMT-py Server
|
|
13
|
3364
|
February 1, 2021
|
In-domain training
|
|
14
|
3183
|
March 19, 2017
|
Convert M2M model to CTranslate2
|
|
12
|
5888
|
June 20, 2022
|
Some experience when training with large datasets
|
|
17
|
4932
|
November 1, 2017
|
Supporting Tokenize/Detokenize automatically by Translation Server
|
|
17
|
4900
|
September 11, 2017
|
Pretrained Model English<->German
|
|
13
|
3091
|
February 14, 2017
|
OpenNMT-py error when training with large amount of data
|
|
21
|
4294
|
November 6, 2022
|
Terminology handling
|
|
16
|
4861
|
September 8, 2023
|
Leave Unknown Words Untranslated
|
|
17
|
4664
|
December 19, 2019
|
How to resume training from last interupted state
|
|
11
|
3179
|
July 21, 2019
|
Translate with CTranslate using cuda
|
|
16
|
4738
|
November 25, 2020
|
Bleu score falling when detokenizing, detruecasing and de subword BPE
|
|
16
|
4644
|
March 23, 2023
|
Where can I find sentencepiece_model of my model?
|
|
12
|
5171
|
January 29, 2020
|
Single character tokenization?
|
|
10
|
5442
|
August 26, 2021
|
Issue in using distributed training in openNMT-TF
|
|
31
|
3186
|
November 21, 2018
|
DataLossError : Checksum does not match
|
|
10
|
5327
|
May 18, 2019
|
Installed but can't get pre-processing working
|
|
16
|
4237
|
November 25, 2019
|
TensorFlow REST API + SentencePiece
|
|
19
|
3865
|
August 30, 2021
|
Korean - English Model
|
|
14
|
4420
|
May 25, 2020
|
Preprocessing corpus for case_feature and POS tags
|
|
12
|
4673
|
March 22, 2018
|
XML tags handling
|
|
18
|
3825
|
October 23, 2018
|
Teacher forcing
|
|
11
|
4758
|
July 25, 2017
|
RuntimeError: Model diverged with loss = NaN
|
|
13
|
4333
|
February 21, 2020
|
Size of feature embeddings (and some digression about casing methods)
|
|
17
|
3698
|
July 29, 2020
|
Detokenization clarification
|
|
10
|
4535
|
January 11, 2017
|
Fine tuning nllb-200-distilled-600M model
|
|
16
|
3575
|
April 15, 2024
|
Feature Request List
|
|
14
|
3705
|
August 7, 2018
|
Convert to Keras Model for CoreMlTools
|
|
16
|
3396
|
June 4, 2021
|
Training Won't resume from latest checkpoint
|
|
14
|
3587
|
September 4, 2022
|
Custom attention mask
|
|
20
|
3029
|
August 18, 2020
|
Incorporating translation dictionary during decoding
|
|
15
|
3440
|
August 29, 2019
|
Using EmbeddingsSharingLevel in a dual source transformer
|
|
22
|
2740
|
June 19, 2020
|