OpenNMT-tf fine-tune an existing model
|
|
1
|
34
|
September 27, 2023
|
cuDNN launch failure After updating tensorflow to 2.13.0
|
|
0
|
28
|
September 26, 2023
|
Export saved_model with tflite mode
|
|
0
|
40
|
September 22, 2023
|
H100 perfomance
|
|
0
|
39
|
September 20, 2023
|
How quality of validation dataset affects final results of a model with a fixed number of steps?
|
|
2
|
58
|
September 19, 2023
|
Learning rate start to decay before start_decay_steps
|
|
3
|
79
|
September 8, 2023
|
BLEU decreases so much after averaging checkpoints
|
|
5
|
357
|
March 27, 2022
|
Any negative effects of using the parameter "Replace unknowns = True" in inference?
|
|
3
|
77
|
August 31, 2023
|
OpenNMT-TF checkpoint conversion
|
|
1
|
70
|
August 26, 2023
|
Curriculum Learning in the Age of Transformers - Parts I-II
|
|
8
|
2223
|
July 30, 2023
|
Single words incorrect translation
|
|
3
|
167
|
July 28, 2023
|
Influence of the parameter "number of heads" on the size of the model
|
|
1
|
104
|
July 26, 2023
|
Increasing effective batch size
|
|
4
|
221
|
July 21, 2023
|
Domain Adaptation Techniques
|
|
16
|
2079
|
July 11, 2023
|
What are the best techniques to add noise
|
|
1
|
195
|
June 23, 2023
|
What are the best techniques to add noise to set clear and unambiguous translation of single word
|
|
2
|
172
|
June 22, 2023
|
Worse performance with different CTranslate2 quantization types
|
|
2
|
182
|
June 14, 2023
|
Incorporating Linguistic Features in Training Data?
|
|
4
|
98
|
June 13, 2023
|
ValueError: shuffle_buffer_size < 0 is not compatible with weighted datasets
|
|
1
|
101
|
June 13, 2023
|
Convert ArgosTranslate model to OpenNMT model
|
|
2
|
153
|
May 29, 2023
|
Problems encountered during word segmentation,
|
|
4
|
115
|
June 5, 2023
|
Sorted VS unsorted corpus for training model
|
|
2
|
101
|
May 31, 2023
|
Model export by averaged checkpoints
|
|
3
|
156
|
May 15, 2023
|
Configuring the LstmCnnCrfTagger with yaml
|
|
3
|
117
|
May 12, 2023
|
Parameter Sharing across Layers in Transformers
|
|
0
|
107
|
April 27, 2023
|
Implementing multi features and multi source together in a transformer
|
|
5
|
684
|
April 25, 2023
|
GPU recommendations
|
|
6
|
583
|
April 22, 2023
|
Error converting model to ctranslate2
|
|
5
|
380
|
April 7, 2023
|
Target tokens out of vocabulary
|
|
1
|
133
|
April 3, 2023
|
Weighted dataset
|
|
3
|
271
|
March 22, 2023
|