Learning rate start to decay before start_decay_steps
|
|
3
|
252
|
September 8, 2023
|
BLEU decreases so much after averaging checkpoints
|
|
5
|
567
|
March 27, 2022
|
Fine-Tuning Llama-2 quantized with CT2
|
|
5
|
1540
|
September 2, 2023
|
English Persian translator
|
|
5
|
392
|
September 1, 2023
|
Any negative effects of using the parameter "Replace unknowns = True" in inference?
|
|
3
|
229
|
August 31, 2023
|
OpenNMT-TF checkpoint conversion
|
|
1
|
219
|
August 26, 2023
|
Create Knime Workflow with OpenNMT on AWS GPU instance
|
|
0
|
206
|
August 18, 2023
|
Ctranslate2 gives KeyError: 'vocab' when translating HF Llama2 model
|
|
2
|
315
|
August 10, 2023
|
How to define the train steps when finetune nllb-200
|
|
0
|
272
|
August 3, 2023
|
Issues running the OpenNMT-py REST server
|
|
61
|
5577
|
August 3, 2023
|
Failing conversion of Small100 (SMALL100Tokenizer does not exist or is not currently imported)
|
|
6
|
425
|
August 1, 2023
|
Extra token produced
|
|
6
|
312
|
July 26, 2023
|
Inference Llama-2 with CTranslate2
|
|
1
|
1284
|
July 26, 2023
|
Influence of the parameter "number of heads" on the size of the model
|
|
1
|
221
|
July 26, 2023
|
Increasing effective batch size
|
|
4
|
499
|
July 21, 2023
|
Is Data Checkpointing possible?
|
|
1
|
256
|
July 7, 2023
|
[Outdated] -report_align not working
|
|
3
|
245
|
July 6, 2023
|
What are the best techniques to add noise
|
|
1
|
311
|
June 23, 2023
|
Ctranslate2 Support for DeltaLM
|
|
1
|
283
|
June 22, 2023
|
Correct settings when source word features
|
|
0
|
287
|
June 20, 2023
|
cTranslated Falcon-7B on OpenNMT-py server
|
|
1
|
364
|
June 17, 2023
|
Wav2vec2 support in CTranslate2
|
|
0
|
313
|
June 16, 2023
|
I want to run in gpu but actually it runs in cpu
|
|
1
|
298
|
June 16, 2023
|
Worse performance with different CTranslate2 quantization types
|
|
2
|
447
|
June 14, 2023
|
ValueError: not enough values to unpack (expected 2, got 1)
|
|
4
|
3147
|
June 14, 2023
|
Incorporating Linguistic Features in Training Data?
|
|
4
|
237
|
June 13, 2023
|
ValueError: shuffle_buffer_size < 0 is not compatible with weighted datasets
|
|
1
|
211
|
June 13, 2023
|
List index out of range while training a model using Opennmt-py
|
|
1
|
312
|
June 9, 2023
|
Error with lora_weights.py
|
|
1
|
376
|
June 6, 2023
|
Problems encountered during word segmentation,
|
|
4
|
261
|
June 5, 2023
|