Learning rate start to decay before start_decay_steps
|
|
3
|
250
|
September 8, 2023
|
BLEU decreases so much after averaging checkpoints
|
|
5
|
564
|
March 27, 2022
|
Fine-Tuning Llama-2 quantized with CT2
|
|
5
|
1525
|
September 2, 2023
|
English Persian translator
|
|
5
|
388
|
September 1, 2023
|
Any negative effects of using the parameter "Replace unknowns = True" in inference?
|
|
3
|
227
|
August 31, 2023
|
OpenNMT-TF checkpoint conversion
|
|
1
|
215
|
August 26, 2023
|
Create Knime Workflow with OpenNMT on AWS GPU instance
|
|
0
|
204
|
August 18, 2023
|
Ctranslate2 gives KeyError: 'vocab' when translating HF Llama2 model
|
|
2
|
311
|
August 10, 2023
|
How to define the train steps when finetune nllb-200
|
|
0
|
271
|
August 3, 2023
|
Issues running the OpenNMT-py REST server
|
|
61
|
5564
|
August 3, 2023
|
Failing conversion of Small100 (SMALL100Tokenizer does not exist or is not currently imported)
|
|
6
|
417
|
August 1, 2023
|
Extra token produced
|
|
6
|
310
|
July 26, 2023
|
Inference Llama-2 with CTranslate2
|
|
1
|
1265
|
July 26, 2023
|
Influence of the parameter "number of heads" on the size of the model
|
|
1
|
221
|
July 26, 2023
|
Increasing effective batch size
|
|
4
|
490
|
July 21, 2023
|
Is Data Checkpointing possible?
|
|
1
|
254
|
July 7, 2023
|
[Outdated] -report_align not working
|
|
3
|
243
|
July 6, 2023
|
What are the best techniques to add noise
|
|
1
|
310
|
June 23, 2023
|
Ctranslate2 Support for DeltaLM
|
|
1
|
280
|
June 22, 2023
|
Correct settings when source word features
|
|
0
|
285
|
June 20, 2023
|
cTranslated Falcon-7B on OpenNMT-py server
|
|
1
|
360
|
June 17, 2023
|
Wav2vec2 support in CTranslate2
|
|
0
|
310
|
June 16, 2023
|
I want to run in gpu but actually it runs in cpu
|
|
1
|
297
|
June 16, 2023
|
Worse performance with different CTranslate2 quantization types
|
|
2
|
444
|
June 14, 2023
|
ValueError: not enough values to unpack (expected 2, got 1)
|
|
4
|
3138
|
June 14, 2023
|
Incorporating Linguistic Features in Training Data?
|
|
4
|
236
|
June 13, 2023
|
ValueError: shuffle_buffer_size < 0 is not compatible with weighted datasets
|
|
1
|
211
|
June 13, 2023
|
List index out of range while training a model using Opennmt-py
|
|
1
|
306
|
June 9, 2023
|
Error with lora_weights.py
|
|
1
|
371
|
June 6, 2023
|
Problems encountered during word segmentation,
|
|
4
|
259
|
June 5, 2023
|