Fine tuning nllb-200-distilled-600M model
|
|
16
|
1481
|
April 15, 2024
|
Adding language Geez Ethiopian to NLLB
|
|
19
|
616
|
May 31, 2023
|
Error: Unable to convert model from OpenNMT-py to CTranslate2
|
|
10
|
264
|
March 24, 2024
|
OpenNMT-py v3.3 released - following 3.2 with plenty of new features
|
|
5
|
1029
|
November 2, 2023
|
Support for Mistral-7B from Mistral AI
|
|
0
|
660
|
September 29, 2023
|
New Python package for exploring LLMs using CTranslate2
|
|
0
|
290
|
June 7, 2023
|
OpenNMT-py v3.4.3 released - blazing fast beam search inference
|
|
3
|
440
|
November 2, 2023
|
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset + Model
|
|
1
|
678
|
November 3, 2023
|
Ctranslate2 Supports MADLAD-400
|
|
2
|
702
|
January 14, 2024
|
AWQ Quantization support - New generic converter for all HF llama-like models
|
|
2
|
676
|
December 29, 2023
|
Memory leak in Argos Translate
|
|
5
|
710
|
February 13, 2024
|
Independent CTranslate2 benchmarking
|
|
1
|
410
|
June 11, 2023
|
Inference Llama-2 with CTranslate2
|
|
1
|
1301
|
July 26, 2023
|
English Persian translator
|
|
5
|
398
|
September 1, 2023
|
Convert ArgosTranslate model to OpenNMT model
|
|
3
|
406
|
February 5, 2024
|
Support for Phi-2 from Microsoft
|
|
2
|
208
|
January 24, 2024
|
Fine-Tuning Llama-2 quantized with CT2
|
|
5
|
1565
|
September 2, 2023
|
cTranslated Falcon-7B on OpenNMT-py server
|
|
1
|
371
|
June 17, 2023
|
LLMs as NMT: comparison between ALMA-7/13B-R and TowerInstruct
|
|
5
|
552
|
February 27, 2024
|
Is it normal to see "Weighted corpora loaded so far" in a loop during the finetuning phase on a very small dataset?
|
|
6
|
584
|
February 15, 2024
|
OpenNMT-py Docker images
|
|
0
|
231
|
December 4, 2023
|
Failing conversion of Small100 (SMALL100Tokenizer does not exist or is not currently imported)
|
|
6
|
440
|
August 1, 2023
|
OpenNMT with "mps" instead of "cuda" on Mac os 12.6
|
|
0
|
1095
|
May 16, 2023
|
Increasing effective batch size
|
|
4
|
505
|
July 21, 2023
|
Support madlad400 on ctranslate2
|
|
4
|
469
|
November 24, 2023
|
Getting encoder embeddings for words from the model
|
|
4
|
211
|
November 30, 2023
|
Input_sentence_size parameter into the spm.SentencePieceTrainer.Train
|
|
1
|
524
|
May 31, 2023
|
Extra token produced
|
|
6
|
314
|
July 26, 2023
|
Worse performance with different CTranslate2 quantization types
|
|
2
|
462
|
June 14, 2023
|
Extracting word alignment from translation models
|
|
3
|
386
|
January 23, 2024
|
Device side assert triggered on AWQ Mistral converted model
|
|
5
|
270
|
February 16, 2024
|
Using a learned BPE Model for Transformer
|
|
0
|
106
|
March 9, 2024
|
Any tutorial on how to finetune using OpenNMT
|
|
1
|
541
|
September 16, 2023
|
Vocab not recognized during translation, producing <unk> all over
|
|
2
|
409
|
November 23, 2023
|
Difference between GPU and CPU translation
|
|
4
|
305
|
June 1, 2023
|
Training speed with alignment significantly drops down
|
|
0
|
107
|
January 11, 2024
|
Using SharedEmbeddings Transformer model with Pretrained Embeddings
|
|
0
|
103
|
February 8, 2024
|
Single words incorrect translation
|
|
3
|
363
|
July 28, 2023
|
How to assign class weights in the loss function in BCEloss
|
|
1
|
443
|
December 2, 2023
|
About the issues with openNMT-py in machine translation models. Really looking forward to the expert's response and assistance! Many thanks!
|
|
5
|
215
|
January 25, 2024
|
Ctranslate2 gives KeyError: 'vocab' when translating HF Llama2 model
|
|
2
|
322
|
August 10, 2023
|
How to set YAML file to train on multi-gpus
|
|
2
|
314
|
May 14, 2023
|
Error with lora_weights.py
|
|
1
|
381
|
June 6, 2023
|
Compile Opennmt-Tf models with AWS neuron sdk
|
|
2
|
348
|
November 26, 2023
|
Incorporating Linguistic Features in Training Data?
|
|
4
|
244
|
June 13, 2023
|
Model export by averaged checkpoints
|
|
3
|
284
|
May 15, 2023
|
Installing OpenNMT-tf on Tesla T4
|
|
4
|
238
|
November 19, 2023
|
Problems encountered during word segmentation,
|
|
4
|
264
|
June 5, 2023
|
Unload Whisper Model from GPU in Python
|
|
1
|
343
|
December 22, 2023
|
OpenNMT to Huggingface Transformers
|
|
2
|
280
|
December 18, 2023
|