Fine tuning nllb-200-distilled-600M model
|
|
16
|
1564
|
April 15, 2024
|
Adding language Geez Ethiopian to NLLB
|
|
19
|
640
|
May 31, 2023
|
Error: Unable to convert model from OpenNMT-py to CTranslate2
|
|
10
|
308
|
March 24, 2024
|
OpenNMT-py v3.3 released - following 3.2 with plenty of new features
|
|
5
|
1053
|
November 2, 2023
|
Support for Mistral-7B from Mistral AI
|
|
0
|
677
|
September 29, 2023
|
New Python package for exploring LLMs using CTranslate2
|
|
0
|
306
|
June 7, 2023
|
OpenNMT-py v3.4.3 released - blazing fast beam search inference
|
|
3
|
456
|
November 2, 2023
|
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset + Model
|
|
1
|
716
|
November 3, 2023
|
Ctranslate2 Supports MADLAD-400
|
|
2
|
773
|
January 14, 2024
|
AWQ Quantization support - New generic converter for all HF llama-like models
|
|
2
|
735
|
December 29, 2023
|
Memory leak in Argos Translate
|
|
5
|
742
|
February 13, 2024
|
Independent CTranslate2 benchmarking
|
|
1
|
426
|
June 11, 2023
|
Inference Llama-2 with CTranslate2
|
|
1
|
1332
|
July 26, 2023
|
English Persian translator
|
|
5
|
415
|
September 1, 2023
|
Convert ArgosTranslate model to OpenNMT model
|
|
3
|
423
|
February 5, 2024
|
Support for Phi-2 from Microsoft
|
|
2
|
224
|
January 24, 2024
|
Fine-Tuning Llama-2 quantized with CT2
|
|
5
|
1632
|
September 2, 2023
|
cTranslated Falcon-7B on OpenNMT-py server
|
|
1
|
380
|
June 17, 2023
|
LLMs as NMT: comparison between ALMA-7/13B-R and TowerInstruct
|
|
5
|
600
|
February 27, 2024
|
Is it normal to see "Weighted corpora loaded so far" in a loop during the finetuning phase on a very small dataset?
|
|
6
|
626
|
February 15, 2024
|
OpenNMT-py Docker images
|
|
0
|
250
|
December 4, 2023
|
Failing conversion of Small100 (SMALL100Tokenizer does not exist or is not currently imported)
|
|
6
|
458
|
August 1, 2023
|
OpenNMT with "mps" instead of "cuda" on Mac os 12.6
|
|
0
|
1123
|
May 16, 2023
|
Support madlad400 on ctranslate2
|
|
4
|
521
|
November 24, 2023
|
Increasing effective batch size
|
|
4
|
519
|
July 21, 2023
|
Using a learned BPE Model for Transformer
|
|
0
|
141
|
March 9, 2024
|
Getting encoder embeddings for words from the model
|
|
4
|
230
|
November 30, 2023
|
Device side assert triggered on AWQ Mistral converted model
|
|
5
|
322
|
February 16, 2024
|
Extracting word alignment from translation models
|
|
3
|
410
|
January 23, 2024
|
Extra token produced
|
|
6
|
326
|
July 26, 2023
|
Input_sentence_size parameter into the spm.SentencePieceTrainer.Train
|
|
1
|
546
|
May 31, 2023
|
Worse performance with different CTranslate2 quantization types
|
|
2
|
475
|
June 14, 2023
|
Building CTranslate2 From Source
|
|
3
|
190
|
April 1, 2024
|
Any tutorial on how to finetune using OpenNMT
|
|
1
|
592
|
September 16, 2023
|
Using SharedEmbeddings Transformer model with Pretrained Embeddings
|
|
0
|
117
|
February 8, 2024
|
Training speed with alignment significantly drops down
|
|
0
|
123
|
January 11, 2024
|
Vocab not recognized during translation, producing <unk> all over
|
|
2
|
434
|
November 23, 2023
|
Single words incorrect translation
|
|
3
|
378
|
July 28, 2023
|
Difference between GPU and CPU translation
|
|
4
|
310
|
June 1, 2023
|
About the issues with openNMT-py in machine translation models. Really looking forward to the expert's response and assistance! Many thanks!
|
|
5
|
243
|
January 25, 2024
|
How to assign class weights in the loss function in BCEloss
|
|
1
|
456
|
December 2, 2023
|
Unload Whisper Model from GPU in Python
|
|
1
|
405
|
December 22, 2023
|
Ctranslate2 gives KeyError: 'vocab' when translating HF Llama2 model
|
|
2
|
330
|
August 10, 2023
|
Error with lora_weights.py
|
|
1
|
401
|
June 6, 2023
|
How to set YAML file to train on multi-gpus
|
|
2
|
327
|
May 14, 2023
|
Compile Opennmt-Tf models with AWS neuron sdk
|
|
2
|
364
|
November 26, 2023
|
Problems encountered during word segmentation,
|
|
4
|
276
|
June 5, 2023
|
Installing OpenNMT-tf on Tesla T4
|
|
4
|
251
|
November 19, 2023
|
Incorporating Linguistic Features in Training Data?
|
|
4
|
252
|
June 13, 2023
|
OpenNMT to Huggingface Transformers
|
|
2
|
300
|
December 18, 2023
|