Sentence Boundary Detection for Non-European languages
|
|
5
|
730
|
February 13, 2022
|
Build a Glossary
|
|
9
|
492
|
January 31, 2022
|
Multilingual model
|
|
9
|
473
|
January 28, 2022
|
Train/Infer on paragraphs
|
|
15
|
850
|
January 13, 2022
|
Retraining model with new datasets
|
|
2
|
342
|
January 6, 2022
|
Can better desubwording be done?
|
|
7
|
626
|
December 18, 2021
|
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
|
|
0
|
276
|
December 14, 2021
|
Compare BLEU score between models
|
|
2
|
317
|
December 13, 2021
|
End and Start Tokens
|
|
11
|
1175
|
November 29, 2021
|
Handling single word 1 to many translation
|
|
2
|
410
|
November 6, 2021
|
Efficiently Modeling Long Sequences with Structured State Spaces
|
|
0
|
397
|
October 6, 2021
|
YouTokenToMe - Up to 90x faster subword encoding
|
|
0
|
417
|
October 3, 2021
|
How an optimal parallel corpus should look like?
|
|
8
|
492
|
September 29, 2021
|
How to choose a best voc size?
|
|
14
|
3266
|
September 18, 2021
|
Fastformer: Additive Attention is All You Need
|
|
2
|
579
|
August 31, 2021
|
Single character tokenization?
|
|
10
|
3541
|
August 26, 2021
|
Using custom tags for domain adaptation
|
|
4
|
829
|
August 25, 2021
|
Deep transformer with more layers
|
|
2
|
478
|
August 24, 2021
|
Suggestions for translating XML
|
|
9
|
1867
|
August 1, 2021
|
Implementing Boosting Techniques
|
|
9
|
642
|
July 28, 2021
|
H-Transformer-1D: Fast One-Dimensional Hierarchical Attention for Sequences
|
|
0
|
373
|
July 27, 2021
|
Multi-source vocabularies
|
|
2
|
447
|
July 9, 2021
|
Best practices for multiple corpora
|
|
0
|
286
|
July 9, 2021
|
Size of subwords vocabulary?
|
|
3
|
2338
|
July 7, 2021
|
Linguee Open Source?
|
|
2
|
342
|
June 30, 2021
|
How Much Does Tokenization Affect Neural Machine Translation?
|
|
1
|
623
|
June 21, 2021
|
TLIte Converter
|
|
4
|
391
|
June 8, 2021
|
Training convergence and beam size impact
|
|
5
|
833
|
June 2, 2021
|
Minimum Sentences for Languages Model
|
|
1
|
412
|
May 18, 2021
|
How to use Bert embedding into OpenNMT seq2seq model
|
|
4
|
1044
|
May 11, 2021
|