Website
GitHub
OpenNMT
Quantized model shards
ctranslate2
vrojkova1
(Viktoria)
June 15, 2024, 2:22am
1
How to shard quantized model to load to different GPUs?