Website
GitHub

Quantized model shards

ctranslate2

vrojkova1 (Viktoria) June 15, 2024, 2:22am 1

How to shard quantized model to load to different GPUs?

Home
Categories
FAQ/Guidelines
Terms of Service
Privacy Policy

Powered by Discourse, best viewed with JavaScript enabled