Convert T5 for ctranslate2

Guoli · April 9, 2021, 12:25pm

Would it be possible to add the conversion for google T5 and Facebook Bart in huggingface/transformers to ctranslate2? It seems like cTranslate2 performance is outstanding and it is very few framework that can support int8 quantization in GPU. (Onnx, TF, Pytorch don’t support it)

guillaumekln · April 9, 2021, 3:51pm

Yes, I think we should look into adding these models. I would first need to review the exact architecture they use to estimate the amount of work.

guillaumekln · April 11, 2021, 10:58am

2 posts were split to a new topic: Question about Intel MKL

guillaumekln · June 24, 2021, 7:40am

A post was split to a new topic: Port OpenNMT-py models to HuggingFace

guillaumekln · January 2, 2023, 12:54pm

Both T5 and BART can now be converted to CTranslate2. See the examples here:

https://opennmt.net/CTranslate2/guides/transformers.html