Ctranslate2 gives KeyError: 'vocab' when translating HF Llama2 model

avesuni · August 9, 2023, 7:55am

Hi,

I have fine-tuned a Llama2 model from Meta with QLORA. After merging and saving the weights (model.merge_and_unload()), I am trying to convert it with ct2-transformers-converter --model merged/pytorch_model.bin --output_dir ct2_model. This gives me KeyError: 'vocab'. Why is Ctranslate2 expecting the model binary to include a vocab?

guillaumekln · August 10, 2023, 7:03am

See the description of the option --model:

Name of the pretrained model to download, or path to a directory containing the pretrained model.

So you should try passing merged/ to this option, not the checkpoint file.

avesuni · August 10, 2023, 8:01am

Tried that. Gives an error something along the lines merged/ is a directory