Converted a marian model successfully using ct2-marian-converter. Runs fine on GPU. However, get the following error message when trying to run on CPU-only:
terminate called after throwing an instance of ‘std::runtime_error’
what(): No SGEMM backend on CPU
Aborted (core dumped)
Is it possible to run converted marian models on CPU? If so, are there some specific parameters that must be set either when building the model, or when converting it?
Thanks!