Please bear with me for theses questions…
I’m slowly making my way through all the learning from tokenization/modeling/serving.
I succeed all the first steps, but now I’m at the point where I have a solid web page which can use my model through ctranslate2, but I’m not able to support more than 1 model as there are size restriction on web page and the model have to be preloaded. So in someway I need to create an API somewhere that will handle my multiple models and have my web interface call that API.
- Is my understanding correct to believe that nmt-wizard-docker can do just exactly that?
- Could I put this into Google Cloud and make an API?
I have 0 experience with API/docker… I actually learn what was a docker today… I’m not going to flood the forum with questions, but I just want to validate that I’m not already searching in the wrong direction.