Questions about contributing codes to CTranslate2

lost-libra · April 13, 2023, 9:54am

Hello there,

We’d like to contribute our codes about a sentence-level MoE structure to CTranslate2.
Compared with original Transformer, the structure introduces a gating network and several experts in the decoder layers, where the gating network will route data to the most suitable expert and the experts are used to fine-tune decoder features (e.g. fine-tune the decoder features from generalization to domain specialization).
The experts can also be seen as the adapters mentioned in this issue.
After testing our codes with the processes in CONTRIBUTING.md, we have some questions:

Is there a requirement for coding style (such as naming rules for variables)?
Is it welcomed to separate the updated code from the original code as much as possible?
Since our model is trained with Fairseq, is it necessary to contribute the training code to Fairseq?

Thank you.

lost-libra · April 19, 2023, 6:32am

@ guillaumekln hi, sorry to bother you, could you please answer our questions? Thanks in advance.

guillaumekln · April 19, 2023, 7:58am

Hello,

Sorry, I saw your post but forget to come back to it after the weekend.

So you trained a custom Fairseq model, updated the CTranslate2 code to support it, and now you want to contribute the changes to the official repository.

In general I don’t accept this type of contributions because I can’t spend time to maintain some code that can only be used by a single organization or individual. There could be exceptions for small code changes, but here it seems to me that the code change is quite large.

Contributing the code to Fairseq would indeed be a first step towards integrating the changes in CTranslate2.

Alternatively, I recently worked to make the core library more extensible. Now you can define your own model specification and then register the related C++ model instance at runtime:

github.com

OpenNMT/CTranslate2/blob/v3.12.0/include/ctranslate2/models/model_factory.h

#pragma once

#include "model.h"

namespace ctranslate2 {
  namespace models {

    class ModelFactory {
    public:
      static ModelFactory& get_instance() {
        static ModelFactory factory;
        return factory;
      }

      template <typename Model, typename... Args>
      bool register_model(const std::string& name, Args&&... args) {
        Builder builder = [args...]() { return std::make_shared<Model>(args...); };
        return _registry.emplace(name, std::move(builder)).second;
      }

This file has been truncated. show original

Is this something that can work for you?