What would be required to improve the word features: OpenNMT Labeled word features
so support labeling blocks of text? For example, let’s say that I have statements like:
S: "What is the color of the blue block? The color of the blue block is blue!"
T: "Are you sure?"
S: "What is the color of the blue block? Yes I’m sure!"
T: "How do you know?"
S: "What is the color of the blue block? I know because you told me that it’s blue!"
T: “That is correct!”
I want to split them with labels like this:
S: "CONTEXT What is the color of the blue block? ||| MESSAGE The color of the blue block is blue!"
T: "MESSAGE Are you sure?"
S: "CONTEXT What is the color of the blue block? ||| MESSAGE Yes I’m sure!"
T: "MESSAGE How do you know?"
S: "CONTEXT What is the color of the blue block? ||| MESSAGE I know because you told me that it’s blue!"
T: “MESSAGE That is correct!”
I’d be happy to fork the code and submit a pull request to implement the changes if I can get some direction on what changes need to be made. What changes would need to be made to implement this feature?
(Either in OpenNMT or OpenNMT-py)