Character-Aware Models

Add an extension to support character aware models. Main trickiness is abstracting the data class a bit more so features like this can be cleanly added.

Has there been any update on the character aware models ? I couldn’t find any on the last release.


We only support this in OpenNMT-tf. It requires some time to get used to its concepts but you can build complex models with it. For example, the provided sequence tagging model combines word embeddings and character-level embeddings in a single input:

1 Like