Does word embedding size change when we use word features?

lengockyquang · May 28, 2019, 12:42am

Hi everyone, as i mention on the title, does word embedding size change when we add word features ?
For example, i use word features such as lemma, pos tag, word cluster and dependency parsing with 128, 64, 64, 64 embedding size respectively. So if the default embed dim is 500, when we use word features, the total embedding dim become 500 + 128 + 64 * 3 = 832 ?

guillaumekln · May 28, 2019, 6:06am

Hi,

Yes, they are concatenated.

lengockyquang · May 29, 2019, 12:11am

I’ve read in this paper: Linguistic input features improve neural machine translation.. On page 86, bottom-right paragraph, it said:

To ensure that performance improvements are not simply due to an increase in the number of model parameters, we keep the total size of the embedding layer fixed to 500

So i’m kinda confused if opennmt use linguistic features and embedding dim increase with feature dim size

guillaumekln · May 29, 2019, 7:27am

As I said, the embeddings are concatenated. It’s left to the user to control the embeddings size with -word_vec_size and -feat_vec_size.