Did someone test RNN with larger recurrence?

Etienne38 · June 23, 2017, 5:19am

Did someone tested a kind of Convolutional RNN ? = RNN with K of its previous states as input.
It would perhaps better handle local patterns.
Lower layer sizes should be certainly used to avoid number of weights explosion.

guillaumekln · June 23, 2017, 7:54am

Yes, someone is experimenting with a CNN-based encoder similar to:

https://arxiv.org/abs/1611.02344

A work in progress.

Etienne38 · June 23, 2017, 8:00am

Isn’t fairseq a pure CNN, without recurrence ?

guillaumekln · June 23, 2017, 8:10am

Correct, that’s not exactly what you are referring but it is a first take on convolutions.

Is the idea from a paper?

Etienne38 · June 23, 2017, 8:23am

Just a personal guess… with few time steps in the recurrent part of a RNN, it would possibly handle in a better way local patterns. On the other hand convolutional-only networks would handle patterns, but would possibly miss a time-sequence effect.

dbl · June 23, 2017, 9:26am

From a theoretical standpoint, a very deep CNN with character-based rather than word-based encoding would be really interesting, particularly for translation to/from morphologically rich languages. I’m tempted to believe that with the right setup, the system could learn things like compounding/de-compounding, agglutination, and morphological affixes quite nicely.