Seq2seq pretraining


(srush) #1

This paper looks relatively simple to implement, claims strong MT results

UNSUPERVISED PRETRAINING FOR SEQUENCE TO SEQUENCE LEARNING