Anybody have tried to use transformer to on wmt 17 Chinese<->English dataset yet? How is that Compared to SogouNMT?