Looking for a master's thesis topic



I am a Computer Science (Mathematical Information Technology) student at University of Jyväskylä. I am currently looking for the topic, study method, and aims for my master’s thesis focusing on neural machine translation.

Currently, I am still trying to figure out whether to make my thesis just a literature review or something a little more hands-on. I find that if I want to something hands-on, OpenNMT is a great project to work with. I have been reading about OpenNMT, including the technical report by Klein et al. My question is: what is there to do with OpenNMT? Are there some features, theories, or different techniques to apply or at least test for OpenNMT (within the scope of a master’s thesis)?

I realise that if I familiarised myself with the code and read more about the topic, I could probably figure something out by myself, but at this point, it would help me tremendously to get some ideas from people who are already familiar with OpenNMT and the world of neural machine translation in general. Thank you!

(Vincent Nguyen) #2

It depends really on:

  • what you are willing to focus on: theory, coding, testing, …
  • the time you can devote to this.

There is something that we never found the time to do:
revisit the bench mark of each kind of model vs the related paper.

Lately we focussed on the Transformer but we never released updated results for the WMT tasks for the LSTM (even though I did some with the Lua framework for the conference) , CNN, SRU.

Then from a more theoritical, there is plenty of testing to accomplish by mixing some configurations.
For this, some coding is required, and would be welcome as contributions.



Hi Vincent,

thank you for the reply. I realise now that I didn’t explain what I mean by ‘hands-on’, sorry about that. I mean doing some kind of application of theory to OpenNMT, so in this case, probably coding and/or testing.

As for the time I can devote, our university recommends (based on study points) 800-900 hrs of work for the master’s thesis, of which about 40% goes into theoretical background and 40% to data gathering/analysis, but I am ready to devote even more time (and probably will spend more time on it anyways). I am starting my thesis now and I need to graduate (at latest) before July 2020, so I have allocated about 1,5 years for the thesis.

I will take this information to my instructor, but I am also interested in details, for example, about testing related to mixing some configurations.

Thank you very much!