What is the best way to generate parallel corpus as per your past experience?


(Karimkhan) #1

I am working with Hindi to English translation and rare corpus available for this language.

I am thinking for creating my own corpus but could not finalize the approach. What is better way to go? Please suggest any better technique available

  1. Using some regional guys to build corpus.
  2. Using Google translation and converting text to target language and building the corpus