Skip to content

English-Odia Parallel corpus


  • 80,437 English text followed by its Odia translation text pairs can be downloaded from our NMT model repo.
  • Parallel pairs have been collected from many sources by many volunteers.


Odia Monolingual corpus

  • Monolingual Odia data has been extracted from Wikipedia.
  • You can use this repo to fetch the latest dataset.
  • Ready-made monolingual corpus (with ~17,000 wikipedia articles) can be found at Kaggle created by Gaurav.

Odia dictionary

  • The dictionary data has been extracted from Odia Purnachandra Bhashakosha.
  • The source code repository for the dataset are in: OdiaNLP/dictionary

Last update: 2023-03-27