Amharic-English Parallel Corpus
This corpus consists of 145,820 Amharic-English parallel sentences (segments) from various sources. This corpus is larger in size than previously compiled corpora. It is released for research purposes and can be used to train or support Amharic-English machine translation systems.
All the documents in the corpus are documents which have been made publicly available in the Web. The corpus has been obtained by crawling the Web. In this distribution, for copyright reasons, the sentences are randomized. By downloading this corpus you agree that the corpus should only be used for research purposes.
When using this data, please cite the original publication:
Gezmu, Andargachew Mekonnen, Andreas Nürnberger, and Tesfaye Bayu Bati. "A Parallel Corpus for Amharic–English Machine Translation." Technical Report, FIN-004-2018, Data and Knowledge Engineering Group, Otto-von-Guericke-Universität Magdeburg, ISSN 1869-5078. Available at: http://www.inf.ovgu.de/Forschung/Technical+Reports/2018/_/Technical_report.pdf
For more details about the corpus, refer to the original publication.