Amharic-English Parallel Corpus

DOI

10.24352/UB.OVGU-2018-145

Abstract

This corpus consists of 145,820 Amharic-English parallel sentences (segments) from various sources. This corpus is larger in size than previously compiled corpora. It is released for research purposes and can be used to train or support Amharic-English machine translation systems.

License

All the documents in the corpus are documents which have been made publicly available in the Web. The corpus has been obtained by crawling the Web. In this distribution, for copyright reasons, the sentences are randomized. By downloading this corpus you agree that the corpus should only be used for research purposes.

Citation

When using this data, please cite the original publication:

Gezmu, Andargachew Mekonnen, Andreas Nürnberger, and Tesfaye Bayu Bati.  "A Parallel Corpus for Amharic–English Machine Translation." Technical Report, FIN-004-2018, Data and Knowledge Engineering Group, Otto-von-Guericke-Universität Magdeburg, ISSN 1869-5078. Available at: http://www.inf.ovgu.de/Forschung/Technical+Reports/2018/_/Technical_report.pdf

Download

Amharic-English Parallel Corpus

Description

For more details about the corpus, refer to the original publication.

Last Modification: 06.11.2018 - Contact Person:

Sie können eine Nachricht versenden an: Prof. Dr.-Ing. Andreas Nürnberger
Sicherheitsabfrage:
Captcha
 
Lösung: