Data & Knowledge Engineering Group
Home > Research > Topics > Cross lingual and Multilingual Information Retrieval

Research

The widespread use of the Internet has increased the multilingual information available online. Furthermore, the non-native English speakers have increased. Initially online documents were used predominantly by English speakers. Now more than half (50.4%) of web users speak a native language other than English. It has become more important that documents of different languages and cultures are retrieved in response to the user's request.

Our research in this area focuses on supporting multilingual information retrieval by interactive retrieval tools with a focus on european languages. However in addition special attention is given to the Arabic language. We focus on different approaches for multilingual information retrieval: One is using machine-readable multilingual dictionaries; the other is automatic extraction of possible correct translation equivalents sensed by statistical analysis of parallel corpora. For the second approach we use a statistical/probabilistic method on parallel text written in multiple languages in order to identify the correct sense of the word translation using bilingual parallel text as training data.

multilingual search

Selected Publications

  • Farag Ahmed and Andreas Nürnberger, Corpora based Approach for Arabic/English Word Translation Disambiguation. Journal of Speech and Language Technology, Volume 11, pp. 195-213, 2009.
  • Farag Ahmed and Andreas Nürnberger, Arabic/English Word Translations Disambiguation using Parallel Corpora and Matching Schemes, In: Proceedings of the 12th European Machine Translation Conference (EAMT08) 22-23 September 2008 at University of Hamburg, Germany. pp. 6-11
  • Farag Ahmed, Ernesto William De Luca and Andreas Nürnberger. MultiSpell: an N-Gram Based Language-Independent Spell Checker. In: Poster Postproc of Eighth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007). Mexico City, Mexico, IEEE CS Press, 2008
  • Ernesto William De Luca und Andreas Nürnberger,  Adaptive Support for Cross-language Text Retrieval, in: Barry Smyth, Helen Ashman und Vincent Wade (Hrsg.),  Proc. of the Int. Conf. on Adaptive Hypermedia and Adaptive Web-Based Systems (AH 2006), LNCS 4018, S.: 425-429, Springer Verlag, Berlin, 2006.
  • Ernesto William De Luca, Martin Eul and Andreas Nürnberger.  Multilingual Query-Reformulation using an RDF-OWL EuroWordNet Representation. In: Proceedings of the Workshop on Improving Web retrieval for non-English queries (iNEWS07). In conjunction with the SIGIR 2007 Konferenz, Amsterdam, 2007 (to appear).
  • Ernesto William De Luca, Martin Eul and Andreas Nürnberger.  Converting EuroWordNet in OWL and Extending It with Domain Ontologies. In: Proceedings of the Workshop on Lexical-Semantic and Ontological Resources. In conjunction with the GLDV-Frühjahrstagung (GLDV 2007). Tübingen, 2007.
  • Ernesto William De Luca and Andreas Nürnberger,  A Word Sense-Oriented User Interface for Interactive Multilingual Text Retrieval In: Proceedings of the Workshop Information Retrieval In conjunction with the LWA 2006, GI joint workshop event "Learning, Knowledge and Adaptivity", Hildesheim, 2006.
  • Ernesto William De Luca and Andreas Nürnberger,  LexiRes: A Tool for Exploring and Restructuring EuroWordNet for Information Retrieval In: Proceedings of the Workshop on Text-based Information Retrieval (TIR-06). In conjunction with the 17th European Conference on Artificial Intelligence (ECAI'06). Riva del Garda, Italy, 2006.
  • Ernesto William De Luca, Stefan Hauke, Andreas Nürnberger and Stefan Schlechtweg,  MultiLexExplorer: Combining Multilingual Web Search with Multilingual Lexical Resources In: Proceedings of the combined Workshop on Language-Enabled Educational Technology and Development and Evaluation of Robust Spoken Dialogue Systems. In conjunction with the 17th European Conference on Artificial Intelligence (ECAI'06). Riva del Garda, Italy, pp. 17-21, 2006.
  • Ernesto William De Luca, Stefan Hauke, Andreas Nürnberger and Stefan Schlechtweg,  Using Multilingual Ontologies for Adaptive Web-based Language Exploration. In: Proceedings of the International Workshop on Applications of Semantic Web Technologies for E-Learning (SW-EL06). In conjunction with the International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems (AH2006), pp. 35-44, 2006.
this page: Seite drucken | Seite vorlesen lassen
last modified: 25.02.2010 - contact person: E-Mail  Webmaster
 
Impressum // © OvGU