Cross lingual and Multilingual Text Retrieval
General Course Information
On this web page, information (slides, etc.) for the seminar 'Cross lingual and Multilingual Text Retrieval', which is will be given during Winter term 2009/2010 by Farag Ahmed. Prerequisites for attending this course, is basic knowledge of computer science (Vordiplom). Therefore, this course is open to all students of computer science having Vordiplom.
Description of topics
The increasing diversity of the Internet web sites has created millions of multilingual resources in the world wide web. At first glance, it seems that increasingly more information can be retrieved by non-English speaking people. However, in order to retrieve information written by these languages, some issues need to be tackled. In this seminar we will deal with different problems related to multilingual text retrieval, analyzing different approaches that help to overcome some issues regarding these languages such as:
- Word sense disambiguation (Word translation disambiguation)
- Query expansion
- Comparable corpora construction
- Corpora alignment (word or sentence level)
- Text segmentation (e.g Chinese text segmentation)
- POS Tagging (Part of speech tagging)
- Cross-language information retrieval evaluation
- Language identification etc.
Course Schedule and Room Assignments
|Monday 11:00 - 13:00
If you have any questions concerning the lectures or assignments please contact (if possible by email)
Requirements for the 'Schein'
The following criteria has to be fullfilled for the 'Schein':
- 30 minutes presentation about an elected topic (topics can be elected in the first lesson)
delivery of the slides 1 week before the presentation date
- active participation and reasonable discussion about the presentations
- write a scientific paper (hints will be given at the beginning of the seminar), 7-10 pages
delivery of the paper 1 week before the presentation date
- participation in the other presentations
If you need a 'benoteten Schein' you have to announce it in the first lesson!
Hints on the delivery
The delivery of the paper, the slides and the reports can be done by email to firstname.lastname@example.org . Please send also the source file (word or latex document). The paper and the slides have to be send one week before the presentation date by wednesday at 8:00 am. The reports have to be send one day before the presentation date of the examined talk at 8:00 am. These deadlines must be strictly adhered.
We will provide the lecture slides and further material during the course.