This is an overview of the course Text Processing (in Swedish: Dokumenthantering) that will be taught in the Winter term of 1998 at the Department of Linguistics at the University of Uppsala.
The goal of the course is to give the students an introduction in document processing from the perspective of someone that has to manage a large collection of documents from different sources. An additional goal is to give the students an introduction to the programming language Perl.
You can take a look at a summary of the results of the evaluation forms of the course: the midcourse evaluation. The final evaluation will be held in the second week of March.
date time room subject 1. ti 2001 10-12 B163 Lecture (Perl) 2. on 2101 10-12 A204 Lecture (Character Encoding) 3. fr 2301 10-12 H327 Lab (deadline 980204:8/8) 4. ti 2701 11-13 F318 Lecture (Text Formats) 5. fr 3001 10-12 H327 Lab (deadline 980218:8/8) 6. må 0202 14-16 A214 Lecture (Simple Text Processing) 7. ti 0302 10-12 B163 Lecture (Advanced Text Processing) 8. on 0402 10-12 K334 Lecture (Text Compression) 9. fr 0602 10-12 H327 Lab (deadline 980218:8/8) 10. ti 1002 10-12 B163 Lecture (Text Compression) 11. on 1102 10-12 A122 Lecture (Indexing, midcourse evaluation) 12. fr 1302 10-12 H327 Lab (deadline 980225:8/8) 13. ti 1702 10-12 B163 Lecture (Indexing) 14. on 1802 10-12 B119 Lecture (Querying) 15. fr 2002 10-12 H327 Lab (deadline 980304:5/5) 16. ti 2402 10-12 A204 Lecture (Querying) 17. on 2502 10-12 K334 Lecture (Index construction) 18. fr 2702 10-12 H327 Lab (deadline 980329:4/5) 19. ti 0303 10-12 H327 Lab 20. on 0403 10-12 H327 Lab 21. fr 0603 10-12 H327 Lab 22. ti 1003 10-12 H327 Lab 23. on 1103 10-12 H327 Lab (final evaluation) 24. fr 1303 10-12 H327 Lab
The students will make different small practical exercises and one large practical exercise. For each of the exercises they will receive a grade between 0 and 10. In order to pass the course the student will have to obtain an average grade of at least 6.0 for the small exercises and at least 6.0 for the large exercise. In order to get VG the student will have to obtain the required grades for passing the course and obtain a mark for the large exercise of 9.0 or higher.
Literature for this course will be taken from different sources. The most important are:
Neither Wall 92 nor Witten 94 is compulsory. Note: there are many other good Perl books beside Wall 92.
Here are some useful links to Perl material: