Dokumenthantering VT97:04
These are the exercises and references for the fourth class of
the course Dokumenthanteringen
Exercises
The results of the exercises marked with * have to be handed in.
- *
Write a Perl program that divides a text in sentences.
- *
Write a Perl program that divides a text in words.
Note: these programs can get as complex as you want.
When you define what you want the program to do
you consider the time that you have available for this
exercise.
References
- [WMB94] Ian H. Witten, Alistair Moffat and Timothy C. Bell. "Managing
Gigabytes, Compression and Indexing Documents and Images", Van
Nostrand Reinhold, 1994.
- [BD92] Bengt Dahlqvist.
"TSSA 2.0, A PC Program for Text Segmentation and Sorting",
Department of Linguistics, Uppsala University, 1994.
- http://spectra.eng.hawaii.edu/Courses/EE150/Book/chap10/chap10.html
The chapter Searching and Sorting in the Computer Programming Methods
by Tep Dobry.
Last update: March 17, 1997.
erikt@stp.ling.uu.se