Unix exercise 1

This is an exercise on using UNIX commands for processing text corpora. The exercise was originally designed for the students of the Computational Lexicography II course which was taught taught in the Winter term of 1996 at the department of Linguistics at the University of Uppsala by Hong Liang Qiao and Erik Tjong Kim Sang.

In all these exercises we use the same press text of the tagged LOB corpus. The text can be found in the file:

/corpora/ICAME/lobtagh/lobth_a.txt

The text contains about 102,000 words. A description of the contents of the text can be found in the file loblst.txt in the same directory.


Exercise 1

Find out how many words in the text are tagged with the tag NN


If you do not know what commands you can use for making this exercise you may want to read the hints for this exercise.

Your answer for this exercise will determine the location of the next exercise. If the answer is nnnnn then the next exercise will be in the file exnnnnn.html in this web directory.


Last update: April 14, 1996. erikt@stp.ling.uu.se