Home | 1 | 2 | 3

Statistical NLP: Exercise 3

This is the third of a series of exercises on statistical natural language processing. In this exercise you examine a statistical model for spelling correction.

This exercise has been created by Erik Tjong Kim Sang, University of Antwerp, Campus Drie Eiken, room J0.07, phone 03-8202793, e-mail erikt@uia.ua.ac.be

Assignments

Use the available spelling corrector for making the following assignments:

Make a multi-word sentence with exactly one spelling error that is corrected in the right way by the spelling corrector. You may choose yourself whether you want to work with Dutch or English. What is the sentence that you have presented to the corrector? What output did it generate?
Make a multi-word sentence with exactly one word that the corrector does not know and for which it cannot generate an alternative. What is the sentence that you have presented to the corrector? What is the unknown word?
Make a multi-word sentence with exactly one spelling error that is corrected in the wrong way by the spelling corrector although the correct word is in its dictionary. What is the sentence that you have presented to the corrector? What should have been the response of the program? And what output did it generate instead?
Explain the error made by the spelling corrector in the previous assignment by computing the score it assigns to the correct word and its incorrect suggestion.
Hint: for this assignment you need to estimate the priors and likelihoods which the search opportunities in the background information of the spelling corrector.
The spelling corrector does not perform perfectly. Give a non-technical suggestion for how its performance can be improved.

Last update: January 22, 2004. erikt@uia.ua.ac.be