Week 8

Learning Objectives

(color key: Python/Programming NLP/CL Software Engineering)

Reading

Please see the slides from my 2019 workshop on regular expressions for a quick introduction:

For applications of regular expressions to NLP, please read these sections from the NLTK book:

Also, we’ve discussed tokenization and basic normalization already, but now see the following to better understand stemming, lemmatization, and segmentation.

Additional Reading

These links may be helpful, but are not assigned reading:

Testing Your Knowledge

Questions

Practical Work