This is an old revision of the document!
PHP's gd library is missing or unable to create PNG images
Weekly readings are specified below. A “+” indicates additional readings will be made available via a link from this page.
Lecture slides in PDF are obtained by clicking on the link to the topic.
Homeworks (prefixed with “HW”) are exercises that you should complete before the class labeling the row in which they appear. These will not be handed in or graded, but provide background to the lecture and it will be assumed you have completed them. The due date for each of the four large (graded) assignments (prefixed with “A”) will be specified in the assignment description.
Please note that this schedule is tentative and may change as the semester progresses. Please visit this page often!
Date | Topic | Reading | Assignments | Supplemental materials |
1/24 | Introduction (PPT) | J&M Ch. 1 | | |
1/29 | Morphology, tokenization (PPT) | NLTK Ch. 1, J&M Ch. 2, 3.1 | | |
1/31 | Introduction to NLTK and Python (PPT) | NLTK Ch. 3, 4 | HW1: NLTK Ch. 1, Ex. 4-7 | Python3.5 tutorial |
2/5 | Probability, Collocations, Ngrams | J&M Ch. 4, NLTK Ch. 2 | A1 S1 HW2 | |
2/12 | Machine Learning | J&M, Ch. 6 (3rd ed.), NLTK Ch. 6 | | |
2/19 | Weka overview | Data Mining, Ch. 11.1-4, 11.6-8 | A2 S2 | Weka Tutorial Weka Manual |
2/21 | Logistic Regression (Maximum Entropy) | J&M ch.7 (3rd ed.) | | |
2/26 | Sentiment Analysis | J&M ch. 6 (3rd ed.), J&M ch. 18 (3rd ed.) R1 | | |
3/5 | Information extraction, named entity recognition | J&M Ch. 21, 3rd edition | A3 S3 | | |
| | | | |
3/26 | Part of Speech Tagging, HMMs | J&M Ch. 9 (3rd ed.),J&M Ch. 10 (3rd ed.), NLTK Ch. 5 | HW3 | Viterbi handout Project proposal guidelines |
4/2 | Lexical Semantics, word sense disambiguation | J&M ch. 15 (3rd ed.), J&M ch. 16 (3rd ed.), J&M ch. 17 (3rd ed.) | | |
4/9 | Vector semantics Neural nets | J&M Ch. 8 (3rd ed.) | | |
4/16 | Parsing | J&M Ch. 12 (3rd ed.), J&M Ch. 13 (3rd ed.), J&M Ch. 14 (3rd ed.); NLTK Ch. 8 | In-class exercisePython code Similarity data | |
4/23 | Discourse | J&M Ch. 21 | In-class exercise | |
4/30 | Project presentations: Shashaty, Bae, Schnarr, Ewing, Caletti | Presentation outline | | |
5/2 | Project presentations: Witteman, Jin, Housen, Chenmei, Berlstein/Schwartz, Horowitz, Yang | | | |
5/15 | Poster session, 3-5PM Olmsted Atrium (tentative) | Poster template (PPT) Poster contents | | |