|Olga Zamaraeva||Ajda Gokcen|
|Office Hours:||W 2-3pm,
|Office Location:||Guggenheim 407||Guggenheim 416A (Treehouse Lab)|
|Email:||olzama at uw||ajdag at uw|
Goals: By the end of this course, you will:
Computational linguistics is a broad field incorporating research and techniques for processing language with computers at all levels of linguistic structure. In this class, we will survey various topics and tasks in computational linguistics focusing on linguistic structure. While we will cover some of the basics of Natural Language Processing (which we will consider a separate subfield), this class will not focus on one specific approach (such as deep learning). Students in this class are expected to have a background in either computer science or linguistics, but not necessarily both. Expect this class to be difficult at times and easy at others. We hope to offer something new and interesting for everyone.
Note: To request academic accommodations due to a disability, please contact Disabled Student Services, 448 Schmitz, 206-543-8924 (V/TTY). If you have a letter from Disabled Student Services indicating that you have a disability which requires academic accommodations, please present the letter to the instructor so we can discuss the accommodations you might need in this class.
Students are expected to complete the assigned readings before each lecture. Lecture and Lab/Section will connect with the readings, but not everything in the readings will be covered in lecture. Homework assignments and exams may nonetheless cover material in the readings not gone over in class.
All homework assignments and the final project will include a significant writing component, weight at or near 1/2 of the assignment grade. Be sure to save time to do a careful job on your write up.
We expect all write ups to be turned in as pdf files, even if they started as plain text files that we gave you.
Collaboration policy: Students are encouraged to work with each other on the homework, both in small groups and by posting & answering questions on Canvas. However, each student must turn in their own answers (both code and write up). No copying or sharing code or prose is allowed. Also, students who have collaborated must acknowledge the collaboration in their write ups (e.g. "I discussed this problem with Kim Smith/with classmates on Canvas as we were working on it.").
Late homework policy: Unless prior arrangements are made, homework turned in late but within 24 hours of the deadline will be graded at 80% credit, homework turned between 24 and 48 hours will be graded at 70% credit, and homework turned in later than that will not be graded. No late final projects will be accepted.
Grades will be based on:
|3/27||Introduction & overview
|J&M Ch 1; L&C Ch 1||Introductory Questionnaire (on Canvas)|
|3/29||Regular expressions; Formal languages; FSA||J&M Ch 2 (through 2.2.4); 16.1; 12.2; L&C 4.4|
|4/3||FSA, contd.; Morphology and Phonology; FST||carmel tutorial (through section 3)
J&M Ch.2.3-2.4; Ch 3.0-3.1.2; 3.2-3.4.0; 3.5; 11.1
|4/5||Phonetics; ASR and Text-to-Speech (Bird's eye view)||Optional:J&M Ch 7,8,9 (skim or read any parts you find approachable); L&C Ch 1.4|
|4/10||Machine Learning. Bird's eye view.
Guest lecture: NLP in cancer research.
|T. Mitchel. Key ideas in ML. (2017)|
Evaluation and Error analysis
|Resnik & Lin, 2010
Kummerfeld et al. (2012)
|4/17||N-grams||J&M Ch 4 (through 4.4); L&C pp 26-28|
|4/19||N-Grams: Smoothing and Discounting||J&M Ch 4.5-4.9, 4.12|
|4/20||Lab/Section||Project Milestone 1|
|4/24||Grammars, Parsing, and CFG||J&M Ch 12 L&C pp 50-58|
|5/3||Parsing algorithms||J&M 13|
|5/4||Lab/Section||Project Milestone 2|
|5/8||Probabilistic parsing||J&M Ch 14 through 14.5; 14.10|
|5/10||Syntactic Theory and Feature Structures||J&M Ch 15 through 15.3;15.6|
|5/15||Parsing w/ Unification; Grammar Engineering||either: J&M Ch 15.4-15.5
or: Bender (2008) (through at least 2.3)
|5/17||Computational semantics||J&M Ch. 17-18|
Project Milestone 3 (revisions to Milestone 2)
Word vectors and embeddings
J&M ch 15 new ed.
J&M ch 16 new ed.: Intro and 16.2-16.3
(Start with J&M ch 8 new ed. if needed.)
|5/24||Ethics, design and NLP||Hovy & Spruit (2016)
Nathan et al 2007
Bolukbasi et al 2016
|5/25||Lab/Section||Assignment 5, Part 1|
|5/29||Deep Learning. Guest lecture.||Embed, encode, attend, and predict|
|6/1||Lab/Section: More presentations||Assignment 5, Parts 2 & 3|
|6/8||Final projects due 11:59 pm|