Ling 472/CSE 472: Introduction to Computational Linguistics
Spring 2018

Course Info

Instructor Info

  Olga Zamaraeva Ajda Gokcen
Office Hours: W 2-3pm,
Th 3:30-4:30pm
M 2-3pm,
T 3:30-4:30pm
Office Location: Guggenheim 407 Guggenheim 416A (Treehouse Lab)
Email: olzama at uw ajdag at uw

Syllabus

Description

Goals: By the end of this course, you will:

Computational linguistics is a broad field incorporating research and techniques for processing language with computers at all levels of linguistic structure. In this class, we will survey various topics and tasks in computational linguistics focusing on linguistic structure. While we will cover some of the basics of Natural Language Processing (which we will consider a separate subfield), this class will not focus on one specific approach (such as deep learning). Students in this class are expected to have a background in either computer science or linguistics, but not necessarily both. Expect this class to be difficult at times and easy at others. We hope to offer something new and interesting for everyone.

Note: To request academic accommodations due to a disability, please contact Disabled Student Services, 448 Schmitz, 206-543-8924 (V/TTY). If you have a letter from Disabled Student Services indicating that you have a disability which requires academic accommodations, please present the letter to the instructor so we can discuss the accommodations you might need in this class.

Requirements

Students are expected to complete the assigned readings before each lecture. Lecture and Lab/Section will connect with the readings, but not everything in the readings will be covered in lecture. Homework assignments and exams may nonetheless cover material in the readings not gone over in class.

All homework assignments and the final project will include a significant writing component, weight at or near 1/2 of the assignment grade. Be sure to save time to do a careful job on your write up.

We expect all write ups to be turned in as pdf files, even if they started as plain text files that we gave you.

Collaboration policy: Students are encouraged to work with each other on the homework, both in small groups and by posting & answering questions on Canvas. However, each student must turn in their own answers (both code and write up). No copying or sharing code or prose is allowed. Also, students who have collaborated must acknowledge the collaboration in their write ups (e.g. "I discussed this problem with Kim Smith/with classmates on Canvas as we were working on it.").

Late homework policy: Unless prior arrangements are made, homework turned in late but within 24 hours of the deadline will be graded at 80% credit, homework turned between 24 and 48 hours will be graded at 70% credit, and homework turned in later than that will not be graded. No late final projects will be accepted.

Grades will be based on:

Schedule of Topics and Assignments (May be updated)

DateTopicReadingDue
3/27 Introduction & overview
RegEx preview
J&M Ch 1; L&C Ch 1 Introductory Questionnaire (on Canvas)
3/29 Regular expressions; Formal languages; FSA J&M Ch 2 (through 2.2.4); 16.1; 12.2; L&C 4.4  
3/30 Lab/Section   Assignment 0
4/3 FSA, contd.; Morphology and Phonology; FST carmel tutorial (through section 3)
J&M Ch.2.3-2.4; Ch 3.0-3.1.2; 3.2-3.4.0; 3.5; 11.1
 
4/5 Phonetics; ASR and Text-to-Speech (Bird's eye view) Optional:J&M Ch 7,8,9 (skim or read any parts you find approachable); L&C Ch 1.4  
4/6 Lab/Section   Assignment 1
4/10 Machine Learning. Bird's eye view.
Guest lecture: NLP in cancer research.
T. Mitchel. Key ideas in ML. (2017)
4/12 ML (contd.)
Evaluation and Error analysis
Resnik & Lin, 2010
Kummerfeld et al. (2012)
 
4/13 Lab/Section   Assignment 2
4/17 N-grams J&M Ch 4 (through 4.4); L&C pp 26-28  
4/19 N-Grams: Smoothing and Discounting J&M Ch 4.5-4.9, 4.12  
4/20 Lab/Section   Project Milestone 1
4/24 Grammars, Parsing, and CFG J&M Ch 12 L&C pp 50-58  
4/26 Midterm review  
4/27 Lab/Section   Assignment 3
5/1 Midterm    
5/3 Parsing algorithms J&M 13  
5/4 Lab/Section   Project Milestone 2
5/8 Probabilistic parsing J&M Ch 14 through 14.5; 14.10  
5/10 Syntactic Theory and Feature Structures J&M Ch 15 through 15.3;15.6  
5/11 Lab/Section  
5/15 Parsing w/ Unification; Grammar Engineering either: J&M Ch 15.4-15.5
or: Bender (2008) (through at least 2.3)
 
5/17 Computational semantics J&M Ch. 17-18  
5/18 Lab/Section   Assignment 4
Project Milestone 3 (revisions to Milestone 2)
5/22 Semantics contd.
Word vectors and embeddings
J&M ch 15 new ed.
J&M ch 16 new ed.: Intro and 16.2-16.3
(Start with J&M ch 8 new ed. if needed.)
 
5/24 Ethics, design and NLP Hovy & Spruit (2016)
Nathan et al 2007
Bolukbasi et al 2016
 
5/25 Lab/Section   Assignment 5, Part 1
5/29 Deep Learning. Guest lecture. Embed, encode, attend, and predict  
5/31 Presentations  
6/1 Lab/Section: More presentations   Assignment 5, Parts 2 & 3
6/8     Final projects due 11:59 pm


Last modified: