CPSC 503 - Winter 2019 -
Computational Linguistics
Readings, Syllabus, Assignments,
Software&Data
|
Synthesis Lectures in Natural Language Processing webpage (especially Neural Network Methods for Natural Language Processing, Yoav Goldberg, 2017)
Natural Language Processing with Python: Bird, Steven; Klein, Ewan, Loper, Edward. n, O'Reilly, 2009. Free HTML version. You can order this book directly from O'Reilly
Introduction to Information Retrieval. by Manning, Raghavan, Schutze webpage
Graph-based Natural Language Processing and Information Retrieval. Rada Mihalcea (Author), Dragomir Radev (Author)
Foundations of Statistical Natural Language Processing by Christopher D. Manning, Hinrich Schutze. (M&S). In many cases the statistical approaches are covered in more detail in this book. However, it does not contain all the topics that we will cover in this course. This book also has a webpage.680 pages 1 edition (1999), M.I.T. Press/Triliteral, ISBN: 0262133601. This book will be useful in cases where you want a different presentation of the same material that is required reading from J&M
Contemporary Linguistics: An introduction by W. O'Grady, J. Archibald, M. Aronoff, J. Rees-Miller. 684 pages 5th Edition (2004). ISBN: 0312419368. This book will be useful in cases where you want a more detailed description of linguistic theories. It also contains lots of clear examples of linguistic phenomena. This book also has a webpage.
Syllabus, Assignments, Software& Data
1 |
Jan 7 Mon |
Intro
and Course Overview
We will communicate through Canvas: to log in use your CWL |
J&M
Chp. 1 |
Intro
-
ACL NLP toolkits: NLTK (Python), Stanford CoreNLP (java) |
2 |
Jan 9 Wed |
English
Morphology and Finite State Machines: FSA and FST |
J&M
Chp. 2&3 (2nd Edition) missing pages a b c |
Assignment1on Canvas (due Jan 21)
Dementia Material: instructions, data, lib, run.py Applications of FSTs in NLP Lauri Karttunen, CIAA, 2000.
|
3 |
Jan 14 Mon |
Finish FST + Stemming + Spelling |
J&M
Chp. 3&4 |
|
4 |
Jan 16 Wed | Minimum
Edit Distance +
Probabilistic
Models: N-grams - N-grams Evaluation -
|
J&M
Chp. 4 |
An empirical study of smoothing techniques for NLP S.F. Chen, J. Goodman - TR CS Harvard Univ - 1998 |
5 |
Jan 21 Mon |
Intro - Neural Networks and Neural Language Models
- Start Markov Models
|
J&M
3Ed Chp. 7-8 |
Neural Network Demos
|
6 |
Jan 23 Wed |
Markov Sequence Labelling Models - Part-of-speech Tagging
|
J&M 3Ed Some of Appendix A Chp. 8 |
- state of the art POS tagging why tagging can be challenging for humans: Penn tagging scheme Part-of-Speech Tagging from 97% to 100% C. Manning 2011
Assignment2 on Canvas (due Feb 11) |
7 |
Jan 28 Mon | Sequence processing with Recurrent Neural Networks (RNN) | J&M
3Ed Chp. 9 see also Goldberg Chps 14-15-16 |
|
8 |
Jan 30 Wed | Start English Syntax and Context-free Grammars -- Parsing Algorithms | J&M
3Ed Chp. 10-11 |
Interactive
tutorials on the English grammar
-
NLTK (demos) - look at *Getting
Started* |
9 |
Feb 4 Mon |
Chunking /
Dependency Grammars and Transition-based Dep. Parsing/ Treebanks -
|
J&M 3Ed Chp. 13 |
Stanford Parser - -Popular Stat Parser - MaltParser - State of the Art Dependency Parser |
10 |
Feb 6 Wed |
Probabilistic CFGs - PCFGs Parsing + Lexicalized PCFGs - Neural Constituency and Dependency Parsing |
J&M 3Ed Chp. 12 |
- Berkeley Parser with demo! |
11 |
Feb 11 Mon |
Representing Meaning and Semantic Analysis |
J&M Chp. |
book on Computational Semantics Semantic Parser (Cornell - Yoav Artzi) |
12 |
Feb 13 Wed | Lexical Semantics | J&M Chp. | -
Wordnet and
YAGO (Wikipedia
+ Wordnet + GeoNames). See also
Probase
and Freebase and
BabelNet - (Domain specific
thesaurus) Medical
Subject Headings (MeSH) Assignment3 on Canvas (due March 2nd) needed files |
Feb 18 - 22 | mid-term Break | |||
13 |
Feb 25 Mon | Computational Lexical Semantics (focus on Vector Semantics) | J&M Chp. 6 |
-
word2vec - A systematic comparison of context-counting vs. context-predicting semantic vectors ! (predicting is clearly better) - generalization of skip-grams to sentences (skip-thought vectors) 2015 - SENSEVAL(Evaluation for WSD) - WSD with Deep Belief Networks -Illinois Semantic Role Labeler
|
14 |
Feb 27 Wed | CNNs, Semantic Role labeling, Brief Intro Pragmatics: |
|
- DAMSL - RST annotation tool |
15 | Mar 4 Mon | Project Proposal Presentations - | ||
Natural Language Generation (NLG): sample system:
Generator Evaluative Arguments (GEA) |
handout |
- SIGGEN - NLG systems book, STOP system, SimpleNLG - NLG companies: data2text CoGenTex |
||
LIST updated for 2019 | ||||
READINGS (what to do?) | avg. year 2014.5 | |||
16 | Mar 6 Wed |
Generic Topic Modeling (background reading Comm. ACM) and Topic Modeling in Asynchronous Conversations
|
||
17 | Mar 11 Mon |
Visual Text Analytics and Interactive Topic Modeling
|
||
18 | Mar 13 Wed |
Natural Language Generation (data2text)
|
||
19 | Mar 18 Mon |
Distributed Representations for Sentence + Summarization (1)
|
||
20 | Mar 20 Wed |
Summarization (2)
|
||
21 |
Mar 25 Mon |
Sentiment
+ Graph Based WSD pre-reading for paper1: Chaper 18 of Y. Goldberg (only 5 pages)
|
||
22 |
Mar 27 Wed |
Neural Text Classification + health application |
||
23 |
Apr 1 Mon |
Discourse
Parsing
|
||
24 |
Apr 3 Wed |
|
||
25 | Apr 10 Wed 12:30-3 (room 304) | Project Update Presentations | ||
26 |
Apr 24 Wed 9am-1pm (room 146) | Project
Final Presentations
Final Project Report Hand in |