CPSC 503 - Winter 2020 -
Computational Linguistics
Readings, Syllabus, Assignments,
Software&Data
|
Synthesis Lectures in Natural Language Processing webpage (especially Neural Network Methods for Natural Language Processing, Yoav Goldberg, 2017)
Add new book on sem-Pragmatics
Natural Language Processing with Python: Bird, Steven; Klein, Ewan, Loper, Edward. n, O'Reilly, 2009. Free HTML version. You can order this book directly from O'Reilly
Introduction to Information Retrieval. by Manning, Raghavan, Schutze webpage
Graph-based Natural Language Processing and Information Retrieval. Rada Mihalcea (Author), Dragomir Radev (Author)
Foundations of Statistical Natural Language Processing by Christopher D. Manning, Hinrich Schutze. (M&S). In many cases the statistical approaches are covered in more detail in this book. However, it does not contain all the topics that we will cover in this course. This book also has a webpage.680 pages 1 edition (1999), M.I.T. Press/Triliteral, ISBN: 0262133601. This book will be useful in cases where you want a different presentation of the same material that is required reading from J&M
Contemporary Linguistics: An introduction by W. O'Grady, J. Archibald, M. Aronoff, J. Rees-Miller. 684 pages 5th Edition (2004). ISBN: 0312419368. This book will be useful in cases where you want a more detailed description of linguistic theories. It also contains lots of clear examples of linguistic phenomena. This book also has a webpage.
Syllabus, Assignments, Software& Data
1 |
Jan 8 Wed |
Intro
and Course Overview
We will communicate through Canvas: to log in use your CWL |
J&M
Chp. 1 |
Intro
-
ACL NLP toolkits: NLTK (Python), Stanford CoreNLP (java) |
2 |
Jan 13 Mon |
English
Morphology and Finite State Machines: FSA and FST |
J&M
Chp. 2&3 (2nd Edition) missing pages a b c |
Assignment1on Canvas (due Jan 22)
Dementia Material: instructions, data, lib, run.py Applications of FSTs in NLP Lauri Karttunen, CIAA, 2000.
|
Jan 15 Wed | CANCELED - SNOW | |||
3 |
Jan 20 Mon |
Finish FST + Stemming + Spelling |
J&M
Chp. 3&4 |
|
4 |
Jan 22 Wed | Minimum
Edit Distance +
Probabilistic
Models: N-grams - N-grams Evaluation -
|
J&M
Chp. 4 |
An empirical study of smoothing techniques for NLP S.F. Chen, J. Goodman - TR CS Harvard Univ - 1998 |
5 |
Jan 27 Mon |
Intro - Neural Networks and Neural Language Models
- Start Markov Models
|
J&M
3Ed Chp. 7-8 |
Neural Network Demos
|
6 |
Jan 29 Wed |
Markov Sequence Labelling Models - Part-of-speech Tagging
|
J&M 3Ed Some of Appendix A Chp. 8 |
- state of the art POS tagging why tagging can be challenging for humans: Penn tagging scheme Part-of-Speech Tagging from 97% to 100% C. Manning 2011
Assignment2 on Canvas (due Feb 12) |
7 |
Feb 3 Mon | Neural Sequence processing with Recurrent Neural Networks (RNN) (Attention and Transformers IN ADDITIONAL LECTURES / READINGS) | J&M
3Ed Chp. 9 see also Goldberg Chps 14-15-16 |
|
8 |
Feb 5 Wed | Start English Syntax and Context-free Grammars -- Parsing Algorithms | J&M
3Ed Chp. 10-11 |
Interactive
tutorials on the English grammar (not working 2020?) Another resource on grammar from UCL
-
NLTK (demos) - look at *Getting
Started* |
9 |
Feb 10 Mon |
Chunking /
Dependency Grammars and Transition-based Dep. Parsing/ Treebanks -
|
J&M 3Ed Chp. 13 |
Stanford Parser - -Popular Stat Parser - MaltParser - State of the Art Dependency Parser |
10 | Feb 12 Mon |
Probabilistic CFGs - PCFGs Parsing + Lexicalized PCFGs - Neural Constituency and Dependency Parsing |
J&M 3Ed Chp. 12 |
- Berkeley Parser with demo! |
Feb 17 - 21 | mid-term Break | |||
11 | Feb 24 Mon |
Representing Meaning and Semantic Analysis |
J&M Chp. |
book on Computational Semantics Semantic Parser (Cornell - Yoav Artzi) |
12 |
Feb 26 Wed | Lexical Semantics | J&M Chp. | -
Wordnet and
YAGO (Wikipedia
+ Wordnet + GeoNames). See also
Probase
and Freebase and
BabelNet - (Domain specific
thesaurus) Medical
Subject Headings (MeSH) Assignment3 on Canvas (due March 6) needed files |
Mar 2 Mon | Canceled | |||
13 |
Mar 4 Wed | Computational Lexical Semantics (focus on Vector Semantics) | J&M Chp. 6 |
-
word2vec - A systematic comparison of context-counting vs. context-predicting semantic vectors ! (predicting is clearly better) - generalization of skip-grams to sentences (skip-thought vectors) 2015 - SENSEVAL(Evaluation for WSD) - WSD with Deep Belief Networks -Illinois Semantic Role Labeler
|
14 |
Mar 9 Mon | CNNs, Semantic Role labeling, Brief Intro Pragmatics: |
|
- Appied CL Discourse
Research Lab - DAMSL - RST annotation tool |
15 | Mar 11 Wed | Encoder-Decoder, Attention and Transformers | Conditioned Neural Generation (Encoder-Decoder framework) pag. 195-211 - Y. Goldberg book 2017- Chp. 17 |
Assignment4 on Canvas (due March
23) - Transformers package |
16 | Mar 16 Mon | Project Proposal Presentations - | ||
READINGS (what to do?) avg. year 2015 | ||||
17 | Mar 18 Wed |
Generic Topic Modeling (background reading Comm. ACM) and Topic Modeling in Asynchronous Conversations
|
||
18 | Mar 23 Mon |
Visual Text Analytics and Interactive Topic Modeling
Intelligent User Interfaces (IUI), 2016 ] VIDEO [Joseph Wonsil]
|
||
19 | Mar 25 Wed |
Distributed Representations for Sentence + Summarization (1)
|
||
20 |
Mar 30 Mon |
Summarization (2)
|
||
21 |
Apr 1 Wed |
Sentiment
+ Graph Based WSD pre-reading for paper1: Chaper 18 of Y. Goldberg (only 5 pages)
|
||
22 |
Apr 6 Mon |
Neural: Text Classification
|
||
23 |
Apr 8 Wed |
Natural Language Generation (data2text) + Discourse
Parsing
|
||
24 |
(Apr 13 holiday - Good Mon) Apr 15 Wed |
Discourse
Parsing: Applications + Distant Supervision
|
||
25 | Cancelled | Project Update Presentations | ||
26 |
Apr
24 (time TBD) deadline for grade submission end of April |
Project
Final Presentations
Final Project Report Hand in |