CPSC 503 - Winter 2010 -
Computational Linguistics
Readings, Syllabus, Assignments,
Software&Data
|
Syllabus, Assignments, Software& Data
1
Sep 9 Th Intro and Course Overview subscribe to course mailing list - send a message to majordomo@cs.ubc.ca with body: subscribe cpsc503
J&M
Chp. 1
- ACL
- NLP demos
-Ambiguity
2
Sep 14Tu English Morphology and Finite State Machines: FSA and FST
J&M
Chp. 2&3
Applications of FSTs in NLP Lauri Karttunen, CIAA, 2000.
Assignment1 (due Sep 21)
3
Sep 16Th Finish FST
Stemming
J&M
Chp. 2&3
- Recent book and software
- Xerox: FiniteState Technology
- Finite State Utilities (Van Noord)
- The Porter Stemmer (includes perl implementation)
4
Sep 21Tu Spelling: Bayesian method and Minimum
J&M
Chp. 3&4
- ProbInfoTheory Handout
- min-edit-dist demo
- A spelling correction program based on a noisy channel model Kerninghan et al. COLING ,1990.
- minimal Python implementation of spelling correction (by P. Norvig)
5
Sep 23Th Edit Distance + Probabilistic Models: N-grams
J&M
Chp. 4
Google ngrams model An empirical study of smoothing techniques for NLP S.F. Chen, J. Goodman - TR CS Harvard Univ - 1998
6
Sep 28Tu Model Evaluation - Markov Models J&M
Chp. 4-57
Sep 30Th Part-of-speech Tagging - J&M
Chp. 5-6
why tagging can be challenging for humans: Penn tagging scheme Assignment2(due Oct 14)
Corpora: wsj-p.txt wsj-ps.txt atis3.pos.tags.txt cmpt-hw2-3.txt
8
Oct 5Tu English Syntax and Context-free Grammars
J&M
Chp. 12
Interactive tutorials on the English grammar
English Dept. University of Calgary.
9
Oct 7Th Parsing Algorithms / Chunking / Dependency Grammars/ Treebank J&M Chp. 13 - NLTK (demos) - look at *Getting Started*
- Some public parsers (inlcuding Stanford and MINIPAR visualization tools)10 Oct 12Tu
Probabilistic CFGs J&M Chp. 14 -Penn Treebank - Stanford Parser -
-Popular Stat Parser11
Oct 14Th
Representing Meaning and
Semantic Analysis
J&M Chp. 17-18 Assignment3 (out Oct 15 due Oct 28) needed files 12
Oct 19Tu Lexical Semantics J&M Chp.19
- Wordnet
- FrameNet
- ProbBank (adding semantic annotations to the Penn Treebank)
13
Oct 21Th Computational Lexical Semantics
J&M Chp. 20
- SENSEVAL(Evaluation for WSD) - WSD online public systems
- Dependency-based word similarity demo
- TREC (Text REtrieval Conference)
- Semantic Labeling (ASSERT)14
Oct 26Tu Pragmatics: Discourse&Dialog
J&M Chp. 21 & 24
- DAMSL
- RST annotation tool15
Oct 28Th Natural Language Generation (NLG)
sample system: Generator Evaluative Arguments (GEA)handout
- SIGGEN
- NLG systems book, STOP system, SimpleNLG
- NLG companies: data2text CoGenTex16 Nov 2 Tu Project Proposal Presentations - 17
Nov 4Th Cancelled READINGS (what to do?)
18
Nov 9Tu (data2text) Natural Language Generation (1)
F Portet, E Reiter, A Gatt, J Hunter, S Sripada, Y Freer, C Sykes Automatic Generation of Textual Summaries from Neonatal Intensive Care Data. Artificial Intelligence 173:789-816. 2009 (pdf) [Masour]
Ryuichiro Higashinaka et al. Learning to generate naturalistic utterances using reviews in spoken dialogue systems Proceeding of ACL 2006 pdf [Misha]19
Nov 11Th Holiday - Remembrance Day 20
Nov 16Tu Summarization (1)
(Biographies) Fadi Biadsy, Julia Hirschberg, Elena Filatova, "An Unsupervised Approach to Biography Production using Wikipedia", ACL-08: HLT, Columbus, Ohio, Jun 2008 pdf [Misha]
Lin, Chin-Yew and E.H. Hovy. Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics. In Proceedings of Language Technology Conference (HLT-NAACL ), 2003. (pdf) [Anika]21
Nov 18Th Summarization (2)
Regina Barzilay, Kathleen McKeown "Sentence Fusion for Multidocument News Summarization",
Computational Linguistics, 2005. [ps] [Simona]
Ani Nenkova et al. The Pyramid Method: Incorporating human content selection variation in summarization evaluation ACM Trans. on Speech and Language Processing (TSLP), 2007 pdf [Oliver]22
Nov 23Tu Summarization(3)
Gabriel Murray and Giuseppe Carenini Summarizing Spoken and Written Conversations EMNLP 2008 [pdf] [Ziyu]Giuseppe Carenini , Raymond NG, Xiaodong Zhou, Summarizing Emails with Conversational Cohesion and Subjectivity ACL 2008 [pdf] [Fahimeh]
23 Nov 25Th Info Extraction from Evaluative Text (1)
Theresa Wilson, Janyce Wiebe and Rebecca Hwa (2006). Recognizing strong and weak opinion clauses. Computational Intelligence, 22 (2), pp. 73-99. [Tianyu]Theresa Wilson, Janyce Wiebe, and Paul Hoffmann (2009). Recognizing Contextual Polarity: An exploration of features for phrase-level sentiment analysis. Computational Linguistics, 35:3, pages 399-433. [Alex]
24 Nov 30Tu Natural Language Generation / Question-Answering (2)
A. Stent, R. Prasad and M. Walker. Trainable sentence planning for complex information presentations in spoken dialog systems. In Proceedings of ACL , 2004. pdf [David]
Y. Chali, S. R. Joty and S. A. Hasan (2009) "Complex Question Answering: Unsupervised Learning Approaches and Experiments", JAIR, Volume 35, pages 1-47, 2009 pdf [Naresh]
25 Dec 2Th Project Update Presentations 26
Dec 15 Wed
9:30-12:30Project Final Presentations same room DMP 101