CPSC 503 - Winter 2010 - Computational Linguistics

Readings, Syllabus, Assignments, Software&Data


Readings
Required
Reference


Syllabus, Assignments, Software& Data

1
Sep 9 Th  Intro and Course Overview

subscribe to course mailing list - send a message to majordomo@cs.ubc.ca with body: subscribe cpsc503

J&M
Chp. 1  
- ACL
- NLP demos
-Ambiguity
2
Sep 14Tu English Morphology and Finite State Machines: FSA and FST
J&M
Chp. 2&3
Applications of FSTs in NLP Lauri Karttunen, CIAA, 2000.
Assignment1 (due Sep 21)
3
Sep 16Th Finish FST
Stemming

J&M
Chp. 2&3
4
Sep 21Tu  Spelling: Bayesian method and Minimum
J&M
Chp. 3&4
5
Sep 23Th Edit Distance + Probabilistic Models: N-grams
J&M
Chp. 4


Google ngrams model

An empirical study of smoothing techniques for NLP S.F. Chen, J. Goodman - TR CS Harvard Univ - 1998

6
Sep 28Tu Model Evaluation -  Markov Models J&M
Chp. 4-5
 
7
Sep 30Th  Part-of-speech Tagging - J&M
Chp. 5-6


why tagging can be challenging for humans: Penn tagging scheme

Assignment2(due Oct 14)
Corpora: wsj-p.txt  wsj-ps.txt  atis3.pos.tags.txt cmpt-hw2-3.txt

 8
Oct 5Tu English Syntax and Context-free Grammars
 J&M
Chp. 12
Interactive tutorials on the English grammar 
English Dept. University of Calgary.
 9
Oct 7Th Parsing Algorithms / Chunking / Dependency Grammars/ Treebank J&M Chp. 13  - NLTK (demos) - look at *Getting Started*
 - Some public parsers (inlcuding Stanford and MINIPAR visualization  tools)
 10 Oct 12Tu
Probabilistic CFGs J&M Chp. 14 -Penn Treebank - Stanford Parser -
-Popular Stat Parser
11
Oct 14Th
Representing Meaning and
Semantic Analysis

J&M Chp. 17-18 Assignment3 (out Oct 15 due Oct 28)  needed files

book on Computational Semantics

12
Oct 19Tu Lexical Semantics J&M Chp.19
- Wordnet
- FrameNet
- ProbBank (adding semantic annotations to the Penn Treebank)
13
 Oct 21Th Computational Lexical Semantics
J&M Chp. 20
 - SENSEVAL(Evaluation for WSD)

- WSD online public systems
- Dependency-based word similarity demo
- TREC (Text REtrieval Conference)
- Semantic Labeling (ASSERT)

-Illinois Semantic Role Labeler

14
Oct 26Tu Pragmatics: Discourse&Dialog
J&M Chp.  21 & 24
- DAMSL
- RST annotation tool
15
Oct 28Th Natural Language Generation (NLG)
 sample system: Generator Evaluative Arguments (GEA)
handout
- SIGGEN
- NLG systems book,   
STOP system, SimpleNLG
- NLG companies:   data2text  CoGenTex
16 Nov 2 Tu Project Proposal Presentations -    
17
Nov 4Th  Cancelled      
    READINGS (what to do?)
18
Nov  9Tu (data2text) Natural Language Generation (1)
F Portet, E Reiter, A Gatt, J Hunter, S Sripada, Y Freer, C Sykes  Automatic Generation of Textual Summaries from Neonatal Intensive Care Data. Artificial Intelligence 173:789-816. 2009 (pdf)    [Masour]
Ryuichiro Higashinaka  et al. Learning to generate naturalistic utterances using reviews in spoken dialogue systems Proceeding of ACL 2006 pdf [Misha]
19
Nov 11Th Holiday - Remembrance Day
20
Nov 16Tu

Summarization (1)
(Biographies) Fadi Biadsy, Julia Hirschberg, Elena Filatova, "An Unsupervised Approach to Biography Production using Wikipedia", ACL-08: HLT, Columbus, Ohio, Jun 2008 pdf [Misha]


Lin, Chin-Yew and E.H. Hovy. Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics. In Proceedings of Language Technology Conference (HLT-NAACL ),  2003. (pdf)     [Anika]

21
Nov 18Th

Summarization (2)
Regina Barzilay, Kathleen McKeown "Sentence Fusion for Multidocument News Summarization",
Computational Linguistics, 2005. [ps] [Simona]


Ani Nenkova et al. The Pyramid Method: Incorporating human content selection variation in summarization evaluation ACM Trans. on Speech and Language Processing (TSLP), 2007 pdf  [Oliver]

22
Nov 23Tu   Summarization(3)
Gabriel Murray and Giuseppe Carenini Summarizing Spoken and Written Conversations EMNLP 2008 [pdf]  [Ziyu]

Giuseppe Carenini , Raymond NG, Xiaodong Zhou, Summarizing Emails with Conversational Cohesion and Subjectivity ACL 2008 [pdf] [Fahimeh]

23 Nov 25Th Info  Extraction from Evaluative Text (1)
Theresa Wilson, Janyce Wiebe and Rebecca Hwa (2006). Recognizing strong and weak opinion clauses. Computational Intelligence, 22 (2), pp. 73-99.   [Tianyu]

Theresa Wilson, Janyce Wiebe, and Paul Hoffmann (2009). Recognizing Contextual Polarity: An exploration of features for phrase-level sentiment analysis. Computational Linguistics, 35:3, pages 399-433.  [Alex]

24 Nov 30Tu

Natural Language Generation / Question-Answering (2)
A. Stent, R. Prasad and M. Walker. Trainable sentence planning for complex information presentations in spoken dialog systems. In Proceedings of ACL , 2004. pdf   [David]


Y. Chali, S. R. Joty and S. A. Hasan (2009) "Complex Question Answering: Unsupervised Learning Approaches and Experiments", JAIR, Volume 35, pages 1-47, 2009  pdf   [Naresh]

25 Dec 2Th Project Update Presentations    
26
Dec 15 Wed
9:30-12:30
Project Final Presentations

same room DMP 101






carenini at cs.ubc.ca