CS 519-001, Natural Language Processing, Spring 2017, Paper Review

Requirements

Your Paper Review (min. 4 pages, max. 6 pages, 12 pt font, single-spaced, figs/tables at most 0.75 pages) due on Saturday June 10 must cover at least: Use 12pt font, single space. Use LaTeX if possible.

List of Candidate Papers

Note: if you want to propose a paper outside of this list, get the instructor's approval by the June 1st lecture. Generally, the paper should be either a hardcore NLP paper in top conferences or journals (ACL, NAACL, EMNLP, CL, TACL), or an application of NLP in other fields such as computational structural biology, computer vision, or music processing, and the paper should use techniques taught in this class (sequences, HMMs, LMs, Viterbi, syntax, CFG, CKY, EM, forward-backward, entropy/perplexity, etc.).

IMPORTANT:

PAPERS:

    creative language generation

  1. phonetic spelling (chinglish): How to Speak a Language without Knowing It, ACL 2014. data.
  2. couplet generation (chinese): Generating Chinese Couplets using a Statistical MT Approach, COLING 2008, demo.
  3. poetry generation 1 (sonnet): Generating Topical Poetry, EMNLP 2016.
  4. poetry generation 2 (chinese): Chinese Poetry Generation with RNNs, EMNLP 2014.

    side note: Shakespearean sonnets (e.g., Shall I compare thee to a summer's day) have the following rhyme pattern: abab cdcd efef gg (the last two form a couplet), while Chinese four-line poems follow this pattern: aaca or abcb, i.e., the second and fourth lines must rhyme, and the first line rhyme is optional. The American poet Robert Frost largely used the Chinese pattern, e.g., Stopping by Woods on a Snowy Evening.

    language and music

  5. music 1 (EM): Harmonic Analysis with Probablistic Graphical Models, ISMIR 2003
  6. music 2 (CFG): Automatic Melodic Reduction using a Supervised Probabilistic Context-Free Grammar, ISMIR 2016

    language and structural biology

  7. contrafold (CFG): CONTRAfold: CONditional TRAining for RNA Secondary Structure Prediction without Energy-based Models, 2006.

    language and vision

  8. show and tell: Show and Tell: A Neural Image Caption Generator, 2014--2015
  9. show, attend, and tell: Show, Attend, and Tell: Neural Image Caption Generation with Visual Attention, 2015--2016

    parsing

  10. parsing 1 (MST): Non-projective Dependency Parsing using Spanning Tree Algorithms, EMNLP 2005
  11. parsing 2 (forest): Forest Reranking: Discriminative Parsing with Non-Local Features, ACL 2008
  12. parsing 3 (span): Span-based Constituency Parsing with a Structure-Label System and Provably Optimal Dynamic Oracle, EMNLP 2016

    translation

  13. MT 1 (pharaoh): Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translation Models, AMTA 2004
  14. MT 2 (hiero): A Hierarchical Phrase-Based Model for Statistical Machine Translation, ACL 2005

    semantics

  15. language game: Learning Language Games through Interaction, ACL 2016
  16. algebra word problem: Learning to Automatically Solve Algebra Word Problems, ACL 2014