"Machine Reading" reading group


Motivation

We want to familiarize ourselves with the research issues in natural language processing, inference, learning, and related issues. This touches on a variety of other topics such as inductive logic programming, graphical models, language processing technology, structured prediction, etc.


Meeting time

Every thursday 4 PM in KEC 2057.

Schedule


Reading list for machine reading project

Some basic ILP and SRL papers
Learning Inference Rules (papers suggested by Prasad)

1. Discovery of Inference Rules for Question Answering (Lin, D. and Pantel, P.) Natural Language Engineering, 7(4), 343-360, 2001. -- The rules are generated using similarities between templates of paths. The similarities are calculated based on a version of "mutual information". High ranking similarities between paths are used to generate inference rules. As a rule, the  recall is good, but precision is low. Moreover, inference rules are symmetric here. X eats Y <=> X likes Y. 
 
2. LEDIR: An unsupervised algorithm for learning directionality of inference rules (Bhagat, R. Pantel, P., Hovy, E. ) Proceedings of the 2007 joint Conference on EMNLP&CoNLL pp 161-170, Prague, June 2007.  -- Learned directional inference rules based on the frequencies of occurence of each side of the inference rule. Learns that X eats Y => X likes Y. The directionality of learning  has improved, but recognizing valid vs invalid inferences was not. So the precision still suffers. For example, x likes y <=> x hates y might be learned as a rule. The problem, it seems to me, is that the x and y are abstracted to "person" before the inference rule is learned. I.e., the learner has not seen any evidence for (x likes y) and (x hates y) for the same x and y! It has only seen someone liking someone and someone else hating someone else. So in fact, there is only evidence for believing someone likes someone <=> someone hates someone. This seems reasonable enough, but it is much weaker than  the inference rule that is actually learned from this!  Another issue: inference was not used during learning process  to learn additional constraints. 
 
3. Harabagiu, S. and Hickl, A. Methods for using Textual Entailment in Open-Domain Question Answering. In Proceedings of ACL 2006, pp 905-912, Sydney Australia. -- Have not read this. Apparently showed that directional textual entailment alone can improve the question answering without other inference mechanisms (according to Bhagat et al. )
 
4. Szpektor, I; Tamev, H.; Dagan, I; and Coppola, B; 2004. Scaling web-based acquisition of entailment relations. In Proceedings of EMNLP 2004. pp 41-48. Barcelona, Spain. 
 
5. Chklovski, T. and Pantel, P. 2004. VerbOCEAN: Mining the  Web for Fine-Granied Semantic Verb Relations. In Proceedings of EMNLP 2004, Barcelona, Spain. 
 
6. Rodrigo de Salvo Braz, Roxana Girju, Vasin Punyakanok, Dan Roth,  ark Sammons: An Inference Model for Semantic Entailment in Natural Language. Lecture Notes in Computer Science, Springer Berlin / Heidelberg Volume 3944/2006, Book: Machine Learning Challenges. --  This paper treats inference as optimization and does not discuss  learning inference rules. 
 
7. Claire Nedellec: Corpus-Based Learning of Semantic Relations by the ILP System, Asium. Learning Language in Logic 1999: 259-278  http://www.eecs.orst.edu/~tadepall/lbr/asium 

More papers

8. A Paper by Ritter, Etzioni et al. on learning functional relationshipshttp://turing.cs.washington.edu/papers/Ritter_emnlp08.pdf  e.g., emplyoeeOf(person,Company) is a function but colleagueOf(x,y)  is not. 
 
9. Subgroup discovery:  Gamberger, D. and Lavrac, N. 2002. Descriptive Induction through Subgroup Discovery: A Case Study in a Medical Domain. In /Proceedings of the Nineteenth international Conference on Machine Learning/ (July 08 - 12, 2002). C. Sammut and A. G. Hoffmann, Eds. Morgan Kaufmann Publishers, San Francisco, CA, 163-170
 
10. Markov Logic Networks paper by Richardson and Domingos. MLNs are schematized versions of undirected graphical models
over relational atoms. There is a lot of current work on using these  in lifted inference and comparisons to directed relational models like probailistic relational models. This is  a 
basic MLN paper.  http://www.springerlink.com/content/w55p98p426l6405q/fulltext.pdf
 
11. Claudien paper - learning from interpretations  An interpretation is an assignment of truth values to all
ground predicates, e.g., author(paper23,JohnDoe). Given a theory, a positive interpretation satisfies the theory. a negative interpretation does not. Claudien learns a clausal theory (conjunction of Horn clauses) from a set of positive and negative examples. http://www.springerlink.com/content/j30702810h758166/fulltext.pdf
 
12. Natural logic for textual inference describes the NatLog  system that does textual inference. 
http://www.springerlink.com/content/j30702810h758166/fulltext.pdf -- This system describes a set of inference rules that can apply to natural language sentences to derive some natural inferences, e.g., John does not work in the US. => John does not work in New York. 
 
13. Natlan/Coling 2008 paper on extending NatLog 
http://nlp.stanford.edu/~wcmac/papers/natlog-coling08.pdf
 
14. Bill MacCartney's Stanford thesis on natural language inference 
http://nlp.stanford.edu/~wcmac/papers/nli-diss.pdf


(The below papers are from Nimar Arora's NLP reading list)

Logical Form Transformation


Some papers from Question Answering literature:

Systems evaluated on Remedia Corpus

Systems evaluated on TREC