BURC: Bootstrapping Using ResearchCyc

A Cyc based common sense extraction project developed for University of North Texas Seminar on Natural Language Processing (CSCE 6330)

2005-02-08

Project proposals (1 pg.)

BURC: Bootstrapping Using ResearchCyc

[http://www.daxtron.com/csce6330/burc_proposal.htm]

2005-02-22

Literature overview

Deriving General World Knowledge from Texts and Taxonomies [http://www.cs.rochester.edu/~schubert/projects/world-knowledge-mining.html]

Lenhart K. Schubert and Matthew Tong, "Extracting and evaluating general world knowledge from the Brown Corpus", Proc. of the HLT-NAACL Workshop on Text Meaning, May 31, 2003, Edmonton, Alberta, pp. 7-13.

Lenhart K. Schubert, "Can we derive general world knowledge from texts?", Human Language Technology Conference (HLT 2002), San Diego, CA, March 24-27, 2002, pp. 94-97.

D. Rajesh Duthie & Rajendra Akerkar. 2002. Knowledge Representation in KRIS Using Link Grammar Parser. [PDF]

Lei Zhang and Yong Yu. 2001. Learning to Generate CGs from Domain Specific Sentences. In The Proceedings of the 9th International Conference on Conceptual Structures, LNAI 2120, July 30-August 3, 2001, Stanford, CA, USA. [Abstract] [Full text (PDF)]

Molla-Aliod,D and Hutchinson,B. 2002, Dependency-based semantic interpretation for answer extraction,In 2002 Australasian Natural Language Processing Workshop [PDF]

Harsha V. Madhyastha, N. Balakrishnan, & K. R. Ramakrishnan. 2003. Event Information Extraction Using Link Grammar. 13th International WorkShop on Research Issues in Data Engineering: Multi-lingual Information Management (RIDE'03). [Abstract]

Chirag Shah and Pushpak Bhattacharyya. 2003. Improving Document Vectors Representation using Semantic Links and Attributes. International Conference on Natural Language Processing (ICON), Mysore, India, December 2003. [PDF]

Ted Pedersen, Weidong Chen. 1995. Lexical Acquisition via Constraint Solving. In Working Notes of the AAAI Spring Symposium on Representation and Acquisition of Lexical Knowledge [Citeseer]

Timothy Chklovski and Patrick Pantel. 2004. VerbOcean: Mining the Web for Fine-Grained Semantic Verb Relations. In Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP-04). pp. 33-40. Barcelona, Spain. [PDF][PS]

Patrick Pantel, Deepak Ravichandran and Eduard Hovy. 2004. Towards Terascale Knowledge Acquisition. In Proceedings of Conference on Computational Linguistics (COLING-04). pp. 771-777. Geneva, Switzerland. [PDF][PS]

Deepak Ravichandran, Patrick Pantel, and Eduard Hovy. 2004. The Terascale Challenge. In Proceedings of KDD Workshop on Mining for and from the Semantic Web (MSW-04). pp. 1-11. Seattle, WA. [PDF][PS]

Dekang Lin and Patrick Pantel. 2001. DIRT - Discovery of Inference Rules from Text. In Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD-01). pp. 323-328. San Francisco, CA. [PDF][PS]
Deepak Ravichandran and Eduard Hovy. 2002. Learning Surface Text Patterns for a Question Answering system. In Proceedings of the 40th ACL conference. Philadelphia, PA. [ ps, pdf ]

Eduard Hovy, Ulf Hermjakob, and Deepak Ravichandran. 2002. A Question/Answer Typology with Surface Text Patterns. In Proceedings of the DARPA Human Language Technology Conference (HLT). San Diego, CA. [ ps, pdf ]

Daniel Gildea and Daniel Jurafsky, ACL 2000 . Automatic labeling of semnantic roles, [pdf]

2005-03-08

Project proposal presentations (10-15 minutes)

[ppt]

2005-04-12

Project draft (5-6 pg.)

[PDF]

2005-05-03

Final project paper (8 pg.)

[PDF]

Final project presentation

[PPT]

Project Outputs

Adjective and Subject-verb-object link data[ 12Meg zip]

Mined Predicate Hypothesis [450K zip]

Raw Fragment counts from BNC [117 Meg zip]