Using Natural Language to Integrate, Evaluate, and Optimize Extracted Knowledge Bases

Doug Downey, Chandra Sekhar Bhagavatula

Jun 29, 2013 (modified: Jun 29, 2013) AKBC 2013 submission readers: everyone
  • Decision: conferencePoster
  • Abstract: Web Information Extraction (WIE) systems can extract billions of unique facts, but integrating the assertions into a coherent knowledge base and evaluating across different WIE techniques remains a challenge. We propose a framework that utilizes natural language to integrate and evaluate extracted knowledge bases (KBs). In the framework, KBs are integrated by exchanging probability distributions over natural language,and evaluated by how well the output distributions predict held-out text. We describe the advantages of the approach, and detail remaining research challenges.

Loading