applications of latent entity networks in information
play

Applications of Latent Entity Networks in Information Retrieval - PowerPoint PPT Presentation

Applications of Latent Entity Networks in Information Retrieval Andreas Spitz, Michael Gertz Heidelberg University, Institute of Computer Science Database Systems Research Group { spitz,gertz } @informatik.uni-heidelberg.de Workshop


  1. Applications of Latent Entity Networks in Information Retrieval Andreas Spitz, Michael Gertz Heidelberg University, Institute of Computer Science Database Systems Research Group { spitz,gertz } @informatik.uni-heidelberg.de Workshop Internationale Klima- und Energiediskurse Darmstadt, May 26, 2017

  2. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Latent Entity Networks in Information Retrieval Andreas Spitz 1 of 11

  3. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Motivation Definition: Event “Something that happens at a given place and time between a group of actors .” [CSG + 02] Latent Entity Networks in Information Retrieval Andreas Spitz 2 of 11

  4. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Motivation Definition: Event “Something that happens at a given place and time between a group of actors .” [CSG + 02] For large document collections such as corpora of newspapers, how can we... • obtain events from unstructured text? • identify connections across documents? • support entity-centric event search? Latent Entity Networks in Information Retrieval Andreas Spitz 2 of 11

  5. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Information Network Extraction from Text Latent Entity Networks in Information Retrieval Andreas Spitz 3 of 11

  6. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Information Network Extraction from Text Latent Entity Networks in Information Retrieval Andreas Spitz 3 of 11

  7. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Information Network Extraction from Text Latent Entity Networks in Information Retrieval Andreas Spitz 3 of 11

  8. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Information Network Extraction from Text Latent Entity Networks in Information Retrieval Andreas Spitz 3 of 11

  9. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Information Network Extraction from Text [SG16] Latent Entity Networks in Information Retrieval Andreas Spitz 3 of 11

  10. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Edge Weight Generation For edges ( x, y ) for which y is a page or sentence, count only (co-) occurrences: � 1 if y contains x ω ( x, y ) = 0 otherwise [SG16] Latent Entity Networks in Information Retrieval Andreas Spitz 4 of 11

  11. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Edge Weight Generation For edges ( x, y ) for which y is a page or sentence, count only (co-) occurrences: � 1 if y contains x ω ( x, y ) = 0 otherwise For edges ( x, y ) between entity types and terms, aggregate co-occurrence instances I : sum over similarities derived from sentence distances s . � ω ( x, y ) := exp( − s ( x, y, i )) i ∈ I [SG16] Latent Entity Networks in Information Retrieval Andreas Spitz 4 of 11

  12. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Entity Topics: Brexit relative frequency of mentions Topics for David Cameron (Q192) − UK (Q145) 1.00 0.75 0.50 0.25 0.00 Jun Jul Aug Sep Oct date brexit nation favour referendum ukip vote prime minist leader demand govern westminst campaign resign pro − brexit Latent Entity Networks in Information Retrieval Andreas Spitz 5 of 11

  13. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Entity Topics: Olympic Games relative frequency of mentions Topics for Brazil (Q155) − IOC (Q40970) 1.00 0.75 0.50 0.25 0.00 Jun Jul Aug Sep Oct date region decad crisis olymp game athlet silver bronz gold insist corrupt sport event medal medalist Latent Entity Networks in Information Retrieval Andreas Spitz 6 of 11

  14. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Event Extraction and Search Intuition: • Events correspond to patterns in the network (e.g., triangular structures) • Participating entities can be used to complete events Latent Entity Networks in Information Retrieval Andreas Spitz 7 of 11

  15. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Event and Entity Search and Exploration EVELIN: Exploration of Event and Entity Links in Information Networks Available for: Wikipedia: http://evelin.ifi.uni-heidelberg.de/ News Corpus: http://evelin.ifi.uni-heidelberg.de:7777 [SAG17] Latent Entity Networks in Information Retrieval Andreas Spitz 8 of 11

  16. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Summary Latent Entity Networks: • fast entity and event exploration • can support most entity-related Information Extraction tasks • can be extended to any kind of entity • scalable and fast • language-agnostic with entity linking Latent Entity Networks in Information Retrieval Andreas Spitz 9 of 11

  17. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Available for download: • Wikipedia latent entity networks • Code for generating latent entity networks • Code for the query interface http://dbs.ifi.uni-heidelberg.de/index.php?id=load Latent Entity Networks in Information Retrieval Andreas Spitz 10 of 11

  18. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Available for download: • Wikipedia latent entity networks • Code for generating latent entity networks • Code for the query interface http://dbs.ifi.uni-heidelberg.de/index.php?id=load Latent Entity Networks in Information Retrieval Andreas Spitz 10 of 11

  19. Motivation Latent Network Extraction Contextual Entity Topics Network Information Retrieval Summary Bibliography I Christopher Cieri, Stephanie Strassel, David Graff, Nii Martey, Kara Rennert, and Mark Liberman. Corpora for topic detection and tracking. In Topic Detection and Tracking . Springer, 2002. Andreas Spitz, Satya Almasian, and Michael Gertz. Evelin: Exploration of event and entity links in implicit networks. In WWW , 2017. Andreas Spitz and Michael Gertz. Terms over LOAD: Leveraging named entities for cross-document extraction and summarization of events. In SIGIR , 2016. Latent Entity Networks in Information Retrieval Andreas Spitz 11 of 11

Recommend


More recommend