knowledge extraction in text
play

Knowledge Extraction in Text Marco Ponza Who am I? Last-year PhD - PowerPoint PPT Presentation

Knowledge Extraction in Text Marco Ponza Who am I? Last-year PhD student... Nov 2015 Aug 2017 Oct 2018 Mar 2019 Moved to PhD Thesis PhD Defense Started PhD Germany Submitted Advanced Algorithms & Applications Lab supervisor: Prof.


  1. Knowledge Extraction in Text Marco Ponza

  2. Who am I? Last-year PhD student... Nov 2015 Aug 2017 Oct 2018 Mar 2019 Moved to PhD Thesis PhD Defense Started PhD Germany Submitted Advanced Algorithms & Applications Lab supervisor: Prof. Paolo Ferragina RESEARCH Data Compression Searching & Mining Web & Social Media Natural Language Understanding

  3. Natural Language Understanding Easy for humans Hard for machines But, machines need today to access, read and understand information stored in very large data archives ...and this will get to be more and more crucial with Conversational AI systems!

  4. Natural Language Understanding ▷ Machines represent texts by their (possibly, ambiguous) words Leonardo is the scientist who painted Mona Lisa Leonardo scientist paint Mona Lisa

  5. Natural Language Understanding ▷ Machines represent texts by their (possibly, ambiguous) words Leonardo is the scientist who painted Mona Lisa Leonardo scientist paint Mona Lisa

  6. Natural Language Understanding ▷ Machines represent texts by their (possibly, ambiguous) words Leonardo is the scientist who painted Mona Lisa Leonardo Leonardo (Town) DiCaprio Leonardo scientist paint Mona Lisa Leonardo (Ninja Turtle) Leonardo da Vinci

  7. Knowledge Graph May 2012 https://www.blog.google/products/search/introducing-knowledge-graph-things-not

  8. Understanding the Text by Entities, not Strings Leonardo is the scientist who painted Mona Lisa Science Italy Renaissance Leonardo da Vinci Mona Lisa (painting) Louvre Cartography Art Florence Map ambiguous words into the real-world entities they refer to as well as contextualize them together with related entities

  9. Understanding the Text by Entities, not Strings Since 2010: Efficient & effective solutions for this problem! Two Editions: 2010 & 2013 Map ambiguous words into the real-world entities they refer to as well as contextualize them together with related entities

  10. Understanding the Text by Salient Entities Hilary Clinton George W Bush Barack Obama Hawaii People ...and more and more entities!

  11. Understanding the Text by Salient Entities ▷ Entity Salience Problem Relevant vs Non-Relevant Entities Hilary Clinton George W Bush Barack Obama Hawaii People ...and more and more entities!

  12. Understanding the Text by Salient Entities ▷ Entity Salience Problem Relevant vs Non-Relevant Entities Hilary Clinton ▷ Our Solution George W Bush Barack Obama Improvements of +12% ○ wrt CMU/Google system Hawaii ○ Published at People ...and more and more entities! Research Grant 2017

  13. Understanding the Text by Salient Entities ▷ How? Applying Graph Theory and Algorithms! Used to draw new features Classify entities into salient/non-salient via ML Text ▷ Problem: How can we weight the edges ?

  14. Understanding the Text by Extraction of Facts How to enrich the ▷ Knowledge Graph with information that comes from a text ? Identify salient entities Extract facts Text Knowledge Graph connecting them Leonardo is the scientist who painted Mona Lisa (“Leonardo”, “is”, “scientist”) Facts (“Leonardo”, “painted”, “Mona Lisa”) EMNLP 2018 Brussels

  15. Expert Finding & Profiling ~1.5K Authors ▷ ~65K Documents (papers’ abstracts) ▷ ~35K Research Topics ▷ More than 1K queries and ~2K profiles view in few months ▷ Currently used by UniPi’s Technology Transfer Office ▷

  16. 2019

  17. Future Directions Moving the paradigms: from text to conversations !

  18. SYSTEMS / swat.d4science.org http:/ / wiser.d4science.org http:/ https:/ /sobigdata.d4science.org/web/tagme Thanks!

Recommend


More recommend