extracting semantic relations from unlabeled text
play

Extracting semantic relations from unlabeled text Chandra Prakash - PowerPoint PPT Presentation

Introduction Problem Statement Related Work Infinite Relational Model References Extracting semantic relations from unlabeled text Chandra Prakash Vishal Kumar Gupta Mentor: Dr. Amitabha Mukerjee March 21, 2013 Chandra Prakash, Vishal


  1. Introduction Problem Statement Related Work Infinite Relational Model References Extracting semantic relations from unlabeled text Chandra Prakash Vishal Kumar Gupta Mentor: Dr. Amitabha Mukerjee March 21, 2013 Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  2. Introduction Problem Statement Related Work Infinite Relational Model References 1 Introduction Motivation Hardness 2 Problem Statement 3 Related Work 4 Infinite Relational Model Algorithm 5 References Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  3. Introduction Problem Statement Motivation Related Work Hardness Infinite Relational Model References Motivation A 14yo bxr owned by a reputable breeder is being treated for IBD with pred. Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  4. Introduction Problem Statement Motivation Related Work Hardness Infinite Relational Model References Motivation [A 14yo bxr] ANIMAL owned by [a reputable breeder] HUMAN is being treated for [IBD] DISEASE with [pred] DRUG . [4] Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  5. Introduction Problem Statement Motivation Related Work Hardness Infinite Relational Model References Why the problem is hard ? Huge amount of data available on the web No manual tags or labels available Don’t know exactly how many types of entities are present Presence of many irrelevant relations as noise Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  6. Introduction Problem Statement Related Work Infinite Relational Model References Problem Definition Given a corpus of data of extracted relational tuples of the form r ( a , b ), clusters the data using their relationship and also determine the best match for a given relation. Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  7. Introduction Problem Statement Related Work Infinite Relational Model References Related Work TextRunner: identifies relational tuples in one pass of the web [3] Semantic Network Extractor: Jointly cluster relation and object string [5] Infinite Relational Model [1] Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  8. Introduction Problem Statement Related Work Algorithm Infinite Relational Model References Algorithm Specification P ( z 1 , .... z n | R 1 , R 2 , .... R m ) Generative Model m n P ( R i | z 1 , ... z n ) � � P ( z j ) P ( R 1 , R 2 , .... R m , z 1 , .... z n ) = i =1 j =1 P ( z j ) is calculated using Chinese Restaurant Process R ( i , j ) | z , η ( a , b ) is calculated using Bernoulli Distribution Chinese Restaurant Process (CRP) also determines the number of clusters Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  9. Introduction Problem Statement Related Work Algorithm Infinite Relational Model References Output Matrics Figure: Output Matrics Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  10. Introduction Problem Statement Related Work Infinite Relational Model References References 1 Kemp Charles, Tenenbaum Joshua B, Griffiths Thomas L, Yamada Takeshi, and Ueda Naonori. Learning systems of concepts with an infinite relational model. 21(1):381, 2006. 2 Turney Peter D, Pantel Patrick, et al. From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research, 37(1):141–188, 2010. 3 Banko Michele. Open information extraction for the web. PhD thesis, University of Washington, 2009. 4 Huang Ruihong and Riloff Ellen. Inducing domain specific semantic class taggers from (almost) nothing. Proceedings of the Association for Computational Linguistics (ACL), 2010. 5 Kok Stanley and Domingos Pedro. Extracting semantic networks from text via relational clustering. Proceedings of ECML, 2008. Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  11. Introduction Problem Statement Related Work Infinite Relational Model References Source code and Dataset The Source code for IRM is publicly available at http://www.psy.cmu.edu/˜ckemp/code/irm.html The dataset is available at http://knight.cis.temple.edu/˜yates/data/resolver data.tar.gz Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  12. Introduction Problem Statement Related Work Infinite Relational Model References Questions !! Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  13. Introduction Problem Statement Related Work Infinite Relational Model References Formulae Specifications Generative Model m n P ( R i | z 1 , ... z n ) P ( R 1 , R 2 , .... R m , z 1 , .... z n ) = � � P ( z j ) i =1 j =1 Generating Clusters (CRP) n a P ( z i = a | z 1 , ..., z i − 1 ) = i − 1+ γ if n a > 0 P ( z i = a | z 1 , ..., z i − 1 ) = i − 1+ γ if a is a new cluster γ Generating Relations from clusters z | γ ∼ CRP ( γ ) η ( a , b ) | β ∼ Beta ( β, β ) R ( i , j ) | z , η ∼ Bernoulli ( η ( z i , z j )) Inference Beta ( m ( a , b ) + β ) , Beta ( ¯ m ( a , b ) + β ) � P ( R | z ) = Beta ( β, β ) a , b ǫ N Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  14. Introduction Problem Statement Related Work Infinite Relational Model References Semantic Network Extractor Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

  15. Introduction Problem Statement Related Work Infinite Relational Model References Thank You Chandra Prakash, Vishal Kumar Gupta CS365: Course Project

Recommend


More recommend