learning expressive ontological concept descriptions via
play

Learning Expressive Ontological Concept Descriptions via Neural - PowerPoint PPT Presentation

Learning Expressive Ontological Concept Descriptions via Neural Networks Marco Rospocher First things first University of Trento - September 21, 2018 MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER


  1. Framing the Problem: the Transformation Function “syntactic transformation of natural language definitions into description logic axioms.” (V¨ olker J., 2008) ı Ù produces honey bee insect ÷ . . an produces honey A bee is insect that All the extralogical symbols come from the sentence. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  2. Framing the Problem syntactic transformation Descriptive ALCQ Language We need: datasets; architecture; MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  3. Structure and Meaning Machine Learning means examples, good examples, many examples. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  4. Structure and Meaning Machine Learning means examples, good examples, many examples. Every bee is an insect and it also produces honey. A bee is an insect that produces honey. Bees are insects that produce also honey. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  5. Structure and Meaning Machine Learning means examples, good examples, many examples. Every bee is an insect and it also produces honey. A bee is an insect that produces honey. Bees are insects that produce also honey. Every bee is an insect that produces also honey. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  6. Structure and Meaning Machine Learning means examples, good examples, many examples. Every bee is an insect and it also produces honey. A bee is an insect that produces Bees are insects that produce also honey. Every bee is an insect that produces also honey. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  7. Structure and Meaning Machine Learning means examples, good examples, many examples. Every bee is an insect and it also produces honey. A bee is an insect that produces honey. Bees are insects that produce also honey. Every bee is an insect that produces also honey. Many structures, one meaning. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  8. Meaning and Structure Machine Learning means examples, good examples, many examples. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  9. Meaning and Structure Machine Learning means examples, good examples, many examples. Bees are insects that produce honey. A bee is also an insect that produces honey. Every bee is an insect and it also produces honey. A cow is a mammal that eats grass. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  10. Meaning and Structure Machine Learning means examples, good examples, many examples. Bees are insects that produce honey. A bee is also an insect that produces honey. Every bee is an insect and it also produces honey. A cow is a mammal that eats grass. A cow is a mammal that produces milk. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  11. Meaning and Structure Machine Learning means examples, good examples, many examples. Bees are insects that produce honey. A bee is also an insect that produces honey. Every bee is an insect and it also produces honey. A cow is a mammal that eats grass. A cow is a mammal that produces milk. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  12. Meaning and Structure Machine Learning means examples, good examples, many examples. Bees are insects that produce honey. A bee is also an insect that produces honey. Every bee is an insect and it also produces honey. A cow is a mammal that eats grass. A cow is a mammal that produces milk. Many meanings, one structure. 2 2Other semantic phenomena are outside the scope of a syntactic transformation approach. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  13. First Challenge: the Dataset Desiderata for the dataset: 3 covers many syntactic constructs (structure); covers many domains (meaning); has annotated < sentence, axiom > pairs. 3 G. Petrucci. “Information Extraction for Learning Expressive Ontologies” , ESWC 2015 Ph.D. Symp. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  14. First Challenge: the Dataset Desiderata for the dataset: 3 covers many syntactic constructs (structure); covers many domains (meaning); has annotated < sentence, axiom > pairs. List of suitable datasets: 3 G. Petrucci. “Information Extraction for Learning Expressive Ontologies” , ESWC 2015 Ph.D. Symp. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  15. First Challenge: the Dataset Desiderata for the dataset: 3 covers many syntactic constructs (structure); covers many domains (meaning); has annotated < sentence, axiom > pairs. List of suitable datasets: ? Following other notable approaches in literature, we started building a dataset to train our model. 3 G. Petrucci. “Information Extraction for Learning Expressive Ontologies” , ESWC 2015 Ph.D. Symp. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  16. First Challenge: the Dataset ı Ù produces honey bee insect ÷ . . an produces honey A bee is insect that MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  17. First Challenge: the Dataset grass ı cow Ù ÷ eats mammal . . grass cow an eats A is mammal that MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  18. First Challenge: the Dataset ı Ù C0 C1 ÷ R0 C2 . . an A is that NP NP VB NP MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  19. First Challenge: the Dataset ı Ù C0 C1 ÷ R0 C2 . . an A is that NP NP VB NP A NP is a NP that VB NP C0 ı C1 Ù ÷ R0 . C2 MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  20. First Challenge: the Dataset ı Ù C0 C1 ÷ R0 C2 . . an A is that NP NP VB NP A NP is a NP that VB NP C0 ı C1 Ù ÷ R0 . C2 Templates: structural regularities beyond meaning. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  21. Data Generation Process Context-Free Grammar V Every C0 R0 at least NUM C1 C0 ı > NUM R0 . C1 Every PhD student de- PhD student ı fends at least 1 thesis. > 1 defends . thesis MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  22. Data Generation Process Context-Free Grammar V Every C0 R0 at least NUM C1 C0 ı > NUM R0 . C1 Every PhD student de- PhD student ı fends at least 1 thesis. > 1 defends . thesis MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  23. Data Generation Process Context-Free Grammar V Every C0 R0 at least NUM C1 C0 ı > NUM R0 . C1 Every PhD student de- PhD student ı fends at least 1 thesis. > 1 defends . thesis MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  24. Data Generation Process Context-Free Grammar V random! Every C0 R0 at least NUM C1 C0 ı > NUM R0 . C1 Every innocent exile craves innocent exile ı at least 100 towers. > 100 craves . towers MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  25. Data Generation Process Descriptive Language Context-Free Grammar template; actualization; (repeat); approximation; sampling... and parsing. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  26. Data Generation Process Descriptive Language Context-Free Grammar template; A C0 is a C1 actualization; (repeat); approximation; sampling... and parsing. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  27. Data Generation Process Descriptive Language Context-Free Grammar template; actualization; (repeat); approximation; A cow is a mammal sampling... A bee is an insect and parsing. A shark is a fish MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  28. Data Generation Process Descriptive Language Context-Free Grammar template; A C0 is a C1 actualization; (repeat); approximation; C0 R0 C1 sampling... and parsing. C0 as also C1 MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  29. Data Generation Process Descriptive Language Context-Free Grammar template; actualization; (repeat); approximation; sampling... and parsing. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  30. Data Generation Process Descriptive Language Context-Free Grammar template; actualization; (repeat); approximation; sampling... and parsing. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  31. Data Generation Process ∼ Descriptive Language Context-Free Grammar template; actualization; (repeat); approximation; sampling... and parsing. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  32. Data Generation Process ∼ Descriptive Language Context-Free Grammar template; actualization; (repeat); approximation; sampling... and parsing. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  33. Data Generation Process ∼ Descriptive Language Context-Free Grammar template; actualization; (repeat); approximation; sampling... and parsing. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  34. Data Generation Process Minimal input preprocessing: lower-cased; “an” → “a” ; “doesn’t” , “does not” , “don’t” → “do not” ; lemmatised nouns and verbs; numbers → NUM ; MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  35. Second Challenge: the Model A 2-chapter adventure in the world of Recurrent Neural Networks: 2014-2015 Tag&Transduce G. Petrucci, C. Ghidini, and M. Rospocher “Ontology Learning in the Deep” EKAW 2016 2016-2017 Translate G. Petrucci, M. Rospocher, and C. Ghidini “Expressive Ontology Learning as Neural Machine Translation Task” (Under review) 4 4 Code & Datasets: https://github.com/dkmfbk/dket MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  36. Tagging and Transducing ı Ù C0 C1 ÷ R0 C2 . . an A is that NP NP VB NP MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  37. Tagging and Transducing ı Ù C0 C1 ÷ R0 C2 . . an A is that NP NP VB NP Transduction from sentence to formula template; MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  38. Tagging and Transducing ı Ù C0 C1 ÷ R0 C2 . . an A is that NP NP VB NP Transduction from sentence to formula template; Tagging extralogical symbols at the right place; MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  39. Tagging and Transducing A bee is an in- sect that produces honey. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  40. Tagging and Transducing C0 ı C1 Ù ÷ R0 . C2 ; transduction (F) A bee is an in- sect that produces honey. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  41. Tagging and Transducing C0 ı C1 Ù ÷ R0 . C2 ; transduction (F) ? A bee is an in- Bee ı Insect sect that produces Ù÷ produces . Honey honey. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  42. Tagging and Transducing C0 ı C1 Ù ÷ R0 . C2 ; transduction (F) ? A bee is an in- Bee ı Insect sect that produces Ù÷ produces . Honey honey. tagging (T) A bee C 0 is an insect C 1 that produces R 0 honey C 2 MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  43. Tagging and Transducing C0 ı C1 Ù ÷ R0 . C2 ; transduction (F) A bee is an in- Bee ı Insect sect that produces × Ù÷ produces . Honey honey. tagging (T) A bee C 0 is an insect C 1 that produces R 0 honey C 2 MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  44. The Transducing Network Prediction ... C 0 ı <EOS> y k = argmax ( y k ) y 1 y 2 y m Output . . . y j = softmax ( Wh j + b ) h 1 h 2 h m Decoding . . . h j = g ( c , h j − 1 ) h n = c h 1 h 2 h 3 Hidden . . . h i = g ( x i , h i − 1 ) x 1 ± w x 2 ± w x 3 ± w x n ± w Embedding . . . x i = Ee i , Input . . . V ( A ) V ( bee ) V ( is ) V ( <EOS> ) V ( x i ) MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  45. The Tagging Network Tag . . . t 1 = w t 2 = C 0 t 3 = w t n = <EOS> y i = argmax ( y i ) y 1 y 2 y 3 y n Output . . . y i = softmax ( Wh i + b ) h 1 h 2 h 3 h n Hidden . . . h i = g ( x i , h i − 1 ) x 1 ± w x 2 ± w x 3 ± w x n ± w Embedding . . . x i = Ee i , Input-Windowing . . . V ( A ± w ) V ( bee ± w ) V ( is ± w ) V ( <EOS> ± w ) V ( x i ± w ) MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  46. Tagging and Transducing: Evaluation RQ1. To what degree is the network capable to generalize over the syntactic structures of descriptive language? (many structures, one meaning) RQ2. To what degree is the network capable to tolerate words that have not been seen during the training phase? (many meanings, one structure) MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  47. Tagging and Transducing: Evaluation Evaluation Metrics: Ó if f k ≡ ˆ f k 1 , q M 0 , otherwise k =1 FA ( ˆ F , F ) = CF Avg. Per-Formula Acc. = fully automated M M q M δ ( f k , ˆ f k ) ED ( ˆ k =1 Avg. Edit Distance F , F ) = semi-automated M Ó if f k = ˆ f k 1 , q Tf k q M j j j =1 k =1 0 , otherwise TA ( ˆ Avg. Per-Token Acc. F , F ) = quick control q M Tf k k =1 MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  48. Tagging and Transducing: Evaluation di ff erent training set sizes; 2M test examples; <UNK> between 20% and 40%. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  49. Tagging and Transducing: Evaluation di ff erent training set sizes; 2M test examples; <UNK> between 20% and 40%. training set size CF FA ED TA 1000 10 0.5e-5 2.67 0.90 2000 8.05e-5 1.34 0.95 161 3000 60 3.00e-5 1.22 0.96 4000 22 1.10e-5 1.07 0.97 MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  50. Tagging and Transducing: Evaluation di ff erent training set sizes; 2M test examples; <UNK> between 20% and 40%. training set size CF FA ED TA 1000 10 0.5e-5 2.67 0.90 2000 8.05e-5 1.34 0.95 161 3000 60 3.00e-5 1.22 0.96 4000 22 1.10e-5 1.07 0.97 Many limitations: we dropped the project and move forward. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  51. Moving forward (aka 1 > 2) The placeholders are numbered in the training set and there is no way to overcome this limit—namely, generalize over the length of the sentence—by design. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  52. Moving forward (aka 1 > 2) Negramaro is a red and strong wine. negramaro ı red wine Ù strong wine Negramaro is a red and strong wine. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  53. Moving forward (aka 1 > 2) Negramaro is a red and strong wine. negramaro ı red wine Ù strong wine C0 ı C1 Û C2 transduction (F) Negramaro is a red ◊ and strong wine. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  54. Moving forward (aka 1 > 2) Negramaro is a red and strong wine. negramaro ı red wine Ù strong wine C0 ı C1 Û C2 transduction (F) Negramaro is a red ◊ and strong wine. tagging (T) Negramaro C 0 is a red C 1 and strong C 2 wine C 2 . MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  55. Moving forward (aka 1 > 2) Negramaro is a red and strong wine. negramaro ı red wine Ù strong wine C0 ı C1 Û C2 transduction (F) Negramaro is a red negramaro ı ◊ red Û strong wine and strong wine. tagging (T) Negramaro C 0 is a red C 1 and strong C 2 wine C 2 . MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  56. Translate MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  57. Translate . an produces honey is insect A bee that MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  58. Translate bee copy (#2) . an produces honey is insect A bee that MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  59. Translate ı bee copy (#2) emit ( ı ) . an produces honey is insect A bee that MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  60. Translate ı bee insect copy (#2) emit ( ı ) copy (#5) . an produces honey is insect A bee that MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  61. Translate ı Ù bee insect copy (#2) emit ( ı ) copy (#5) emit ( Ù ) . an produces honey is insect A bee that MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  62. Translate ı Ù bee insect ÷ copy (#2) emit ( ı ) copy (#5) emit ( Ù ) emit ( ÷ ) . an produces honey is insect A bee that MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  63. Translate ı Ù produces bee insect ÷ copy (#2) emit ( ı ) copy (#5) emit ( Ù ) emit ( ÷ ) copy (#7) . an produces honey is insect A bee that MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  64. Translate ı Ù produces bee insect ÷ . copy (#2) emit ( ı ) copy (#5) emit ( Ù ) emit ( ÷ ) copy (#7) emit ( . ) . an produces honey is insect A bee that MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  65. Translate ı Ù produces honey bee insect ÷ . copy (#2) emit ( ı ) copy (#5) emit ( Ù ) emit ( ÷ ) copy (#7) emit ( . ) copy (#8) . an produces honey is insect A bee that Quasi-zero vocabulary setting. MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  66. Translate Emit logical symbol Copy input word z t attention x 1 , x 2 , ..., x n MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  67. Translate x T x x 1 x 2 x i ... ... MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  68. Translate h i − 1 h T x − 1 h 2 h i h 1 ... ... g e ( x 1 , h 0 ) g e ( x 2 , h 1 ) g e ( x i , h i − 1 ) g e ( x T x , h T x ) x T x x 1 x 2 x i ... ... MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  69. Translate d j − 1 Attention h T x h 1 h i h 2 h i − 1 h T x − 1 h 2 h i h 1 ... ... g e ( x 1 , h 0 ) g e ( x 2 , h 1 ) g e ( x i , h i − 1 ) g e ( x T x , h T x ) x T x x 1 x 2 x i ... ... MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

  70. Translate d j g d ( ..., d j − 1 ) ˜ y j − 1 d j − 1 c j d j − 1 Attention h T x h 1 h i h 2 h i − 1 h T x − 1 h 2 h i h 1 ... ... g e ( x 1 , h 0 ) g e ( x 2 , h 1 ) g e ( x i , h i − 1 ) g e ( x T x , h T x ) x T x x 1 x 2 x i ... ... MARCO Learning Expressive Ontological Concept Descriptions via Neural Networks ROSPOCHER

Recommend


More recommend