means of natural language processing
play

means of Natural Language Processing Viorel Morari Chair of Web - PowerPoint PPT Presentation

Mining Rhetorical Devices by means of Natural Language Processing Viorel Morari Chair of Web Technology and Information Systems Master Thesis Defense January 23 rd , 2018 viorel.morari@uni-weimar.de Prof. Dr. Benno Stein Advisor: Khalid


  1. Omission: Hypozeugma Hypozeugma - placing last, in a construction containing several words or phrases of equal value, the word or words on which all of them depend. (Silva Rhetoricae) A rooster, a prince and a lion walk into a bar… Stanford Dependencies A rooster, a prince and a lion walk into a bar… governor-dependent relation 52

  2. Omission: Hypozeugma Hypozeugma - placing last, in a construction containing several words or phrases of equal value, the word or words on which all of them depend. (Silva Rhetoricae) A rooster, a prince and a lion walk into a bar… Stanford Dependencies A rooster, a prince and a lion walk into a bar… UIMA Ruta rooster, a prince and a lion walk 53

  3. Omission: Hypozeugma Hypozeugma - placing last, in a construction containing several words or phrases of equal value, the word or words on which all of them depend. (Silva Rhetoricae) A rooster, a prince and a lion walk into a bar… Stanford Dependencies A rooster, a prince and a lion walk into a bar… UIMA Ruta rooster, a prince and a lion walk 54

  4. Omission: Hypozeugma Hypozeugma - placing last, in a construction containing several words or phrases of equal value, the word or words on which all of them depend. (Silva Rhetoricae) A rooster, a prince and a lion walk into a bar… Stanford Dependencies A rooster, a prince and a lion walk into a bar… UIMA Ruta rooster, a prince and a lion walk 55

  5. Omission: Hypozeugma Hypozeugma - placing last, in a construction containing several words or phrases of equal value, the word or words on which all of them depend. (Silva Rhetoricae) A rooster, a prince and a lion walk into a bar… Stanford Dependencies A rooster, a prince and a lion walk into a bar… UIMA Ruta rooster, a prince and a lion walk 56

  6. Omission: Hypozeugma Hypozeugma - placing last, in a construction containing several words or phrases of equal value, the word or words on which all of them depend. (Silva Rhetoricae) A rooster, a prince and a lion walk into a bar… Stanford Dependencies A rooster, a prince and a lion walk into a bar… UIMA Ruta rooster, a prince and a lion walk 57

  7. Omission: Hypozeugma Hypozeugma - placing last, in a construction containing several words or phrases of equal value, the word or words on which all of them depend. (Silva Rhetoricae) A rooster, a prince and a lion walk into a bar… Stanford Dependencies A rooster, a prince and a lion walk into a bar… UIMA Ruta rooster, a prince and a lion walk 58

  8. Omission: Hypozeugma Hypozeugma - placing last, in a construction containing several words or phrases of equal value, the word or words on which all of them depend. (Silva Rhetoricae) A rooster, a prince and a lion walk into a bar… Stanford Dependencies A rooster, a prince and a lion walk into a bar… UIMA Ruta rooster, a prince and a lion walk 59

  9. Repetition: Epanalepsis Epanalepsis - repeats the beginning word of a sentence at the end. Our eyes saw it, but we could not believe our eyes. 60

  10. Repetition: Epanalepsis Epanalepsis - repeats the beginning word of a sentence at the end. Our eyes saw it, but we could not believe our eyes. 61

  11. Repetition: Epanalepsis Epanalepsis - repeats the beginning word of a sentence at the end. Our eyes saw it, but we could not believe our eyes. Our eyes saw believe our eyes. 62

  12. Repetition: Epanalepsis Epanalepsis - repeats the beginning word of a sentence at the end. Our eyes saw it, but we could not believe our eyes. Our eyes saw believe our eyes. 63

  13. Repetition: Epanalepsis Epanalepsis - repeats the beginning word of a sentence at the end. Our eyes saw it, but we could not believe our eyes. Our eyes saw believe our eyes. 64

  14. Repetition: Epanalepsis Epanalepsis - repeats the beginning word of a sentence at the end. Our eyes saw it, but we could not believe our eyes. Our eyes saw believe our eyes. 65

  15. Repetition: Epanalepsis Epanalepsis - repeats the beginning word of a sentence at the end. Our eyes saw it, but we could not believe our eyes. Our eyes saw believe our eyes. 66

  16. Repetition: Epanalepsis Epanalepsis - repeats the beginning word of a sentence at the end. Our eyes saw it, but we could not believe our eyes. Our eyes saw believe our eyes. 67

  17. Repetition: Epanalepsis Epanalepsis - repeats the beginning word of a sentence at the end. Our eyes saw it, but we could not believe our eyes. Our eyes saw believe our eyes. 68

  18. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. 69

  19. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. Stanford Dependencies If I were president, I would cut taxes. 70

  20. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. Stanford Dependencies If I were president, I would cut taxes. governor-dependent relations 71

  21. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. Stanford Dependencies If I were president, I would cut taxes. 72

  22. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. Stanford Dependencies If I were president, I would cut taxes. P-clause Q-clause 73

  23. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. Stanford Dependencies If I were president, I would cut taxes. UIMA Ruta If I were president I would cut 74

  24. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. Stanford Dependencies If I were president, I would cut taxes. UIMA Ruta If I were president I would cut 75

  25. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. Stanford Dependencies If I were president, I would cut taxes. UIMA Ruta If I were president I would cut 76

  26. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. Stanford Dependencies If I were president, I would cut taxes. UIMA Ruta If I were president I would cut 77

  27. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. Stanford Dependencies If I were president, I would cut taxes. UIMA Ruta If I were president I would cut 78

  28. Custom: If-conditional 2 If-conditional 2 - expresses consequences that are totally unrealistic or will not likely happen in the future. If I were president, I would cut taxes. Stanford Dependencies If I were president, I would cut taxes. UIMA Ruta If I were president I would cut 79

  29. Evaluation dataset Literature Political Expert speeches websites Rhetorical Commercials Bible Devices 80

  30. Evaluation dataset Literature Political Expert speeches websites Rhetorical Commercials Bible Devices Evaluation measures 𝑢𝑞 𝑢𝑞 𝐺1 𝑡𝑑𝑝𝑠𝑓 = 2 ∙ 𝑞𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 ∙ 𝑠𝑓𝑑𝑏𝑚𝑚 𝑄𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 = 𝑆𝑓𝑑𝑏𝑚𝑚 = 𝑢𝑞 + 𝑔𝑞 𝑢𝑞 + 𝑔𝑜 𝑞𝑠𝑓𝑑𝑗𝑡𝑗𝑝𝑜 + 𝑠𝑓𝑑𝑏𝑚𝑚 81

  31. Evaluation Results Balance schemes Omission schemes 1 1 1 0.84 0.9 0.9 0.8 0.8 0.68 0.69 0.69 0.7 0.7 Enumeration Hypozeugma 0.6 0.6 Epizeugma 0.5 0.5 0.4 Isocolon 0.4 0.4 Pysma Asynd. 0.3 0.3 0.2 0.2 0.1 0.1 0 0 82

  32. Evaluation Results Balance schemes Omission schemes 1 1 0.95 1 0.84 0.9 0.9 0.8 0.8 0.68 0.69 0.69 0.7 0.7 Enumeration Hypozeugma 0.6 0.6 Epizeugma 0.5 0.5 0.4 Isocolon 0.4 0.4 Pysma Asynd. 0.25 0.3 0.3 0.2 0.2 0.1 0.1 0 0 Precision Recall 83

  33. Evaluation Results Balance schemes Omission schemes 1 1 0.95 1 0.84 0.9 0.9 0.8 0.8 0.68 0.69 0.69 0.7 0.7 Enumeration Hypozeugma 0.6 0.6 Epizeugma 0.5 0.5 0.4 Isocolon 0.4 0.4 Pysma Asynd. 0.25 0.3 0.3 0.2 0.2 0.1 0.1 0 0 Precision Recall Enumeration Asyndeton Old farmer had a pig, a dog, a cow and a horse. 84

  34. Evaluation Results Balance schemes Omission schemes 1 1 1 0.84 0.9 0.9 0.8 0.8 0.68 0.69 0.69 0.7 0.7 Enumeration Hypozeugma 0.6 0.6 Epizeugma 0.5 0.5 0.4 Isocolon 0.4 0.4 Pysma Asynd. 0.3 0.3 0.2 0.2 0.1 0.1 0 0 Custom schemes Repetition schemes 1 1 0.91 1 0.85 0.87 0.9 0.9 0.73 0.78 0.78 0.74 0.78 0.8 0.74 0.74 0.74 0.73 0.8 0.72 0.67 0.7 0.56 0.61 0.7 Whether-cond 0.59 If-counterFact 0.56 Polysyndeton Passive Voice 0.6 0.6 Unless-cond Epanalepsis Anadiplosis Comp. Adv. Super. Adv. Comp. Adj. Super. Adj. 0.5 0.5 Mesarchia 0.4 Epiphoza Epizeuxis If-cond 1 If-cond 3 If-cond 0 If-cond 2 Diacope 0.4 0.4 Mesod. 0.3 0.3 0.2 0.2 0.1 0.1 85 0 0

  35. Evaluation Results Balance schemes Omission schemes 1 1 1 0.84 0.9 0.9 0.8 0.8 0.68 0.69 0.69 0.7 0.7 Enumeration Hypozeugma 0.6 0.6 Epizeugma 0.5 0.5 0.4 Isocolon 0.4 0.4 Pysma Asynd. 0.3 0.3 0.2 0.2 0.1 0.1 0 0 Custom schemes Repetition schemes 1 1 0.91 1 0.85 0.87 0.9 0.9 0.73 0.78 0.78 0.74 0.78 0.8 0.74 0.74 0.74 0.73 0.8 0.72 0.67 0.7 0.56 0.61 0.7 Whether-cond 0.59 If-counterFact 0.56 Polysyndeton Passive Voice 0.6 0.6 Unless-cond Epanalepsis Anadiplosis Comp. Adv. Super. Adv. Comp. Adj. Super. Adj. 0.5 0.5 Mesarchia 0.4 Epiphoza Epizeuxis If-cond 1 If-cond 3 If-cond 0 If-cond 2 Diacope 0.4 0.4 Mesod. 0.3 0.3 0.2 0.2 0.1 0.1 86 0 0

  36. Evaluation Results F1-Score Balance schemes Omission schemes 1 1 1 0.84 0.9 0.9 0.8 0.8 0.68 0.69 0.69 0.7 0.7 Enumeration Hypozeugma 0.6 0.6 Epizeugma 0.5 0.5 0.4 Isocolon 0.4 0.4 Pysma Asynd. 0.3 0.3 0.2 0.2 0.1 0.1 0 0 Custom schemes Repetition schemes 1 1 0.91 1 0.85 0.87 0.9 0.9 0.73 0.78 0.78 0.74 0.78 0.8 0.74 0.74 0.74 0.73 0.8 0.72 0.67 0.7 0.56 0.61 0.7 Whether-cond 0.59 If-counterFact 0.56 Polysyndeton Passive Voice 0.6 0.6 Unless-cond Epanalepsis Anadiplosis Comp. Adv. Super. Adv. Comp. Adj. Super. Adj. 0.5 0.5 Mesarchia 0.4 Epiphoza Epizeuxis If-cond 1 If-cond 3 If-cond 0 If-cond 2 Diacope 0.4 0.4 Mesod. 0.3 0.3 0.2 0.2 0.1 0.1 87 0 0

  37. Pipeline output 88

  38. Pipeline output 89

  39. Analysis of 2 Rhetorical Devices 90

  40. Pipeline output 91

  41. Pipeline Group distribution Frequency Combinations output + + + + + + 92

  42. Pipeline Precision Device Recall output Confidence interval n * precision (rd) n/recall (rd) Al-Khatib et al. [2017] 93

  43. Pipeline Significance Test A B Effect-size Test output A B 94

  44. Pipeline Type Topic output 95

  45. Pipeline 96

  46. Analysis 3 Experiments Data Preparation 97

  47. Experiments: datasets The New York Times US Presidential Debates 2016 Ben Wiseman [2016] 98

  48. Data dimensionality Language Mode Communication Author Audience English Written Monological Identity U.S. Type Genre Topic Medium Descriptive Editorial Education Newspaper Argumentative Review Science Presidential Debates Biography Art Debate Politics 99

  49. NYT Experiment: data subsampling NYT Dataset Random Article-length based Matching 1000 articles 600 articles 343 articles 100

Recommend


More recommend