leveraging joint interactions for credibility analysis in
play

Leveraging Joint Interactions for Credibility Analysis in News - PowerPoint PPT Presentation

Leveraging Joint Interactions for Credibility Analysis in News Communities Subhabrata Mukherjee and Gerhard Weikum Max Planck Institute for Informatics CIKM 2015 Motivation Media plays a crucial role in public dissemination of information


  1. Leveraging Joint Interactions for Credibility Analysis in News Communities Subhabrata Mukherjee and Gerhard Weikum Max Planck Institute for Informatics CIKM 2015

  2. Motivation ➢ Media plays a crucial role in public dissemination of information ➢ However, people believe there is substantial media bias in news in view of inter-dependencies and cross-ownerships of media companies and other industries (like energy) ➢ 4 out of 5 Americans among younger generations do not trust major news networks [Gallup poll, 2013] ➢ This work: Credibility Analysis of News Communities

  3. News Community ➢ A news community is a news aggregator site (e.g., reddit.com, digg.com, newstrust.net) where: ➢ Users can give explicit feedback (e.g., rate, review, share) on the quality of news ➢ Interact (e.g., comment, vote) with each other ➢ However, this adds user subjectivity as users incorporate their own bias and perspectives in the framework ➢ Controversial topics create polarization among users which influence their evaluation

  4. Contributions ● A model to capture joint interaction between language , topics , users and sources leading to better prediction than the ones in isolation ● User expertise , source trustworthiness , language objectivity , topical perspective and article credibility mutually reinforce each other ● A supervised Conditional Random Field model that can capture these interactions, and handle real-valued ratings

  5. Example FACTORS C 1 C 2 Source s 1 s 1 Article d 2 d 1 Review r 22 r 11 r 12 u 1 u 2 u 2 User y 1 y 2

  6. Example FACTORS Instantiation C 1 C 2 Alternet.org Source s 1 s 1 (progressive/liberal) Why do conservaties hate your children? Article d 2 d 1 Topic: Climate Review Ratings r 22 r 11 r 12 Discussions u 1 u 2 u 2 User (liberal vs.conservative) y 1 y 2

  7. Example FACTORS FEATURES C 1 C 2 Viewpoint, Expertise Source s 1 s 1 Why do conservaties hate your children? Article d 2 d 1 Topic: Climate Review Ratings r 22 r 11 r 12 Discussions u 1 u 2 u 2 User (liberal vs.conservative) y 1 y 2

  8. Example FACTORS FEATURES C 1 C 2 Viewpoint, Expertise Source s 1 s 1 Emotionality, Discourse Article d 2 d 1 Topic: Climate Review Ratings r 22 r 11 r 12 Discussions u 1 u 2 u 2 User (liberal vs.conservative) y 1 y 2

  9. Example FACTORS FEATURES C 1 C 2 Viewpoint, Expertise Source s 1 s 1 Emotionality, Discourse Article d 2 d 1 Topic Review Ratings r 22 r 11 r 12 Discussions u 1 u 2 u 2 User (liberal vs.conservative) y 1 y 2

  10. Example FACTORS FEATURES C 1 C 2 Viewpoint, Expertise Source s 1 s 1 Emotionality, Discourse Article d 2 d 1 Topic Review Ratings r 22 r 11 r 12 Bias, Viewpoint, u 1 u 2 u 2 User Expertise y 1 y 2

  11. Task FACTORS ATTRIBUTES C 1 C 2 Trustworthiness Source s 1 s 1 Objectivity Article d 2 d 1 Credibility Review r 22 r 11 r 12 u 1 u 2 Expertise u 2 User Article Credibility y 1 y 2 Rating?

  12. Credibility Analysis ➢ Given a set of news sources generating news articles, and users reviewing them on different qualitative aspects with mutual interactions: ➢ Jointly rank the sources , articles , and users based on their trustworthiness , credibility ,and expertise

  13. Credibility of Statements in Health Communities [S. Mukherjee et al.: KDD‘14]

  14. Language Features C 1 C 2 Source s 1 s 1 Objectivity Article d 2 d 1 Review r 22 r 11 r 12 A s s e r t i v e s , F a c t i v e s , H e d g e s , I m p l i c a t i v e s , R e p o r t , D i s c o u r s e , S u b j e c t i v i t y e t c . u 1 u 2 User u 2 y 1 y 2 1. M. Recasens, C. Danescu-Niculescu-Mizil, and D. Jurafsky. Linguistic models for analyzing and detecting biased language. In ACL, 2013. 2. S. Mukherjee, G. Weikum, and C. Danescu-Niculescu-Mizil. People on drugs: Credibility of user statements in health communities. KDD, 2014.

  15. Topic Features ➢ Only 33% of the articles have explicit tags ➢ Use Latent Dirichlet Allocation to learn the latent topic distribution in the corpus of news articles

  16. Source Features

  17. User Features Category Elements Engagement answers, ratings (given / received), comments etc. Agreement Inter-user agreement Topics perspective and expertise Interactions user-user, user-item, user-source

  18. Given a factor, with its features, use Support Vector Regression to learn a model that will predict its rating for an article. C 1 C 2 Source Models Source s 1 s 1 Article Language Article d 2 d 1 Model, Topic Model Review r 22 Review Language r 11 r 12 Model, Topic Model User Models u 1 u 2 User u 2 How to Article Credibility y 1 y 2 aggregate? Rating?

  19. Conditional Random Field Probability Mass Function for discrete labels: Probability Density Function for continuous ratings:

  20. Energy Function Clique potential User Potential Topic Potential Language Potential Source Potential Clique: source, article, <users>, <reviews>

  21. partitions the user space user expertise error of predictor SVR

  22. Energy Function language objectivity source trustworthiness topical perspective

  23. The joint p.d.f is a multivariate gaussian distribution Σ needs to be positive definite for inverse to exist → {α, β, γ} > 0 Makes sense: predictor reliability should be positive

  24. Constrained optimization problem. Gradient ascent cannot be directly used. Maximize log-likelihood with respect to log λ k instead of λ k Prediction is the expected value of the function given by the mean of the Multivariate Gaussian distribution:

  25. Experiments: NewsTrust Data available at: http://www.mpi-inf.mpg.de/impact/credibilityanalysis/

  26. Predicting User Ratings Users, Articles, Ratings +Time +Review Text +Review Text and Interactions 1. Y. Koren. Factorization meets the neighborhood: A multifaceted collaborative filtering model. KDD, 2008. 2. J. McAuley and J. Leskovec. Hidden factors and hidden topics: Understanding rating dimensions with review text. RecSys, 2013. 3. J. J. McAuley and J. Leskovec. From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. In WWW, 2013.

  27. Predicting Article Credibility Ratings

  28. Predicting Article Credibility Ratings

  29. Predicting Article Credibility Ratings

  30. Predicting Article Credibility Ratings

  31. Ranking Trustworthy News Sources Ranking Expert Users:

  32. Sample Output: Most and Least Trust Sources on Sample Topics

  33. Conclusions ➢ Joint interaction between language , topics , users and sources lead to better prediction in multiple tasks ➢ User expertise , source trustworthiness , language objectivity , topical perspective and article credibility mutually reinforce each other

  34. Ongoing Work ➢ Analyze temporal evolution of these factors ➢ Communities are inherently dynamic in nature ➢ Source trustworthiness, and user expertise change with time ➢ To this end we propose an Experience-aware Item Recommendation for Evolving Review Communities, ICDM 2015.

Recommend


More recommend