a text based model of foreign affairs sentiment
play

A text-based model of foreign-affairs sentiment Sean Gerrish and - PowerPoint PPT Presentation

A text-based model of foreign-affairs sentiment Sean Gerrish and David Blei Princeton University Computer Science 17 December 2011 These news articles tell a story. A spatial model of foreign relations sentiment This work develops a model of


  1. A text-based model of foreign-affairs sentiment Sean Gerrish and David Blei Princeton University Computer Science 17 December 2011

  2. These news articles tell a story.

  3. A spatial model of foreign relations sentiment This work develops a model of the sentiment between countries over time. • It models dynamic relationships in an interpretable way • It infers sentiment from printed media • Sentiment is defined by Mechanical Turkers

  4. A spatial model of foreign relations sentiment To do this, our plan is to: • Collect a bunch of newspaper articles • Define a latent variable model to capture interesting structure in these articles • Perform posterior inference to estimate the value of these random variables

  5. Countries take latent positions ¯ x ct over time x 2 1 0 Time -1 -2 x c , t − 1 , σ 2 x c , t | ¯ ¯ x c , t − 1 ∼ N (¯ K )

  6. The relationship between countries is observed in the news. x 2 1 0 Time -1 -2 x c 1 , t , σ 2 x c 1 , d ∼ N (¯ D ) x c 2 , t , σ 2 x c 2 , d ∼ N (¯ D ) := x c 1 , dT x c 2 , d Sentiment s d

  7. The relationship between countries is observed in the news. x 2 1 0 Time -1 -2 x c 1 , t , σ 2 x c 1 , d ∼ N (¯ D ) x c 2 , t , σ 2 x c 2 , d ∼ N (¯ D ) := x c 1 , dT x c 2 , d Sentiment s d

  8. The relationship between countries over time Regularization zero zero zero x x x Latent position c,t-1 c,t c,t+1 C Position during x x x c,t-1 c,t c,t+1 interaction 2 2 2 β s s s Sentiment w w w Observed text D D D t-1 t t+1

  9. Labeling sentiment 1. We found all pairs of paragraphs from the New York Times which discussed exactly two countries 2. A random sample of 3607 paragraphs from New York Times articles from 1988 to 2008 were labeled by Amazon Mechanical Turk workers 3. Raters rated news articles on the scale − 5 , − 3 , − 1 , 1 , 3 , 5

  10. Labeling sentiment: typical task

  11. Sentiment and news articles: text regression s d = w T d β + ε • w d ∈ R V is the text of a news paragraph • s d ∈ R is the sentiment between two countries • β ∈ R V is the “weight” of each word

Recommend


More recommend