Online Collaborative Prediction of Regional Vote Results Vincent - PowerPoint PPT Presentation

Online Collaborative Prediction of Regional Vote Results Vincent Etter, Emtiyaz Khan, Mattias Grossglauser, Patrick Thiran DSAA — October 17, 2016 — Montréal, Canada

Data Opportunity Many countries adopt open government initiatives • Several datasets published • Demographics • State a ff airs • Votes and elections • Unique opportunity • Get a better understanding • Build tools useful to others • 2

Voting Data News agencies, political parties, and polling institutes are all • interested in understanding voting behaviors Will the next vote pass easily? • What makes two regions vote similarly? • Where should we focus our e ff orts? • 3

Dataset Vote results from Switzerland • Issue votes between 1981 and 2014 • Outcome (% of “yes”) at the municipality level • 281 votes • 13 features: voting recommendation of the main parties • 2352 regions • 25 features: languages spoken, demographics, etc. • Data available at http://vincent.etter.io/dsaa16 4

Similarities Between Results 5

Online Predictions On the day of the vote, regional results are released in • sequence Use published results to predict others • … and re fi ne the prediction as more results are published? • 6

Our Approach Use a matrix-factorization model to capture the bi-clustering • Add region and vote features • Reduce the cold-start problem • More interpretable • Build the model incrementally to assess the e ff ect of each • component 7

Our Model y dn = z dn + ✏ v T z dn = µ n + f n ( x d ) + f d ( w n ) + d u n bias regression regression matrix on region on vote factorization 8

Our Models v T z dn = µ n + f n ( x d ) + f d ( w n ) + d u n LIN(r) z dn = µ n + β T n x d LIN(v) γ T z dn = µ n + d w n LIN(r) + LIN(v) β T γ T λ β , λ γ , λ u , λ v z dn = µ n + + n x d d w n MF v T z dn = µ n + + d u n MF + LIN(r) β T v T z dn = µ n + + n x d d u n MF + GP(r) v T GP( x d ) z dn = µ n + + d u n θ , σ s , λ γ MF + GP(r) + LIN(v) γ T v T GP( x d ) z dn = µ n + + + d w n d u n 9

Performance Evaluation Last 50 votes as test data • Simulate 500 random reveal order • Last 10% of regions as test regions • Observe increasing number of regions • Predict result of test regions • 10

Results 13 13 13 RMSE on the last 10 % of regions [%] RMSE on the last 10 % of regions [%] RMSE on the last 10 % of regions [%] 12 12 12 11 11 11 10 10 10 9 9 9 MF + LIN(r) 8 8 8 7 7 7 LIN(r) LIN(r) LIN(r) 6 6 6 MF MF 5 5 5 10 0 10 0 10 0 10 1 10 1 10 1 10 2 10 2 10 2 10 3 10 3 10 3 Number of observed regions Number of observed regions Number of observed regions 11

Bayesian VS Non-Bayesian 13 13 RMSE on the last 10 % of regions [%] RMSE on the last 10 % of regions [%] 12 12 11 11 10 10 9 9 MF + LIN(r) MF + LIN(r) 8 8 7 7 M 6 6 F + G P ( r ) 5 5 10 0 10 0 10 1 10 1 10 2 10 2 10 3 10 3 Number of observed regions Number of observed regions 12

Final Model 13 13 RMSE on the last 10 % of regions [%] RMSE on the last 10 % of regions [%] 12 12 11 11 10 10 9 9 LIN(v) LIN(v) M M 8 8 F F MF + GP(r) + LIN(v) + + G G P P ( ( 7 7 r r ) ) 6 6 5 5 10 0 10 0 10 1 10 1 10 2 10 2 10 3 10 3 Number of observed regions Number of observed regions 13

Interpretation Röstigraben x y Election CVP Election BDP Election SVP Age 20-64 Election other right Election PST Elevation Election SP Age 0-19 Election Greens Election GL Election FDP Election PEV Foreigners Social aid Age 65+ Speaks French Population density Speaks Romansh Jobs Population Speaks German Speaks Italian 0 . 0 0 . 2 0 . 4 0 . 6 0 . 8 1 . 0 Relative importance 14

Summary Individual models have di ff erent strengths • Vote features regression for cold start • Region features and bi-clustering when more observations • Bayesian methods are useful • Proper hyperparameters setting • Accurate and interpretable results • 15

Thank you! Code and data available at http://vincent.etter.io/dsaa16 Any questions? 16

Online Collaborative Prediction of Regional Vote Results Vincent - PowerPoint PPT Presentation

Online Collaborative Prediction of Regional Vote Results Vincent Etter, Emtiyaz Khan, Mattias Grossglauser, Patrick Thiran DSAA October 17, 2016 Montral, Canada Data Opportunity Many countries adopt open government initiatives

REGISTER TO VOTE | MAKE A VOTING PLAN VISIT WMICH.EDU/VOTE TO LEARN HOW. MAKE A VOTING PLAN:

https://www.gov.uk/register-to-vote Can register here and at home, vote in both local elections

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Branch Prediction Branch Prediction vs vs Execution Time Execution Time Prediction

Agenda Minutes Overview of Changes Discussion; Vote? 1 3/19/2018 How to Vote? Type yes

COLLABORATIVE COMMUNITY PRESENTATION MAY 30TH, 2018 One San Pedro COLLABORATIVE One San Pedro

Using lasso and related estimators for prediction Di Liu StataCorp July 12, 2019 1 / 20

Prediction and Odds 18.05 Spring 2017 Probabilistic Prediction Also called probabilistic

Using Stata 16s lasso features for prediction and inference Di Liu StataCorp 1 / 50

CS 104 Computer Organization and Design Branch Prediction CS104:Branch Prediction 1 Branch

Exercise 7a: Additional Intra Prediction Modes Implement Additional Block Prediction Modes Add

V Voter Assistance: A i How Can I Continue to Vote? How Can I Continue to Vote?

HAVA HAVA Help America Vote Act Help America Vote Act What Every Voter Should Know Why Is HAVA

VOTE B BY MAIL IL #VBMHawaii What i is Vote B By M Mail? Act 136, SLH 2019 requires all

2020-21 Budget Presentation Budget Updates April 7, 2020 Tonights Topics Vote Dates

PRE-VOTE PRE-VOTE PROPOSED CAPITAL PROJECT PREVOTE - Estimated Impact of Proposed Capital

A General Solver Based on Sparse Resultants Ioannis Z. Emiris presented by Pavel Trutman

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

Today. Secret Sharing. Polynomials A polynomial P ( x ) = a d x d + a d 1 x d 1 +

Basic Assumptions for Efficient Model Representation Michael Gutmann Probabilistic Modelling and

Dynamically Typed Programming Languages Part 2: Dynamic PCF Jim Royer CIS 352 April 16, 2019

Algebraic and combinatorial methods for bounding the number of the complex embeddings of

Lecture 24: Perceptrons Regression Prof. Julia Hockenmaier juliahmr@illinois.edu

Superusers and IT support Learning aim Identify groups supporting in IT use Specify

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us