Beating the bookie A look at statistical models for prediction of - PowerPoint PPT Presentation

Beating the bookie A look at statistical models for prediction of football matches Helge Langseth Norwegian University of Science and Technology SCAI 2013 1 Helge Langseth Beating the bookie

Building a model for match outcomes Suppose we want to build a model to predict the outcomes of games from the English Premier League: 20 teams , all play each other twice during a season. Each team plays 38 matches, 380 games per season in total. The quality is measured by the systems ability to win bets . A bet (e.g., “Liverpool to win” ) is offered with odds ω . The model generates the corresponding probability p . A bet is only rational whenever the expected gain is positive , i.e., p · ω ≥ 1 . Accurate predictions imply a useful betting agent, thus our goal is to generate good probability estimates for upcoming games based on the history of the season so far . 2 Helge Langseth Beating the bookie

Maher (1982) An early attempt at building a statistical model: X ij ∼ Poisson ( k · λ · α i β j ) , where: X ij is no. goals scored by Team i vs. Team j playing at home. k captures the home-team advantage. λ is a normalization constant. α i is the attacking strength of Team i . β j is the defending strength of Team j . Y ij ∼ Poisson ( λ · α j β i ) ; Y ij is no. goals scored by Team j . Crucially — and surprisingly — he assumes X ij ⊥ ⊥ Y ij | Model . The model is under-specified, so he requires avg ℓ ( α ℓ ) = avg ℓ ( β ℓ ) = 1 . 3 Helge Langseth Beating the bookie

Maher (1982) An early attempt at building a statistical model: X ij ∼ Poisson ( k · λ · α i β j ) , where: X ij is no. goals scored by Team i vs. Team j playing at home. k captures the home-team advantage. λ is a normalization constant. α i is the attacking strength of Team i . β j is the defending strength of Team j . Y ij ∼ Poisson ( λ · α j β i ) ; Y ij is no. goals scored by Team j . α i α j β i β j X ij Y ij k λ 3 Helge Langseth Beating the bookie

Predictions from the model We predict the result of the game between Team k and Team ℓ by looking at the probability distributions for X kℓ and Y kℓ . The maximum likelihood parameters for the abilities of the two best teams in the Premier League’s after 11 rounds, are: After 11 games Attack Defence Arsenal 1 . 4 0 . 9 Liverpool 1 . 4 0 . 8 We can use these parameters (plus ˆ k and ˆ λ ) to find, e.g., � � P X Liv , Ars > Y Liv , Ars . 4 Helge Langseth Beating the bookie

Predictions from the model We predict the result of the game between Team k and Team ℓ by looking at the probability distributions for X kℓ and Y kℓ . The maximum likelihood parameters for the abilities of the two best teams in the Premier League’s after 11 rounds, are: After 5 games After 11 games Attack Defence Attack Defence Arsenal 1 . 5 1 . 2 1 . 4 0 . 9 Liverpool 1 . 0 0 . 7 1 . 4 0 . 8 Abilities change over time, so we need a dynamic model!! 4 Helge Langseth Beating the bookie

Adding dynamics We follow, e.g., Rue & Salvesen (2000) and introduce dynamics at the “strength-level” : Let α ( t ) be the attack-strength for Team i at time t . i Then, α ( t ) � α ( t +∆ t ) is a random walk with st.dev. τ · ∆ t . i i Similarly for the defence-strength, β ( t ) i . One HMM/KF -structured model per ability: Latent and time-varying strengths; partially disclosed through goal-model. Assume we observe the result when Team i and Team j : The chains of these teams get correlated . Similarly, the strengths of all teams Team i and Team j have played previously get correlated, too! We use Markov Chain Monte Carlo to find estimators for the model parameters, and sample results for unseen matches. 5 Helge Langseth Beating the bookie

Adding dynamics We follow, e.g., Rue & Salvesen (2000) and introduce dynamics at the “strength-level” : Let α ( t ) be the attack-strength for Team i at time t . i Then, α ( t ) � α ( t +∆ t ) is a random walk with st.dev. τ · ∆ t . i i Similarly for the defence-strength, β ( t ) i . τ α ( t ) β ( t ) α ( t ) β ( t ) i i j j X ij Y ij k λ 5 Helge Langseth Beating the bookie

Looking behind the results Estimated defensive strength for Arsenal over the 2011-12 season. Small margins can significantly influence the result of a game. This inherit randomness makes the estimation of α ( t ) and β ( t ) i i difficult, as the “signal-to-noise-ratio” is typically small. More data, that “look behind the result” , e.g., No. chances created Shot statistics: On target, off target, hitting wood-work Passing accuracy . . . can be useful to uncover the teams’ underlying abilities . 6 Helge Langseth Beating the bookie

Data-intensive model Here we use: λ H i λ H i : Chance creation rate; home C ij : Number of chances . F ij : Number of shots . C ij X ij : Number of goals . β ( t ) α ( t ) j ℓ : The attacking strength. F ij β ( t ) ℓ : The defensive strength. α ( t ) γ ( t ) γ ( t ) ℓ : The goalkeeper strength. j i τ : The scaler in the step-size of X ij τ the random walk for the abilities. 7 Helge Langseth Beating the bookie

Money management Consider a bet with offered odds ω and estimated winning probability p . We require the expected gain to be non-negative , i.e., p · ω ≥ 1 . Consider the two bet-options Bet A: ω A = 11 . 0 , p A = 0 . 1 . Bet B: ω B = 1 . 10 , p B = 1 . 0 . Both bets have the same expected return of 1 . 1 unit per unit staked, but obviously Bet B is preferable. It is important to consider money management carefully! Many strategies exist, we have considered, e.g., Fixed Bet , Fixed Return , Kelly’s Rule and Rue’s Variance Adjustment . 8 Helge Langseth Beating the bookie

Results Premier League 2011-2012 Model Fixed Bet Fixed Return Kelly Var. Adjust Static 17.4% 17.4% 23 . 2 % 15 . 6 % Dynamic 22 . 7 % 14.3% 21.3% 12.0% DataIntensive 20.3% 24 . 2 % 23.0% 14.3% Premier League 2012-2013 Model Fixed Bet Fixed Return Kelly Var. Adjust Static -23.7% -24.9% -27.8% -21.2% Dynamic -17.1% -20.0% -22.9 % -15.9% DataIntensive − 6 . 3 % − 0 . 7 % − 3 . 4 % 0 . 4 % 2011-2012: Results are non-conclusive, but DataIntensive combined with FixedReturn gives the best result. 2012-2013: Only DataIntensive combined with Variance Adjustment beats the bookie. 9 Helge Langseth Beating the bookie

Future work Although we are looking at betting agents , and not simple classifiers , improving prediction quality is beneficial: Build models that incorporate more game-information ; data can be harvested, e.g., from http://www.whoscored.com/ . Combine the ensemble of different candidate models into one prediction-engine. Utilize pre-game information about line-ups to enhance the predictions. Generate results from more leagues – aiming to understand why some leagues are easier to generate profits from than others. Replace MCMC simulations with fast approximate Bayesian inference based on variational approximations . 10 Helge Langseth Beating the bookie

Beating the bookie A look at statistical models for prediction of - PowerPoint PPT Presentation

Beating the bookie A look at statistical models for prediction of football matches Helge Langseth Norwegian University of Science and Technology SCAI 2013 1 Helge Langseth Beating the bookie Building a model for match outcomes Suppose we

Beating The Best - The Santander Bank Kaggle Beating The Best - The Santander Bank Kaggle Beating

Beating the No Win Scenario Joe DeVivo @joedevivo Tuesday, 26 March 2013 Beating the No Win

Arbitrage Pricing What is an Equivalent Martingale Measure, and why should a bookie care?

Phototaxis in Volvox 18.S995 - L28 the beating of thousands of flagellated cells despite the

-Beating, dispersion and coupling correction in the LHC R. Toms, R. Calaga, O. Bruning, S.

The Stochastic Matching Problem: Beating Half with a Non-Adaptive Algorithm Sepehr Assadi

DEALING WITH ALIASING USING DEALING WITH ALIASING USING CONTRACTS CONTRACTS BEATING FORTRAN'S

Lecture 4: Spectrum Mark Hasegawa-Johnson ECE 401: Signal and Image Analysis, Fall 2020 Beating

Beating Stress into L2 Learners: Can Beat Gesture Awareness Impact Learning English ? Jeremy

Beating the Heat, Fighting the Bite, and More: Satellite Data Applications for Public Health in VT

Measurement and Correction of Beta Beating in the Fermilab Booster Meghan McAteer The University

Beating I/I With Effective 21st Century Sewer Rehabilitation Good Ideas And Successful

STOWA WORKSHOP BEATING MICROPOLLUTANTS Wastewater Treatment Plants Full scale plants and

Bad Sushi: Beating Phishers at their Own Game Nitesh Dhanjani & Billy Rios [BlackHat DC 2008]

BEATING THE TRAFFIC JAM using embedded devices, OPC UA, Akka, NoSQL Kristoffer Dyrkorn

Beating the odds Opportunities in demanding times May 2020 Agenda WHERE ARE WE IN THE ECONOMIC

D3 Tutorial Force-directed Layout Edit by Jiayi Xu and Han-Wei Shen, The Ohio State University

Drainage Installation and Inspection Hank Gottschalk Concrete Pipe & Precast, LLC Buried

JUST THE MATHS SLIDES NUMBER 19.1 PROBABILITY 1 (Definitions and rules) by A.J.Hobson

Measuring password strength by simulating password-cracking algorithms Saranga Komanduri Patrick

How to lift heavy things: a nerds guide to powerlifting Outline Intro to strength

Boosting Can we make dumb learners smart? Aarti Singh Machine Learning 10-701/15-781 Oct 11,

Better Habits for Healthier Backs Better Habits for Healthier Backs Protect Your Back with

Welcome! We cannot solve our problems with the same thinking we used when we created them. ~

Sambuz

Useful Links

Newsletter

Mail Us

Beating the bookie A look at statistical models for prediction of - PowerPoint PPT Presentation

Beating the bookie A look at statistical models for prediction of football matches Helge Langseth Norwegian University of Science and Technology SCAI 2013 1 Helge Langseth Beating the bookie Building a model for match outcomes Suppose we

Beating The Best - The Santander Bank Kaggle Beating The Best - The Santander Bank Kaggle Beating

Beating the No Win Scenario Joe DeVivo @joedevivo Tuesday, 26 March 2013 Beating the No Win

Arbitrage Pricing What is an Equivalent Martingale Measure, and why should a bookie care?

Phototaxis in Volvox 18.S995 - L28 the beating of thousands of flagellated cells despite the

-Beating, dispersion and coupling correction in the LHC R. Toms, R. Calaga, O. Bruning, S.

The Stochastic Matching Problem: Beating Half with a Non-Adaptive Algorithm Sepehr Assadi

DEALING WITH ALIASING USING DEALING WITH ALIASING USING CONTRACTS CONTRACTS BEATING FORTRAN'S

Lecture 4: Spectrum Mark Hasegawa-Johnson ECE 401: Signal and Image Analysis, Fall 2020 Beating

Beating Stress into L2 Learners: Can Beat Gesture Awareness Impact Learning English ? Jeremy

Beating the Heat, Fighting the Bite, and More: Satellite Data Applications for Public Health in VT

Measurement and Correction of Beta Beating in the Fermilab Booster Meghan McAteer The University

Beating I/I With Effective 21st Century Sewer Rehabilitation Good Ideas And Successful

STOWA WORKSHOP BEATING MICROPOLLUTANTS Wastewater Treatment Plants Full scale plants and

Bad Sushi: Beating Phishers at their Own Game Nitesh Dhanjani &amp; Billy Rios [BlackHat DC 2008]

BEATING THE TRAFFIC JAM using embedded devices, OPC UA, Akka, NoSQL Kristoffer Dyrkorn

Beating the odds Opportunities in demanding times May 2020 Agenda WHERE ARE WE IN THE ECONOMIC

D3 Tutorial Force-directed Layout Edit by Jiayi Xu and Han-Wei Shen, The Ohio State University

Drainage Installation and Inspection Hank Gottschalk Concrete Pipe &amp; Precast, LLC Buried

JUST THE MATHS SLIDES NUMBER 19.1 PROBABILITY 1 (Definitions and rules) by A.J.Hobson

Measuring password strength by simulating password-cracking algorithms Saranga Komanduri Patrick

How to lift heavy things: a nerds guide to powerlifting Outline Intro to strength

Boosting Can we make dumb learners smart? Aarti Singh Machine Learning 10-701/15-781 Oct 11,

Better Habits for Healthier Backs Better Habits for Healthier Backs Protect Your Back with

Welcome! We cannot solve our problems with the same thinking we used when we created them. ~

Sambuz

Useful Links

Newsletter

Mail Us

Bad Sushi: Beating Phishers at their Own Game Nitesh Dhanjani & Billy Rios [BlackHat DC 2008]

Drainage Installation and Inspection Hank Gottschalk Concrete Pipe & Precast, LLC Buried