sparsity with multi type lasso regularized glms
play

Sparsity with multi-type Lasso regularized GLMs Sander Devriendt - PowerPoint PPT Presentation

Sparsity with multi-type Lasso regularized GLMs Sander Devriendt (email: sander.devriendt@kuleuven.be) Joint work with K. Antonio, T. Reynkens, E. Frees, R. Verbelen eRum 2018, Budapest May 15, 2018 Motivation 2 Claim frequency and claim


  1. Sparsity with multi-type Lasso regularized GLMs Sander Devriendt (email: sander.devriendt@kuleuven.be) Joint work with K. Antonio, T. Reynkens, E. Frees, R. Verbelen eRum 2018, Budapest May 15, 2018

  2. Motivation 2 Claim frequency and claim severity as function of nominal / numeric ∼ ordinal / spatial features Sparse modeling with multi-type variables – Sander Devriendt

  3. Research questions 3 ◮ Generalized Linear Models (GLMs) for frequency ( ∼ Poisson) and severity ( ∼ Gamma). ◮ How to: (1) select variables or features? (2) cluster (or bin or fuse) levels within a variable? age groups / postal code clusters / clusters of car models ◮ Procedure should be data driven, scalable to large (big) data. ◮ End product is interpretable, within actuarial comfort zone. Sparse modeling with multi-type variables – Sander Devriendt

  4. Research questions rephrased 4 ◮ Generalized Linear Models (GLMs) for frequency ( ∼ Poisson) and severity ( ∼ Gamma). ◮ How to: (1) avoid overfitting with too many variables or levels? (2) avoid underfitting with a priori binning/selection? Sparse modeling with multi-type variables – Sander Devriendt

  5. A stepwise solution 5 Henckaerts, Antonio et al., 2018 (Scandinavian Actuarial Journal) Stepwise procedure 1 Do an exhaustive search through variables to find best GAM model. 2 Use well-chosen clustering algorithm to bin 2D spatial effect. Use evolutionary trees to bin 1D continuous effects and interactions. 3 Fit GLM with bins and clusters obtained in previous steps. 4 R packages: mgcv , classInt , evtree , rpart Sparse modeling with multi-type variables – Sander Devriendt

  6. 250 250 ^ GLM f 200 4 200 coefficients 0.5 −0.07 150 150 power power −0.021 0.0 0 100 100 0.035 −0.5 50 50 0.064 0 0 25 50 75 25 50 75 ageph ageph GLM ^ f 5 coefficients −0.329 0.2 −0.204 0.0 −0.155 −0.2 0 −0.4 0.199 Sparse modeling with multi-type variables – Sander Devriendt

  7. Sparsity with multi-type Lasso regularized GLMs Devriendt, Antonio, Reynkens, Frees, Verbelen, 2018 (in progress)

  8. Regularization 8 ✞ ☎ Standard GLM ✝ ✆ fit data as good as possible, no constraint on parameters. �    � ✞ ☎ Regularized GLM ✝ ✆ tradeoff between fit and interpretability/sparsity/stability, constraint on parameters. Sparse modeling with multi-type variables – Sander Devriendt

  9. Lasso 9 ◮ Less is more: (Hastie, Tibshirani & Wainwright, 2015) a sparse model is easier to estimate and interpret than a dense model. ◮ Regularize (with budget constraint t , or regularization parameter λ ): min β 0 , β {−L ( β 0 , β ) } subject to � β � 1 ≤ t , or equivalenty   p   � min  −L ( β 0 , β ) + λ · | β j |  . β 0 , β j =1 Shrinks coefficients and even sets some to zero. Sparse modeling with multi-type variables – Sander Devriendt

  10. Lasso visualization 10 Regularization = limited budget for β 1 , β 2 , β 3 . ‘Statistical Learning with Sparsity’ - Hastie et al. (2015) Sparse modeling with multi-type variables – Sander Devriendt

  11. Lasso plot 11 Package glmnet overfitting ← − − → underfitting λ 0.2 0.1 Coordinates of β 0.0 −0.1 −0.2 0 5 10 15 λ Sparse modeling with multi-type variables – Sander Devriendt

  12. Lasso and friends 12 ◮ Adjust lasso regularization to the type of variable: • Determine type (nominal / numeric ∼ ordinal / spatial); • Allocate logical penalty. ◮ Thus, for J variables, each with regularization term P j ( . ), we want to optimize: J � −L ( β 1 , . . . , β J ) + λ · P j ( β j ) . j =1 Sparse modeling with multi-type variables – Sander Devriendt

  13. Lasso and friends: visualization 13 Different variable type → different penalty budget. ‘Statistical Learning with Sparsity’ - Hastie et al. (2015) Sparse modeling with multi-type variables – Sander Devriendt

  14. Fused Lasso 14 Package genlasso overfitting ← − λ − → underfitting ordinal penalty example 0.20 var 1 var 6 var 2 var 7 var 3 var 8 var 4 var 9 0.15 var 5 var 10 Coordinates of β 0.10 0.05 0.00 −0.05 0 5 10 15 20 λ Sparse modeling with multi-type variables – Sander Devriendt

  15. Generalized Fused Lasso 15 Package genlasso overfitting ← − λ − → underfitting nominal penalty example 0.20 var 1 var 6 var 2 var 7 var 3 var 8 var 4 var 9 0.15 var 5 var 10 Coordinates of β 0.10 0.05 0.00 −0.05 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 λ Sparse modeling with multi-type variables – Sander Devriendt

  16. Unified GLM framework with multiple type of penalties 16 ◮ Gertheiss & Tutz (2010) and Oelker & Gertheiss (2017): • GLMs with various penalties. • R package available: gvcm.cat (not maintained). ◮ Uses local quadratic approximations of penalties and PIRLS: • non-exact selection or fusion; • computationally intensive. Sparse modeling with multi-type variables – Sander Devriendt

  17. Unified GLM framework with multiple type of penalties 17 ◮ Our contribution: • implements an efficient algorithm (with proximal operators); - code bottleneck in C++ ( Rcpp ) - efficient linear algebra ( RcppArmadillo ) - parallel computations ( parallel ) • scalable to big data (splits into smaller sub-problems); • flexible regularization - penalty takes type of variable into account; - works for all popular penalties; ⇒ Package under construction. Sparse modeling with multi-type variables – Sander Devriendt

  18. Case study: MTPL data 18 ◮ Frequency (and severity) information for n = 163 , 234 policyholders. ◮ 14 variables: binary, ordinal and nominal. ◮ Exposure modeled as offset. ◮ Fit Poisson GLM for frequency data with different penalties. • N i ∼ Poisson( µ i ) • log( µ i ) = log(exposure i ) + β 0 + � 14 j =1 X j β j • O ( β ) = −L ( β 0 , β 1 , . . . , β 14 ) + λ · � 14 j =1 P j ( β j ) Sparse modeling with multi-type variables – Sander Devriendt

  19. Case study: MTPL data 19 Payment Frequency 0.30 0.25 0.20 Parameters 0.15 0.10 0.05 0.00 1 10 100 1000 10000 Lambda Sparse modeling with multi-type variables – Sander Devriendt

  20. Case study: MTPL data 20 Age parameters 0.5 0.4 0.3 Parameter value 0.2 0.1 0.0 −0.1 −0.2 20 30 40 50 60 70 80 90 Lambda = 1 Age Sparse modeling with multi-type variables – Sander Devriendt

  21. Case study: MTPL data 21 ◮ Settings: • Incorporate adaptive (GLM) and standardization weights for better consistency and predictive performance. • Tune λ with out-of-sample MSE (ˆ λ = 380) ◮ Re-estimate the final sparse GLM with standard GLM routines (from 164 to 38 params.). Sparse modeling with multi-type variables – Sander Devriendt

  22. MTPL claim frequency with multiple type of penalties 22 1.0 ● ● ● 0.4 ● ● ● ● ● ● ● ● ● ● ● 0.5 ● ● 0.2 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −0.5 −0.2 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 20 30 40 50 60 70 80 90 50 100 150 Age Power (kW) 1.0 0.5 ● ● 0.6 ● ● ● ● ● ● ● ● ● ● ● ● 0.0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 0.2 ● ● ● ● ● ● ● −0.5 ● ● ● ● ● ● ● ● ● ● ● −0.2 ● ● ● ● 0 5 10 15 20 0 5 10 15 20 25 Bonus−Malus scale Car age GAM fit, penalized GLM fit, GLM refit with new clusters. Sparse modeling with multi-type variables – Sander Devriendt

  23. MTPL claim frequency with multiple type of penalties 23 0.6 Parameter estimates 0.4 ● 0.2 ● ● ● ● 0.0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● −0.2 sex use fuel sport fleet monovolume 4x4 0.3 ● Parameter estimates ● ● ● ● ● ● ● 0.1 ● ● ● −0.1 ● ● ● ● payfreq2 payfreq3 payfreq4 coverage2 coverage3 GAM fit, penalized GLM fit, GLM refit with new clusters. Sparse modeling with multi-type variables – Sander Devriendt

  24. Wrap-up 24 ◮ Less is more. ◮ Flexible regularization can help predictive modeling. ◮ R package combines general framework with efficient algorithm. ◮ Package and working paper to be finalized. Sparse modeling with multi-type variables – Sander Devriendt

Recommend


More recommend