Model Validation: The Modelers Perspective Am ber Popovitch, FCAS - PowerPoint PPT Presentation

Model Validation: The Modeler’s Perspective Am ber Popovitch, FCAS CAS RPM Sem inar March 2 0 1 2 1

Disclaim er The views expressed in this presentation are those of the author and do not necessarily reflect the views of The Travelers Companies, Inc. or any of its subsidiaries. This presentation is for general informational purposes only. 2

W hat I s Model Validation? From a modeler’s perspective, there are two parts: • Model Building –Have I chosen the right model? (e.g. are assumptions valid?) –Have I selected the right variables? –Have I adhered to the principle of parsimony? –Have I selected the right factors? • Model Testing –Have I achieved the modeling objectives? –Have I avoided over-fitting my data? –Have I created a model that will predict future behavior? 3

Data Partitioning • Training / Validation / Holdout Approach • Out of Time Validation • Bootstrapping Approach Original Bootstrap 1 Bootstrap 2 Bootstrap 3 1 1 3 2 2 1 4 2 3 2 5 3 4 3 5 3 5 3 5 4 • Cross Validation Approach Original CrossValid1 CrossValid2 CrossValid3 CrossValid4 CrossValid5 1 2 1 1 1 1 2 3 3 2 2 2 3 4 4 4 3 3 4 5 5 5 5 4 5 1 2 3 4 5 4

Model Building Tools and Techniques • Type III statistics What happens when model • p-values for variable levels assumptions are violated? • Factor assessment –Does it make business sense? –Does the relationship make sense? (e.g. monotonic) The easy part is coming up with • Comparison with other techniques the story. . . –Univariate analysis Beware of –Decision trees correlations! • Residual analysis • AIC / BIC / log-likelihood / deviance measures 5

Connecting Model Building and Model Testing Optimal Model Complexity Validation Error Training Error * From Elements of Statistical Learning by Hastie, Tibshirani, and Friedman 6

Model Testing Tools and Techniques The Lift Chart Questions: Sample Lift Chart • How should lift be measured? 1.4 • How many buckets? 1.2 • How should reversals be 1 Loss Ratio interpreted? 0.8 Actual Predicted • Are there variable biases affecting 0.6 the ordering? (e.g. size, policy year) 0.4 0.2 • Is there over-fitting? 0 • Fit vs. Lift? 1 2 3 4 5 6 7 8 9 10 Decile 7

Model Testing Tools and Techniques The GI NI I ndex A  Gini  A B Cum % of Loss • Commonly used to assess income inequality across countries • More granular assessment of model fit • Gives information on model segmentation Cum % of Exposure • -1 ≤ Gini ≤ 1 (1 = more segmentation, better fit) Sort Predictions Low -> High Reference: http://en.wikipedia.org/wiki/Gini_index 8

Model Testing Tools and Techniques Com paring Across Models • Which modeling technique is best? • How much better is this version vs. the last one? • Can use any measure you’d like – lift, GINI index, etc. • Some software packages have this capability built in (e.g. Enterprise Miner) • Be careful of over-fitting • Don’t use this on the holdout data as a model building technique! * from SAS Enterprise Miner documentation 9

Food For Thought. . . Should there be an actuarial standard of practice addressing predictive m odeling? – Topics such a standard might address • When is out-of-time validation rather than just out-of-sample validation critical? • What steps should be taken to ensure knowledge of the holdout data has not crept into the model-building process? – For instance, split off the holdout data before or after EDA? – Splitting it too early makes balancing to control-totals difficult • Auditing – “Lock up” holdout data? – Peer review standards • What should be done when holdout data “disagrees?” 10

Model Validation: The Modelers Perspective Am ber Popovitch, FCAS - PowerPoint PPT Presentation

Model Validation: The Modelers Perspective Am ber Popovitch, FCAS CAS RPM Sem inar March 2 0 1 2 1 Disclaim er The views expressed in this presentation are those of the author and do not necessarily reflect the views of The Travelers

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far

Validation of National Burn Severity Validation of National Burn Severity Validation of National

Form Validation 1 CS380 What is form validation? 2 validation: ensuring that form's values

Learning From Data Lecture 13 Validation and Model Selection The Validation Set Model Selection

Perspective LanguaL Structured Vocabulary: USDA Perspective Joanne Holden Perspective: Earth

LaGov LaGov Version 2.2 Updated: 12/17/08 Visit our website for Blueprint Presentations,

LaGov LaGov Validation Session Agenda Validation Session Agenda Purpose Work Session

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method

Module 4 19/05/2015 2 Agenda 1. What is validation? 2. Three-part empathy 3. What is

Bounce Address Tag Validation Bounce Address Tag Validation Bounce Address Tag Validation (BATV)

Capital Quality Validation Webinar Sept. 17, 2020 Agenda Validation Overview

AIRS Validation Overview & TDS Support of Validation Eric Fetzer AIRS Science Team Meeting

AngularJS & Bootstrap Form Validation HTML default validation Browsers have built-in

Chapter 5 Analysis: Four Level for Validation Vis/Visual Analytics, Chap 5 Validation 1 CGGM

BAAQMD Modeling Advisory Committee Meeting on Particulate Matter Saffet Tanrikulu, Ph.D.,

Validation and Testing COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning

A Semi-Parametric Block Bootstrap Approach for Clustered Data Ray Chambers & Hukum Chandra

Whats an eBike? From 2006 to 2018 Whats an eBiketoday? <750 Watt Drive Unit: powered

PARCEL 6 DEVELOPMENT PROPOSAL I-195 REDEVELOPMENT DISTRICT MAY 2019 truth box ARCHITECTS D+P

!""#$%&'() !""#$%&'()

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection (Kohavi,

Inference Barbara Brown National Center for Atmospheric Research Boulder Colorado USA

An Outlier Robust Block Bootstrap for Small Area Estimation Payam Mokhtarian and Ray Chambers

Conformal Field Theories, Conformal Bootstrap and Applications Konstantinos Deligiannis December

Sambuz

Useful Links

Newsletter

Mail Us

Model Validation: The Modelers Perspective Am ber Popovitch, FCAS - PowerPoint PPT Presentation

Model Validation: The Modelers Perspective Am ber Popovitch, FCAS CAS RPM Sem inar March 2 0 1 2 1 Disclaim er The views expressed in this presentation are those of the author and do not necessarily reflect the views of The Travelers

Data Mining II Model Validation Heiko Paulheim Why Model Validation? We have seen so far

Validation of National Burn Severity Validation of National Burn Severity Validation of National

Form Validation 1 CS380 What is form validation? 2 validation: ensuring that form's values

Learning From Data Lecture 13 Validation and Model Selection The Validation Set Model Selection

Perspective LanguaL Structured Vocabulary: USDA Perspective Joanne Holden Perspective: Earth

LaGov LaGov Version 2.2 Updated: 12/17/08 Visit our website for Blueprint Presentations,

LaGov LaGov Validation Session Agenda Validation Session Agenda Purpose Work Session

Progress to Date in A3: Method Transfer, Partial Validation and Cross validation A3: Method

Module 4 19/05/2015 2 Agenda 1. What is validation? 2. Three-part empathy 3. What is

Bounce Address Tag Validation Bounce Address Tag Validation Bounce Address Tag Validation (BATV)

Capital Quality Validation Webinar Sept. 17, 2020 Agenda Validation Overview

AIRS Validation Overview &amp; TDS Support of Validation Eric Fetzer AIRS Science Team Meeting

AngularJS &amp; Bootstrap Form Validation HTML default validation Browsers have built-in

Chapter 5 Analysis: Four Level for Validation Vis/Visual Analytics, Chap 5 Validation 1 CGGM

BAAQMD Modeling Advisory Committee Meeting on Particulate Matter Saffet Tanrikulu, Ph.D.,

Validation and Testing COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning

A Semi-Parametric Block Bootstrap Approach for Clustered Data Ray Chambers &amp; Hukum Chandra

Whats an eBike? From 2006 to 2018 Whats an eBiketoday? &lt;750 Watt Drive Unit: powered

PARCEL 6 DEVELOPMENT PROPOSAL I-195 REDEVELOPMENT DISTRICT MAY 2019 truth box ARCHITECTS D+P

!&quot;&quot;#$%&amp;'() !&quot;&quot;#$%&amp;'()

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection (Kohavi,

Inference Barbara Brown National Center for Atmospheric Research Boulder Colorado USA

An Outlier Robust Block Bootstrap for Small Area Estimation Payam Mokhtarian and Ray Chambers

Conformal Field Theories, Conformal Bootstrap and Applications Konstantinos Deligiannis December

Sambuz

Useful Links

Newsletter

Mail Us

AIRS Validation Overview & TDS Support of Validation Eric Fetzer AIRS Science Team Meeting

AngularJS & Bootstrap Form Validation HTML default validation Browsers have built-in

A Semi-Parametric Block Bootstrap Approach for Clustered Data Ray Chambers & Hukum Chandra

Whats an eBike? From 2006 to 2018 Whats an eBiketoday? <750 Watt Drive Unit: powered

!""#$%&'() !""#$%&'()