Developments in MC tuning methods: Professor 2 and all that Andy - PowerPoint PPT Presentation

Developments in MC tuning methods: Professor 2 and all that Andy Buckley University of Glasgow MPI@LHC 2015, ICTP, Trieste, Nov 2015 1/12

Introduction MC tuning is a necessary evil – data description is needed, and models aren’t as predictive as we’d like. Data and models also aren’t as perfect as we’d like: we need estimates of systematics. Hope that somewhere along the way we also gain better physical understanding.. . Professor is numerical machinery frequently used these days to aid MC generator tuning. Since 2009. Many widespread tunes used it. . . though not all! Basic idea is to build fast parameterisations of observable response to parameters, from parallel sampling of the param space. Then optimise. Systematics should be derivable from the shape of the goodness-of-fit around the optimum. Right? In this talk: Professor v2, handling systematics, other applications 2/12

The Professor method ◮ MC is slow: ∼ 1 day per run ⇒ can’t use in serial optimisation. ◮ Solution is very simple: trivially parallelise MC runs throughout “reasonable” range of parameter space, and use sampled “anchor” points to interpolate each bin’s param dependence. Up to O ( 15 ) params. ◮ We use SVD polynomial fits for robustness – requires that values vary in a polynomial fashion, or are transformed to do so. ◮ Analytic interpolations ⇒ fast serial minimisation of an objective function. ◮ Much strength is in the “system” features to support the core machinery. 3/12

Professor 1 ◮ Python code, using scipy and pyminuit ◮ Heavy internal code framework, hangover from early development work ◮ Parameterisation up to 5th order – manually encoded ◮ Many “magic” behaviours, coded in > 20 scripts. . . not all well maintained! ◮ Lots of “advanced” features, like uncertainty correlations, sensitivities, interactive GUI, etc. ◮ Hit “maximum entropy” point of development! 4/12

Professor 2 ◮ Start again from scratch, to make Professor lean, mean & flexible again. Thanks to Holger Schulz ◮ Part motivated by more generic applications, part by pure frustration! ◮ Core code now in C++; independent of concepts like “bins”. Very generic ◮ Wrapped into Python, and used as core of a library for data I/O ⇒ lessons learned from v1 11 ◮ Works best integrated with the YODA 10 9 8 data toolkit, but not tied to Rivet 7 6 5 ◮ Now only a few scripts – and most 4 0 3 1 2 2 work is done by the library, so scripts 3 P T 3.5 0 4 3.0 R E 2.5 F 5 2.0 1.5 1.0 W 6 0.5 E X P P O easily customised 5/12

More on Professor 2 ◮ All-orders parameterisations now possible, including. zero (i.e. constant) ◮ SVD stability improvements through param mapping ◮ Not all old features: we still “need” 40000 envelopes, sensitivities, correlations, 30000 20000 eigentunes, runcombns, kebabs, . . . 10000 0 ◮ More general weighting system: 10000 5 20000 0 matching patterns, bin ranges, etc. 30000 5 5 10 0 5 15 10 20 ◮ Strengths are speed and 15 20 25 25 programmability – less emphasis on pre-built magic, more on power & flexibility ◮ Interactive Web interface in development 6/12

Param sampling Prof 2’s more powerful sampling script prof2-sample allows biased sampling using SymPy expressions. ◮ Programmable vetos also supported, e.g. to ban regions where param A greater than param B . Used in Sherpa 2.2 tuning. ◮ Sampling can now be used to 6 4 generate run scripts and any other 2 file as well as the standard params 0 list ⇒ easy to use “meta-params” to 2 4 control multiple parameters. 6 0 2 4 6 8 10 ◮ Still very up to the user to ensure that sampling is dense enough and concentrated appropriately. To calculate where best to sample, you’d need to already know the answer! 7/12

Testing the parameterisation Can’t just assume that parameterisation is working. . . but this is often done / inferred much later. New prof2-residuals tests explicitly. ◮ Loop over runs and histogram absolute & relative residuals between ipol and MC, e.g. ( f ( p i ) − MC i ) / MC i ◮ Breakdown by observable, and value / error. Easily extended. ◮ For better testing, train interpolation on one run subset and test on the remainder 8/12

Measuring goodness-of-fit ◮ Historically used a simple pseudo- χ 2 : p ) − ref b ) 2 ( f b ( � χ 2 ( � � p ) = w b b + ǫ 2 (1) ∆ ref 2 b + ∆ f 2 b ◮ Several limitations: no stat/syst separation, weight has √ of intuitive effect, ∆ f 2 b = median (∆ MC ) i.e. const! ◮ Expt correlations available in Prof 1.4; coming soon in v2 ◮ Linearised weights also imminent – depending on feedback. Affects correlations via cov ij ◮ Prof 2 allows error parameterisation: greatly improves residuals. Denom is of equal importance in χ 2 ! Needs to be regularised in fit. 9/12

Eigentunes and coverage Eigentunes have gained acceptance as a “robust” way to create systematic variation tunes. ◮ Motivation cf. PDF Hessian errors. ◮ Directions are robust: physics in the components of principle directions ◮ But distance along vectors not well-defined. If true χ 2 , expect ∆ ∼ numparams; actually more like num bins for coverage ◮ Effect of large correlated systematics? Not experimental.. . but in model?? ◮ Can we define a statistically robust ∆ χ 2 for tunes? Perhaps instead aim for iterated minimal data coverage. ◮ More robust dimensional reduction wanted / needed? cf. ATLAS A14 procedure 10/12

Prof4BSM – life beyond tuning individual ¯ C G marginalized ◮ Much recent development/use hasn’t ¯ C tG C 1 ¯ u been for tuning at all.. . C 2 ¯ u ¯ C 1 d ¯ C 2 d C tW ¯ ◮ Fast parameterisation also finds use C 3 ¯ qq ¯ C 3 φq in BSM physics, e.g. arXiv:1506.08845 , -1 -0.5 0 0.5 1 ¯ arXiv:1511.05170 C i = C i v 2 / Λ 2 φq v 2 / Λ 2 0 . 3 ◮ Use parameterisation of observables C 3 0 . 2 in Wilson coefficient space to build 0 . 1 confidence limit contours 0 . 0 − 0 . 1 ◮ Speed important for marginalising − 0 . 2 limits in many dimensions for − 0 . 3 − 0 . 3 − 0 . 2 − 0 . 1 0 . 0 0 . 1 0 . 2 0 . 3 projections C tW v 2 / Λ 2 11/12

Summary & outlook ◮ Professor is a well-established tool to aid in many-parameter MC tuning. Not a replacement for physics awareness. ◮ Used by majority of experiment tunes, also some MC author tunes. ◮ Prof 1 series had become unwieldy for developers, flaky for users: Prof 2 is a leaner, faster reboot. ◮ Not all functionality yet replaced – taking time to think about better-motivated heuristics. ◮ But already some advantages like speed, format support, and uncertainty parameterisation. ◮ New version has so far been used more for BSM than QCD; but not for long, I hope! 12/12

Developments in MC tuning methods: Professor 2 and all that Andy - PowerPoint PPT Presentation

Developments in MC tuning methods: Professor 2 and all that Andy Buckley University of Glasgow MPI@LHC 2015, ICTP, Trieste, Nov 2015 1/12 Introduction MC tuning is a necessary evil data description is needed, and models arent as

PAC PACE AUT AUTO-WER WERKS KS Vehicle Tuning Services Performance tuning with fuel

TUNING Russia: Development of master programmes in engineering education using the Tuning

SELF TUNING MEMORY MANAGEMENT FOR DATA SERVERS By Sangeetha Sivaprakasam Introduction : 1)

Hyperparameter tuning in caret Dr. Shirin Glander Data Scientist DataCamp Hyperparameter

Parameters vs hyperparameters Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning

CHAPTER 9: PID TUNING Process Solve the tuning Apply, is the reaction curve problem. Requires

Elementary Particles Lecture 4 Niels Tuning Harry van der Graaf Niels Tuning (1) Thanks

A Java Based Interactive Control Design and Tuning Platform Aaron Radke 7/9/3 Aaron Radke

10/16/19 Parameters and Parameter Tuning Genetic Algorithms History Taxonomy

Commercial meets Open Source Tuning STATISTICA with R Tuning STATISTICA with R Christian H.

Tuning and orbit feedback in Storage Ring Light Sources Susan Smith ASTeC, Daresbury

Data Mining II Optimization & Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Data Mining II Optimization & Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Elementary Particles Lecture 3 Niels Tuning Harry van der Graaf Niels Tuning (1) Plan

Tuning tuning curves So far: Receptive fields Representation of stimuli Population vectors

Foundations of Foundations of Automated Database Tuning Automated Database Tuning Surajit

Big Data: Bold Solutions Dean Magee Senior Data Scientist Geografia Where I came from the

Data-driven innovation B t Betsy Masiello M i ll Google Public Policy Google Confidential and

Chromaticity Tracker Devices Ron Moore Fermilab Tevatron Dept. Tevatron Mtg - 06 Mar 09

Introduction to Survey Statistics Day 1 Survey Methodology 101 Federico Vegetti Central

R EALIZING V ALUE T HROUGH ERP 5 K EY S TRATEGIES TO D RIVE B USINESS P ROCESS I MPROVEMENT P r e

Negotiating vocationalism Music Education and Industry Engagement - critiquing curriculum 30th

Good Economics for Hard Times #LSEStamp Professor Esther Duflo Abdul Latif Jameel Professor of

Thinking Strategically: (i) Repeated Play & (ii) Strategic Communication David P Myatt

Developments in MC tuning methods: Professor 2 and all that Andy - PowerPoint PPT Presentation

Developments in MC tuning methods: Professor 2 and all that Andy Buckley University of Glasgow MPI@LHC 2015, ICTP, Trieste, Nov 2015 1/12 Introduction MC tuning is a necessary evil data description is needed, and models arent as

PAC PACE AUT AUTO-WER WERKS KS Vehicle Tuning Services Performance tuning with fuel

TUNING Russia: Development of master programmes in engineering education using the Tuning

SELF TUNING MEMORY MANAGEMENT FOR DATA SERVERS By Sangeetha Sivaprakasam Introduction : 1)

Hyperparameter tuning in caret Dr. Shirin Glander Data Scientist DataCamp Hyperparameter

Parameters vs hyperparameters Dr. Shirin Glander Data Scientist DataCamp Hyperparameter Tuning

CHAPTER 9: PID TUNING Process Solve the tuning Apply, is the reaction curve problem. Requires

Elementary Particles Lecture 4 Niels Tuning Harry van der Graaf Niels Tuning (1) Thanks

A Java Based Interactive Control Design and Tuning Platform Aaron Radke 7/9/3 Aaron Radke

10/16/19 Parameters and Parameter Tuning Genetic Algorithms History Taxonomy

Commercial meets Open Source Tuning STATISTICA with R Tuning STATISTICA with R Christian H.

Tuning and orbit feedback in Storage Ring Light Sources Susan Smith ASTeC, Daresbury

Data Mining II Optimization &amp; Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Data Mining II Optimization &amp; Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Elementary Particles Lecture 3 Niels Tuning Harry van der Graaf Niels Tuning (1) Plan

Tuning tuning curves So far: Receptive fields Representation of stimuli Population vectors

Foundations of Foundations of Automated Database Tuning Automated Database Tuning Surajit

Big Data: Bold Solutions Dean Magee Senior Data Scientist Geografia Where I came from the

Data-driven innovation B t Betsy Masiello M i ll Google Public Policy Google Confidential and

Chromaticity Tracker Devices Ron Moore Fermilab Tevatron Dept. Tevatron Mtg - 06 Mar 09

Introduction to Survey Statistics Day 1 Survey Methodology 101 Federico Vegetti Central

R EALIZING V ALUE T HROUGH ERP 5 K EY S TRATEGIES TO D RIVE B USINESS P ROCESS I MPROVEMENT P r e

Negotiating vocationalism Music Education and Industry Engagement - critiquing curriculum 30th

Good Economics for Hard Times #LSEStamp Professor Esther Duflo Abdul Latif Jameel Professor of

Thinking Strategically: (i) Repeated Play &amp; (ii) Strategic Communication David P Myatt

Data Mining II Optimization & Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Data Mining II Optimization & Parameter Tuning Heiko Paulheim Why Parameter Tuning?

Thinking Strategically: (i) Repeated Play & (ii) Strategic Communication David P Myatt