Online Joint GlueX-EIC-PANDA Machine Learning Workshop Machine - PowerPoint PPT Presentation

Online Joint GlueX-EIC-PANDA Machine Learning Workshop Machine Learning for Beginners Thomas Stibor GSI Helmholtzzentrum f¨ ur Schwerionenforschung GmbH t.stibor@gsi.de 21 th September 2020 - 25 th September 2020 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Organizational Machine Learning for Beginners I, September 21th, 14:00 - 14:45 Machine Learning for Beginners II, September 21th, 15:00 - 15:45 Machine Learning for Beginners III, September 22th, 14:15 - 15:00 Machine Learning for Beginners IV, September 23th, 14:15 - 15:00 Support Vector Machines, September 24th, 15:15 - 16:00 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Overview Literature Introductory Example Historical Overview Linear Classifiers Gradient Descent Neural Networks Learning (Backpropagation) Overfitting vs. Underfitting Bias-Variance Dilemma Support Vector Machines Machine Learning is a large field, here we will focus and Neural Networks and Support Vector Machines. 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Literature History of Artificial Intelligence & Machine Learning Some figures are from: The Quest for Artificial Intelligence (Nils J. Nilsson) 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Literature Machine Learning Some figures are from: Pattern Recognition and Machine Learning (Christopher M. Bishop) 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Literature Neural Networks 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Literature Support Vector Machines 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Literature Deep Learning 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

An Introductory Example Suppose that a fishpacking factory wants to automate the process of sorting incoming fish (salmon and sea bass). 22 sea bass 20 length 18 salmon 16 14 2 4 6 8 10 lightness After some preprocessing, each fish is characterized by feature vector x = ( x 1 , x 2 ) ∈ R 2 (pattern), where the first component is the lightness and the second component the length. 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Pattern belongs to Class? 22 sea bass 20 ? length 18 salmon 16 14 2 4 6 8 10 lightness Given labeled training data ( x 1 , y 1 ) , . . . , ( x N , y N ) ∈ R n × Y coming from some unknown probability distribution P ( x , y ). In this example, Y ∈ { salmon , sea bass } and n = 2. Unseen (unlabeled) pattern belongs to class salmon or sea bass? 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

A (too underfitted) Classifier 22 sea bass 20 length 18 salmon 16 14 2 4 6 8 10 lightness This linear separation suggests the rule: Classify the fish as salmon if its features falls below the decision boundary , otherwise as sea bass. 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

A (too overfitted) Classifier 22 sea bass 20 length 18 salmon 16 14 2 4 6 8 10 lightness A too complex model will lead to decision boundary that gives perfect classification accuracy on training set (seen patterns), but poor classification on unseen patterns. 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

A Good Classifier 22 sea bass 20 length 18 salmon 16 14 2 4 6 8 10 lightness Optimal tradeoff between performance on the training set and simplicity of the model. This gives high classification accuracy on unseen patterns, i.e. it gives good generalization . 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

An Optimal Classifier seabass 22 20 length 18 R2 16 R1 salmon 14 2 4 6 8 10 lightness 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

History of Neural Networks Era of Kernel Methods (SVM, Kernel-PCA, Kernel-Fisher Discriminants, etc.) Neural Networks were however still used 1995 Support-Vector Networks (Cortes and Vapnik) 1992 A Training Algorithm for Optimal Margin Classifiers (Boser, Guyon and Vapnik), first paper on SVM Era of Neural Networks 1986/ Backpropagation (Rumelhart, Hinton, Williams, Le Cun (actually first proposed by Werbos, 1974)) 1985 1982 Hopfield Network, (Hopfield), Recurrent Networks, Energy Function Decline of neural network research 1969 Book: Perceptrons (Minsky and Papert) 1962/ Adaline (Widrow and Hoff), Perceptron (Rosenblatt) 1960 1943 Model of McCulloch and Pitts Note, this historical overview is far from being complete (c.f. The Quest for Artificial Intelligence (Nils J. Nilsson)) 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Neuron & Model of McCulloch and Pitts Taken from: The Quest for Artificial Intelligence (Nils J. Nilsson) 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Book Perceptrons (Minsky and Papert) Taken from: Pattern Recognition and Machine Learning (Christopher M. Bishop) 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

History of Neural Networks (cont.) 2020 Deep Neural Networks are state of the art classifiers, however ensemble classifier (XGB, Random Forrest, etc.) and SVM are still useful 2018 ACM Turing Award: Bengio, Hinton and LeCun Era of Deep Neural Networks (also called Deep Learning) 2012 ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky, Sutskever and Hinton) 2009 ImageNet: A large-scale hierarchical image database (Deng et al.) (see Image Classification on ImageNet) Decline of neural network research Bengio, Hinton, LeCun and others still worked on neural network (see Deep Learning in Neural Networks: An Overview (Schmidhuber)) 2000 SVM’s are state of the art classifiers 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Overview ImageNet ≈ 14 million annotated images to indicate what objects are pictured. Objects categorized into 1000 classes (e.g. ’Tibetan mastiff’, ’Great Dane’, ’Eskimo dog, husky’, ... Top-1 score: Check if predicted class with highest probability is the same as the target label. Top-5 score: Check if target label is one of your 5 predictions with highest probability. 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Why are Deep Neural Networks so successful? Prediction accuracy Deep neural networks Traditional machine learning algorithms Amount of data Deep Neural Networks (Backpropagation) are universal , that is, applicable to a large class of problems: Vision, speech, text, . . . and scale with data. Backpropagation (forward + backward pass) is intrinsically linked to matrix multiplication (GPU’s, TPU’s). 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Attendance AI & ML conferences (1984 - 2019) Taken from: Artificial Intelligence Index, 2019 Annual Report (pp. 39) 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Machine Learning Framework Machine Learning ≡ Optimization & Statistics Data ≡ (input data, target data) predicted data probability while not min Loss Θ ( target data, predicted data) { fit parameters Θ } while not max Prob ( target data, input data | Θ) { Θ fit parameters Θ } input data 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Machine Learning Framework (Example SVM) Machine Learning ≡ Optimization & Statistics Data ≡ (input data x n , target data y n ) while not min Loss Θ ( target data, predicted data) { predicted data fit parameters Θ := w , b (normal, offset) } minimize 1 2 � w � 2 subject to y n ( w T · x n + b ) ≥ 1 n = 1 , . . . , N Θ { x | w T · x + b = 1 } 2 � w � 1 � w � x 2 x 1 1 � w � input data w { x | w T · x + b = − 1 } { x | w T · x + b = 0 } 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Machine Learning Framework (Example One-Class SVM) Machine Learning ≡ Optimization & Statistics Data ≡ (input data x n ) while not min Loss Θ ( input data) { predicted data fit parameters Θ := c , r (sphere center, radius) } minimize r 2 subject to � x n − c � 2 ≤ r 2 n = 1 , . . . , N Θ 2 1.8 1.6 1.4 1.2 1 input data 0.8 1 1.2 1.4 1.6 1.8 2 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Machine Learning Framework (Example HMM) Machine Learning ≡ Optimization & Statistics Data ≡ (input data) while not max Prob ( input data | Θ) { probability fit parameters Θ := s , H , E (start vector, hidden matrix, emission matrix) } max Prob(input data | Θ) S 0 0 . 6 0 . 4 Θ 0 . 7 0 . 6 0 . 3 S 1 S 2 0 . 4 0 . 1 0 . 4 0 . 5 0 . 7 0 . 2 0 . 1 E 1 E 2 E 3 input data 21 th September 2020 - 25 th September 2020 T.Stibor (GSI) ML for Beginners

Online Joint GlueX-EIC-PANDA Machine Learning Workshop Machine - PowerPoint PPT Presentation

Online Joint GlueX-EIC-PANDA Machine Learning Workshop Machine Learning for Beginners Thomas Stibor GSI Helmholtzzentrum f ur Schwerionenforschung GmbH t.stibor@gsi.de 21 th September 2020 - 25 th September 2020 21 th September 2020 - 25 th

Hall D Overview E.Chudakov JLab Presented at Workshop GlueX-PANDA 2019 George Washington

Local District Update 2018.03 EIC (LOCAL) & EIC (EXHIBIT) February 26, 2018 Timeline Spring

Machine Learning in PandaRoot GlueX-Panda Workshop G.Washington University, May 2019 Ralf Kliemt

Overview GlueX principal motivation: hybrid meson searches Synergies with light meson studies

EIC Accelerator Collaboration Meeting 2019 RF Systems for EIC at BNL K.Smith Outline RF

Light and heavy quark spectroscopy at EIC M.Battaglieri INFN -GE Italy 1 M.Battaglieri - INFN

BSM Physics at the EIC Mini Ad-hoc Workshop Sonny Mantry University of North Georgia December

Analysis Tools in PandaRoot GlueX PANDA Workshop 2019 Washington, GW, May 3 - 5, 2019 Klaus

Recent Results From GlueX 2019 April APS Meeting Colin Gleason Indiana University on Behalf of

The ( ) Experiment (E12-10-011) in Hall D/GlueX We propose to perform a new

Spin-Density Matrix Elements for Vector-Meson Photoproduction at GlueX Alexander Austregesilo

Search for Gluonic Excitations in Hadrons with GlueX Hadron 2011 Igor Senderovich June 16, 2011

EIC software Alexander Kiselev NPPS Group Meeting August,23 2019 Contents of this talk n Fast

Generic R&D for an EIC : Developing Analysis Tools and Techniques for the EIC Whitney

Panda Hill Niobium Cradle Definitive Feasibility Study Panda Hill Managing Director E:

PanDA in Nutshell PanDA = Production and Distributed Analysis System Designed to meet

Pattern Recognition 2 1 3 Perceptrons by M.L. Minsky and S.A. Papert (1969) 4 Books: Pattern

4000 King, 20 (15

Safe Grid Search with Optimal Complexity Joseph Salmon http://josephsalmon.eu IMAG, Univ

Susitna River Chinook Salmon Escapement Goals 1 Recommended Northern Cook Inlet King Salmon

Safe Grid Search with Optimal Complexity E. Ndiaye Riken AIP Joint work with: T. Le, O. Fercoq,

South Campus Continuous Learning Teacher Information 5/4-5/8 Name: Bruce Callahan Mr.

SMHOA Annual Meeting March 5 th , 2018 Welcome! Well call the meeting to order at 6:30 PM 1

1 molecular evolution molecular phylogenetics evolution of molecules genomics bioinformatics

Online Joint GlueX-EIC-PANDA Machine Learning Workshop Machine - PowerPoint PPT Presentation

Online Joint GlueX-EIC-PANDA Machine Learning Workshop Machine Learning for Beginners Thomas Stibor GSI Helmholtzzentrum f ur Schwerionenforschung GmbH t.stibor@gsi.de 21 th September 2020 - 25 th September 2020 21 th September 2020 - 25 th

Hall D Overview E.Chudakov JLab Presented at Workshop GlueX-PANDA 2019 George Washington

Local District Update 2018.03 EIC (LOCAL) &amp; EIC (EXHIBIT) February 26, 2018 Timeline Spring

Machine Learning in PandaRoot GlueX-Panda Workshop G.Washington University, May 2019 Ralf Kliemt

Overview GlueX principal motivation: hybrid meson searches Synergies with light meson studies

EIC Accelerator Collaboration Meeting 2019 RF Systems for EIC at BNL K.Smith Outline RF

Light and heavy quark spectroscopy at EIC M.Battaglieri INFN -GE Italy 1 M.Battaglieri - INFN

BSM Physics at the EIC Mini Ad-hoc Workshop Sonny Mantry University of North Georgia December

Analysis Tools in PandaRoot GlueX PANDA Workshop 2019 Washington, GW, May 3 - 5, 2019 Klaus

Recent Results From GlueX 2019 April APS Meeting Colin Gleason Indiana University on Behalf of

The ( ) Experiment (E12-10-011) in Hall D/GlueX We propose to perform a new

Spin-Density Matrix Elements for Vector-Meson Photoproduction at GlueX Alexander Austregesilo

Search for Gluonic Excitations in Hadrons with GlueX Hadron 2011 Igor Senderovich June 16, 2011

EIC software Alexander Kiselev NPPS Group Meeting August,23 2019 Contents of this talk n Fast

Generic R&amp;D for an EIC : Developing Analysis Tools and Techniques for the EIC Whitney

Panda Hill Niobium Cradle Definitive Feasibility Study Panda Hill Managing Director E:

PanDA in Nutshell PanDA = Production and Distributed Analysis System Designed to meet

Pattern Recognition 2 1 3 Perceptrons by M.L. Minsky and S.A. Papert (1969) 4 Books: Pattern

4000 King, 20 (15

Safe Grid Search with Optimal Complexity Joseph Salmon http://josephsalmon.eu IMAG, Univ

Susitna River Chinook Salmon Escapement Goals 1 Recommended Northern Cook Inlet King Salmon

Safe Grid Search with Optimal Complexity E. Ndiaye Riken AIP Joint work with: T. Le, O. Fercoq,

South Campus Continuous Learning Teacher Information 5/4-5/8 Name: Bruce Callahan Mr.

SMHOA Annual Meeting March 5 th , 2018 Welcome! Well call the meeting to order at 6:30 PM 1

1 molecular evolution molecular phylogenetics evolution of molecules genomics bioinformatics

Local District Update 2018.03 EIC (LOCAL) & EIC (EXHIBIT) February 26, 2018 Timeline Spring

Generic R&D for an EIC : Developing Analysis Tools and Techniques for the EIC Whitney