Data Sciences CentraleSupelec Advance Machine Learning Course III - PowerPoint PPT Presentation

Jan 14, 2023 •582 likes •650 views

Data Sciences CentraleSupelec Advance Machine Learning Course III - Stochastic approximation algorithms Emilie Chouzenoux Center for Visual Computing CentraleSupelec emilie.chouzenoux@centralesupelec.fr Motivation Linear

Data Sciences – CentraleSupelec Advance Machine Learning Course III - Stochastic approximation algorithms Emilie Chouzenoux Center for Visual Computing CentraleSupelec emilie.chouzenoux@centralesupelec.fr
Motivation Linear regression/classification: ◮ Dataset with n entries: x i ∈ R d , y i ∈ R , i = 1 , . . . , n ◮ Prediction of y as a linear model x ⊤ β ◮ Minimization of a penalized cost function: n F ( β ) = 1 � ( ∀ β ∈ R d ) ℓ ( y i , x ⊤ i β ) + λ R ( β ) n i =1 Examples of loss/regularizers: ◮ Quadratic loss: ℓ ( y , x ) = 1 2 ( x − y ) 2 ◮ Logistic loss: ℓ ( y , x ) = log(1 + exp( − yx )) ◮ Ridge penalty R ( β ) = 1 2 � β � 2 ◮ Lasso penalty R ( β ) = � β � 1 :
Motivation Large n - Small d ⇒ Minimization of F assuming that, at each iteration, only a subset of the data is available. Loss for single observation: ( ∀ β ∈ R d ) f i ( β ) = ℓ ( y i , x ⊤ i β ) + λ R ( β ) so that F = 1 � n i =1 f i ( β ). n Loss for a subset of observation: (mini-batch) ( ∀ β ∈ R d ) � ℓ ( y i , x ⊤ F j ( β ) = i β ) + λ R ( β ) i ∈B j with ( B j ) 1 ≤ j ≤ k forming a partition of { 1 , . . . , n } . :
Stochastic gradient descent We assume that F is differentiable on R d . For every t ∈ N , we sample uniformly an index i t ∈ { 1 , . . . , n } and update: β ( t +1) = β ( t ) − γ t ∇ f i t ( β ( t ) ) ◮ The randomly chosen gradient ∇ f i t ( β ( t ) ) yields an unbiased estimate of the true gradient ∇ F ( β ( t ) ) ◮ γ t > 0 is called the stepsize or learning rate . Its choice has an influence on the convergence properties of the algorithm. Typical choice: γ t = Ct − 1 . ◮ More stable results using averaging : t ( t ) = 1 ( t ) = (1 − 1 ( t − 1) + 1 β ( k ) ⇔ β � t β ( t ) β t ) β t k =1 New choice: γ t = Ct − α with α ∈ [1 / 2 , 1]. :
Accelerated variants There are a large variety of approaches available to accelerate the convergence of SG methods. We list the most famous ones here: ◮ Momentum: β ( t +1) = β ( t ) − γ t ∇ f i t ( β ( t ) ) + θ t ( β ( t ) − β ( t − 1) ) ◮ Gradient averaging: (see also SAG/SAGA) t β ( t +1) = β ( t ) − γ t � ∇ f i k ( β ( k ) ) t k =1 ◮ ADAGRAD: β ( t +1) = β ( t ) − γ t W t ∇ f i t ( β ( t ) ) with a specific diagonal matrix W t related to ℓ 2 norm of past gradients. :

Recommend

Data Sciences CentraleSupelec Advance Machine Learning Course VII - Inference on Graphical

Data Sciences CentraleSupelec Advance Machine Learning Course VII - Inference on Graphical Models Emilie Chouzenoux Center for Visual Computing CentraleSupelec emilie.chouzenoux@centralesupelec.fr Graphical models A graph G consists

392 views • 14 slides

Data Sciences CentraleSupelec Advance Machine Learning Course II - Linear regression/Linear

Data Sciences CentraleSupelec Advance Machine Learning Course II - Linear regression/Linear classification Emilie Chouzenoux Center for Visual Computing CentraleSupelec emilie.chouzenoux@centralesupelec.fr Linear Regression Linear

797 views • 35 slides

Data Sciences CentraleSupelec Advance Machine Learning Course VI - Nonnegative matrix

Data Sciences CentraleSupelec Advance Machine Learning Course VI - Nonnegative matrix factorization Emilie Chouzenoux Center for Visual Computing CentraleSupelec emilie.chouzenoux@centralesupelec.fr Motivation Matrix factorization: Given

191 views • 16 slides

Inperia Advance BIS Coated CoCr BMS for BTK Indications DS - 2018 Inperia Advance Inperia

Inperia Advance BIS Coated CoCr BMS for BTK Indications DS - 2018 Inperia Advance Inperia Advance CoCr Stent Delivery System BIS coating Inperia Advance Inperia Advance CoCr Stent Delivery System CoCr Stent BIS coating Inperia Advance

650 views • 27 slides

Clustering Lesson 3 : Lab Session Advanced Machine Learning, CentraleSupelec Teachers Assistant

Clustering Lesson 3 : Lab Session Advanced Machine Learning, CentraleSupelec Teachers Assistant : Omar CHEHAB Professors : Emilie CHOUZENOUX, Frederic PASCAL 1 General Information Assignment : alone or in pairs, you will code the algorithms

188 views • 5 slides

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Rob Schapire Princeton University www.cs.princeton.edu/ schapire Machine

1.26k views • 38 slides

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

MACHINE LEARNING 2012 MACHINE LEARNING MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How to separate the red class from the grey class? x 2 360 r x 1 Polar coordinates Data

1.04k views • 44 slides

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum Computing Machine Learning Quantum Computing Machine Learning so hot so so hot Quantum Computing Machine Learning Quantum Computing Machine Learning

835 views • 51 slides

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is Machine Learning? Azure Machine Learning: How it works Azure Machine Learning in action Get started Contents What is Machine Learning?

456 views • 21 slides

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING Exam Format The exam lasts a total of 3 hours: - Upon entering the room, you must

373 views • 21 slides

Deep learning J er emy Fix CentraleSup elec jeremy.fix@centralesupelec.fr 2016 1 / 94

Introduction Feedforward Neural Networks Deep learning J er emy Fix CentraleSup elec jeremy.fix@centralesupelec.fr 2016 1 / 94 Introduction Feedforward Neural Networks Introduction and historical perspective [Schmidhuber, 2015]

1.59k views • 144 slides

T1 ADVANCE + / T1D ABOUT THE T1 ADVANCE The T1 ADVANCE + from TRIWATER SOLUTIONS INC. was

TRIWATER SOLUTIONS T1 ADVANCE + / T1D ABOUT THE T1 ADVANCE The T1 ADVANCE + from TRIWATER SOLUTIONS INC. was designed for disaster relief and military activities. It was engineered to be a mobile water treatment system designed to

478 views • 24 slides

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach to Preventing to Preventing to Preventing to Preventing Avoidable ED Utilization Avoidable ED Utilization Avoidable ED

727 views • 13 slides

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

DataCamp Machine Learning for Finance in Python MACHINE LEARNING FOR FINANCE IN PYTHON Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning for Finance in Python Machine Learning in Finance source:

389 views • 36 slides

Machine Learning 1 Machine(Learning(in(a(Nutshell ( Data$ Model$ Performance$ Measure$

Machine Learning 1 Machine(Learning(in(a(Nutshell ( Data$ Model$ Performance$ Measure$ Machine(Learner( 2 Machine(Learning(in(a(Nutshell ( Data$ Model$ Performance$ Measure$ Machine(Learner( Data$with$a5ributes$ ID( A1( Reflex(

718 views • 17 slides

ST5 : Semi autonomous drone navigation Introduction J er emy Fix

ST5 : Semi autonomous drone navigation Introduction J er emy Fix jeremy.fix@centralesupelec.fr Herv e Frezza-Buet herve.frezza-buet@centralesupelec.fr Thursday, 1 st september 2020 1 / 25 General information ? ? 2 / 25 General

275 views • 25 slides

Generic Support for Bulk Operations in Grid Applications Stephan Hirmer, Hartmut Kaiser, Andre

Generic Support for Bulk Operations in Grid Applications Stephan Hirmer, Hartmut Kaiser, Andre Merzky, Andrei Hutanu , Gabrielle Allen Outline Introduction Grid APIs SAGA. Asynchronous operations Bulk operations within the

142 views • 11 slides

"#$%&'()++,&-.%&#'/)%#)0%$11234 5.$6.')7%1$89)0:;

"#$%&'()*++,&-.%&#'/)%#)0%$11234 5.$6.')7%1$89)0:*; 2.$6.'</%1$8=>,.?</& ) !"#$$%&'()*(+#,-$." (/0(1234$4(56("7$(82#,+$93(:,%%/00/,3(234$#(.,3"#9."()';<=*><?@@AB>

697 views • 36 slides

A revised look at the A revised look at the oceanic sink for oceanic sink for atmospheric

A revised look at the A revised look at the oceanic sink for oceanic sink for atmospheric carbon atmospheric carbon tetrachloride (CCl 4 ) tetrachloride (CCl 4 ) James H. Butler 1 , Shari A. Yvon-Lewis 2,6 , Jrgen M. Lobert 3,6 , Daniel B.

257 views • 10 slides

The Legendary Sagas Thematically close to romance ( riddarasgur ), but without the emphasis on

The Legendary Sagas Thematically close to romance ( riddarasgur ), but without the emphasis on courtly culture and exploits. May contain poetry, but less commonly the dense skaldic verse found in the sagas of Icelanders. c. 30 legendary sagas

131 views • 11 slides

SATIRE: A Software Introduction to SATIRE Architecture for Smart Software Architecture

Presentation Outline SATIRE: A Software Introduction to SATIRE Architecture for Smart Software Architecture Implementation AtTIRE Design Challenges Critique & Comparison Authors: Raghu Ganti, Praveen Jayachandran, Tarek

76 views • 3 slides

Satire Humor Presentation By: Elayna Parkhust Major Influence: The Onion I told you the

Satire Humor Presentation By: Elayna Parkhust Major Influence: The Onion I told you the couches were a fire hazard, says fire marshall after dousing them in gasoline. Student is confident in their ability to fit 500,025,600 minutes of

586 views • 10 slides

AUTOMATIC DETECTION OF SATIRE AND SARCASM Computational Approaches to Creative Language, SS

AUTOMATIC DETECTION OF SATIRE AND SARCASM Computational Approaches to Creative Language, SS 2010, 22 June Olga Nikitina Intro The task is novel - no stable framework Similar tasks: text classification, sentiment analysis, opinion mining

304 views • 18 slides

Towards X Visual Reasoning Hanwang Zhang hanwangzhang@ntu.edu.sg Pattern Recognition

Towards X Visual Reasoning Hanwang Zhang hanwangzhang@ntu.edu.sg Pattern Recognition v.s. Reasoning Pattern Recognition v.s. Reasoning Caption: Lu et al. Neural Baby Talk. CVPR18 VQA: Teney et al. Graph- Structured Representations

1.1k views • 64 slides