CS485/685 Lecture 16: March 1, 2012 Agnostic Learning [BDSS] Chapters - PDF document

01/03/2012 CS485/685 Lecture 16: March 1, 2012 Agnostic Learning [BDSS] Chapters 2, 3 CS485/685 (c) 2012 P. Poupart 1 Agnostic PAC Learning • Definition: A learner that doesn’t assume that � contains an error free hypothesis and that simply finds the hypothesis with minimum training error is often called an agnostic learner CS485/685 (c) 2012 P. Poupart 2 1

01/03/2012 Agnostic PAC Learnability • A hypothesis class � is agnostic PAC learnable if for � � any � � 0 , � ∈ �0,1� , there exists an N � � � , � and a learning algorithm such that for any � and N i.i.d. samples it returns � ∈ � such that with probability 1 � � � � ∈� � � � � � � � � � � min CS485/685 (c) 2012 P. Poupart 3 � ‐ representative • Definition: A training set � is called � ‐ representative if ∀� ∈ �, |� � � � � � � | � � � • Lemma: Assume that a training set � is � ‐ representative. Then any output � � of an empirical risk minimizing algorithm satisfies � � � � � min �∈� � � � � � � • Proof: � � � � � � � � � � � � � � � � � � � � � � � � CS485/685 (c) 2012 P. Poupart 4 2

01/03/2012 Uniform Convergence • Definition: A hypothesis class � has the uniform convergence property if there exists a function �: � � � 0,1 → � such that for every probability distribution � , if � is a sample of � � ��, �� examples drawn i.i.d. according to � , then with probability at least 1 � � , � is � ‐ representative. CS485/685 (c) 2012 P. Poupart 5 Uniform Convergence • Corollary 2: If a class � has the uniform convergence property with a function � , then the class is agnostically PAC learnable with sample complexity � N � � � , � . Furthermore, an empirical risk minimization algorithm is a successful agnostic PAC learner for � . CS485/685 (c) 2012 P. Poupart 6 3

01/03/2012 Uniform Convergence • To show that uniform convergence holds, show that: |� � � � � � � | is likely to be small for any 1. fixed hypothesis (chosen before seeing the data) 2. Think of � � �� as a random variable with mean � � �� . Then the distribution of � � �� is concentrated around its mean for all � ∈ � . CS485/685 (c) 2012 P. Poupart 7 Measure Concentration • Let � � be random variables with mean � . Then as � � � ∑ � → ∞ , � � → � �� • Use measure concentration inequalities to quantify � � � ∑ � � from � for finite � the deviation of �� CS485/685 (c) 2012 P. Poupart 8 4

01/03/2012 Markov’s Inequality • Markov’s inequality: � � ∀� � 0 Pr � � � � � � • Derivation: � � � � Pr � � � �� Pr � � � �� Pr � � � �� Pr�� CS485/685 (c) 2012 P. Poupart 9 Chebyshev’s Inequality • Bound deviation from the mean on both sides: � � � � � Pr � � � � � � � Pr � � � � � � � �� • Since �� ∑ � � � for i.i.d. � � ’s, �� ∑ then Pr � � � � �� CS485/685 (c) 2012 P. Poupart 10 5

01/03/2012 Chebyshev’s Inequality • Lemma: Let � � , … , � � be i.i.d., � � � � � ∀� � and �� 1 ∀� � , then for any � ∈ �0,1� , with probability 1 � � , we have � � � ∑ � � � � � � �� • Proof: Let � � Pr � ∑ � � � � � � �� Then � � � �� ∑ Hence � � � � � � � �� and �� CS485/685 (c) 2012 P. Poupart 11 Hoeffding’s Inequality • Tighter bound than Chebyshev’s inequality • Let � � , … , � � be i.i.d. variables with mean � • Assume that Pr � � � � � � � 1 � �� ∑ • Then Pr � � � � � � � 2� �� 2� �� • Hence Pr � � � � � � � CS485/685 (c) 2012 P. Poupart 12 6

01/03/2012 Agnostic PAC Learnability • Theorem: Let � be finite, � ∈ �0,1� , � � 0 and � �� , then with probability at least 1 � � , � � we have � � � � � min �∈� � � � � � CS485/685 (c) 2012 P. Poupart 13 Agnostic PAC Learnability • Proof : From Corollary 2, it suffices to show that � Pr ∃� ∈ �, � � � � � � � � � � � Using the union bound: � Pr ∃� ∈ �, � � � � � � � � � � � ∑ Pr � � � � � � � � �∈� � � 2 � � � �� since � � � � � CS485/685 (c) 2012 P. Poupart 14 7

CS485/685 Lecture 16: March 1, 2012 Agnostic Learning [BDSS] Chapters - PDF document

01/03/2012 CS485/685 Lecture 16: March 1, 2012 Agnostic Learning [BDSS] Chapters 2, 3 CS485/685 (c) 2012 P. Poupart 1 Agnostic PAC Learning Definition: A learner that doesnt assume that contains an error free hypothesis and that simply

CS485/685 Lecture 7: Jan 24, 2012 Perceptrons, Neural Networks [B]: Sections 4.1.7, 5.1 CS485/685

CS485/685 Lecture 15: Feb 28, 2012 Probably Approximately Correct Learning [BDSS] Chapter 1

CS485/540 Software Engineering Project Details and Team Roles Cengiz Gnay Fall 2012 Gnay

CS485/540 Software Engineering Demo Guidelines Cengiz Gnay Dept. Math & CS, Emory

Deployment Tools and Techniques Cengiz Gnay CS485/540 Software Engineering Fall 2014, some

H1 2012 Results Main results Key figures H1 2012 H1 2011 Q2 2012 Q1 2012 Q2 2011 Q1 2011

March 2018 Progress Report March Feb Anderson March Feb Anderson March Feb Anderson March

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

Tailored 685 Third Avenue Technologies LLC New York, NY 10017 Tel: (212) 503-6300 Date:

Tailored 685 Third Avenue Technologies LLC New York, NY 10017 Tel: (212) 503-6300 Date:

BUILDING ENVELOPE ASSESSMENT February 4, 2013 2017 WINDOW MASTER PLAN ST. ANSELM CHURCH 685

on Coursera Pratt Institute INFO-685-02 Digital Analytics By: Shradha Shree, Gloriana Amador,

School/Family Engagement Survey 2019-2020 School Year Tuesday, January 21, 2020 Q1: My child

Scenarios Being Studied 990 Hanlon + Sheehan Hanlon only Hanlon + Deerfield + PK 685

Challenges Tax Levy Limit $1,459,685 Net Increase in Revenue Sources From $474,710 $

language modeling CS 685, Fall 2020 Introduction to Natural Language Processing

D Exploring diachronic collocations with DiaCollo Bryan Jurish jurish@bbaw.de G ottingen

through Neuromorphic Circuits Hongyu An The Bradley Department of Electrical and Computer

care: the LOUISE and Virtual Promenade experiments PhD defense by: Pierre WARGNIER Advisors:

Science Fiction UDLS UDLS Jan 7, 2015 Jan 8, 2016 Neil Newman Neil Newman What is Science

Drugs and the Brain Teaser Zak Fallows 2013-07-03 http://datb.mit.edu pharmacology@mit.edu 1

Medications for RLS 2019 Webinar Series Michael H. Silber, M.B.Ch.B. Professor of Neurology

Medication-Assisted Treatment for Opioid Dependence: Role for Agonists and Antagonists Maria A.

and renal benefits of GLP-1 receptor agonists Filip Krag Knop, MD Copenhagen, Denmark June 15,

CS485/685 Lecture 16: March 1, 2012 Agnostic Learning [BDSS] Chapters - PDF document

01/03/2012 CS485/685 Lecture 16: March 1, 2012 Agnostic Learning [BDSS] Chapters 2, 3 CS485/685 (c) 2012 P. Poupart 1 Agnostic PAC Learning Definition: A learner that doesnt assume that contains an error free hypothesis and that simply

CS485/685 Lecture 7: Jan 24, 2012 Perceptrons, Neural Networks [B]: Sections 4.1.7, 5.1 CS485/685

CS485/685 Lecture 15: Feb 28, 2012 Probably Approximately Correct Learning [BDSS] Chapter 1

CS485/540 Software Engineering Project Details and Team Roles Cengiz Gnay Fall 2012 Gnay

CS485/540 Software Engineering Demo Guidelines Cengiz Gnay Dept. Math &amp; CS, Emory

Deployment Tools and Techniques Cengiz Gnay CS485/540 Software Engineering Fall 2014, some

H1 2012 Results Main results Key figures H1 2012 H1 2011 Q2 2012 Q1 2012 Q2 2011 Q1 2011

March 2018 Progress Report March Feb Anderson March Feb Anderson March Feb Anderson March

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

Tailored 685 Third Avenue Technologies LLC New York, NY 10017 Tel: (212) 503-6300 Date:

Tailored 685 Third Avenue Technologies LLC New York, NY 10017 Tel: (212) 503-6300 Date:

BUILDING ENVELOPE ASSESSMENT February 4, 2013 2017 WINDOW MASTER PLAN ST. ANSELM CHURCH 685

on Coursera Pratt Institute INFO-685-02 Digital Analytics By: Shradha Shree, Gloriana Amador,

School/Family Engagement Survey 2019-2020 School Year Tuesday, January 21, 2020 Q1: My child

Scenarios Being Studied 990 Hanlon + Sheehan Hanlon only Hanlon + Deerfield + PK 685

Challenges Tax Levy Limit $1,459,685 Net Increase in Revenue Sources From $474,710 $

language modeling CS 685, Fall 2020 Introduction to Natural Language Processing

D Exploring diachronic collocations with DiaCollo Bryan Jurish jurish@bbaw.de G ottingen

through Neuromorphic Circuits Hongyu An The Bradley Department of Electrical and Computer

care: the LOUISE and Virtual Promenade experiments PhD defense by: Pierre WARGNIER Advisors:

Science Fiction UDLS UDLS Jan 7, 2015 Jan 8, 2016 Neil Newman Neil Newman What is Science

Drugs and the Brain Teaser Zak Fallows 2013-07-03 http://datb.mit.edu pharmacology@mit.edu 1

Medications for RLS 2019 Webinar Series Michael H. Silber, M.B.Ch.B. Professor of Neurology

Medication-Assisted Treatment for Opioid Dependence: Role for Agonists and Antagonists Maria A.

and renal benefits of GLP-1 receptor agonists Filip Krag Knop, MD Copenhagen, Denmark June 15,

CS485/540 Software Engineering Demo Guidelines Cengiz Gnay Dept. Math & CS, Emory