Sample Complexity Bounds for Active Learning Paper by Sanjoy - PowerPoint PPT Presentation

May 12, 2023 •270 likes •428 views

Sample Complexity Bounds for Active Learning Paper by Sanjoy Dasgupta Presenter: Peter Sadowski Passive PAC Learning Complexity Based on VC dimension To get error < with probability 1 : num samples O

Sample Complexity Bounds for Active Learning Paper by Sanjoy Dasgupta Presenter: Peter Sadowski
Passive PAC Learning Complexity � Based on VC dimension To get error < ǫ with probability ≥ 1 − δ : � � � num samples ≥ � O ǫ ( V C ( H ) log (1 /δ )) Is there some equivalent for active learning?
Example: Reals in 1-D P=underlying distribution of points H=space of possible hypotheses w � 1 if x ≥ w H= { h w : w ∈ � } h w ( x ) = 0 if x < w O(1/ ǫ ) random labeled examples needed from P to get error rate < ǫ
Example: Reals in 1-D � 1 if x ≥ w h w ( x ) = 0 if x < w w Passive learning: O(1/ ǫ ) random labeled examples needed from P to get error rate < ǫ Active learning (Binary Search): O(log 1 /ǫ ) examples needed to get error < ǫ Active learning gives us an exponential improvement!
Example 2: Points on a Circle � P = some density on circle perimeter � H = linear separators in R 2 h � h � h �
Example 2: Points on a Circle Worst case: small ǫ slice of the circle is different O(1/ ǫ ) � Passive learning: O(1/ ǫ ) � Active learning: No improvement!
Active Learning Abstracted � Goal: Narrow down the version space , (hypotheses that fit with known labels � Idea: Think of hypotheses as points x=1 version space New version space if x=0 Observe x Cut made by Version space observing x
Shrinking the Version Space � Define distance between hypotheses: d(h,h’)=P { x:h(x) � = h ′ ( x ) } � Ignore distances less than ǫ Q=H × H Q ǫ = { ( h, h ′ ) ∈ Q : d ( h, h ′ ) > ǫ } A good cut!
Quick Example � What is the best cut? Q ǫ = { ( h, h ′ ) ∈ Q : d ( h, h ′ ) > ǫ }
Quick Example � Cut edges => shrink version space After this cut, we have a solution! The hypotheses left are insignificantly different.
Quantifying “Usefulness” of Points A point x ∈ X is said to ρ − split Q ǫ IF its label reduces the number of edges by a fraction ρ > 0 ¼-split 1-split ¾-split
Quantifying the Difficulty of Problems Definition: Subset S of hypotheses is if ( ρ, ǫ, τ ))splittable P { x : x ρ )splits Q ǫ } ≥ τ ”At least a fraction of τ samples are ρ )useful in splitting S.” ρ small ⇒ smaller splits ǫ small ⇒ small error τ small ⇒ lots of samples needed to get a good split
Lower Bound Result Suppose for some hypothesis space H: for some hypotheses � d( h � , h i ) > ǫ h � , h � , ..., h N { x : h � ( x ) � = h i ( x ) } � “disagree sets” are disjoint h � Then: For any τ and ρ > 1 /N , Q is not ( ρ, ǫ, τ ))splittable.
An Interesting Result There is constant c > 0 such that for any dimension d ≥ 2, if 1. H is the class of homogeneous lenear separators in R d , and 2. P is the uniform distribution over the surface of the unit sphere, then H is (1 / 4 , ǫ, cǫ ))splittable for all ǫ > 0. ⇒ For any h ∈ H , any ǫ ≤ 1 / (32 π � √ d ), � � �� √ � B ( h, 4 ǫ ) is � , ǫ, 1 ǫ/ d )splittable.
Conclusions � Active learning not always much better than passive. � “Splittability” is the VC dimension for active learning. � We can use this framework to fit bounds for specific problems.

Recommend

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample

207 views • 10 slides

Circuit Lower-bounds Lecture 24 Weak circuits are indeed weak 1 Circuit Lower-bounds 2

Circuit Lower-bounds Lecture 24 Weak circuits are indeed weak 1 Circuit Lower-bounds 2 Circuit Lower-bounds Today: 2 Circuit Lower-bounds Today: PARITY AC 0 2 Circuit Lower-bounds Today: PARITY AC 0 Two different proofs! (Latter

1.59k views • 137 slides

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active Learning Veterans Wrap Up Introduction to Active Learning What is active learning? Brainstorm your ideas with your group and generate a

855 views • 42 slides

Sample 2 Inlet in western (Sunset) Bay 0 Sample 3 Inlet behind Christian Island 1 Sample

eColi Reading Sample Site cfu/100mL * Sample 1 Base of Hungry Bay (near shore) 2 Sample 2 Inlet in western (Sunset) Bay 0 Sample 3 Inlet behind Christian Island 1 Sample 4 In front of camp, east end of lake 0 Sample 5 In

358 views • 17 slides

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

The Active Card An Active Mind in an Active Body More people, More Active, More often! The Active vision Active The Olympic Legacy 100,000 card holders by 2012 SMART card technology Improved communications for

421 views • 21 slides

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active adversary can inject messages into the channel Active Adversary An active adversary can inject messages into the channel Eve can send ciphertexts to Bob

2.08k views • 160 slides

Multi-Task Active Learning Yi Zhang Outline Active Learning Multi-Task Active Learning

Multi-Task Active Learning Yi Zhang Outline Active Learning Multi-Task Active Learning Linguistic Annotations (ACL 08) Image Classification (CVPR 08) Current Work and Discussions Constraint-Driven Active Learning

945 views • 47 slides

Agglomeration of Ash Particles due to Flue Gas Conditioning (a) Sample CA8S12F1 (b) Sample

Agglomeration of Ash Particles due to Flue Gas Conditioning (a) Sample CA8S12F1 (b) Sample CA8S12F2 (c) Sample CA8S12F3 (d) Sample CA8S12F4 (e) Sample CA8S12F5 (f) Sample CA8S12F6 SEM micrographs of Silica Fumes & Ground Granulated Blast

258 views • 14 slides

Data Streams & Communication Complexity Lecture 3: Communication Complexity and Lower Bounds

Data Streams & Communication Complexity Lecture 3: Communication Complexity and Lower Bounds Andrew McGregor, UMass Amherst 1/23 Basic Communication Complexity Three friends Alice, Bob, and Charlie each have some information x , y , z

1.11k views • 98 slides

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013, Warsaw Andrew Drucker Kernel-Size Lower Bounds Part 3/3 Andrew Drucker Kernel-Size Lower Bounds Note These slides are taken (with minor

1.04k views • 88 slides

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013, Warsaw Andrew Drucker Kernel-Size Lower Bounds Part 1/3 Andrew Drucker Kernel-Size Lower Bounds Note These slides are taken (with minor

1.3k views • 95 slides

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013, Warsaw Andrew Drucker Kernel-Size Lower Bounds Part 2/3 Andrew Drucker Kernel-Size Lower Bounds Note These slides are a slightly revised version

803 views • 51 slides

SEM Photographs of Activated ash samples SEM Micrographs (Original ash samples) (a) Sample S1F1

SEM Photographs of Activated ash samples SEM Micrographs (Original ash samples) (a) Sample S1F1 (b) Sample S2F2 (c) Sample SA35F2 (d) Sample SA50F3 (e) Sample SA35F4 (f) Sample SA50F5 Agglomeration of Ash Particles due to Flue Gas

230 views • 7 slides

Learning Loss for Active Learning Rymarczyk D., Zieliski B., Tabor J., Sadowski M., Titov M.

Learning Loss for Active Learning Rymarczyk D., Zieliski B., Tabor J., Sadowski M., Titov M. Agenda 1. Active Learning introduction 2. Base methods in AL for Deep Learning 3. Learning Loss for Active Learning 4. Our ideas for Active

802 views • 43 slides

Complexity Results for MCMC derived from Quantitative Bounds Jun Yang (joint work with Jeffrey

Complexity Results for MCMC derived from Quantitative Bounds Jun Yang (joint work with Jeffrey S. Rosenthal) Department of Statistical Sciences University of Toronto SSC 2018, McGill University MCMC Complexity Bounds (J. Yang) 1 Motivation

383 views • 7 slides

tail bounds tail bounds For a random variable X, the tails of X are the parts of the PMF/density

tail bounds tail bounds For a random variable X, the tails of X are the parts of the PMF/density that are far from its mean. PMF for X ~ Bin(100,0.5) 0.08 0.06 P(X=k) 0.04 0.02 0.00 30 40 50 60 70 4 k tail bounds

325 views • 21 slides

Direct Illumination & Monte Carlo Integration CS295, Spring 2017 Shuang Zhao Computer

Direct Illumination & Monte Carlo Integration CS295, Spring 2017 Shuang Zhao Computer Science Department University of California, Irvine CS295, Spring 2017 Shuang Zhao 1 Announcement Homework 1 will be out on Thursday, Apr 13

512 views • 32 slides

Class 36: Non uniform circular motion Course Evaluation: 1. Started yesterday Apr 13 th , ends

Class 36: Non uniform circular motion Course Evaluation: 1. Started yesterday Apr 13 th , ends Dec 29 th (Wednesday). 2. Go to http://pa.as.uky.edu/ 3. Click at UNDERGRADUATES in the top menu and then choose the first item: Physics

696 views • 6 slides

Localization capabilities of ground based interferometers Lee Samuel Finn Penn State University

Localization capabilities of ground based interferometers Lee Samuel Finn Penn State University Goal, assumptions, outline Goal: How well can ground-based detectors localize gravitational wave sources. Assumptions Simultaneous

645 views • 9 slides

Eigenvalues and Eigenvectors Suppose A is an n n symmetric matrix with real entries. The

Eigenvalues and Eigenvectors Suppose A is an n n symmetric matrix with real entries. The function from R n to R defined by x x t Ax is called a quadratic form. We can maximize x T Ax subject to x T x = || x || 2 = 1 by

742 views • 33 slides

Nilsequences and the primes (The lack of) hidden patterns in the prime numbers Fields Medalists

Nilsequences and the primes (The lack of) hidden patterns in the prime numbers Fields Medalists Symposium April 26, 2007 Ben Green (Cambridge) Terence Tao (UCLA) 1 Analytic prime number theory Analytic prime number theory studies the

633 views • 35 slides

Cylinders Through Five Points: Computational Algebra and Geometry Daniel Lichtblau Wolfram

Cylinders Through Five Points: Computational Algebra and Geometry Daniel Lichtblau Wolfram Research, Inc. 100 Trade Centre Dr. Champaign, IL 61820 danl@wolfram.com ICMS, Castro Urdiales, Spain September 13, 2006 | 2 Abstract

468 views • 27 slides

defines whats learned Most instance-based schemes use Euclidean distance : a (1) and a (2) :

Distance function defines whats learned Most instance-based schemes use Euclidean distance : a (1) and a (2) : two instances with k attributes Taking the square root is not required when comparing distances Other popular

550 views • 40 slides

Clustering Lecture notes Clustering is Exploratory, unsupervised method Data in cluster is

Clustering Lecture notes Clustering is Exploratory, unsupervised method Data in cluster is similar to each other, and dissimilar to other cluster data Finding structure in data Hard vs. Soft/Fuzzy vs. Exclusive Understanding and/or summarise

400 views • 26 slides

Sample Complexity Bounds for Active Learning Paper by Sanjoy - PowerPoint PPT Presentation

Sample Complexity Bounds for Active Learning Paper by Sanjoy Dasgupta Presenter: Peter Sadowski Passive PAC Learning Complexity Based on VC dimension To get error < with probability 1 : num samples O

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

Circuit Lower-bounds Lecture 24 Weak circuits are indeed weak 1 Circuit Lower-bounds 2

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active

Sample 2 Inlet in western (Sunset) Bay 0 Sample 3 Inlet behind Christian Island 1 Sample

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Multi-Task Active Learning Yi Zhang Outline Active Learning Multi-Task Active Learning

Agglomeration of Ash Particles due to Flue Gas Conditioning (a) Sample CA8S12F1 (b) Sample

Data Streams &amp; Communication Complexity Lecture 3: Communication Complexity and Lower Bounds

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

Kernel-Size Lower Bounds: The Evidence from Complexity Theory Andrew Drucker IAS Worker 2013,

SEM Photographs of Activated ash samples SEM Micrographs (Original ash samples) (a) Sample S1F1

Learning Loss for Active Learning Rymarczyk D., Zieliski B., Tabor J., Sadowski M., Titov M.

Complexity Results for MCMC derived from Quantitative Bounds Jun Yang (joint work with Jeffrey

tail bounds tail bounds For a random variable X, the tails of X are the parts of the PMF/density

Direct Illumination &amp; Monte Carlo Integration CS295, Spring 2017 Shuang Zhao Computer

Class 36: Non uniform circular motion Course Evaluation: 1. Started yesterday Apr 13 th , ends

Localization capabilities of ground based interferometers Lee Samuel Finn Penn State University

Eigenvalues and Eigenvectors Suppose A is an n n symmetric matrix with real entries. The

Nilsequences and the primes (The lack of) hidden patterns in the prime numbers Fields Medalists

Cylinders Through Five Points: Computational Algebra and Geometry Daniel Lichtblau Wolfram

defines whats learned Most instance-based schemes use Euclidean distance : a (1) and a (2) :

Clustering Lecture notes Clustering is Exploratory, unsupervised method Data in cluster is

Data Streams & Communication Complexity Lecture 3: Communication Complexity and Lower Bounds

Direct Illumination & Monte Carlo Integration CS295, Spring 2017 Shuang Zhao Computer