Examples of the VC Dimension prof. dr Arno Siebes Algorithmic Data - PowerPoint PPT Presentation

Examples of the VC Dimension prof. dr Arno Siebes Algorithmic Data Analysis Group Department of Information and Computing Sciences Universiteit Utrecht

Recall: VC dimension The previous time we introduced the VC dimension of a hypothesis class H as: The VC dimension of a set of hypotheses H is the size of the largest set C ⊆ X such that C is shattered by H . If H can shatter arbitrarily sized sets, its VC dimension is infinite. Where a finite set is shattered by H if |H C | = 2 | C | We now study the VC dimension of some finite classes, more in particular: classes of boolean functions.

Finite Hypothesis Classes If a finite hypothesis class H shatters a finite set C then |H| ≥ |H C | = 2 | C | This immediately implies that VC ( H ) ≤ log ( |H| ) Clearly, the VC dimension can be smaller ◮ consider threshold functions that can take thresholds in { 1 , . . . k } ◮ |H| = k , while VC ( H ) = 1 In other words, ◮ the difference between VC ( H ) and log ( |H| ) can be arbitrarily large ◮ but log ( |H| ) is never the smallest

Monotone Monomials Recall the class C n of boolean expressions over n literals. A smaller class C + n (sometimes denoted by M + n ) consists of the monotone (positive) monomials ◮ no negations , just conjunctions of the variables Clearly, a variable is either in such an expression or not. Hence, | C + n | = 2 n Hence, by the previous page: C + ≤ log (2 n ) = n � � VC n But, as we noted on the previous page, it could be smaller, a lot smaller. ◮ however, it isn’t. To prove that we are going to create a set of n elements that is shattered by C + n .

VC ( C + n ) = n Let S consist of all 0/1-vectors of length n that have exactly ◮ n − 1 1’s ◮ and 1 0. Denote by x i that element of S that has 0 for the i-th coordinate. ◮ if j = i : π j ( x i ) = 0 ◮ if j � = i : π j ( x i ) = 1 Let R ⊆ S be any subset of S . Define h R ∈ C + n as ◮ the conjunction of all variables u j such that x j �∈ R Then we have: � 1 if x ∈ R h R ( x ) = if x ∈ S \ R 0 That is, we have a classifier for any R ⊆ S : S is shattered. Hence, C + � � VC = n n

How About C n ? It is easy to see that ◮ VC ( C 1 ) = 2 the monomials ◮ x and ¬ x will do that for you. Moreover, since C + n ⊂ C n : VC ( C + n ) ≤ VC ( C n ) ◮ any set that can be shattered by C + n can be shattered by C n So, it may appear that by allowing negations we increase the VC dimension, because we now have that n ≤ VC ( C n ) ≤ log ( | C n | ) = log(3 n ) = n log(3) But, we don’t ◮ except for the case n = 1 No set of size n + 1 can be shattered by C n if n ≥ 2

VC ( C n ) = n Let S = { s 1 , . . . , s n +1 } be a set of n + 1, 0/1 vectors of length n , that is shattered by C n ◮ define S i = S \ { s i } Because S is shattered by C n there exists a m i ∈ C n such that ◮ S i = S ∩ m i , thus, ∀ i , j : m i ( s j ) = 0 ↔ i = j (0 = false) But this means that: ◮ each s i contains a component s i h ( i ) ◮ each m i contains a literal l k ( i ) ◮ such that l k ( i ) is false on s i h ( i ) , i.e., l k ( i ) ( s i h ( i ) ) = 0 Given that there are only n variables ◮ at least 2 of these literals l k (1) , . . . , l k ( i +1) ◮ must refer to the same variable, say l k (1) and l k (2) ◮ either l k (1) = l k (2) , then l k (1) ( s 1 h (1) ) = l k (1) ( s 2 h (2) ) = 0, i.e, m 1 ( s 1 ) = m 1 ( s 2 ) = 0. Contradiction ◮ or l k (1) = ¬ l k (2) , then either l k (1) or l k (2) is false on s 3 . Either m 1 ( s 3 ) = 0 or m 2 ( s 3 ) = 0. Again a contradiction

D (+) by Duality n Denote by ◮ D + n the set of all disjunctions over at most n variables, again no negations ◮ D n the set of disjunctions over at most n literals Note that for φ ∈ C n and x ∈ { 0 , 1 } n we have φ ( x ) ↔ ¬ φ ( ¬ x ) That is we have a duality between C n and D n and similarly between C + n and D + n By this duality we immediately have: ◮ VC ( D n ) = n and ◮ VC ( D + n ) = n In the end, it is just consistently switching ◮ 1’s to 0’s and vice versa

Monotone Formulas We have seen that both ◮ C + n , conjunctions of variables, has VC dimension n ◮ and D + n , disjunctions of variables, has VC dimension n The natural follow up question is ◮ what happens if we allow both conjunctions and disjunctions ◮ but no negations This is the class of monotone boolean formulas , ◮ sometimes denoted by M n ◮ note, without a +; perhaps because allowing negations as well yields the class of all boolean functions ◮ which we will discuss later The problem is thus: determine VC ( M n )

Sperner’s Theorem To compute the VC dimension of M n we need a result from combinatorics known as Sperner’s Theorem. Let X be a set of n elements ◮ a chain of subsets of X is a family of subsets A i such that ∅ ⊆ A 1 ⊂ A 2 ⊂ · · · ⊂ A k ⊆ X ◮ an antichain is a family of subsets F such that for any two elements A , B ∈ F : A �⊂ B ∧ B �⊂ A Sperner: if F is an antichain of X , then � n � | F | ≤ ⌊ n / 2 ⌋ Note, an antichain is also known as a Sperner family of subsets.

Maximal Chains Without loss of generality we assume that X = { 1 , . . . , n } . A maximal chain in X obviously has length n + 1 ∅ = A 0 ⊂ A 1 ⊂ · · · ⊂ A n = X Such a maximal chain puts a total order on the elements of X ◮ the smallest element is the single element of A 1 ◮ the one-but-smallest is the new element in A 2 ◮ and so on and so on Similarly, each total order on X defines a chain ◮ A 1 consists of the smallest element ◮ A 2 consists of the two smallest elements ◮ and so on and so on That is, the total number of maximal chains equals the number of permutations: n !

Maximal Chains and Antichains Let A ⊆ X , with | A | = k . A maximal chain that contains A ◮ i.e., A = A k in that chain consists of ◮ A maximal chain for the set A ◮ followed by a chain for X \ A ◮ each set in the latter chained is extended by the union with A , of course This means that there are k !( n − k )! maximal chains containing A . Note that if F is an antichain, than any chain can contain at most one element of F ◮ If A and B are in a chain, then either A ⊂ B or B ⊂ A ◮ If A and B are in F , then both A �⊂ B and B �⊂ A

Proving Sperner Recall that F is an antichain. The number of maximal chains that contain an element of F (and thus exactly 1) is A ∈ F n ! | A | !( n −| A | )! 1 ◮ � A ∈ F | A | !( n − | A | )! = � = n ! � A ∈ F ( n n ! | A | ) Because there are in total n ! maximal chains, we have 1 ◮ � | A | ) ≤ 1 A ∈ F ( n For binomial coefficients, the middle ones are the largest, hence 1 1 ◮ � ⌊ n / 2 ⌋ ) ≤ � | A | ) ≤ 1 A ∈ F n A ∈ F ( n ( Since | F | 1 ◮ � ⌊ n / 2 ⌋ ) = A ∈ F n n ( ( ⌊ n / 2 ⌋ ) We have that � n � | F | ≤ ⌊ n / 2 ⌋

Back to Monotone Formula’s Let S be the set of all assignments to { x 1 , . . . , x n } such that exactly ◮ ⌊ n / 2 ⌋ variables are mapped to 1 (true) n � � Clearly, | S | = ⌊ n / 2 ⌋ � a ◮ this is the definition of � b Now choose some 0/1 labelling on S ◮ i.e., choose an arbitrary function g : S → { 0 , 1 } ◮ we need to show that M n contains that function Define T (from true) by T = { A ∈ S | g ( A ) = 1 } We need to construct a monotone formula f such that f ( A ) = 1 ↔ A ∈ T ↔ g ( A ) = 1

Two Special Cases and f g maps al variables to 0 (false) ◮ iff S = ∅ Clearly, the function false ∈ M n . Hence we can assume S � = ∅ If n = 1, we have only 1 variable which is either mapped to 1 or to 0 ◮ a function that is obviously in M 1 Hence we may assume that n > 1 Let f be the monotone function � � f ( z 1 , . . . x n ) = x i A ∈ T i : A ( x i )=1 Given the assumptions made above, the disjunction isn’t empty and neither is the conjunction

n � � VC ( M n ) ≥ ⌊ n / 2 ⌋ Let B ∈ T , then the monomial � x i i : B ( x i )=1 is mapped to 1 by B and, thus, by f For B ∈ S \ T , note that each monomial � x i i : A ( x i )=1 in f assigns 1 to exactly ⌊ n / 2 ⌋ variables and 0 to the rest. Since B ∈ S \ T ◮ it assigns 0 to at least one of these ⌊ n / 2 ⌋ variables Which means that f assigns 0 to B , n � � In other words, M n shatters S which has elements. Hence ⌊ n / 2 ⌋ n VC ( M n ) ≥ � � . ⌊ n / 2 ⌋

n � � VC ( M n ) ≤ ⌊ n / 2 ⌋ n � � Let S be a set of assignments such that | S | > . For each ⌊ n / 2 ⌋ A ∈ S define: V A = { i | A ( x i ) = 1 } Because of the size of S, Sperner’s theorem tells us the V A ’a cannot be an antichain. Hence, there are A 1 , A 2 ∈ S such that A 1 ( x i ) = 1 → A 2 ( x i ) = 1 Since the functions in M n are monotone, this means: ∀ f ∈ M n : f ( A 1 ) = 1 → f ( A 2 ) = 1 In other words a labelling that maps A 1 to 1 and A 2 to 0 cannot n � � be constructed in M n . In other words: VC ( M n ) ≤ Hence ⌊ n / 2 ⌋ � � n VC ( M n ) = ⌊ n / 2 ⌋

Adding Negations In the case of C n and D n we saw that ◮ adding negation did not increase the VC dimension So, it is reasonable to expect that ◮ the VC dimension of all boolean functions is the same as that of M n This is, however, not true! The VC dimension of that set of hypotheses is strictly bigger. Computing the exact dimension is pretty hard ◮ in fact, I am not aware of an exact expression Bounding the dimension is easier ◮ for k -DNF we can compute a Θ bound For the general case, we need some extra machinery. But first we look at k -DNF

Examples of the VC Dimension prof. dr Arno Siebes Algorithmic Data - PowerPoint PPT Presentation

Examples of the VC Dimension prof. dr Arno Siebes Algorithmic Data Analysis Group Department of Information and Computing Sciences Universiteit Utrecht Recall: VC dimension The previous time we introduced the VC dimension of a hypothesis class

VC-dimension and Erd os-P osa property Nicolas Bousquet LIRMM, University Montpellier II

The Human Dimension Sue Manns Regional Director Pegasus The Human Dimension The Human

Dimension Reduction and Nearest Neighbor Search Advanced Algorithms Nanjing University, Fall

Dimension Reduction CSE 6242 / CX 4242 Thanks : Prof. Jaegul Choo , Dr. Ramakrishnan Kannan,

The Metric Dimension Problem. J. D az Monash U., May 2018 The Metric Dimension problem

Packing Dimension Results for Anisotropic Gaussian Random Fields Dongsheng Wu Department of

OPTIMAL CONTROL METHODS IN SHAPE OPTIMIZATION Dan TIBA INSTITUTE OF MATHEMATICS OF THE ROMANIAN

Developing the intercultural dimension Developing the intercultural dimension in teaching and

1 In this lecture we discuss Pansus conformal dimension . Definition (Pansu, 1989) Let X be a

Portland Vancouver ULTRA-Ex Ecological Dimension Social Dimension Riparian greenspaces Land use

Boolean Function Jean Vuillemin ENS, Paris Dimension d= D (f) Bound d |DD(f)| on all

Linear Dimension Reduction (in L 2 ) Linear Dimension Reduction: R D R d Goal: Find a low-dim.

Crossed product C -algebras and nuclear dimension Jianchao Wu University of M unster Aug

The theory of essential dimension was born in 1997 with the publication of On the essential

Technical Information Rack Slide Dimension Drawing and Usage Table Dimension Diagram Rack

Computing Singularity Dimension Mark Pollicott 12 December 2012 1 / 27 Introduction Self

One Sketch to Rule Them All: Rethinking Network Flow Monitoring with UnivMon Zaoxing Liu, Antonis

Shai Halevi IBM CRYPTO 2011 Wouldnt it be nice to be able to Encrypt my data before

Leading Between Meetings Session Four Overview Pray Read Scripture prayerfully, and pray

Modeling, Identification, & Deep Learning of Batteries Scott Moura Assistant Professor | eCAL

Why You Should Run TPC-DS: A Workload Analysis Meikel Poess Raghunath Othayoth Nambiar Dave

Recent Advances in Two-loop Superstrings Eric DHoker Institut des Hautes Etudes Scientifiques,

Multigrid methods for zero-sum two player stochastic games with mean reward Sylvie Detournay and

Introduction to Computational Topology Ahmed Abdelkader Guest Lecture CMSC 754 Spring 2020

Sambuz

Useful Links

Newsletter

Mail Us

Examples of the VC Dimension prof. dr Arno Siebes Algorithmic Data - PowerPoint PPT Presentation

Examples of the VC Dimension prof. dr Arno Siebes Algorithmic Data Analysis Group Department of Information and Computing Sciences Universiteit Utrecht Recall: VC dimension The previous time we introduced the VC dimension of a hypothesis class

VC-dimension and Erd os-P osa property Nicolas Bousquet LIRMM, University Montpellier II

The Human Dimension Sue Manns Regional Director Pegasus The Human Dimension The Human

Dimension Reduction and Nearest Neighbor Search Advanced Algorithms Nanjing University, Fall

Dimension Reduction CSE 6242 / CX 4242 Thanks : Prof. Jaegul Choo , Dr. Ramakrishnan Kannan,

The Metric Dimension Problem. J. D az Monash U., May 2018 The Metric Dimension problem

Packing Dimension Results for Anisotropic Gaussian Random Fields Dongsheng Wu Department of

OPTIMAL CONTROL METHODS IN SHAPE OPTIMIZATION Dan TIBA INSTITUTE OF MATHEMATICS OF THE ROMANIAN

Developing the intercultural dimension Developing the intercultural dimension in teaching and

1 In this lecture we discuss Pansus conformal dimension . Definition (Pansu, 1989) Let X be a

Portland Vancouver ULTRA-Ex Ecological Dimension Social Dimension Riparian greenspaces Land use

Boolean Function Jean Vuillemin ENS, Paris Dimension d= D (f) Bound d |DD(f)| on all

Linear Dimension Reduction (in L 2 ) Linear Dimension Reduction: R D R d Goal: Find a low-dim.

Crossed product C -algebras and nuclear dimension Jianchao Wu University of M unster Aug

The theory of essential dimension was born in 1997 with the publication of On the essential

Technical Information Rack Slide Dimension Drawing and Usage Table Dimension Diagram Rack

Computing Singularity Dimension Mark Pollicott 12 December 2012 1 / 27 Introduction Self

One Sketch to Rule Them All: Rethinking Network Flow Monitoring with UnivMon Zaoxing Liu, Antonis

Shai Halevi IBM CRYPTO 2011 Wouldnt it be nice to be able to Encrypt my data before

Leading Between Meetings Session Four Overview Pray Read Scripture prayerfully, and pray

Modeling, Identification, &amp; Deep Learning of Batteries Scott Moura Assistant Professor | eCAL

Why You Should Run TPC-DS: A Workload Analysis Meikel Poess Raghunath Othayoth Nambiar Dave

Recent Advances in Two-loop Superstrings Eric DHoker Institut des Hautes Etudes Scientifiques,

Multigrid methods for zero-sum two player stochastic games with mean reward Sylvie Detournay and

Introduction to Computational Topology Ahmed Abdelkader Guest Lecture CMSC 754 Spring 2020

Sambuz

Useful Links

Newsletter

Mail Us

Modeling, Identification, & Deep Learning of Batteries Scott Moura Assistant Professor | eCAL