Multiplicative Updates & the Winnow Algorithm Machine Learning - PowerPoint PPT Presentation

Multiplicative Updates & the Winnow Algorithm Machine Learning 1

Where are we? • Still looking at linear classifiers • Still looking at mistake-bound learning • We have seen the Perceptron update rule • Receive an input ( x i , y i ) • if sgn( w tT x i ) ≠ y i : Update w t+1 Ã w t + y i x i • The Perceptron update is an example of an additive weight update 2

This lecture • The Winnow Algorithm • Winnow mistake bound • Generalizations 3

This lecture • The Winnow Algorithm • Winnow mistake bound • Generalizations 4

The setting • Recall linear threshold units – Prediction = +1 if w T x ¸ µ – Prediction = -1 if w T x < µ • The Perceptron mistake bound is ( R / ° ) 2 – For Boolean functions with n attributes, R 2 = n, so basically O(n) • Motivating question : Suppose we know that even though the number of attributes is n, the number of relevant attributes is k, which is much less than n Can we improve the mistake bound? 5

Learning when irrelevant attributes abound Example • Suppose we know that the true concept is a disjunction of only a small number of features – Say only x 1 and x 2 are relevant The elimination algorithm will work: • – Start with h(x) = x 1 Ç x 2 Ç ! Ç x 1024 – Mistake on a negative example: Eliminate all attributes in the example from your hypothesis function h • Suppose we have an example x 100 = 1, x 301 = 1, label = -1 • Simple update: just eliminate these two variables from the function – Will never make mistakes on a positive example. Why? Makes O(n) updates • But we know that our function is a k-disjunction (here k = 2) • – And there are only C (n, k) · 2 k ¼ n k 2 k such functions – The Halving algorithm will make k log(n) mistakes – Can we realize this bound with an efficient algorithm? 6

Multiplicative updates • Let’s use linear classifiers with a different update rule – Remember: Perceptron will make O(n) mistakes on Boolean functions • The idea: Weights should be promoted and demoted via multiplicative, rather than additive, updates 10

The Winnow algorithm Littlestone 1988 Given a training set D = {( x , y )}, x 2 {0,1} n , y 2 {-1,1} Initialize: w = (1,1,1,1…,1) 2 < n, µ = n 1. 2. For each training example ( x , y ): – Predict y ’ = sgn( w T x – µ ) – If y = +1 and y ’ = -1 then: • Update each weight w i Ã 2w i only for those features x i i that are 1 Promotion Else if y = -1 and y ’ = +1 then: • Update each weight w i Ã w i /2 only for those features x i i that are 1 Demotion 11

Example run of the algorithm f = x 1 Ç x 2 Ç x 1023 Ç x 1024 Initialize µ = 1024, w = (1,1,1,1…,1) Example Prediction Error? Weights x =(1,1,1,…,1), y = +1 w T x ¸ µ No w = (1,1,1,1…,1) x =(0,0,0,…,0), y = -1 w T x < µ No w = (1,1,1,1…,1) x =(0,0,1,1,1,…,0), y = -1 w T x < µ No w = (1,1,1,1…,1) w T x < µ x =(1,0,0,…,0), y = +1 Yes w = ( 2 ,1,1,1…,1) w T x < µ x =(0,1,0,…,0), y = +1 Yes w = (2, 2 ,1,1…,1) x =(1,1,1,…,0), y = +1 w T x < µ Yes w = ( 4 , 4 , 2 ,1…,1) x =(1,0,0,…,1), y = +1 w T x < µ Yes w = ( 8 ,4,2,1…, 2 ) ... … … … w = ( 512 , 256 ,512,512…, 512 ) x =(0,0,1,1,…,0), y = -1 w T x ¸ µ Yes w = (512,256, 256 , 256 …,512) x =(0,0,0,…,1), y = +1 w T x < µ Yes w = (512,256,256,256…, 1024 ) Final weight vector could be w = ( 1024 , 1024 ,128,32…, 1024,1024 ) 16

Multiplicative Updates & the Winnow Algorithm Machine Learning - PowerPoint PPT Presentation

Multiplicative Updates & the Winnow Algorithm Machine Learning 1 Where are we? Still looking at linear classifiers Still looking at mistake-bound learning We have seen the Perceptron update rule Receive an input ( x i , y i

CS 170 Section 13 Multiplicative Updates Owen Jow April 25, 2018 University of California,

CS70: Today Euclids GCD algorithm. Multiplicative Inverse. (define (euclid x y) (if (= y 0)

Today. Notes. The multiplicative weights framework. Quick Review: experts

Multiplicative Weights Algorithms CompSci 590.03 Instructor: Ashwin Machanavajjhala Lecture 13 :

Introduction to Machine Learning 25. Multiplicative Updates, Games and Boosting Alex Smola

Multiplicative Weights Update as a Distributed Optimization Algorithm: Constrained Optimization

Developing Multiplicative Thinking- Foundations of Multiplicative Thinking with Julie Adams

Developing Multiplicative Thinking More Assessing and Monitoring Multiplicative Thinking Welcome

Imaginary multiplicative chaos and the XOR-Ising model Janne Junnila (EPFL) joint work with Eero

Multiplicative chaos in number theory Adam J Harper July 2019 Plan of the talk: First

Outline of Presentation MARS: Applying Multiplicative Introduction -- the vector model over

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

MARS: Applying Multiplicative Adaptive User Preference Retrieval to Web Search Zhixiang Chen

Multiplicative Thinking K-3 Candy Standley, Math Specialist Kristen Brink, Math Specialist

Recent results on the multiplicative renormalization method for orthogonal polynomials

Privacy preserving data mining multiplicative perturbation techniques Li Xiong CS573 Data

BS2247 Introduction to Econometrics Lecture 7: The multiple regression model Testing single

Welcome to the IR Department! MSc International Political Economy (IPE) / MSc IPE Research

Never Alone: How Collaboration has Changed and is Changing in Software Development Daniela

Running RegCM4 G. Giuliani ICTP - Earth System Physics Section Ninth ICTP Workshop on the

INTEGRATING MAINTENANCE AND ASSET MANAGEMENT Transportation Asset Management Peer Exchange July

AUTOMATION The Generalists Workload Manager IMMUCOR USER GROUP MEETING 2016 C Michelle Roye

Multiple Regression Analysis - Inference of the OLS Estimators Testing Hypotheses About a

for Image Classification Qilong Wang ( ) Dalian University of Technology