Parallel Online Learning Daniel Hsu Nikos Karampatziakis John - PowerPoint PPT Presentation

Jun 24, 2023 •207 likes •276 views

Parallel Online Learning Daniel Hsu Nikos Karampatziakis John Langford University of Pennsylvania Cornell University Yahoo! Research Rutgers University Workshop on Learning on Cores, Clusters and Clouds Online Learning Learner gets the

Parallel Online Learning Daniel Hsu Nikos Karampatziakis John Langford University of Pennsylvania Cornell University Yahoo! Research Rutgers University Workshop on Learning on Cores, Clusters and Clouds
Online Learning ◮ Learner gets the next example x t , makes a prediction p t , receives actual label y t , suffers loss ℓ ( p t , y t ), updates itself ◮ Simple and fast predictions and updates w ⊤ x t = p t = w t − η t ∇ ℓ ( p t , y t ) w t +1 ◮ Online gradient descent asymptotically attains optimal regret ◮ Online learning scales well . . . ◮ . . . but it’s a sequential algorithm ◮ What if we want to train on huge datasets? ◮ We investigate ways of distributing predictions, and updates while minimizing communication.
Delay ◮ Parallelizing online learning leads to delay problems. ◮ Temporally correlated or adversarial examples. ◮ We investigate no delay and bounded delay schemes.
Tree Architectures y ˆ y 2 , 1 y 2 , 2 ˆ ˆ y 1 , 4 y 1 , 1 y 1 , 2 y 1 , 3 ˆ ˆ ˆ ˆ x F 1 x F 2 x F 3 x F 4
Local Updates Each node in the tree: ◮ Computes its prediction p i , j based on its weights and inputs ◮ Sends ˆ y i , j = σ ( p i , j ) to its parent 1 ◮ Updates its weights based on ∇ ℓ ( p i , j , y ) No delay Representation power: between Naive Bayes and centralized linear model. 1 The nonlinearity introduced by σ has an interesting effect
Global Updates ◮ Local update can help or hurt. ◮ Improved representation power by more communication. ◮ Delayed global training ◮ Delayed backprop For details and experiments come see the poster.

Recommend

Online Learning Lorenzo Rosasco MIT, 9.520 L. Rosasco Online Learning About this class Goal

Online Learning Lorenzo Rosasco MIT, 9.520 L. Rosasco Online Learning About this class Goal To introduce theory and algorithms for online learning. L. Rosasco Online Learning Plan Different views on online learning From batch to online

620 views • 43 slides

Online Learning and Online Investing Jia Mao February 20, 2006 Jia Mao () Online Learning and

Online Learning and Online Investing Jia Mao February 20, 2006 Jia Mao () Online Learning and Online Investing February 20, 2006 1 / 20 Outline Online Investing 1 Constant Rebalanced Portfolios 2 Algorithms competing against best CRP 3

1.11k views • 87 slides

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.2 Parallel

Parallel Programming Paradigms MPI Message-Passing Interface OpenMP Portable Shared Memory Programming Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.2 Parallel Programming Michael T. Heath and Edgar

790 views • 45 slides

Introduction Introduction What is Parallel Architecture? Why Parallel Architecture? Evolution

Introduction Introduction What is Parallel Architecture? Why Parallel Architecture? Evolution and Convergence of Parallel Architectures Fundamental Design Issues 2 What is Parallel Architecture? A parallel computer is a collection of

1.01k views • 84 slides

Parallel and Distributed Programming Introduction Kenjiro Taura 1 / 21 Contents 1 Why Parallel

Parallel and Distributed Programming Introduction Kenjiro Taura 1 / 21 Contents 1 Why Parallel Programming? 2 What Parallel Machines Look Like, and Where Performance Come From? 3 How to Program Parallel Machines? 4 How to Program Parallel

337 views • 33 slides

Introduction to Parallel Computing George Karypis Principles of Parallel Algorithm Design

Introduction to Parallel Computing George Karypis Principles of Parallel Algorithm Design Outline Overview of some Serial Algorithms Parallel Algorithm vs Parallel Formulation Elements of a Parallel Algorithm/Formulation Common

873 views • 52 slides

+ Design of Parallel Algorithms Parallel Algorithm Analysis Tools + Topic Overview n Sources of

+ Design of Parallel Algorithms Parallel Algorithm Analysis Tools + Topic Overview n Sources of Overhead in Parallel Programs n Performance Metrics for Parallel Systems n Effect of Granularity on Performance n Scalability of Parallel Systems n

646 views • 50 slides

+ Design of Parallel Algorithms Parallel Algorithm Analysis Tools + Topic Overview n Sources

1.39k views • 50 slides

Overview Why Parallel Sorting? Parallel Quicksort Bitonic Sort Parallel Merge Sort

Department of Mathematics and Computer Science Department of Mathematics and Computer Science Overview Why Parallel Sorting? Parallel Quicksort Bitonic Sort Parallel Merge Sort Parallel Sorting Algorithms Summary Course 01727

251 views • 6 slides

Parallel Computing: Opportunities and Challenges Victor Lee Parallel Computing Lab (PCL), Intel

Parallel Computing: Opportunities and Challenges Victor Lee Parallel Computing Lab (PCL), Intel Parallel Computing Lab (PCL), Intel Who We Are: Parallel Computing Lab Parallel Computing Research to Realization Worldwide leadership in

372 views • 34 slides

A Massively Parallel Dense Symmetric A Massively Parallel Dense Symmetric A Massively Parallel

A Massively Parallel Dense Symmetric A Massively Parallel Dense Symmetric A Massively Parallel Dense Symmetric A Massively Parallel Dense Symmetric Eigensolver with Communication Eigensolver with Communication g Splitting Multicasting

539 views • 37 slides

Shared Memory Programming with OpenMP Lecture 3: Parallel Regions Parallel region directive

Shared Memory Programming with OpenMP Lecture 3: Parallel Regions Parallel region directive Code within a parallel region is executed by all threads. Syntax: Fortran: !$OMP PARALLEL block !$OMP END PARALLEL C/C++: #pragma omp

321 views • 15 slides

Teaching with Online Platforms What is an Online Learning Platform? A n Online Learning Platform

Teaching with Online Platforms What is an Online Learning Platform? A n Online Learning Platform is typically a website used by educational institutions as a tool or primary environment for learning. It can serve many functions, such as being a

912 views • 9 slides

ONLINE ADVERTISING What is SIBC online? SIBC Online is a leading online news source for the

ONLINE ADVERTISING What is SIBC online? SIBC Online is a leading online news source for the Solomon Islands, We average 1550 hits/clicks per day of our website - equating to 48,000 views a month (Though so far in November we have averaged

189 views • 18 slides

How to Think Algorithmically in Parallel? Or, Parallel Programming through Parallel Algorithms

How to Think Algorithmically in Parallel? Or, Parallel Programming through Parallel Algorithms Uzi Vishkin Context Will review variant of context part in August 22 talk given at Hot Interconnects, Stanford, CA. Please relax and listen. This

1.02k views • 72 slides

PARALLEL Joachim Nitschke PROGRAMMING Project Seminar Parallel Programming, Summer

PARALLEL Joachim Nitschke PROGRAMMING Project Seminar Parallel Programming, Summer Semester 2011 CONTENT Introduction Parallel program design Patterns for parallel programming A: Algorithm structure B: Supporting

831 views • 40 slides

Games with Costs and Delays Martin Zimmermann Saarland University June 20th, 2017 LICS 2017,

Games with Costs and Delays Martin Zimmermann Saarland University June 20th, 2017 LICS 2017, Reykjavik, Iceland Martin Zimmermann Saarland University Games with Costs and Delays 1/14 Gale-Stewart Games Bchi-Landweber: The winner of a

765 views • 62 slides

On Delay-Storage Trade-o ff s in Content Download from Coded Distributed Storage Systems Gauri

On Delay-Storage Trade-o ff s in Content Download from Coded Distributed Storage Systems Gauri Joshi (MIT) joint work with Yanpei Liu (UW-Madison) Emina Soljanin (Bell Labs) DIMACS Workshop on Algorithms for Green Data Storage Gauri Joshi

477 views • 24 slides

Communication Services zyxwvutsrqponmlkjihgfedcbaZYXWVUTSRQPONMLKJIHGFEDCBA Client Requirements

Communication Services zyxwvutsrqponmlkjihgfedcbaZYXWVUTSRQPONMLKJIHGFEDCBA Client Requirements for Real-Time Domenico Ferrari R EAL-TIME COMMUNICATION SERVICES PRO- for the communications of a client, it commits itself to provid- vide clients

223 views • 8 slides

Introduction Partitioning into multiple simpler subsystems Supporting Component-Based

Introduction Partitioning into multiple simpler subsystems Supporting Component-Based Lower complexity; Development with Component reuse; Hierarchical Scheduling Team-base development; Outsourcing. Introduction

741 views • 11 slides

Low Delay Random Linear Coding Over a Douglas J. Leith Stream Mohammad Karzand, Douglas J.

Low Delay Random Linear Coding Over a Stream Mohammad Karzand, Low Delay Random Linear Coding Over a Douglas J. Leith Stream Mohammad Karzand, Douglas J. Leith Trinity College Dublin January 2015 1/8 Motivation Low Delay Random

400 views • 7 slides

On feedback stabilizability of time-delay systems in Banach spaces S. Hadd and Q.-C. Zhong

On feedback stabilizability of time-delay systems in Banach spaces S. Hadd and Q.-C. Zhong q.zhong@liv.ac.uk Dept. of Electrical Eng. & Electronics The University of Liverpool United Kingdom Outline Background and motivation Hautus

690 views • 30 slides

Optimization of Time Delays in a Parabolic Delay Equation Fredi Trltzsch Technische

Optimization of Time Delays in a Parabolic Delay Equation Fredi Trltzsch Technische Universitt Berlin New trends in PDE constrained optimization Linz, October 2019 Fredi Trltzsch (TU Berlin) Time delays 18.10.2019 1 / 41 Joint work

906 views • 86 slides

Bounded Model Checking for Functional Programs Koen Lindstrm Claessen (joint work with Dan

Bounded Model Checking for Functional Programs Koen Lindstrm Claessen (joint work with Dan Rosn) prop_Unambiguous t1 t2 = show t1 == show t2 ==> t1 == t2 prop_Unambiguous t1 t2 = t1 /= t2 ==> show t1 /= show t2 very hard to test

890 views • 42 slides