Frontiers and Open-Challenges CS330 Logistics Final project - PowerPoint PPT Presentation

Frontiers and Open-Challenges CS330

Logistics Final project presentations next week Schedule on Piazza. Final project report Due next Friday midnight. This is the last lecture! We’ll leave time for course evaluations at the end.

Today: What doesn’t work very well? (and how might we fi x it) Meta-learning for addressing distribution shift Capturing equivariances with meta-learning Adapting to distribution shift What does it take to run multi-task & meta-RL across distinct tasks? what set of distinct tasks do we train on? what challenges arise? Open Challenges

Why address distribution shift?

Our current paradigm Our current reality v stocks (ML research) robots dataset model evaluation supply & demand Can our algorithms handle the changing world?

How does industry cope? Chip Huyen on misperceptions about ML production: the way our techniques are being used != the way we intend

One solution to distribution shift: build in structure to solve this problem. e.g. convolutions — Not great when we don’t + Great when we know the structure & how to build it in! Can we discover equivariant and invariant structure via meta-learning? (i.e. symmetries)

Does MAML already do this? 4 r θ L 0 1 2 3 4 MAML can learn equivariant initial features but equivariance may not be preserved in the gradient update! Goal: Can we decompose weights into equivariant structure & corresponding parameters ? If so: update only parameters in the inner loop, retaining equivariance .

How are equivariances represented in neural networks? Let’s look at an example. 1D convolution layer 1D convolution represented as FC layer Zhou, Knowles, Finn. Meta-Learning Symmetries by Reparametrization . under review ‘20

Representing Equivariance by Reparametrization Key idea: reparametrize weight matrix W underlying filter sharing matrix parameters 1 0 0 0 1 0 0 0 1 0 0 0 … 0 0 1 Captures underlying shared parameters. Captures symmetries. 1D convolution represented as FC layer Theoretically, this can directly represent decoupled equivariant sharing pattern + fi lter parameters. for all G-convolutions with fi nite group G Zhou, Knowles, Finn. Meta-Learning Symmetries by Reparametrization . under review ‘20

Meta-Learning Equivariance v → v ′ U Inner loop : only update parameters , keep equivariance fi xed U v Outer loop : learn equivariance and initial parameters Important assumption : Some symmetries shared by all tasks. meta-learning symmetries by reparametrization (MSR) Zhou, Knowles, Finn. Meta-Learning Symmetries by Reparametrization . under review ‘20

Can we recover convolutions? from translationally equivariant data Mean-squared error on held-out test tasks MAML-X: X corresponds to architecture (fully-connected, locally-connected, convolution) recovered weight matrix MSR-FC: fully-connected layer weights W Zhou, Knowles, Finn. Meta-Learning Symmetries by Reparametrization . under review ‘20

Can we recover something better than convolutions? …from data with partial translation symmetry k : rank of a locally- connected layer …from data with translation + rotation + re fl ection symmetry W MSR-Conv: corresponds to convolution layer weights Zhou, Knowles, Finn. Meta-Learning Symmetries by Reparametrization . under review ‘20

Can we learn symmetries from augmented data? baking data augmentation —> into the architecture / update rule Zhou, Knowles, Finn. Meta-Learning Symmetries by Reparametrization . under review ‘20

MSR provides a framework for understanding the interplay of features & structure in meta-learning Zhou, Knowles, Finn. Meta-Learning Symmetries by Reparametrization . under review ‘20

What kind of distribution shift to adapt to? We’ll now focus on: group shift categorical group variable z Training data from p ( x , y | z ) p tr ( z ) e.g. user, location, time of day Test data from p ( x , y | z ) p ts ( z ) (can be derived from meta-data) can capture label shift, most covariate shift captures problems like federated learning Group DRO ( distributionally robust optimization ) : (Ben-Tal et al. ’13, Duchi et al ’16) q ( z ) Form adversarial distribution : + can enable robust solutions - often sacri fi ces average/empirical group performance + less pessimistic than adversarial robustness

Can we aim to adapt instead of aiming for robustness? Test time unlabeled data from test sub-distribution adapt model & infer labels (e.g. new user, di ff erent time-of-day, new place) Assumption : test inputs from one group available in a batch or streaming. Adaptive risk minimization (ARM) Zhang*, Marklund*, Dhawan, Gupta, Levine, Finn. Adaptive Risk Minimization: A Meta-Learning Approach for Tackling Group Shift . ‘20

1. Construct sub-distributions of training data Adaptive risk minimization (ARM) 2. Train for adaptation to sub-distributions. How to adapt with unlabeled data? MAML with learned loss or meta-learning with context variable Simplest setting : context = BN statistics Zhang*, Marklund*, Dhawan, Gupta, Levine, Finn. Adaptive Risk Minimization: A Meta-Learning Approach for Tackling Group Shift . ‘20

Experimental Comparisons ERM - standard deep network training DRNN - distributional robustness (Sagawa, Koh et al. ICLR ’20) UW - ERM but upweight groups to the uniform distribution ARM - adaptive risk minimization ARM-CML - adapt with context variable ARM-BN - adapt using batch norm stats ARM-LL - adapt with learned loss

Experiment 1. Federated Extended MNIST (Cohen et al. 2017, Caldas et al. 2019) Distribution shift : adapt to new users with only unlabeled data + 5% improvement in average accuracy + 10% improvement in worst-case accuracy ARM - adaptive risk minimization ERM - standard deep network training UW - ERM but upweight groups to DRNN - distributional robustness the uniform distribution (Sagawa, Koh et al. ICLR ’20) q -FedAvg (Li et al. 2020) - federated learning method Zhang*, Marklund*, Dhawan, Gupta, Levine, Finn. Adaptive Risk Minimization: A Meta-Learning Approach for Tackling Group Shift . ‘20

Experiment 1. Federated Extended MNIST (Cohen et al. 2017, Caldas et al. 2019) Distribution shift : adapt to new users with only unlabeled data Zhang*, Marklund*, Dhawan, Gupta, Levine, Finn. Adaptive Risk Minimization: A Meta-Learning Approach for Tackling Group Shift . ‘20

Experiment 2. CIFAR-C, TinyImageNet-C (Hendrycks & Dietterich, 2019) Distribution shift : adapt to new image corruptions (train using 56 corruptions, test using 22 disjoint corruptions) + 3-10% improvement in average accuracy + 8-21% improvement in worst-case accuracy ARM - adaptive risk minimization ERM - standard deep network training UW - ERM but upweight groups to DRNN - distributional robustness the uniform distribution (Sagawa, Koh et al. ICLR ’20) Zhang*, Marklund*, Dhawan, Gupta, Levine, Finn. Adaptive Risk Minimization: A Meta-Learning Approach for Tackling Group Shift . ‘20

Today: What doesn’t work very well? (and how might we fi x it) Meta-learning for addressing distribution shift Capturing equivariances with meta-learning Adapting to distribution shift Takeaways Preliminary evidence that meta-learning can capture equivariances via reparametrized weight matrices Allow adaptation / fine-tuning without labeled target data via adaptive risk minimization

Have MAML, RL 2 , PEARL, DREAM accomplished our goal of making policy adaptation fast? Sort of… Can we adapt to entirely new tasks ? & not sparse ^ —> Need broad distribution of tasks meta-train task meta-test task = for meta-training distribution distribution A few options: Fan et al. SURREAL: Open-Source Reinforcement Learning Brockman et al. OpenAI Gym . Bellemare et al. Atari Learning Framework and Robot Manipulation Benchmark . CoRL 2016 Environment . 2016 2018

Our desiderata 50+ qualitatively distinct tasks shaped reward function & success metrics All tasks individually solvable (to allow us to focus on multi- task / meta-RL component) Unified state & action space, environment (to facilitate transfer) Meta-World Benchmark T Yu, D Quillen, Z He, R Julian, K Hausman, C Finn, S Levine. Meta-World . CoRL ‘19

Results: Meta-learning algorithms seem to struggle… …even on the 45 meta-training tasks! Multi-task RL algorithms also struggle… T Yu, D Quillen, Z He, R Julian, K Hausman, C Finn, S Levine. Meta-World . CoRL ‘19

Why the poor results? Exploration challenge? All tasks individually solvable. Data scarcity? All methods given budget with plenty of samples. Limited model capacity? All methods plenty of capacity. Training models independently performs the best. Our conclusion: must be a multi-task optimization challenge.

Frontiers and Open-Challenges CS330 Logistics Final project - PowerPoint PPT Presentation

Frontiers and Open-Challenges CS330 Logistics Final project presentations next week Schedule on Piazza. Final project report Due next Friday midnight. This is the last lecture! Well leave time for course evaluations at the end. Today:

Taking the SCA to New Taking the SCA to New Frontiers Frontiers Steve Bernier & Claude

Relic Neutrinos the holy grail of neutrino physics? Fermilab Summer School 2009 J. A.

Make Money With Open Source What is Open Source? Community Free software vs. open source

Mid-Atlantic Education: Next Frontiers in Sustainable Building Construction and Operation

NEW FRONTIERS IN POVERTY MEASUREMENT James E. Foster George Washington University and OPHI,

Frontiers of Research in NDE Baldev Raj Hon. Member, ICNDT Distinguished Scientist and Director

CURE Frontiers in Research Seminar Series Host Guidelines Goal for the Program: The goal of

Security, Privacy, Ethics and Sheep Professor Stephen Hailes UCL New Frontiers in IoT UCL New

CODATA 18th International Conference: 18th International Conference: Frontiers of Scientific

Frontiers of Metrology in Biology 26 th CGPM 2018 Marc Salit Joint Initiative for Metrology in

Frontiers of Materials Research: A Decadal Survey UMC Meeting, March 2018

Detector Research at Fermilab Erik Ramberg Users Meeting 2 June, 2010 The Frontiers of the

Julien Leuthold, Adrian Gilli 2019 Frontiers in Education: Digital Scholarship DOI:

Welcome to New England ! Gordon Conference, Aug 5 - 10 Frontiers & Careers, Aug 2 - 4

NEURODIVERSITY: New Frontiers in Workforce Talent Management Presented by Kennedy Krieger

Challenges in making Challenges in making Free and Open Source Accessibility work Free and Open

Fermilab Test Beam Facility Mandy Rominsky FTBF Coodinator

Robust Statistics and Generative Adversarial Networks Yuan YAO HKUST 1 Chao GAO Jiyi LIU

Database Programming (JDBC) Lecture 5 1 Outline JDBC overview JDBC API Reading:

EMMA MA RF System C. Ohmori and J. S. Berg EMMA MA System * Many FFAG applications require slow

Distributionally Robust Planning Tool for Sustainable Microgrids Shahab Dehghan 1 , Agnes

CS 245: Database System Principles Notes 03: Disk Organization Hector Garcia-Molina CS 245

Conquering MPSoC Design and Architecture Complexity with Bio-Inspired Self-Organization Andreas

DIY IMAGERY COLLECTION FOR MAPPING UAV / RPAS / UAS UAV Unmanned Aerial Vehicle RPAS

Frontiers and Open-Challenges CS330 Logistics Final project - PowerPoint PPT Presentation

Frontiers and Open-Challenges CS330 Logistics Final project presentations next week Schedule on Piazza. Final project report Due next Friday midnight. This is the last lecture! Well leave time for course evaluations at the end. Today:

Taking the SCA to New Taking the SCA to New Frontiers Frontiers Steve Bernier &amp; Claude

Relic Neutrinos the holy grail of neutrino physics? Fermilab Summer School 2009 J. A.

Make Money With Open Source What is Open Source? Community Free software vs. open source

Mid-Atlantic Education: Next Frontiers in Sustainable Building Construction and Operation

NEW FRONTIERS IN POVERTY MEASUREMENT James E. Foster George Washington University and OPHI,

Frontiers of Research in NDE Baldev Raj Hon. Member, ICNDT Distinguished Scientist and Director

CURE Frontiers in Research Seminar Series Host Guidelines Goal for the Program: The goal of

Security, Privacy, Ethics and Sheep Professor Stephen Hailes UCL New Frontiers in IoT UCL New

CODATA 18th International Conference: 18th International Conference: Frontiers of Scientific

Frontiers of Metrology in Biology 26 th CGPM 2018 Marc Salit Joint Initiative for Metrology in

Frontiers of Materials Research: A Decadal Survey UMC Meeting, March 2018

Detector Research at Fermilab Erik Ramberg Users Meeting 2 June, 2010 The Frontiers of the

Julien Leuthold, Adrian Gilli 2019 Frontiers in Education: Digital Scholarship DOI:

Welcome to New England ! Gordon Conference, Aug 5 - 10 Frontiers &amp; Careers, Aug 2 - 4

NEURODIVERSITY: New Frontiers in Workforce Talent Management Presented by Kennedy Krieger

Challenges in making Challenges in making Free and Open Source Accessibility work Free and Open

Fermilab Test Beam Facility Mandy Rominsky FTBF Coodinator

Robust Statistics and Generative Adversarial Networks Yuan YAO HKUST 1 Chao GAO Jiyi LIU

Database Programming (JDBC) Lecture 5 1 Outline JDBC overview JDBC API Reading:

EMMA MA RF System C. Ohmori and J. S. Berg EMMA MA System * Many FFAG applications require slow

Distributionally Robust Planning Tool for Sustainable Microgrids Shahab Dehghan 1 , Agnes

CS 245: Database System Principles Notes 03: Disk Organization Hector Garcia-Molina CS 245

Conquering MPSoC Design and Architecture Complexity with Bio-Inspired Self-Organization Andreas

DIY IMAGERY COLLECTION FOR MAPPING UAV / RPAS / UAS UAV Unmanned Aerial Vehicle RPAS

Taking the SCA to New Taking the SCA to New Frontiers Frontiers Steve Bernier & Claude

Welcome to New England ! Gordon Conference, Aug 5 - 10 Frontiers & Careers, Aug 2 - 4