The Big Problem with Meta-Learning and How Bayesians Can Fix It - PowerPoint PPT Presentation

The Big Problem with Meta-Learning   and How Bayesians Can Fix It Chelsea Finn Stanford

training data test datapoint Braque Cezanne By Braque or Cezanne?

How did you accomplish this? Through previous experience.

How might you get a machine to accomplish this task? Modeling image formaKon Geometry Fewer human priors, more data -driven priors SIFT features, HOG features + SVM Greater success. Fine-tuning from ImageNet features Domain adaptaKon from other painters ??? Can we explicitly learn priors from previous experience that lead to efficient downstream learning? Can we learn to learn?

Outline 1. Brief overview of meta-learning 2. The problem: peculiar, lesser-known, yet ubiquitous 3. Steps towards a solution

How does meta-learning work? An example. Given 1 example of 5 classes: Classify new examples test set training data

How does meta-learning work? An example. training meta-training classes … … Given 1 example of 5 classes: Classify new examples meta-testing T test test set training data

How does meta-learning work? One approach : parameterize learner by neural network 4 0 1 2 3 4 y ts = f ( 𝒠 tr , x ts ; θ ) (Hochreiter et al. ’91, Santoro et al. ’16, many others)

How does meta-learning work? Another approach : embed optimization inside the learning process 4 r θ L y ts = f ( 𝒠 tr , x ts ; θ ) 0 1 2 3 4 (Maclaurin et al. ’15, Finn et al. ’17, many others)

The Bayesian perspective p ( ϕ | θ ) meta-learning <~> learning priors from data (Grant et al. ’18, Gordon et al. ’18, many others)

Outline 1. Brief overview of meta-learning 2. The problem: peculiar, lesser-known, yet ubiquitous 3. First steps towards a solution

How we construct tasks for meta-learning. 𝒠 tr x ts 0 1 2 3 4 2 4 0 1 2 3 4 3 1 T 3 0 1 2 3 4 4 3 Randomly assign class labels to image classes for each task —> Tasks are mutually exclusive . Algorithms must use training data to infer label ordering.

What if label order is consistent? 𝒠 tr x ts 0 1 2 3 4 2 4 0 1 2 3 4 3 1 T 3 0 2 3 4 1 1 2 Tasks are non-mutually exclusive : a single function can solve all tasks. The network can simply learn to classify inputs, irrespective of 𝒠 tr

The network can simply learn to classify inputs, irrespective of 𝒠 tr 4 1 2 3 4 0 4 r θ L 0 1 2 3 4

What if label order is consistent? 𝒠 tr x ts 0 1 2 3 4 2 4 0 1 2 3 4 3 1 T 3 0 2 3 4 1 1 2 For new image classes: can’t make predictions w/o 𝒠 tr T test training data test set

Is this a problem? - No : for image classi fi cation, we can just shu ffl e labels* - No , if we see the same image classes as training (& don’t need to adapt at meta-test time) - But, yes , if we want to be able to adapt with data for new tasks.

Another example “hammer” “close drawer” “stack” meta-training … T 50 “close box” T test If you tell the robot the task goal, the robot can ignore the trials. T Yu, D Quillen, Z He, R Julian, K Hausman, C Finn, S Levine. Meta-World . CoRL ‘19

Another example Model can memorize the canonical orientations of the training objects. Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ‘19

Can we do something about it?

If tasks mutually exclusive : single function cannot solve all tasks (i.e. due to label shu ffl ing, hiding information) If tasks are non - mutually exclusive : single function can solve all tasks y ts = f θ ( D tr multiple solutions to the i , x ts ) meta-learning problem 𝒠 tr One solution: θ memorize canonical pose info in & ignore i 𝒠 tr Another solution: θ carry no info about canonical pose in , acquire from i An entire spectrum of solutions based on how information fl ows. Suggests a potential approach: control information fl ow. Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ‘19

If tasks are non - mutually exclusive : single function can solve all tasks y ts = f θ ( D tr multiple solutions to the i , x ts ) meta-learning problem 𝒠 tr One solution: θ memorize canonical pose info in & ignore i 𝒠 tr Another solution: θ carry no info about canonical pose in , acquire from i An entire spectrum of solutions based on how information fl ows. one option: max I ( ̂ y ts , 𝒠 tr | x ts ) Meta-regularization minimize meta-training loss + information in θ ℒ ( θ , 𝒠 meta − train ) + β D KL ( q ( θ ; θ μ , θ σ ) ∥ p ( θ )) θ Places precedence on using information from over storing info in . 𝒠 tr Can combine with your favorite meta-learning algorithm. Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ‘19

Omniglot without label shu ffl ing: “non-mutually-exclusive” Omniglot On pose prediction task: (and it’s not just as simple as standard regularization) TAML: Jamal & Qi. Task-Agnostic Meta-Learning for Few-Shot Learning . CVPR ‘19 Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ‘19

Does meta-regularization lead to better generalization? P ( θ ) θ Let be an arbitrary distribution over that doesn’t depend on the meta-training data. P ( θ ) = 𝒪 ( θ ; 0 , I ) (e.g. ) 1 − δ For MAML, with probability at least , ∀ θ μ , θ σ error on the meta-regularization generalization meta-training set error β With a Taylor expansion of the RHS + a particular value of —> recover the MR MAML objective . Proof: draws heavily on Amit & Meier ‘18 Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ‘19

CS330: Deep Multi-Task & Meta-Learning Want to Learn More? Lecture videos coming out soon! Working on Meta-RL? Try out the Meta-World benchmark Collaborators T Yu, D Quillen, Z He, R Julian, K Hausman, C Finn, S Levine. Meta-World . CoRL ‘19 Yin, Tucker, Yuan, Levine, Finn. Meta-Learning without Memorization . ‘19

The Big Problem with Meta-Learning and How Bayesians Can Fix It - PowerPoint PPT Presentation

The Big Problem with Meta-Learning and How Bayesians Can Fix It Chelsea Finn Stanford training data test datapoint Braque Cezanne By Braque or Cezanne? How did you accomplish this? Through previous experience. How might you get a

The Meta-Learning Problem & Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today,

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A

Meta Reinforcement Learning Kate Rakelly 11/13/19 Questions we seek to answer Motivation : What

A few meta learning papers Guy Gur-Ari Machine Learning Journal Club, September 2017 Meta

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

Meta-Learning Unsupervised Update Rules Paper by Luke Metz, Niru Maheswaranathan, Brian Cheung,

Meta-transfer Learning for Few-shot Learning Yaoyao Liu Tianjin University and NUS School of

MetaFun: Meta-Learning with Iterative Functional Updates Jin Xu, Jean-Francois Ton, Hyunjik Kim,

Me Meta Lear Learnin ing A Bri Brief Introduct ction Xiachong Feng Ou Outline

Meta-Learning of Structured Representation by Proximal Mapping Mao Li, Yingyi Ma, Xinhua

Optimization-Based Meta-Learning CS 330 1 Course Reminders HW1 due next Weds (9/30). Project

Abstract Meta-learning, or learning to learn, has gained renewed interest in recent years within

Me Meta Lear Learnin ing A Bri Brief Introduct ction Xiachong Feng TG Ph.D. Student

Meta-DermDiagnosis: Few-Shot Skin Disease Identification using Meta-Learning Kushagra Mahajan ,

Lifelong Learning CS 330 Plan for Today The lifelong learning problem statement Basic approaches

Improving Cross-Validation Classifier Selection Accuracy through Meta- learning Jesse H. Krijthe

Prefrontal cortex as a meta-reinforcement learning system Wang et al. CS330 Student

Meta Reinforcement Learning as Task Inference Jan Humplik, Alexandre Galashov, Leonard

One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning Authors: Tianhe Yu*,

Bayesian Meta-Learning CS 330 1 Logistics Homework 2 due next Wednesday. Project proposal due in

Learning and Meta-learning computation making predictions choosing actions

Payo consequences of social learning: A meta analysis Georg Weizscker (LSE) October 2006 I

Meta-Learning Neural Bloom Filters Jack Rae Sergey Bartunov Tim Lillicrap Architecture

Bayesians Can Learn From Old Data William H. Jefferys University of Texas at Austin University

The Big Problem with Meta-Learning and How Bayesians Can Fix It - PowerPoint PPT Presentation

The Big Problem with Meta-Learning and How Bayesians Can Fix It Chelsea Finn Stanford training data test datapoint Braque Cezanne By Braque or Cezanne? How did you accomplish this? Through previous experience. How might you get a

The Meta-Learning Problem &amp; Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today,

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A

Meta Reinforcement Learning Kate Rakelly 11/13/19 Questions we seek to answer Motivation : What

A few meta learning papers Guy Gur-Ari Machine Learning Journal Club, September 2017 Meta

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

Meta-Learning Unsupervised Update Rules Paper by Luke Metz, Niru Maheswaranathan, Brian Cheung,

Meta-transfer Learning for Few-shot Learning Yaoyao Liu Tianjin University and NUS School of

MetaFun: Meta-Learning with Iterative Functional Updates Jin Xu, Jean-Francois Ton, Hyunjik Kim,

Me Meta Lear Learnin ing A Bri Brief Introduct ction Xiachong Feng Ou Outline

Meta-Learning of Structured Representation by Proximal Mapping Mao Li, Yingyi Ma, Xinhua

Optimization-Based Meta-Learning CS 330 1 Course Reminders HW1 due next Weds (9/30). Project

Abstract Meta-learning, or learning to learn, has gained renewed interest in recent years within

Me Meta Lear Learnin ing A Bri Brief Introduct ction Xiachong Feng TG Ph.D. Student

Meta-DermDiagnosis: Few-Shot Skin Disease Identification using Meta-Learning Kushagra Mahajan ,

Lifelong Learning CS 330 Plan for Today The lifelong learning problem statement Basic approaches

Improving Cross-Validation Classifier Selection Accuracy through Meta- learning Jesse H. Krijthe

Prefrontal cortex as a meta-reinforcement learning system Wang et al. CS330 Student

Meta Reinforcement Learning as Task Inference Jan Humplik, Alexandre Galashov, Leonard

One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning Authors: Tianhe Yu*,

Bayesian Meta-Learning CS 330 1 Logistics Homework 2 due next Wednesday. Project proposal due in

Learning and Meta-learning computation making predictions choosing actions

Payo consequences of social learning: A meta analysis Georg Weizscker (LSE) October 2006 I

Meta-Learning Neural Bloom Filters Jack Rae Sergey Bartunov Tim Lillicrap Architecture

Bayesians Can Learn From Old Data William H. Jefferys University of Texas at Austin University

The Meta-Learning Problem & Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today,