Generalization via Modularity Deepak Chris Trevor Phillip - PowerPoint PPT Presentation

Learning to Control Self-Assembling Morphologies Generalization via Modularity Deepak Chris Trevor Phillip Alyosha Pathak* Lu* Darrell Isola Efros * equal contribution

How do we train a robot?

Multiple tasks   Expert demonstrations Rewards, labels  … 

Self-supervision Multiple tasks    Curious exploration  Expert demonstrations Learning “common sense” Rewards, labels   … …  

. . . … even earlier?

Single to Multicellular

Single to Multicellular competition  collaboration

Single to Multicellular competition  collaboration shared objective

Compositionality has been useful in language … [Andreas et. al. 2016]

How to implement compositionality in hardware?

Modular Co-evolution of Control and Morphology

Modular Co-evolution of Control and Morphology Cylindrical Limb

Modular Co-evolution of Control and Morphology Cylindrical Limb Configurable Motor Joint

Modular Co-evolution of Control and Morphology

Modular Co-evolution of Control and Morphology Potential Magnetic Joint

Modular Co-evolution of Control and Morphology Acts as single agent upon joining Rewards are shared! Potential Magnetic Joint

Modular Co-evolution of Control and Morphology Acts as single agent upon joining Rewards are shared!  Input = Local Sensory State  Output = Torques, Link, Unlink Potential Magnetic Joint

Consider the task of “standing up” …

How to learn compositional controllers?

Idea: Shared policy network across limbs Node Node Node Node Nod Node Node in Node Node Node Node Node Node

Idea: Shared policy network across limbs output Node Node shared Node Node Nod Node Node policy in 𝜌 𝜄 Node Node Node Node Node Node input

How to adapt when morphology changes?

Network as reusable LEGO Blocks

Network as reusable LEGO Blocks output shared policy 𝜌 𝜄 input

Network as reusable LEGO Blocks message output output shared policy 𝜌 𝜄 input message input

Network as reusable LEGO Blocks message output output shared same policy dimension 𝜌 𝜄 input message input

Network as reusable LEGO Blocks message output output shared policy 𝜌 𝜄 input message input

Network as reusable LEGO Blocks 𝜌 𝜄 𝜌 𝜄 message output output 𝜌 𝜄 shared policy 𝜌 𝜄 input message input

Network as reusable LEGO Blocks 𝜌 𝜄 𝜌 𝜄 message output output 𝜌 𝜄 shared policy cut 𝜌 𝜄 input message input

Network as reusable LEGO Blocks 𝜌 𝜄 𝜌 𝜄 message output output 𝜌 𝜄 shared policy cut and paste 𝜌 𝜄 𝜌 𝜄 input message input 𝜌 𝜄 𝜌 𝜄

Network as reusable LEGO Blocks 𝜌 𝜄 𝜌 𝜄 message output output 𝜌 𝜄 shared adaptation by policy cut and paste conditioning 𝜌 𝜄 𝜌 𝜄 input message input 𝜌 𝜄 𝜌 𝜄

Dynamic Graph Networks

BTW, basically curriculum learning but in hardware

How well does it generalize?

. . . a bit crazy… and totally useless!

Self-Assembling Robots in the Real World [Mark Yim’s Lab at UPenn] [Daniela Rus's Lab at MIT] Also: [Modular Snake Robot – Howie Choset’s Lab at CMU]

code & data at https://people.eecs.berkeley.edu/~pathak/ Poster # 197 …today!! (Multi-agent RL) Thank You!

Generalization via Modularity Deepak Chris Trevor Phillip - PowerPoint PPT Presentation

Learning to Control Self-Assembling Morphologies Generalization via Modularity Deepak Chris Trevor Phillip Alyosha Pathak* Lu* Darrell Isola Efros * equal contribution How do we train a robot? Multiple tasks Expert

Walk Modularity: Graph partitioning based on a generalization of modularity David Mehrle 1 Amy

Generalization via Modularity Deepak Chris Trevor Phillip Alyosha Pathak* Lu* Darrell

Higher-Order (Non-)Modularity Claus Appel & Vincent van Oostrom & Jakob Grue Simonsen

4. What Is Modularity? butterfillS@ceu.hu butterfillS@ceu.hu Outline Why we need a notion of

4. What Is Modularity? butterfillS@ceu.hu butterfillS@ceu.hu Outline Why we need a notion of

Chapter 3: Data Abstraction Modularity and Abstraction Abstraction, modularity, information

Modularity (1): Childhood Activity Modularity Abstract Data Types (ADTs) EECS3311 A & E:

Modularity Modularity Also a structured programming topic: Can replace a rectangle with a

Local Substitutability for Sequence Generalization Fran cois Coste , Ga elle Garet , Jacques

Data Anonymization - Generalization Algorithms Li Xiong, Slawek Goryczka CS573 Data Privacy and

Data Anonymization - Generalization Algorithms Li Xiong CS573 Data Privacy and Anonymity

CSC321 Lecture 9: Generalization Roger Grosse Roger Grosse CSC321 Lecture 9: Generalization 1 /

VC GENERALIZATION BOUND VC GENERALIZATION BOUND Matthieu Bloch March 12, 2020 1 LOGISTICS (AND

Deep learning: Challenges in learning and generalization Tomas Mikolov, Facebook AI What is

Generalization of Cycle-Covering Heuristics Clemens B uchner Department of Mathematics and

Generalization Bounds and Stability Lorenzo Rosasco Tomaso Poggio 9.520 Class 6 February, 23

Urdu and the Modular Architecture of ParGram Tina B ogel, Miriam Butt, Annette Hautli,

Improving UD processing via satellite resources for morphology Kaja Dobrovoljc Toma Erjavec

M OTIVATING E XAMPLE 2 Other languages display still more variation C ZECH T URKISH PRODUCTIVE

Lecture 2: Finite-state methods for morphology Julia Hockenmaier juliahmr@illinois.edu 3324

Introduction to Computational Linguistics Frank Richter fr@sfs.uni-tuebingen.de. Seminar f

Generalizing paerns in Instrumented Item-and-Paern Morphology Sarah Beniamine and Olivier

Linguistics in a nutshell by hook or by crook Jeremy G. Kahn Signal, Speech & Language

An Unsupervised Method for Uncovering Morphological Chains Karthik Narasimhan Regina Barzilay

Generalization via Modularity Deepak Chris Trevor Phillip - PowerPoint PPT Presentation

Learning to Control Self-Assembling Morphologies Generalization via Modularity Deepak Chris Trevor Phillip Alyosha Pathak* Lu* Darrell Isola Efros * equal contribution How do we train a robot? Multiple tasks Expert

Walk Modularity: Graph partitioning based on a generalization of modularity David Mehrle 1 Amy

Generalization via Modularity Deepak Chris Trevor Phillip Alyosha Pathak* Lu* Darrell

Higher-Order (Non-)Modularity Claus Appel &amp; Vincent van Oostrom &amp; Jakob Grue Simonsen

4. What Is Modularity? butterfillS@ceu.hu butterfillS@ceu.hu Outline Why we need a notion of

4. What Is Modularity? butterfillS@ceu.hu butterfillS@ceu.hu Outline Why we need a notion of

Chapter 3: Data Abstraction Modularity and Abstraction Abstraction, modularity, information

Modularity (1): Childhood Activity Modularity Abstract Data Types (ADTs) EECS3311 A &amp; E:

Modularity Modularity Also a structured programming topic: Can replace a rectangle with a

Local Substitutability for Sequence Generalization Fran cois Coste , Ga elle Garet , Jacques

Data Anonymization - Generalization Algorithms Li Xiong, Slawek Goryczka CS573 Data Privacy and

Data Anonymization - Generalization Algorithms Li Xiong CS573 Data Privacy and Anonymity

CSC321 Lecture 9: Generalization Roger Grosse Roger Grosse CSC321 Lecture 9: Generalization 1 /

VC GENERALIZATION BOUND VC GENERALIZATION BOUND Matthieu Bloch March 12, 2020 1 LOGISTICS (AND

Deep learning: Challenges in learning and generalization Tomas Mikolov, Facebook AI What is

Generalization of Cycle-Covering Heuristics Clemens B uchner Department of Mathematics and

Generalization Bounds and Stability Lorenzo Rosasco Tomaso Poggio 9.520 Class 6 February, 23

Urdu and the Modular Architecture of ParGram Tina B ogel, Miriam Butt, Annette Hautli,

Improving UD processing via satellite resources for morphology Kaja Dobrovoljc Toma Erjavec

M OTIVATING E XAMPLE 2 Other languages display still more variation C ZECH T URKISH PRODUCTIVE

Lecture 2: Finite-state methods for morphology Julia Hockenmaier juliahmr@illinois.edu 3324

Introduction to Computational Linguistics Frank Richter fr@sfs.uni-tuebingen.de. Seminar f

Generalizing paerns in Instrumented Item-and-Paern Morphology Sarah Beniamine and Olivier

Linguistics in a nutshell by hook or by crook Jeremy G. Kahn Signal, Speech &amp; Language

An Unsupervised Method for Uncovering Morphological Chains Karthik Narasimhan Regina Barzilay

Higher-Order (Non-)Modularity Claus Appel & Vincent van Oostrom & Jakob Grue Simonsen

Modularity (1): Childhood Activity Modularity Abstract Data Types (ADTs) EECS3311 A & E:

Linguistics in a nutshell by hook or by crook Jeremy G. Kahn Signal, Speech & Language