ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta - PowerPoint PPT Presentation

ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta Assistant Prof. (ECE, CS, CSL)

Today, we will… • Course outline • Course logistics • Get to know each other

Understand how we can build intelligent machines Source: The Atlantic

Understand how we can build intelligent machines … that can favorably change the state of the physical world around them.

Understand how we can build intelligent machines … that can favorably change the state of the physical world around them. Video credit: Boston Dynamics, CNN

Yet, Research Robots Keep Falling… Video credit: IEEE Spectrum. DARPA Robotics Challenge Finals 2015.

State-of-the-art Results in Object Pushing Video credit: Pulkit Agrawal 2016

Understand how we can build intelligent machines … that can favorably change the state of cluttered real world environments to solve a variety of tasks . Household Robots Understand how far are we from making this PR1 showcase a reality. Video credit: Pieter Abbeel

Goals of the Course • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks

Successes in Computer Vision “in the Wild” Image Labeling Tasks person, motorcycle, car, chair person, motorcycle, car, chair K. He et al. Mask R-CNN ICCV 2017

Successes in Computer Vision “in the Wild” Shape and Pose Estimation for Objects and Humans S. Goel et al. Shape and Viewpoint without Keypoints. ECCV 2020 A. Kanawaza et al. End-to-end Recovery of Human Shape and Pose. CVPR 2018

Factors Leading to Success in Computer Vision Hand-crafted features to Large-scale labeled data End-to-end trained features A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012 J. Deng et al. ImageNet: A Large-Scale Hierarchical Image Database. CVPR 2009

Factors Leading to Success in Computer Vision Hand-crafted features Can large-scale learning enable robots to execute a variety of tasks in mid-level features classifier features cluttered real-world environments? (e.g. DPM) (e.g. SVM) (e.g. HOG) Felzenszwalb et al. end-to-end training End-to-end trained cat features A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012

Robotic Tasks Navigation Goal “Go 300 feet North, 400 feet East” “Go Find a Chair” Robot with a first Dropped into a novel Navigate person camera environment around

Robotic Tasks Manipulation

Typical Classical Robotics Pipeline State Low-level Observations Planning Control Estimation Controller Slide adapted from S. Levine.

Typical Classical Robotics Pipeline State Low-level Observations Planning Control Estimation Controller Manipulation Grasp Motion Observed Images 6DOF Pose Planning Slide adapted from S. Levine.

Typical Classical Robotics Pipeline State Low-level Observations Planning Control Estimation Controller end-to-end training Observations Control But why? Slide adapted from S. Levine.

Robot Navigation Goal “Go 300 feet North, 400 feet East” “Go Find a Chair” Robot with a first Dropped into a novel Navigate person camera environment around

State Low-level Planning Control Observations Estimation Controller Mapping Observed Images Geometric Reconstruction Planning Hartley and Zisserman. 2000. Multiple View Geometry in Computer Vision Thrun, Burgard, Fox. 2005. Probabilistic Robotics Canny. 1988. The complexity of robot motion planning. Kavraki et al. RA1996. Probabilistic roadmaps for path planning in high-dimensional configuration spaces. Lavalle and Kuffner. 2000. Rapidly-exploring random trees: Progress and prospects. Path Plan Video Credits : Mur-Artal et al., Palmieri et al.

Geometric 3D Reconstruction of the World Unnecessary Do we need to tediously reconstruct everything on this table? Video Credit: Mur-Artal and Tardos, TRobotics 2016. ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras.

Geometric 3D Reconstruction of the World Insufficient Can’t speculate about space not directly observed.

Geometric 3D Reconstruction of the World Insufficient Can’t exploit patterns in layout of indoor spaces.

Visual Learning vs Robot Learning Hand-crafted features mid-level features classifier features (e.g. DPM) (e.g. SVM) (e.g. HOG) Felzenszwalb et al. end-to-end training End-to-end trained cat features A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012

Visual Learning vs Robot Learning • How do we get supervision? • Non-stationarity • Exploration vs exploitation

Formalism for Modeling Behavior Reinforcement Learning

Markov Decision Process … Step Back Transition Function o t a t o t +1 How you move, how the tiger moves? 3D Relative Reward Function Pose Survived? s t s t +1 … o t a t o t +1 a t +1 s t +1 s t +2 s t p ( s t +1 | s t , a t ) p ( s t +2 | s t +1 , a t +1 ) Transition Function r t = R ( s t +1 , s t , a t ) r t +1 = R ( s t +2 , s t +1 , a t +1 ) Reward Function argmax a 0 ,…, a T ∑ γ t r t Goal t

Goals of the Course • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks

Challenges with Markov Decision Process … Step Back Transition Function o t a t o t +1 How you move, how the tiger moves? 3D Relative Reward Function Pose Survived? s t s t +1 … o t a t o t +1 a t +1 s t +1 s t +1 s t +2 s t Need to live many many lives to p ( s t +1 | s t , a t ) p ( s t +2 | s t +1 , a t +1 ) Transition Function learn how to live. r t = R ( s t +1 , s t , a t ) r t +1 = R ( s t +2 , s t +1 , a t +1 ) Reward Function argmax a 0 ,…, a T ∑ γ t r t Goal t

Credit assignment problem in RL o t B B B B B B B B B F B B F B B B B B B B B B B B F B … Yann LeCun’s Cake

Alternatives to Solving MDPs M. Andrychowicz et al. Hindsight Experience Replay. NeurIPS 2018. Pieter Abbeel’s Cake

Solve a Related but Supervision-rich Problem S. Levine et al. Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large- Scale Data Collection. ISER 2017.

Build Models and Plan with Them PILCO - Inverting a pendulum

Build Models and Plan with Them PILCO - Inverting a pendulum [PILCO] M. Deisenroth et al. PILCO: A Model-based and Data-Efficient Approach to Policy Search. ICML 2011

Learn by Imitating Experts S. Levine et al. End-to-End Training of Deep Visuomotor Policies. JMLR 2016.

Learn by Observing Experts A. Kumar et al. Learning Navigation Subroutines by Watching Videos. CoRL 2019.

Hierarchies Think about going to the airport. Take an Uber down to the airport Request Uber Wait for Uber Take Uber to airport App Dest. FB Check Get Into Car Talk to the Uber driver Get Off Car tension in various muscles time

Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks

Typically, useful to incorporate problem-specific insights. Goal (300, 400) Mapper Planner Action to Execute Spatial Representation Neural of the World Network S. Gupta et al., CVPR 2017, IJCV 2020. Cognitive Mapping and Planning for Visual Navigation

Locomotion: Combining with low-level control Deep Drone Racing: Learning Agile Flight in Dynamic Environments Kaufmann, et al. CoRL 2018

Manipulation: Use of specialized hardware Learning to Grasp and Re-grasp using Vision and Touch Calandra, et al. RAL 2018

Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks • Perspectives

Perspectives • Representations vs Behaviors • Big Data vs Clever Algorithms • Lessons from Cognitive Science, Psychology, Neuroscience

Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks • Perspectives

Course Logistics http://saurabhg.web.illinois.edu/teaching/ece598sg/fa2020/ Instructor: TA: Saurabh Gupta Rishabh Goyal

Thank you

ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta - PowerPoint PPT Presentation

ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta Assistant Prof. (ECE, CS, CSL) Today, we will Course outline Course logistics Get to know each other Understand how we can build intelligent machines Source:

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Sensors for Robotics

Human-Oriented Robotics Octave/Matlab Tutorial Kai Arras Social Robotics Lab, University of

Robotics Engineering Prof. Michael Gennert Robotics Engineering Program Director Fall 2016

LEGO Develops a new LEGO Develops a new robotics platform - WeDo robotics platform - WeDo

Human-Oriented Robotics Unsupervised Learning Kai Arras Social Robotics Lab, University of

Human-Oriented Robotics Supervised Learning Part 3/3 Kai Arras Social Robotics Lab, University

Human-Oriented Robotics Supervised Learning Part 2/3 Kai Arras Social Robotics Lab, University

Human-Oriented Robotics Supervised Learning Part 1/3 Kai Arras Social Robotics Lab, University

Component-based Robotics Middleware Software Development and Integration in Robotics (SDIR V)

ECE 697J Advanced Topics Advanced Topics ECE 697J in Computer Networks in Computer

Human-Oriented Robotics Basics of Probabilistic Reasoning Kai Arras Social Robotics Lab,

Human-Oriented Robotics Temporal Reasoning Part 3/3 Kai Arras Social Robotics Lab, University

Human-Oriented Robotics Probability Refresher Kai Arras Social Robotics Lab, University of

Human-Oriented Robotics Robot Motion Planning Kai Arras Social Robotics Lab, University of

The Importance of Events Overview of past theories related to the event recognition Definition of

Gated Path Planning Networks Lisa Lee Machine Learning Department Carnegie Mellon University

Visualizing Complex Systems CMPM 290A, F2017 Angus Forbes angus@ucsc.edu

MEMORY AUGMENTED CONTROL NETWORKS Arbaaz Khan, Clark Zhang, Nikolay Atanasov, Konstantinos

E E valuating Motion Constr valuating Motion Constr aints aints for 3D Wayfinding in Imme r

Geovisualization and synergies with InfoVis and Visual Analytics Panel position statement

Cartographic Visualization April Webster November 21, 2008 Outline Background Cartography

Case Study India Saon Ray Bangkok September 10, 2019 Overview I. What are the trends in

ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta - PowerPoint PPT Presentation

ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta Assistant Prof. (ECE, CS, CSL) Today, we will Course outline Course logistics Get to know each other Understand how we can build intelligent machines Source:

Mobile &amp; Service Robotics Mobile &amp; Service Robotics Sensors for Robotics Sensors for

Mobile &amp; Service Robotics Mobile &amp; Service Robotics Sensors for Robotics Sensors for

Mobile &amp; Service Robotics Mobile &amp; Service Robotics Sensors for Sensors for Robotics

Human-Oriented Robotics Octave/Matlab Tutorial Kai Arras Social Robotics Lab, University of

Robotics Engineering Prof. Michael Gennert Robotics Engineering Program Director Fall 2016

LEGO Develops a new LEGO Develops a new robotics platform - WeDo robotics platform - WeDo

Human-Oriented Robotics Unsupervised Learning Kai Arras Social Robotics Lab, University of

Human-Oriented Robotics Supervised Learning Part 3/3 Kai Arras Social Robotics Lab, University

Human-Oriented Robotics Supervised Learning Part 2/3 Kai Arras Social Robotics Lab, University

Human-Oriented Robotics Supervised Learning Part 1/3 Kai Arras Social Robotics Lab, University

Component-based Robotics Middleware Software Development and Integration in Robotics (SDIR V)

ECE 697J Advanced Topics Advanced Topics ECE 697J in Computer Networks in Computer

Human-Oriented Robotics Basics of Probabilistic Reasoning Kai Arras Social Robotics Lab,

Human-Oriented Robotics Temporal Reasoning Part 3/3 Kai Arras Social Robotics Lab, University

Human-Oriented Robotics Probability Refresher Kai Arras Social Robotics Lab, University of

Human-Oriented Robotics Robot Motion Planning Kai Arras Social Robotics Lab, University of

The Importance of Events Overview of past theories related to the event recognition Definition of

Gated Path Planning Networks Lisa Lee Machine Learning Department Carnegie Mellon University

Visualizing Complex Systems CMPM 290A, F2017 Angus Forbes angus@ucsc.edu

MEMORY AUGMENTED CONTROL NETWORKS Arbaaz Khan, Clark Zhang, Nikolay Atanasov, Konstantinos

E E valuating Motion Constr valuating Motion Constr aints aints for 3D Wayfinding in Imme r

Geovisualization and synergies with InfoVis and Visual Analytics Panel position statement

Cartographic Visualization April Webster November 21, 2008 Outline Background Cartography

Case Study India Saon Ray Bangkok September 10, 2019 Overview I. What are the trends in

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Robotics Sensors for

Mobile & Service Robotics Mobile & Service Robotics Sensors for Sensors for Robotics