ECE 598 SG Special Topics in Learning-based Robotics Saurabh Gupta Assistant Prof. (ECE, CS, CSL)
Today, we will… • Course outline • Course logistics • Get to know each other
Understand how we can build intelligent machines Source: The Atlantic
Understand how we can build intelligent machines … that can favorably change the state of the physical world around them.
Understand how we can build intelligent machines … that can favorably change the state of the physical world around them. Video credit: Boston Dynamics, CNN
Yet, Research Robots Keep Falling… Video credit: IEEE Spectrum. DARPA Robotics Challenge Finals 2015.
State-of-the-art Results in Object Pushing Video credit: Pulkit Agrawal 2016
Understand how we can build intelligent machines … that can favorably change the state of cluttered real world environments to solve a variety of tasks . Household Robots Understand how far are we from making this PR1 showcase a reality. Video credit: Pieter Abbeel
Goals of the Course • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks
Successes in Computer Vision “in the Wild” Image Labeling Tasks person, motorcycle, car, chair person, motorcycle, car, chair K. He et al. Mask R-CNN ICCV 2017
Successes in Computer Vision “in the Wild” Shape and Pose Estimation for Objects and Humans S. Goel et al. Shape and Viewpoint without Keypoints. ECCV 2020 A. Kanawaza et al. End-to-end Recovery of Human Shape and Pose. CVPR 2018
Factors Leading to Success in Computer Vision Hand-crafted features to Large-scale labeled data End-to-end trained features A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012 J. Deng et al. ImageNet: A Large-Scale Hierarchical Image Database. CVPR 2009
Factors Leading to Success in Computer Vision Hand-crafted features Can large-scale learning enable robots to execute a variety of tasks in mid-level features classifier features cluttered real-world environments? (e.g. DPM) (e.g. SVM) (e.g. HOG) Felzenszwalb et al. end-to-end training End-to-end trained cat features A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012
Robotic Tasks Navigation Goal “Go 300 feet North, 400 feet East” “Go Find a Chair” Robot with a first Dropped into a novel Navigate person camera environment around
Robotic Tasks Manipulation
Typical Classical Robotics Pipeline State Low-level Observations Planning Control Estimation Controller Slide adapted from S. Levine.
Typical Classical Robotics Pipeline State Low-level Observations Planning Control Estimation Controller Manipulation Grasp Motion Observed Images 6DOF Pose Planning Slide adapted from S. Levine.
Typical Classical Robotics Pipeline State Low-level Observations Planning Control Estimation Controller end-to-end training Observations Control But why? Slide adapted from S. Levine.
Robot Navigation Goal “Go 300 feet North, 400 feet East” “Go Find a Chair” Robot with a first Dropped into a novel Navigate person camera environment around
State Low-level Planning Control Observations Estimation Controller Mapping Observed Images Geometric Reconstruction Planning Hartley and Zisserman. 2000. Multiple View Geometry in Computer Vision Thrun, Burgard, Fox. 2005. Probabilistic Robotics Canny. 1988. The complexity of robot motion planning. Kavraki et al. RA1996. Probabilistic roadmaps for path planning in high-dimensional configuration spaces. Lavalle and Kuffner. 2000. Rapidly-exploring random trees: Progress and prospects. Path Plan Video Credits : Mur-Artal et al., Palmieri et al.
Geometric 3D Reconstruction of the World Unnecessary Do we need to tediously reconstruct everything on this table? Video Credit: Mur-Artal and Tardos, TRobotics 2016. ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras.
Geometric 3D Reconstruction of the World Insufficient Can’t speculate about space not directly observed.
Geometric 3D Reconstruction of the World Insufficient Can’t exploit patterns in layout of indoor spaces.
Geometric 3D Reconstruction of the World Insufficient Can’t exploit patterns in layout of indoor spaces.
Visual Learning vs Robot Learning Hand-crafted features mid-level features classifier features (e.g. DPM) (e.g. SVM) (e.g. HOG) Felzenszwalb et al. end-to-end training End-to-end trained cat features A. Krizhevsky et al. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012
Visual Learning vs Robot Learning • How do we get supervision? • Non-stationarity • Exploration vs exploitation
Formalism for Modeling Behavior Reinforcement Learning
Markov Decision Process … Step Back Transition Function o t a t o t +1 How you move, how the tiger moves? 3D Relative Reward Function Pose Survived? s t s t +1 … o t a t o t +1 a t +1 s t +1 s t +2 s t p ( s t +1 | s t , a t ) p ( s t +2 | s t +1 , a t +1 ) Transition Function r t = R ( s t +1 , s t , a t ) r t +1 = R ( s t +2 , s t +1 , a t +1 ) Reward Function argmax a 0 ,…, a T ∑ γ t r t Goal t
Goals of the Course • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks
Challenges with Markov Decision Process … Step Back Transition Function o t a t o t +1 How you move, how the tiger moves? 3D Relative Reward Function Pose Survived? s t s t +1 … o t a t o t +1 a t +1 s t +1 s t +1 s t +2 s t Need to live many many lives to p ( s t +1 | s t , a t ) p ( s t +2 | s t +1 , a t +1 ) Transition Function learn how to live. r t = R ( s t +1 , s t , a t ) r t +1 = R ( s t +2 , s t +1 , a t +1 ) Reward Function argmax a 0 ,…, a T ∑ γ t r t Goal t
Credit assignment problem in RL o t B B B B B B B B B F B B F B B B B B B B B B B B F B … Yann LeCun’s Cake
Alternatives to Solving MDPs M. Andrychowicz et al. Hindsight Experience Replay. NeurIPS 2018. Pieter Abbeel’s Cake
Solve a Related but Supervision-rich Problem S. Levine et al. Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large- Scale Data Collection. ISER 2017.
Build Models and Plan with Them PILCO - Inverting a pendulum
Build Models and Plan with Them PILCO - Inverting a pendulum [PILCO] M. Deisenroth et al. PILCO: A Model-based and Data-Efficient Approach to Policy Search. ICML 2011
Learn by Imitating Experts S. Levine et al. End-to-End Training of Deep Visuomotor Policies. JMLR 2016.
Learn by Observing Experts A. Kumar et al. Learning Navigation Subroutines by Watching Videos. CoRL 2019.
Hierarchies Think about going to the airport. Take an Uber down to the airport Request Uber Wait for Uber Take Uber to airport App Dest. FB Check Get Into Car Talk to the Uber driver Get Off Car tension in various muscles time
Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks
Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks
Typically, useful to incorporate problem-specific insights. Goal (300, 400) Mapper Planner Action to Execute Spatial Representation Neural of the World Network S. Gupta et al., CVPR 2017, IJCV 2020. Cognitive Mapping and Planning for Visual Navigation
Locomotion: Combining with low-level control Deep Drone Racing: Learning Agile Flight in Dynamic Environments Kaufmann, et al. CoRL 2018
Manipulation: Use of specialized hardware Learning to Grasp and Re-grasp using Vision and Touch Calandra, et al. RAL 2018
Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks • Perspectives
Perspectives • Representations vs Behaviors • Big Data vs Clever Algorithms • Lessons from Cognitive Science, Psychology, Neuroscience
Course Outline • Understand state-of-the-art in robotics and robot learning • Formulate robot learning problems as MDPs • Investigate alternative ways of solving MDPs • Applying these techniques to solve robotic tasks • Perspectives
Today, we will… • Course outline • Course logistics • Get to know each other
Course Logistics http://saurabhg.web.illinois.edu/teaching/ece598sg/fa2020/ Instructor: TA: Saurabh Gupta Rishabh Goyal
Today, we will… • Course outline • Course logistics • Get to know each other
Thank you
Recommend
More recommend