Efficient Grasping from RGBD Images: Learning Using a New Rectangle - PowerPoint PPT Presentation

Efficient Grasping from RGBD Images: Learning Using a New Rectangle Representation Yun Jiang, Stephen Moseson, Ashutosh Saxena Cornell University

Problem Goal:  Figure out a way to pick up the object.  Approach  Grip  Pick up Question: where and how to grasp? Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

How to Perceive Objects  RGBD cameras give RGB image plus depth information  Stereo cameras ($1000): Bumblebee  Kinect Camera ($140) RGB image Depth map 3D point cloud Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Our Formulation  Input: RGBD image  Output: a proper grasp -- the configuration of the gripper at the final grasp stage  3D location, 3D orientation, opening width. Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Traditional Approaches  Control/Planning  Force and form closure (Nguyen1986, Lakshminarayana1978)  Requires full 3D knowledge of grippers and objects  Disadvantages:  Complete 3D model is not always available  Noise sensors.  Difficult to model friction.  Search in enormous configuration space Does not apply to deformable grippers! Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Learning Approaches  Learning  provides generalization on novel objects  Robust to noise and variations of environment (Saxena et al. , NIPS 2006)  Previous learning approaches  Representation problem  3D orientation of gripper not represented well. (Le at al. , ICRA 2010) Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Representation  Should contain full 7-dimensional gripper configuration (3D location, 3D orientation, gripper opening width)  Specifically model gripper’s physical size Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

New Representation  Grasping Rectangle  Contains full 7-dimensional gripper configuration  Specifically model gripper’s physical size.  Strictly constraints the boundary of features. Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Define the Score Function  : the feature vector for a possible grasp G  Score of grasp G:  Best grasp: the highest-score rectangle in the image Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Learning the Score Function  Learning algorithm: SVM-Rank  Ranking not classification:  because the boundary between ‘good’/‘bad’ grasps is vague  Training data: Labeled rectangles for pictures. Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Inference  Search for all possible rectangles Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Search Highest-score Rectangles  Image: n x m  Features: k (per rectangle)  Brute-force search?  O(n 2 m 2 ) rectangles, O(nmk) to compute features  O( n 3 m 3 k) for one orientation  To accelerate:  Compute features incrementally  O(n 2 m 2 k)  Even faster? φ ( G ) φ + G ∆ = ( G ) ? Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Fast search  Condition: features are independent in pixel level, i.e.  The score of a rectangle can be decomposed to the scores of pixels  Classical problem: maximum-sum submatrix!  In one dimension, array 3 -4 5 2 -5 5 9 -8 sum 3 0 5 7 2 7 16 8  In our problem, reduce the time complexity to O(nmk+n 2 m) Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Histogram Features for Fast Search  Histograms from 15 filters to capture color, textures and edges  Spatial Histogram Features Divide a rectangle into 3 sub-rectangles Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Advanced Features  Histogram is fast but not able to capture the correlations among the 3 sub-rectangles d 1  E.g., One criteria: d 1 >d 2 and d 2 <d 3 d 2 d 3  Non-linear features  E.g., d = d 1 d 3 /(d 2 ) 2  Expressive but not applicable to fast search Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Two-step Process  Algorithm: Two models:  First step: Fast, but not accurate (good for pruning).  Second step: Accurate, but slow. Step2: Re-ranking Top 100 rectangles after the 1st step Top 3 rectangles after the 2nd step Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Summary  RGBD images  Representation  Oriented rectangle  Learning using Efficient two-step process  Fast search with histogram features  Re-rank with more sophisticated features Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Experiments  Tested on novel objects  Offline: 128 images  Robot: 12 objects, multiple tries Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Results on offline test  Evaluation-1: rectangle metric Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Results on offline test  Evaluation-2: point metric [Saxena2008] Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Robotic experiments  Adept Viper s850  Parallel plate gripper Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Results on robotic experiments Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Universal Jamming gripper: Robotic Experiment and Analysis Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

After Grasp: Learning to Place  Challenges:  Enormous search space  Placing under preference  Efficient learning approach to identify good placements  Results on robotic experiment  Goal: correct location and preferred orientation  92% for New Objects in New Environments. Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Thank you! Yun Jiang, Stephen Moseson and Ashutosh Saxena, Efficient Grasping from RGBD Images: Learning using a new Rectangle Representation, ICRA 2011. Learning to Place New Objects:  Yun Jiang, Changxi Zheng, Marcus Lim, Ashutosh Saxena, Learning to Place New Objects, ICRA 2012. First appeared in RSS workshop on mobile manipulation, June 2011.

Video Efficient Grasping from RGBD Image: Learning Using a New Rectangle Representation 4/18/2012

Future Work

Advanced Features  Histogram is fast but not able to capture the correlations among the 3 sub-rectangles d 1  E.g., One criteria: d 1 >d 2 and d 2 <d 3 d 2  Non-linear features d 3 Histogram of a non-linear feature d = d 1 d 3 /(d 2 ) 2

Spatial Histogram for Fast Search  Time complexity is only multiplied by 3

Efficient Grasping from RGBD Images: Learning Using a New Rectangle - PowerPoint PPT Presentation

Efficient Grasping from RGBD Images: Learning Using a New Rectangle Representation Yun Jiang, Stephen Moseson, Ashutosh Saxena Cornell University Problem Goal: Figure out a way to pick up the object. Approach Grip Pick up

Learning To Grasp Jake Varley Overview - What is a grasping pipeline? - A current grasping

HUMAN GRASPING Can robots grasp as well? DATA-DRIVEN GRASPING OF UNKNOWN OBJECTS Arsalan

Depth Prediction and RGBD Images for Recognition Yihui He, Metehan Ozten yihuihe@foxmail.com, m

Self-Supervised Deep Learning for Robotic Grasping Lars Berscheid | KUKA Roboter GmbH | 10/10/2017

Novel Gaits for a Novel Novel Gaits for a Novel Crawling/Grasping Mechanism Crawling/Grasping

Human-Robot Interaction Elective in Artificial Intelligence Lecture 7 RGBD Perception Luca

CS395T paper review Indoor Segmentation and Support Inference from RGBD Images Chao Jia Sep

Depth Sensing and Deep Learning: Grasping and Segmenting 3D Objects from Real Depth Images using

CS4495/6495 Introduction to Computer Vision 2A-L1 Images as functions Images as functions Images

RGBD Occlusion Detection via Deep Convolutional Neural Networks Soumik Sarkar 1,2 , Vivek

Eye Gaze Tracking Usin ing an RGBD Camera: A Comparison with an RGB Solu lution Xuehan Xiong

RGBD Tutorial 14210240041 Gu Pan Image RGB YUV Lab Depth Image RGB image Depth image Each pixel in

Learning to Place New Objects Yun Jiang , Changxi Zheng, Marcus Lim and Ashutosh Saxena Cornell

Using Vision for Pre- and Post-grasping Object Localization for Soft Hands Changhyun Choi,

Using Geometry to Detect Grasp Poses in 3D Point Clouds ten Pas, Platt Northeastern University

Bitmap (Raster) Images CO2016 Multimedia and Computer Graphics Roy Crole: Bitmap Images (CO2016,

Presentation Highlights Prof. Gordon Cheng Director of Institute for Cognitive Systems Faculty

Earnings Presentation H1FY14 www.repcohome.com Agenda H1FY 14 Performance Business

Investor Presentation Q1 FY21 July 20 th , 2020 1 Performance Highlights (Q1 FY21) PAT

Investors Presentation February 2019 1 Mission Statement 2 Strictly Private and confidential

INVESTOR PRESENTATION 9M FY19 February 5, 2019 (CIN: L85110KA1995PLC019126) 1 FINANCIAL

IIFL BONDS / NCDs Tranche II Issue PRESENTATION TITLE IN India Infoline Finance Ltd. ARIAL

Outreach Roundtable July 11, 2017 $1.5 billion Expanded Footprint Improved Patient Experience

Nonlinear Filter Design for Pose and IMU Bias Estimation Glauco Garcia Scandaroli, Pascal Morin.