Explainable AI for Human-Robot Collaboration Prof. Brad Hayes - PowerPoint PPT Presentation

Hierarchical Task Structure IKEA Chair IK Assemble Chair Orient Attach Attach Rear Add Seat Front Supports Frame Frame Place Place Pegs Mount Get Frame Frame in Get Seat Place Seat Workspace Place Front Get Front Place left Place right Frame(Sup Frame peg peg ports) Attach Attach Left Right Support Support Place Place Get peg peg(left Get peg peg(right Place Place Add Left Place Place Add Right support) support) Get Get Get Peg Peg(Left Support(Lef Support Get Peg Peg(Right Support(Rig Support Support Support Frame) t Frame) HW Frame) ht Frame) HW Place Place Screw Get nut Nut(Left Get bolt bolt(Left bolt(left support) rear frame) rear frame) Place Place Screw Get nut Nut(Right Get bolt bolt(Right bolt(Right support) rear frame) rear frame)

Collaborative robots need to recognize human activities • Nearly all collaboration models depend on some form of activity recognition • Collaboration imposes real-time constraints on classifier performance and tolerance to partial trajectories

Interpretable Models for Fast Activity Recognition and Anomaly Explanation During Collaborative Robotics Tasks [ICRA 17]

Common Activity Classifier Pipeline Training Keyframe Clustering Point to Keyframe HMM trained on Feature Extraction (Usually KNN) Classifier (Usually SVM) keyframe sequences Testing HMM Likelihood Choose model with Feature Extraction Keyframe Classification Evaluation greatest posterior (Forward Algorithm) probability • P. Koniusz, A. Cherian, and F. Porikli , “Tensor representations via kernel linearization for action recognition from 3d skeletons.” • Gori, J. Aggarwal, L. Matthies, and M. Ryoo , “Multitype activity recognition in robot - centric scenarios,” • E. Cippitelli, S. Gasparrini, E. Gambi, and S. Spinsante , “A human activity recognition system using skeleton data from rgbd sensors.” • L. Xia, C. Chen, and J. Aggarwal, “View invariant human action recognition using histograms of 3d joints.”

Rapid Activity Prediction Through Object-oriented Regression (RAPTOR) A highly parallel ensemble classifier that is resilient to temporal variations Ensemble Feature Temporal Feature-wise Local Model Weight Extraction Segmentation Segmentation Training Learning

Activity Model Training Pipeline Kinect Skeletal Joints VICON Markers Learned Feature Extractor [Timestep x Feature] Matrix Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline Displacement 12 sec 0 sec Time Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline Displacement 100% 0% Time Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline Two Temporal Segment Parameters: Width and Stride Displacement 100% 0% Time Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline {Width=0.2 , Stride=1.} Displacement 1 2 3 4 5 100% 0% Time Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline {Width=0.2 , Stride=.5} Displacement 2 4 6 8 1 3 5 7 9 100% 0% Time Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline Object Map : Dictionary that maps IDs to sets of column indices E.g., {“Hands”: [0,1,2,5,6,7]} Displacement Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline Within each temporal segment: • Isolate columns of each demonstration trajectory according to (pre-defined) object map Displacement • Create local model for each object Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline Within each temporal-object segment: • Ignore temporal information for each data point Displacement • Treat as general pattern recognition problem • Model the resulting distribution using a GMM Result: An activity classifier ensemble across objects and time! Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline Need to find the most discriminative Object GMMs per time segment Object Object GMMs … GMMs … 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1 3 0 2 4 Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline Need to find the most discriminative Object GMMs per time segment Random Forest Classifier Object … GMMs 1.0 1.0 1.0 1.0 1 3 0 2 4 Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline Need to find the most discriminative Object GMMs per time segment Target Class Demonstrations Random Forest Classifier Off-Target Class Likelihood Trajectories Vector Demonstrations Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline • Choose top-N most discriminative features from the Random Forest classifier • Weight each GMM proportional to its discriminative power Object GMMs … 0.28 0.22 .5 0.0 1 3 0 2 4 Temporal Feature-wise Ensemble Weight Feature Extraction Local Model Training Segmentation Segmentation Learning

Activity Model Training Pipeline • Choose top-N most discriminative object-based classifiers • Weight each object proportionally to its discriminative power Object … GMMs Result: Trained Highly Parallel Ensemble Learner with Temporal/Object-specific 0.28 .5 0.22 0.0 sensitivity 1 3 0 2 4 Temporal Temporal Feature-wise Feature-wise Ensemble Weight Ensemble Weight Feature Extraction Feature Extraction Local Model Training Local Model Training Segmentation Segmentation Segmentation Segmentation Learning Learning

Results: Three Datasets • UTKinect publicly available benchmark (Kinect Joints) • Dy Dynamic Actor Industrial Manufacturing Task (Joint positions) • Static Actor Industrial Manufacturing Task (Joint positions) UTKinect Automotive Final Assembly Sealant Application

Recognition Results: UTKinect-Action3D

Results: Online Prediction

Interpretability: Explaining Classifications Key Insight: • Apply outlier detection methods across internal activity classifiers • Use outliers or lack thereof to explain issues across tim time and ob objects Asking a “carry” classifier about a “walk” trajectory: “In the middle and end of the trajectory, the left hand and right hand features were very poorly matched to my template.”

Supportive Behaviors Support task network by Demonstration Associating su supportive behaviors with su subgoals ls Explicitly learned from demonstration during task execution Support policy can be propagated to higher-level task nodes Hayes & Scassellati , “Online Development of Assistive Robot Behaviors for Collaborative Manipulation and Human - Robot Teamwork”, Machine Learning fo r Interactive Systems, AAAI 2014

Context-sensitive Supportive Behavior Policies

Supportive Behaviors by Demonstration Iss Issues • Only ly le learns before deplo loyment • Fix ixed behavio ior, reactiv ive-only ly durin ing executio ion • Dif iffi ficult lt to generaliz lize across tasks What happens if you’re not the one programming the support policy?

Learning from Demonstration Breaks Down in Team Scenarios! Tradit itional l LfD fD is optimal if the reference demonstrations are “Expert” demonstrations. …but execution happens in isolation! Exp xpert demonstrati tions are not ot alw lways th the mos ost effectiv ive teaching str trategy. Sometimes it’s better to learn the landscape of the problem than to see optimal demonstrations Properly crafted ‘imperfect’ demonstrations can better com ommunicate e in informati tion abou out th the e ob obje jectiv ive. Leading to one all- important question…

Can we do better Ca r th than lea learnin ing fr from example les? Human figures out how and when the robot can be helpful Quickly enables useful, helpful actions. Demonstration-based Methods Does not scale with task count! Requires human expert Robot figures out how and when it can be helpful Allows for novel behaviors to be discovered Enables deeper task comprehension and action Goal-driven understanding Methods

Autonomously Generating Supportive Behaviors: A Task and Motion Planning Approach Symbolic planning Motion planning Perspective Taking Autonomously Generated Supportive Behaviors

Supportive Behavior Pipeline: Intuition 1. Propose alternative environments - Change one thing about the Propose alternate environments environment 2. Evaluate if they facilitate the leader’s task/motion planning - Simulate policy execution(s) from leader’s perspective Evaluate Impacts Evaluate Cost of on Leader Alterations 3. Compute cost of creating target environment - Simulate support agent’s plan execution Manipulate scene to create best 4. Choose environment that environment candidate maximizes [benefit – cost] - Execute supportive behavior plan

Plan Evaluation Choose the support policy ( ξ ∈ Ξ ) that minimizes the expected execution cost of the leader’s policy ( π ∈ Π ) to solve the TAMP problem T from the current state ( s c ) • Cost estimate must account for • Resource conflicts (shared utilization/demand) • Spatial constraints (support agent’s avoidance of lead)

Plan Evaluation Choose the support policy ( ξ ∈ Ξ ) that minimizes the expected execution cost of the leader’s policy ( π ∈ Π ) to solve the TAMP problem T from the current state ( s c ) • Cost estimate must account for Weighting function makes a • Resource conflicts (shared utilization/demand) big difference! • Spatial constraints (support agent’s avoidance of lead)

Weighting functions: Uniform, Greedy = 1 Consider all known solutions equivalently likely and important Min duration Only the best-known solution is worth planning against

Weighting functions: Uniform

Weighting functions: Optimality-Proportional p Weight plans proportional to their cost vs. the best-known solution p=2 Plan Weight Plan Duration : Best Known Plan Duration

Weighting functions: Error Mitigation f ( π ) α w π Plans more optimal than some cutoff ε are treated normally, per f . Suboptimal plans are negatively weighted, encouraging active mitigation behavior from the supportive robot. 1 α < 𝑥 𝜌 is a normalization term to avoid harm due to plan overlap max 𝜌

Weighting functions: Error Mitigation

Limitations • Short forward lookahead (<10 seconds) • Sampling problem is incredibly difficult • Pushes some of the same problems that LfD has into the sampling mechanism • A priori knowledge of human policy space is necessary • This is coordination, not planning!

The Promise of Collaborative Robots

The Reality of Mismatched Expectations

Shared Expectations are Critical for Teamwork In close human- robot collaboration… • Human must be able to plan around expected robot behaviors • Understanding failure modes and policies are central to ensuring safe interaction and managing risk risk Fluent teaming requires communication… • When there’s no prior knowledge • When expectations are violated • When there is joint action

Establishing Shared Expectations Coordination Graphs Hierarchical Task Models [Kalech 2010] [Hayes et al. 2016] Role-based Feedback Legible Motion [St. Clair et al. 2016] [Dragan et al. 2013] State Disambiguation Cross-training [Wang et al. 2016] [Nikolaidis et al. 2013] Collaborative Planning Policy Dictation [Milliez et al. 2016] [Johnson et al. 2006] Short Term Long Term

Semantics for Policy Transfer Under what conditions will you drop the bar?

Semantics for Policy Transfer I will drop the bar when the world is in the blue region of state space:

Semantics for Policy Transfer

I will drop the bar when the world is in the blue region of state space: 12.4827 15 12.4827 5.12893 7.125 8.51422 1.12419 1.12419 1.12419 0 0 0 0 0 1 1 1 0 3.62242 -8.1219 3.62242 -40.241 -40 -40.241 … … … , , …

Explainable AI for Human-Robot Collaboration Prof. Brad Hayes - PowerPoint PPT Presentation

Explainable AI for Human-Robot Collaboration Prof. Brad Hayes Bradley.Hayes@Colorado.edu Collaborative Artificial Intelligence and Robotics Lab http://www.cairo-lab.com/ @hayesbh http://bradhayes.info Research Themes Learning from

Robothlon Team competition, each team programs a robot for each event Events Robot

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Rational Robot A Test Automation Tool What is Rational Robot? Rational Robot is a complete

Verifying the Motion of a Robot Arm Akul Penugonda 1 /6 Akul Penugonda - Robot Arm Motion 2

What is a robot? A robot is an intelligent system that interacts with the Robot Lecture 2:

Human-Robot Interaction CMSC 691 Spring 2016 2 u What is an interaction with a robot? u What is

Automated Reasoning for EXplainable Artificial Intelligence Maria Paola Bonacina Dipartimento di

The Role of Normware in Trustworthy and Explainable AI Giovanni Sileno (g.sileno@uva.nl),

Explainable (Deep) Learning and Simulation approaches Torsten Mller Visualization and

Visualization for Explainable Classifiers Yao MING THE HONG KONG UNIVERSITY OF SCIENCE AND

Kowledge-Based Programs as Explainable Policies for Contingent Planning J. Lang, A. Saffidine,

Explainable(?) Statistical ML Derek Doran Dept. of Computer Science and Engineering Wright

Establishing a Korean Robot Ethics Charter 2007. 4. 14 Robot Division, Ministry of Commerce,

Out line Robot ics Percept ion Robot ics Planning Reading: R&N Sect .

Robot behaviour and control A robot can be defined as an intelligent link between perception

Robot Localization Localization Robot and and Kalman Filters Filters Kalman Rudy Negenborn

!"#$ %&'!(

Searching String Collections for the Most Relevant Documents the Most Relevant Documents Wing

New Computing In 2019 and Beyond - Opportunities, Challenges, and Threats Fromm Institute Fall

COVID-19: Effects and Policy Responses Kunal Sen, Director, UNU-WIDER The COVID-19 Pandemic

Unit 5: Principles of Contract Law D39PZ: Procurement and Contracts 2 Contracts in construction

Recap: What What an object can do an object can do Recap: Defined by its interface

BSMP - Broadcast Satellite Multimedia Protocol Andreas R OTTMANN arott@cosy.sbg.ac.at

The high 5 Disclosures I have nothing to disclose 1 2/6/2020 5. When can my hospitalized

Explainable AI for Human-Robot Collaboration Prof. Brad Hayes - PowerPoint PPT Presentation

Explainable AI for Human-Robot Collaboration Prof. Brad Hayes Bradley.Hayes@Colorado.edu Collaborative Artificial Intelligence and Robotics Lab http://www.cairo-lab.com/ @hayesbh http://bradhayes.info Research Themes Learning from

Robothlon Team competition, each team programs a robot for each event Events Robot

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Rational Robot A Test Automation Tool What is Rational Robot? Rational Robot is a complete

Verifying the Motion of a Robot Arm Akul Penugonda 1 /6 Akul Penugonda - Robot Arm Motion 2

What is a robot? A robot is an intelligent system that interacts with the Robot Lecture 2:

Human-Robot Interaction CMSC 691 Spring 2016 2 u What is an interaction with a robot? u What is

Automated Reasoning for EXplainable Artificial Intelligence Maria Paola Bonacina Dipartimento di

The Role of Normware in Trustworthy and Explainable AI Giovanni Sileno (g.sileno@uva.nl),

Explainable (Deep) Learning and Simulation approaches Torsten Mller Visualization and

Visualization for Explainable Classifiers Yao MING THE HONG KONG UNIVERSITY OF SCIENCE AND

Kowledge-Based Programs as Explainable Policies for Contingent Planning J. Lang, A. Saffidine,

Explainable(?) Statistical ML Derek Doran Dept. of Computer Science and Engineering Wright

Establishing a Korean Robot Ethics Charter 2007. 4. 14 Robot Division, Ministry of Commerce,

Out line Robot ics Percept ion Robot ics Planning Reading: R&amp;N Sect .

Robot behaviour and control A robot can be defined as an intelligent link between perception

Robot Localization Localization Robot and and Kalman Filters Filters Kalman Rudy Negenborn

!&quot;#$ %&amp;'!(

Searching String Collections for the Most Relevant Documents the Most Relevant Documents Wing

New Computing In 2019 and Beyond - Opportunities, Challenges, and Threats Fromm Institute Fall

COVID-19: Effects and Policy Responses Kunal Sen, Director, UNU-WIDER The COVID-19 Pandemic

Unit 5: Principles of Contract Law D39PZ: Procurement and Contracts 2 Contracts in construction

Recap: What What an object can do an object can do Recap: Defined by its interface

BSMP - Broadcast Satellite Multimedia Protocol Andreas R OTTMANN arott@cosy.sbg.ac.at

The high 5 Disclosures I have nothing to disclose 1 2/6/2020 5. When can my hospitalized

Out line Robot ics Percept ion Robot ics Planning Reading: R&N Sect .

!"#$ %&'!(