Stay on the Path: Instruction Fidelity in Vision-and-Language - PowerPoint PPT Presentation

Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation Google Research Vihan Jain*, Gabriel Magalhaes*, Alexander Ku* Ashish Vaswani, Eugene Ie, Jason Baldridge * equal contribution ACL Florence, 29th July 2019

Vision-and-Language Navigation (VLN) ● Language ● Action Perception Planning ● ●

Vision-and-Language Navigation (VLN) Example from Room-to-Room (R2R) 1 dataset [1] Anderson et al. Vision-and-language Navigation: Interpreting visually grounded navigation instructions in real environments , CVPR, 2018.

Key Contributions ● Data Make a left down at the narrow hall... Go out the door and wait. Turn around and enter the bedroom... Walk into the doorway and stop

Key Contributions ● Data ● Evaluation

Key Contributions ● Data ● Evaluation r t = CLS ~ 0 Agent training ● reward r t Agent Environment action a t

R2R → R4R Make a left down at the narrow hall... Go out the door and wait a 1 a n a 1 a n d(a n , b 1 ) < d th b 1 b 1 b m b m Turn around and enter the bedroom... Walk Make a left down at the narrow hall... Go out into the doorway and stop the door and wait. Turn around and enter the bedroom... Walk into the doorway and stop R2R-to-R4R code is at https://github.com/googleresearch/google-research/tree/master/r4r

R2R v/s R4R

VLN Evaluation: Success Rate (SR) success = d(p 5 , r 5 ) < d th r 1 =p 1 p 5 r 5 reference path agent path

VLN Evaluation: Success Rate (SR) success = 1 r 1 =p 1 r 5 reference path agent path

VLN Evaluation: SPL Success weighted by Path Length 1 r 1 =p 1 r 5 spl = 4/10 = 0.4 reference path agent path [1] Anderson et al. On Evaluation of Embodied Navigation Agents arXiv, 2018.

VLN Evaluation: SPL r 1 =p 1 r 5 spl = 1 spl = 1 reference path agent path 1 agent path 2 [1] Anderson et al. On Evaluation of Embodied Navigation Agents arXiv, 2018.

VLN Evaluation: SED Success weighted by Edit Distance 1 r 1 =p 1 r 5 sed = 1 - 0 = 1 sed = 1 - 3/4 = 0.25 reference path agent path 1 agent path 2 [1] Chen et al. Touchdown: Natural language navigation and spatial reasoning in visual street environments CVPR, 2019

VLN Evaluation: SED SED=0 SED=0

CLS: New VLN Evaluation Metric ● C overage weighted by L ength S core (CLS): product of Path Coverage ( PC ) and Length Score ( LS ) R : reference path P : agent’s predicted path

CLS: New VLN Evaluation Metric ● Path Coverage (PC): average coverage of each node in reference path with respect to the predicted path d 3 d 2 d 1 reference path agent’s predicted path

CLS: New VLN Evaluation Metric ● Expected optimal path length (EPL) is a function of path coverage Length Score (LS): compares path length of predicted path P to EPL ● reference path agent’s predicted path P

CLS: Desirable Properuies Path Similarity Soft Unique Scale Measure Penalties Optimum Invariance Tractability CLS PC measures how Both PC and A predicted Both PC and Computation well the predicted LS are path achieves LS are Time: path covered the continuous the maximum invariant nodes of reference measures score if and due to graph PC - path only if it is equal invariant O(|P|.|R|) to reference constant d th path LS - O(|P|+|R|)

Training VLN Agents Architecture similar to RCM 1 model ● a 1 a 2 a 3 Visual Visual Visual . . . Language Encoder Encoder Encoder Encoder . . . x 1 x 2 x n v 1 v 2 v 3 instructions visual scenes [1] Wang et al. Reinforced cross-modal matching and self-supervised imitation learning for vision-language navigation CoRR, 2018.

Training VLN Agents Goal-oriented agents encouraged to pursue the goal node only ● The immediate reward after taking action a t at time step t in an episode of length T r T = 1 r T = 0

Training VLN Agents Fidelity-oriented agents reach the goal node + conform to the reference path R ● CLS ~ 0 CLS ~ 1

R2R Pergormance ● Fidelity-oriented agents perform slightly better on SPL, CLS SPL appears consistent with CLS ● Results on Validation Unseen dataset

R2R Pergormance ● Ablation Studies Agent optimized to reach the goal may incidentally appear to be ○ conforming to the instructions Results on Validation Unseen dataset

R4R Pergormance ● Fidelity-oriented agents outperform goal-oriented agents Results on Validation Unseen dataset

R4R Pergormance ● Ablation Studies Fidelity-oriented agents attend more carefully to the instructions ○ Results on Validation Unseen dataset

Recent Work ● Effective and General Evaluation for Instruction Conditioned Navigation using Dynamic Time Warping - https://arxiv.org/abs/1907.05446 Suite of DTW 1 based evaluation metrics for general instruction ● conditioned robotic tasks including VLN [1] Berndt et al. Using Dynamic Time Warping to Find Patterns in Time Series AAAIWS'94.

Conclusion Data Agent training R4R Fidelity-oriented agents Evaluation CLS r T ~ 0

Thank You! Questions?

Stay on the Path: Instruction Fidelity in Vision-and-Language - PowerPoint PPT Presentation

Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation Google Research Vihan Jain, Gabriel Magalhaes, Alexander Ku* Ashish Vaswani, Eugene Ie, Jason Baldridge * equal contribution ACL Florence, 29th July 2019

STAY HOME | STAY HEALTHY | STAY CONNECTED | RETURN STRONGER STAY HOME. STAY HEALTHY. STAY

Oh, Wont You Stay? Oh, Wont You Stay? Oh, Won t You Stay? Oh, Won t You Stay? Predictors

Dual Path Instruction Processing Dual Path Instruction Processing Dual Path Instruction

Stay informed. Stay involved. Stay invested. Parallel Parenting Presentation Children

ST STAY Y MOB MOBILE ILE, ST , STAY Y UP2GO UP2GO COMMUNI COMMUNITY TY CARPOOL CARPOOLIN

Water Resources Water Resources Water Resources Water Resources Geospatial World Forum 2014

ACHIEVING HIGH FIDELITY ASSERTIVE COMMUNITY TREATMENT THROUGH THE IMPLEMENTATION OF FIDELITY

Tier 2 Fidelity Data: Strengthening your Tier 2 PBIS Implementation: Using Fidelity Measures to

Fidelity Bank Presentation Non-Deal Roadshow May Jun 2018 www.fidelitybank.ng Profile of

Informal/formal sector partnerships: Fidelity Bank Ghana BUSINESS UNIT: MICROFINANCE DATE: 22 nd

Wireless Fidelity with bwfm(4) Patrick Wildt September 22, 2019 Patrick Wildt Wireless Fidelity

Optimizing Email Signups How Fidelity Investments boosted email signups Scott MacMillan

On Path Generation, Path Following On Path Generation, Path Following and Time Coordination for

Using Off-Path and On-Path Signaling for Internet Security Saikat Guha, Paul Francis Cornell

A * A path finding algorithm. A path finding algorithm. Given a state space, such as a

Instruction Set 2 Architecting a vocabulary for the HW INSTRUCTION SET OVERVIEW 3 Instruction

Effective Scripting in Embedded Devices Steve Bennett 1 What is Embedded? ELC 2010 2

An Enabling Standard for Internet TV Thomas Stockhammer

Ph.D. course in epidemiology: Fall 2012. Confounding Analysis of cohort studies. Epidemiology

Overview of the Upper Klamath Lake and Agency Upper Klamath Lake Drainage Lake TMDL Upper

Medicaid Update Legislative Health and Human Services Committee Brent Earnest, Secretary August

Bash One-Liners and Other Tools to Simplify MySQL Ops at Scale Percona Live Santa Clara,

ACSI | Barna School Choice Research Study Fall 2016 | District Meetings How do you attract

Neutrinoless double beta decay with Andrea Pocar University of Massachusetts, Amherst (on behalf