From heuristic to optimal models in naturalistic visual search - PowerPoint PPT Presentation

From heuristic to optimal models in naturalistic visual search Angela Radulescu 1,2 *, Bas van Opheusden 1,2 *, Fred Callaway 2 , Thomas Griffiths 2 & James Hillis 1 1 2 Bridging AI and Cognitive Science workshop, ICLR April 24th, 2020 � 1

An everyday problem… …where are the keys? � 2

Resource allocation in visual search • Main contribution : frame visual search as a reinforcement learning problem ‣ Fixations as information-gathering actions ‣ Do people employ optimal strategies? � 3

Resource allocation in visual search • Main contribution : frame visual search as a reinforcement learning problem ‣ Fixations as information-gathering actions ‣ Do people employ optimal strategies? • Challenges: ‣ Representing the state space — world is high-dimensional; what features does visual system have access to ? ‣ Finding the optimal policy — reward function is sparse; how to balance cost of sampling and performance ? � 4

Naturalistic visual search in VR • VR + gaze tracking, fixed camera location • Cluttered room, 1 target among many distractors • “Find the target within 8 seconds” • 6 different rooms x 5 locations per room x 10 trials per location = 300 unique scenes • Some trials assisted � 5

Start End � 8

Meta-level Markov Decision Process Callaway & Gri ffi ths 2018 � 9

Meta-level Markov Decision Process • Latent : {F true , i true } ‣ Scene features and target identity unknown to the agent • States : {F , J, f target } ‣ Mean and precision of each feature for each object • Actions : { o, ⊥ } ‣ Fixate on object o, or terminate • Transitions : measure X ~ N(F true , J meas ) ‣ J meas decreases with distance from o ‣ Integrate X into F and J with Bayesian cue combination • Rewards : if fixating o then R = -c ; if ⊥ then R = 1 if argmax(P(target | F , J)) = i true and 0 otherwise ‣ Reward agent when most probable target given state matches true target Callaway & Gri ffi ths 2018 10 �

Challenge I: representing the belief space Challenge II: finding the optimal policy � 15

Which features to include? � 16

Which features to include? Objects Target Color Shape Object attributes Treisman & Gelade, 1980 Horowitz & Wolfe, 2017 � 17

Which features to include? � 18

Which features to include? Shape 3D mesh D2 distribution � 19

Which features to include? Shape Color 3D mesh D2 distribution 2D texture CIELAB A B A B � 20

Which features to include? Shape Color 3D mesh D2 distribution 2D texture CIELAB PCA PCA A B A B � 21

Which features to include? Shape Color 3D mesh D2 distribution 2D texture CIELAB PCA PCA A B A B Full Partial (3PCs) Full Partial (3PCs) Similarity structure � 22

Shape and color predict gaze � 23

Shape and color predict gaze Gaze on objects Gaze on objects � 24

Shape and color predict gaze � 25

Challenge I: representing the belief space Challenge II: finding the optimal policy � 26

“Ideal observer” model of visual search Calculate/update posterior probabilities If maximum exceeds criterion, STOP Move eyes to object most likely to be target Sample information at fixated location • Can be expressed as a policy in the meta-MDP , but not necessarily optimal Najemnik and Geisler, 2005 Yang, Lengyel and Wolpert, 2017 � 27

Optimizing meta-level return with deep reinforcement learning • Proximal Policy Optimization π Policy (PPO, Schulman, 2017), Object locations implemented with tf-agents Object features • 10 replications, manually tuned hyper-parameters Target features • Manual tweaking of input Posterior V Value representation & initialization Input Dense Layers � 28

Optimizing meta-level return with deep reinforcement learning • Proximal Policy Optimization 0.8 (PPO, Schulman, 2017), 0.6 implemented with tf-agents 5eward 0.4 • 10 replications, manually tuned hyper-parameters 0.2 • Manual tweaking of input 0 0 0.5 1 1.5 2 representation & initialization 6imulated eSisRdes (milliRns) � 29

Optimizing meta-level return with deep reinforcement learning • Proximal Policy Optimization 0.8 (PPO, Schulman, 2017), 0.6 implemented with tf-agents 5eward 0.4 • 10 replications, manually tuned hyper-parameters 0.2 NG • Manual tweaking of input 0 0 0.5 1 1.5 2 representation & initialization 6imulated eSisRdes (milliRns) 30 �

Does optimal policy match humans? Model Human Start End � 31

Does optimal policy match humans? Model Object Human Object Start End � 32

Which features drive human search? Model Human Start End � 33

Ongoing work • Alternative schemes for extracting low-dimensional feature representations of objects ‣ Deep convolutional neural network models of human ventral visual stream (Yamins et al. 2014, Fan et al. 2019) ‣ MeshNet model of 3D shape representation (Feng et al. 2018) � 34

Ongoing work • Alternative schemes for extracting low-dimensional feature representations of objects ‣ Deep convolutional neural network models of human ventral visual stream (Yamins et al. 2014, Fan et al. 2019) ‣ MeshNet model of 3D shape representation (Feng et al. 2018) • Investigating the learned policy ‣ Is it optimal? � 35

Thank you! � 36

From heuristic to optimal models in naturalistic visual search - PowerPoint PPT Presentation

From heuristic to optimal models in naturalistic visual search Angela Radulescu 1,2 , Bas van Opheusden 1,2 , Fred Callaway 2 , Thomas Griffiths 2 & James Hillis 1 1 2 Bridging AI and Cognitive Science workshop, ICLR April 24th, 2020

Heuristic Search Lucia Moura Winter 2018 Heuristic Search Lucia Moura Heuristic Search Intro

Heuristic Search Heuristic Search Best-First A * Heuristic Functions Some material

Modeling Visual Cortex V4 in Naturalistic Conditions with Invariant and Sparse Image

Exact and Heuristic MIP Models for Nesting Problems Matteo Fischetti, Ivan Luzzi DEI, University

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

Optimal Algorithms for Learning Bayesian Optimal Algorithms for Learning Bayesian Network

Optimal Agents Nick Hay 27th September 2005 1 / 36 Nick Hay Optimal Agents The Optimal Agent

Toward Computing Towards an Optimal . . . An (Almost) Optimal . . . Minor Problem an Optimal

Heuristic Search CPSC 322 Lecture 6 September 17, 2007 Textbook 3.5 Heuristic Search CPSC

Heuristic Approaches Mark Voorhies 5/5/2017 Mark Voorhies Heuristic Approaches PAM (Dayhoff)

ECE 3060 VLSI and Advanced Digital Design Lecture 12 Computer-Aided Heuristic Two-level Logic

Heuristic Methods and Metaheuristics for 2. Heuristic Methods Construction Search 3.

Heuristic Search: A* and beyond Heuristic Search: A* and beyond Course: CS40002 Course: CS40002

How do we deal with such pointers? What about write-barrier cost? Inter-generational ptrs

Minimization of Energy Loss using Integrated Evolutionary Approaches

Reparameterization: a Universal Tool for Optimization and Counting George Katsirelos 10/05/2017

Optimal approximation for unconstrained non-submodular minimization Marwa El Halabi Stefanie

RGB-D Object Discovery via Mul7scene Analysis Evan Herbst,

1 Issues for Cache Hierarchies Issues for Cache Hierarchies Hashing: Cache Array Routing

Recognizing and Learning Object Categories Based on work and slides by R. Fergus, P. Perona, A.

The Cost of Capital Chapter 14 Principles Applied in This Chapter Principle 1: Money Has a