Following Gaze in Video A. Recasens et al. Presented by: Keivaun - PowerPoint PPT Presentation

Feb 04, 2023 •409 likes •653 views

Following Gaze in Video A. Recasens et al. Presented by: Keivaun Waugh and Kapil Krishnakumar Background Given face in one frame, how can we figure out where that person is looking? Target object might not be in the same frame Sample

Following Gaze in Video A. Recasens et al. Presented by: Keivaun Waugh and Kapil Krishnakumar
Background ● Given face in one frame, how can we figure out where that person is looking? Target object might not be in the same frame ●
Sample Results Input Video Gaze Density Gazed Area
Architecture
VideoGaze Dataset ● 160k annotations of video frames from MoviesQA dataset ● Annotations: ○ Source Frame Head Location ○ ○ Body Target Frame ( 5 per source frame) ○ ■ Gaze Location Time difference between Source and ■ Target
Experiments ● Naive network architecture Don’t segment network into different into different pathways ○ ○ Concatenate all inputs and predict directly Replace transformation pathway with SIFT+RANSAC affine fit finding ● ● Various neighboring frame prediction windows ● Examine failure cases ○ “Look cone” doesn’t take into account the eye position ○ Other failures
Naive Model
Naive Architecture ● Use fusion of target frame and source frame to predict gaze location Alex Net Source Frame 0 …………… 0 0, 0.4, 0.3, 0 0 ………….. 0 Target Frame 0 ………….. 0 20x20
Alternate Transformation Pathway
Architecture ● Replace deep CNN pathway with traditional SIFT+RANSAC affine warp SIFT + RANSAC
Quantitative Results
Results AUC (higher KL Divergence (lower L2 Dist (lower Description better) better) better) 73.7 8.048 0.225 Normal model with transformation pathway 60.2 6.604 0.294 Normal model with sparse affine 60.2 6.6604 0.294 Normal model with dense affine 60.9 6.641 0.242 Naive model 56.9 28.39 0.437 Random
Qualitative Results
Results ● Input video is 150 frames long Full Video Cropped Head What I’m looking at
Results - Search 150 Neighboring Frames Original Transformation Pathway Naive Model
Results - Search 150 Neighboring Frames Sparse SIFT Affine Warp Dense SIFT Affine Warp
Results - Search 25 Neighboring Frames Original Transformation Pathway Naive Model
Results - Search 25 Neighboring Frames Sparse SIFT Affine Warp Dense SIFT Affine Warp
Target in Same Frame Original Video Original Transformation Pathway Naive Model
Target in Same Frame Sparse SIFT Affine Warp Dense SIFT Affine Warp
Runtimes ● GTX 1070 and Haswell Core i5 Generating results is CPU bound ● ● 5 second video with 150 frame search width ○ Deep transformation pathway: 6.5 minutes Sparse affine: 10.5 minutes ○ ○ Dense affine: 32 minutes 100% CPU Usage GPU Usage 0% Usage when running model with transformation pathway
Failure Cases Input Video Original Transformation Pathway
Failure Cases Input Video Original Transformation Pathway
Conclusions ● Separating input modalities for Saliency and Head Pose provides significant information to the model. ○ Illustrates importance of hand-crafted architecture even though features are automatically discovered ● Head Direction != Eye Direction ● Frame Predictor window selection determines whether match can be found or not.

Recommend

Learning video saliency from human gaze using candidate selection Rudoy,Goldman, Schechtman,

Learning video saliency from human gaze using candidate selection Rudoy,Goldman, Schechtman, Manor Akanksha Saran CS381V: Experiment Presentation Outline Description of Gaze Datasets -DIEM -CRCNS Analysis of Human Gaze Datasets

593 views • 47 slides

gaze-following and recognizing intentions from gaze Outline infant gaze following studies

gaze-following and recognizing intentions from gaze Outline infant gaze following studies and intentionality gaze following and object processing Do infants gaze-follow? Infants turn in the direction that an adult has turned.

415 views • 25 slides

Outline Gaze-Based Interaction in Cinematic 360 VR Cinematic 360 VR Gaze-Based

Outline Gaze-Based Interaction in Cinematic 360 VR Cinematic 360 VR Gaze-Based Interaction Non-Linear Storytelling Media Authoring Luiz Velho VISGRAF Lab - IMPA Case Study Omnidirectional Video Equirectangular Format -

446 views • 10 slides

Learning video saliency from human gaze using candidate selection Rudoy, Goldman, Shechtman,

Learning video saliency from human gaze using candidate selection Rudoy, Goldman, Shechtman, Zelnik-Manor CVPR 2013 Paper presentation by Ashish Bora Outline What is saliency? Image vs video Candidates : Motivation

723 views • 34 slides

Gaze Tracking -Shashank Shekhar Aim To estimate a person's gaze using a webcam. Gaze

Gaze Tracking -Shashank Shekhar Aim To estimate a person's gaze using a webcam. Gaze tracking can be used in interactive applications as a means to take inputs. Possible Approaches 3-D gaze tracking 2-D gaze tracking

514 views • 7 slides

a story telling robot: modelling and evaluation of human-like gaze behaviour 1 motivations

a story telling robot: modelling and evaluation of human-like gaze behaviour 1 motivations social functions of gaze behaviour gaze and task performance previous work on simulating gaze behaviour in agents and robots How are

245 views • 11 slides

Learning to Predict Gaze in Egocentric Videos Yin Li, Alireza Fathi, James M. Rehg Outline: -

Learning to Predict Gaze in Egocentric Videos Yin Li, Alireza Fathi, James M. Rehg Outline: - What is visual saliency (through Itti Koch & Torralba methods) - Gaze distributions for GTEA Gaze + dataset (per user and per task) - Local

580 views • 21 slides

FAZE Tab switching & Window switching with Gaze & Facial gestures VARUN VIKASH

SLIDES FAZE Tab switching & Window switching with Gaze & Facial gestures VARUN VIKASH KARTHIKEYAN Interaction Engineering Winter Semester 2019/20 Prof. Dr. Michael Kipp Augsburg University of Applied Sciences Concept video

538 views • 28 slides

Learning to Anticipate Gaze: Top-Down Approach Mentor: Dr. Amitabha Mukerjee Presented by

Learning to Anticipate Gaze: Top-Down Approach Mentor: Dr. Amitabha Mukerjee Presented by Vempati Anurag Sai SE367 Cognitive Science Introduction Humans deploy anticipatory gaze in many situations. While moving around, driving

346 views • 13 slides

Implementation Strategies for Eye Gaze Users Katelyn Oeser SLP Brenda Del Monte SLP They are

Implementation Strategies for Eye Gaze Users Katelyn Oeser SLP Brenda Del Monte SLP They are doing it with their eyes! Why do we put kiddos on eye gaze? Vision is a relative strength Direct Select with a pointer finger is not reliable

721 views • 28 slides

Saccade Tasks Visual Search Saccades Micro-Fixation Saccades Reading Gaze Shifts Reading Gaze

Derivation of Saccade Saccade Tasks Visual Search Saccades Micro-Fixation Saccades Reading Gaze Shifts Reading Gaze Shifts Catch-up Saccades Saccadic Fast Phase Ballistic nature of saccades. Pulse and step are pre-programmed Latency

4.22k views • 45 slides

The experiments for You -Do, I- Learn Presenter: Wenguang Mao Instructor: Kristen Grauman

The experiments for You -Do, I- Learn Presenter: Wenguang Mao Instructor: Kristen Grauman Author for the paper: Dima Damen Recap of the Paper Gaze attention Gaze point Clustering position Clustering TRO MOI Gaze area appearance

597 views • 41 slides

Three classes of eye movements: Gaze Stabilization with body movement Optokinetic Nystagmus (OKN)

Three classes of eye movements: Gaze Stabilization with body movement Optokinetic Nystagmus (OKN) Vestibulo-ocular reflex (VOR) Foveal gaze shifts with attention shifts Saccades Asymmetric vergence Foveal Maintenance of stationary &

477 views • 20 slides

Multimodal Interaction Eye Gaze and Head Movement Tracking Iris Recognition Dr Pradipta Biswas,

Multimodal Interaction Eye Gaze and Head Movement Tracking Iris Recognition Dr Pradipta Biswas, PhD (Cantab) Assistant Professor Indian Institute of Science http://cpdm.iisc.ernet.in/PBiswas.htm What is Eye Tracking & Gaze Control Eye

665 views • 18 slides

Gaze-Assisted Remote Communication Between Teacher And Students Kari-Jouko Rih, Oleg pakov,

Gaze-Assisted Remote Communication Between Teacher And Students Kari-Jouko Rih, Oleg pakov, Howell Istance Diederick C. Niehorster University of Tampere Lund University Previous Work on Shared Gaze Offline (experts pre-recorded

817 views • 22 slides

DEEP UNCONSTRAINED GAZE ESTIMATION WITH SYNTHETIC DATA Shalini De Mello, Rajeev Ranjan, Jan Kautz

DEEP UNCONSTRAINED GAZE ESTIMATION WITH SYNTHETIC DATA Shalini De Mello, Rajeev Ranjan, Jan Kautz NVIDIA AI CO-PILOT 2 APPLICATIONS INTERFACE DESIGN AR/VR ACCESSIBILITY 3 TRADITIONAL GAZE TRACKERS Fovea Cornea C E Sclera Pupil C c ,

665 views • 47 slides

Realtime Gaze Estimation with Online Calibration Li Sun, Mingli Song, Zicheng Liu, Ming-Ting Sun

Realtime Gaze Estimation with Online Calibration Li Sun, Mingli Song, Zicheng Liu, Ming-Ting Sun Outline Introduction Limitations & goals Proposed method Results Demo Gaze Estimation Dia iagnostic applications:

520 views • 24 slides

13 th November 2015 John Liddle Senior Account Manager Tobii Dynavox Tobii Dynavox Our

Our eye gaze solutions presentation to the AAC SIG 13 th November 2015 John Liddle Senior Account Manager Tobii Dynavox Tobii Dynavox Our range 2 PC Eye Explore Entry level camera Works with eye gaze games and Gaze Viewer

798 views • 34 slides

Visual Attention in Spoken HRI Maria Staudte & Matthew Crocker Saarland University, Germany

Visual Attention in Spoken HRI Maria Staudte & Matthew Crocker Saarland University, Germany 1 Gaze in HRI Robot gaze appears active,enjoyable,friendly (e.g. Kanda et al. 01, Kuno et al. 07) Robot gaze is different

465 views • 32 slides

Acknowledgements Inves&ga&on of Eye Gaze on AAC Visual Scene Displays

O'Neill (2017) 10/11/17 Acknowledgements Inves&ga&on of Eye Gaze on AAC Visual Scene Displays with a Naviga&on Menu by Individuals The data I will present today is part

330 views • 7 slides

iType: Using Eye Gaze to Enhance Typing Privacy Zhenjiang Li 1 , Mo Li 2 , Prasant Mohapatra 3 ,

iType: Using Eye Gaze to Enhance Typing Privacy Zhenjiang Li 1 , Mo Li 2 , Prasant Mohapatra 3 , Jinsong Han 4 , Shuaiyu Chen 4 CityU 1 , NTU 2 , UC Davis 3 , XJTU 4 Wearables Accelerometers Gyroscope Ambient light sensor Hart rate

179 views • 16 slides

Probabilistic Estimation of the Drivers Gaze from Head Orientation and Position Sumit Jha and

Probabilistic Estimation of the Drivers Gaze from Head Orientation and Position Sumit Jha and Carlos Busso Multimodal Signal Processing (MSP) Laboratory Department of Electrical Engineering, The University of Texas at Dallas, Richardson

486 views • 25 slides

Implementing Eye Gaze Technology & Communication for Emerging Communicator Patrick Brune M.S.

7/4/2019 Implementing Eye Gaze Technology & Communication for Emerging Communicator Patrick Brune M.S. CCC/SLP Tobii Dynavox Senior Member Learning Team 1 Agenda Eye Tracking Calibration and introducing access Overview of

616 views • 29 slides

60 MS TO GET IT RIGHT GAZE-CONTINGENT RENDERING & HUMAN PERCEPTION Dr. Rachel Albert, GTC San

60 MS TO GET IT RIGHT GAZE-CONTINGENT RENDERING & HUMAN PERCEPTION Dr. Rachel Albert, GTC San Jose 2019 GOAL: IMMERSIVE VR Requirements to mimic real-world human visual experience Wide color gamut / high dynamic range High frame

590 views • 32 slides