The experiments for You -Do, I- Learn Presenter: Wenguang Mao - PowerPoint PPT Presentation

The experiments for “You -Do, I- Learn” Presenter: Wenguang Mao Instructor: Kristen Grauman Author for the paper: Dima Damen

Recap of the Paper Gaze attention Gaze point Clustering position Clustering TRO MOI Gaze area appearance Neighbor frames

Experiment Setup • Dataset: Bristol Egocentric Object Interactions Dataset

Experiment Setup • Dataset: Bristol Egocentric Object Interactions Dataset • Egocentric videos at 6 locations • Gaze point on each frame • Gaze positions in 3D space • Gaze fixation on each frame • Ground truth positions of TROs • 3D map for each location, 3D positions of the camera for each frame, …… • Code: VLFeat, Matlab toolboxes, and programs written by myself

Why Need Gaze Info • Given an egocentric image, which part of the image do you think I am focusing on? • Center of image? • Blue point: center of image • Red point: gaze point

Why Need Gaze Info • The distance between the center and the gaze point (a) Desk (b) Door

Why Need Gaze Info • The distance between the center and the gaze point Center of image is not good approximation for the gaze point (a) Desk (b) Door

Why Need Gaze Info • The distance between the center and the gaze point ( during gaze fixation ) (a) Desk (b) Door

Why Need Gaze Info • The distance between the center and the gaze point ( during gaze fixation ) Center of image is not good approximation for the gaze point Even during attention period (a) Desk (b) Door

How Gaze Fixation Helps • Do you think there is any TRO in the video clips • Red dot: gaze point

How Gaze Fixation Helps • Do you think there is any TRO in the video clips • Red dot: gaze point Gaze fixation helps identify a TRO

How Gaze Fixation Helps • Do you think there is any TRO in the video clips • Red dot: gaze point

How Gaze Fixation Helps • Do you think there is any TRO in the video clips • Red dot: gaze point Gaze fixation alone is far from enough to find TROs

How 3D Positions of Gaze Help • Blue circles: 3D positions of gazes in a video • Red cross: ground truth positions of TRO (a) Without gaze fixation filtering (a) With gaze fixation filtering

How 3D Positions of Gaze Help • Blue circles: 3D positions of gazes in a video • Red cross: ground truth positions of TRO 3D gaze positions are very helpful to identify TROs (a) Without gaze fixation filtering (a) With gaze fixation filtering

Clustering for Gaze 3D positions • Right number of clusters (kmeans) • Yellow square: cluster center (a) Without gaze fixation filtering (b) With gaze fixation filtering

Clustering for Gaze 3D positions • Right number of clusters (kmeans) • Yellow square: cluster center With the knowledge of right number of TROs, they can be easily identified using 3D gaze positions (a) Without gaze fixation filtering (b) With gaze fixation filtering

Clustering for Gaze 3D positions • Too less clusters • Yellow square: cluster center (a) Without gaze fixation filtering (b) With gaze fixation filtering

Clustering for Gaze 3D positions • Too less clusters • Yellow square: cluster center If underestimating the number, low precision and low recall for identifying TROs (a) Without gaze fixation filtering (b) With gaze fixation filtering

Clustering for Gaze 3D positions • Too much clusters • Yellow square: cluster center (a) Without gaze fixation filtering (b) With gaze fixation filtering

Clustering for Gaze 3D positions • Too much clusters • Yellow square: cluster center If overestimating the number, high recall and low precision (a) Without gaze fixation filtering (b) With gaze fixation filtering

Spectral Clustering • Right number of clusters (a) kmeans (b) spectral

Spectral Clustering • Right number of clusters Same with K-means (a) kmeans (b) spectral

Spectral Clustering • Too less clusters (a) kmeans (b) spectral

Spectral Clustering • Too less clusters Same with k-means (a) kmeans (b) spectral

Spectral Clustering • Too much clusters (a) kmeans (b) spectral

Spectral Clustering • Too much clusters Outperform k-means, high precision and high recall. (a) kmeans (b) spectral

What is the Limitation of Gaze Positions • Can we only use 3D gaze positions? • No, because of moving TRO • How to solve this problem? • Appearance

Appearance • How HoG features represent an image

Appearance • How HoG features represent an image HoG is good to describe the boundary

Identify TROs based on Appearance • Extract HoG from the region near the gaze point for each frame • Generate BoW representation for each frame • Perform clustering on frames • Use the frame closest to the center to represent each cluster • Compare the appearance of center frames with the ground truth

Appearance • Five TROs around the desk tape socket screwdriver charger box

Results Success (box) Success (tape) Duplicated (box) Success (charger) Failure

Results Success (box) Success (tape) Duplicated (box) Missing two TROs, the appearance is not as effective as the position Success (charger) Failure

Using Neighbor frames Failure Success (charger) Success (box) Success (driver) Success (tape)

Using Neighbor frames Failure Success (charger) Success (box) Missing one TRO, using neighbor frames is helpful to improve performance Success (driver) Success (tape)

Over-Estimating No. of Clusters Failure Success (box) Success (driver) Success (charger) Success (tape) Duplicated (box) Duplicated (box) Duplicated (driver)

Over-Estimating No. of Clusters Failure Success (box) Success (driver) Success (charger) Missing one TROs, over-estimating is helpful to identify more TROs Success (tape) Duplicated (box) Duplicated (box) Duplicated (driver)

Also Using Neighbor frames Failure Success (tape) Success (socket) Duplicated (socket) Success (driver) Success (box) Success (charger) Duplicated (box)

Also Using Neighbor frames Failure Success (tape) Success (socket) Duplicated (socket) Finding all TROs Success (driver) Success (box) Success (charger) Duplicated (box)

Conclusion • Gaze information is important and necessary for egocentric videos, and the center of image is not a good approximation • Gaze fixation is helpful for identifying TROs, but itself is not enough • 3D positions of gaze give rich information for TROs, but clustering method and the estimation on the number of TROs is critical • Use spectral clustering and do not worry about overestimating • Appearance is another important feature for identifying TROs • Using neighbor frames is beneficial to improve performance • Over-estimating No. of TROs is helpful to reduce false negative

The experiments for You -Do, I- Learn Presenter: Wenguang Mao - PowerPoint PPT Presentation

The experiments for You -Do, I- Learn Presenter: Wenguang Mao Instructor: Kristen Grauman Author for the paper: Dima Damen Recap of the Paper Gaze attention Gaze point Clustering position Clustering TRO MOI Gaze area appearance

Experiments on deflection of charged Experiments on deflection of charged Experiments on

Chapter 8. Experiments Chapter 8. Experiments Experimental Research Experimental Research

Experimental Design and the Search for Quasi-Experiments Department of Government London School

Experiments Philosophy of Economics University of Virginia Matthias Brinkmann Contents 1.

OBT Formation in Night Experiments and OBT Formation in Night Experiments and OBT Formation in

Team Introduction Experiments Outreach Problem Project Brainstorm Introduction Introduction

Designs Chapter 11 Quasi-Experimentation Quasi-experiments resemble experiments, but lack

WISP searches by Tokyo tabletop experiments group UTokyo tabletop experiments group Toshio

Collider Experiments and India Sunanda Banerjee January, 2019 Experiments in High Energy Physics

Feeding experiments with selected fatty acid Feeding experiments with selected fatty acid

Hagner experiments status after 22 years Per-Erik Wikberg SLU Ume Hagner experiments

Climate change experiments Climate change experiments with a Hi- - res. climate model res.

Remote Participation in in Remote Participation physical experiments physical experiments

Some aspects of Design of Experiments Nancy Reid University of Toronto June 28, 2007

Future SK- -Experiments Experiments Future SK US-Japan Seminar Decay and Mass

Planned OBT Experiments at CRL Planned OBT Experiments at CRL September 14, 2011 Sang Bog Kim

On Improving the Efficiency and Robustness of Table Storage Mechanisms for Tabled Evaluation

Acknowledgements Krzysztof Gajos Corin Anderson Mary Czerwinski Pedro Domingos

Matthew Series Lesson #191 March 11, 2018 Dean Bible Ministries www.deanbibleministries.org Dr.

Five Star Quality Rating System Design For Nursing Home Compare Nathan Shaw RN, BSN, MBA, LHRM,

Photon Detection System (PDS) and SN triggering Pierre Lasorak 1 Introduction Outline

Coat of arms Knights couldnt tell who was on their side and who the enemies were so they

Some thoughts on safety of machine learning Fabio Roli University of Cagliari, Italy HUML

Convergence and error estimates for the compressible Navier-Stokes equations Antonin Novotny ( 1 )