Daily Activity Recognition Combining Gaze Motion and Visual - PowerPoint PPT Presentation

Daily Activity Recognition Combining Gaze Motion and Visual Features Yuki Shiga, Takumi Toyama, Yuzuko Utsumi, Andreas Dengel, Koichi Kise

Outline • Introduction • Proposed Method • Experiment • Conclusion

Gaze Motion Vision Focus movements • Activity recognition draws public attention • Focus on vision-based and Gaze motion-based method • These methods deal with activities that involve eye

Eye Tracker •An eye tracker is useful for recognizing activities that involve eye movements •Record a scene image video as well as the gaze position data Scene Image Gaze Position (Where the User Fixates)

Related Works •Gaze motion-based activity recognition: •Bulling et al., “Eye movement analysis for activity recognition using electrooculography.”[1] •Vision-based activity recognition: •Hipny et al., “Recognizing Egocentric Activities from Gaze Regions with Multiple-Voting Bag of Words.”[2] They used only each modality (Motion or Vision) [2] Hipiny IM, Mayol-Cuevas W. Recognising Egocentric Activities from Gaze Regions with Multiple-Voting Bag of Words. CSTR-12-003. 2012. [1] Bulling, Andreas, Ward, Jamie, Gellersen, Hans, and Töster, Gerhard. Eye movement analysis for activity recognition using electrooculography. IEEE transactions on pattern analysis and machine intelligence , 33, 4 (2011), 741-53. !

Purpose Activity can also be expressed by "what eyes see” can be expressed by "how eyes move” We use both vision-based and gaze motion-based modality for activity recognition

and vision-based method Both combination of vision and gaze motion can improve recognizing activities that involve eye movements Purpose • Propose a method combining gaze motion-based method • Verify the hypothesis:  

Gaze Motion Feature Overview Visual Feature Classifier Classifier Eye Tracker Record Gaze Points and Scene Images Fusion Result Output Output

Gaze Motion Feature L N-gram Fature Statistical R R r r r L R r r r R Convert Saccade Representing Size and Direction of Saccade Fixation method • The method proposed by Bulling et al. [1] Bulling, Andreas, Ward, Jamie, Gellersen, Hans, and Töster, Gerhard. Eye movement analysis for activity recognition using electrooculography. IEEE transactions on pattern analysis and machine intelligence , 33, 4 (2011), 741-53. !

Visual Feature Crop a region around gaze points to remove a irrelevant region

Local Feature Extraction Intrest Points by Dense Sampling Extract Local Features (PCA-SIFT) From Each Point

Convert to Global Feature Learning Image k-means clustering k centroids (visual words) … Test Image Nearest Neighbor Search to visual words … Global Feature

Classifier Read Write Type ~ Feature Vector For Learning • SVM with Probability Estimation • Two classifiers are made for visual and gaze motion features

Classifier Read Write Type ~ Feature Vector for Test

Classifier Read Write Type Type Write Read Probability

Fusion Read Type Write Read Probability from gaze motion Type Write Read Probability from vision

Fusion Type Write Read Probability from gaze motion Type Write Read Probability from vision Type Write Read Combined probability Average

Experiments User Same Cross-user Same Different Cross-scene Same Same Baseline Target Objects / Environments contains a person different from training data Whether the combined method performs when test data objects are different between training and test data Whether the combined method performs when target vision-based and gaze motion-based method Whether combined method performs better than individual Different • Baseline:   • Cross-scene:   • Cross-user:  

1280 × 960 Pixels 300 × 300 pixels around gaze points 700 gaze samples Condition of All Experiments • Sampling rate of the eye tracker: 30 Hz • Resolution of the scene camera:   • Visual features are extracted from   • Gaze motion features are extracted from  

Activity List Watch a video Write text Read text Type text Have a chat Walk

Baseline Experiment Wach a video 4 Scene 3 Scene 2 Scene 1 Scene Walk Have a chat Type text Read Text Write text • 1 person • Contains 4 different scenes • The dataset was divided into 2 parts

Baseline Experiment Type Proposed Visual Gaze motion Avg. Walk Chat Read Acuracy(%) Write Watch 100 75 50 25 0 • The accuracy of the proposed method was the best

Cross-scene Experiment Wach a video Write text Read Text Type text Have a chat Walk Scene 1 Scene 2 Scene 3 Scene 4 • 3 people

Cross-scene Experiment Wach a video 4 Scene 3 Scene 2 Scene 1 Scene Walk Have a chat Type text Read Text Write text Leave Out for Test Data • 3 people • Leave-one-out cross validation

Cross-scene Experiment Read Propsed(Cross-scene) Proposed(Baseline) Avg. Walk Chat Type Write Acuracy(%) Watch 100 75 50 25 0 • The recognition rate of Cross-scene is lower than Baseline

Cross-scene Experiment 0 Visual(Closs-scene) Visual(Baseline) Avg. Walk Chat Type Read Write Watch 100 75 50 25 Acuracy(%) Acuracy(%) Watch 0 25 50 75 100 Write Gaze motion(Cross-scene) Read Type Chat Walk Avg. Gaze motion(Baseline) • Both of recognition rates dropped • Gaze motion also depends on targets or environments

Cross-user Experiment Wach a video Write text Read Text Type text Have a chat Walk Scene 1 Scene 2 × 7 people 1 person: test The rest 6 people: training

Cross-user Experiment Read Proposed(Cross-user) Proposed(Baseline) Avg. Walk Chat Type Write Acuracy(%) Watch 100 75 50 25 0 • The recognition rate of Cross-user is lower than Baseline

Cross-user Experiment Walk people Gaze motions of “Read” activity are similar between different • Gaze motions are different between people • Gaze motion(Cross-user) Gaze motion(Baseline) Avg. Chat Acuracy(%) Type Read Write Watch 100 75 50 25 0

recognize daily activities that involve eye movements recognition accuracy is higher when we combine vision- based method and gaze motion-based method Conclusion • Combined gaze motion feature and visual feature to • The results from the experiments show that the

Daily Activity Recognition Combining Gaze Motion and Visual Features Yuki Shiga, Takumi Toyama, Yuzuko Utsumi, Andreas Dengel, Koichi Kise

Cross-User Experiment Acuracy(%) 0 25 50 75 100 Watch Write Read Type Chat Walk Avg. Visual(Baseline) Visual(Closs-user)

Daily Activity Recognition Combining Gaze Motion and Visual - PowerPoint PPT Presentation

Daily Activity Recognition Combining Gaze Motion and Visual Features Yuki Shiga, Takumi Toyama, Yuzuko Utsumi, Andreas Dengel, Koichi Kise Outline Introduction Proposed Method Experiment Conclusion Outline Introduction

gaze-following and recognizing intentions from gaze Outline infant gaze following studies

Gaze Tracking -Shashank Shekhar Aim To estimate a person's gaze using a webcam. Gaze

a story telling robot: modelling and evaluation of human-like gaze behaviour 1 motivations

Visual Motion Motion illusions Uses for motion cues Optic flow Motion blindness

Saccade Tasks Visual Search Saccades Micro-Fixation Saccades Reading Gaze Shifts Reading Gaze

Learning to Predict Gaze in Egocentric Videos Yin Li, Alireza Fathi, James M. Rehg Outline: -

Learning video saliency from human gaze using candidate selection Rudoy,Goldman, Schechtman,

Outline Gaze-Based Interaction in Cinematic 360 VR Cinematic 360 VR Gaze-Based

CS 403X Mobile and Ubiquitous Computing Lecture 12: Activity Recognition Emmanuel Agu Activity

Forces and Motion Click on the topic to go to that section Motion Motion Graphs of Motion

Forces and Motion Click on the topic to go to that section Motion Motion Graphs of Motion

Motion Estimation for Video Coding Motion-Compensated Prediction Bit Allocation Motion

Multimodal Interaction Eye Gaze and Head Movement Tracking Iris Recognition Dr Pradipta Biswas,

Real-time motion and activity recognition Seminar: Post-Desktop User Interfaces Tim Hemig and

Outline Outline Motion & Inverse Motion Motion & Inverse Motion Time

Learning to Synthesize Motion Blur CVPR 2019 Tim Brooks and Jon Barron Research Motion During

Re sults for Q1 F isc al 2021 E a rning s Anno unc e me nt: July 30, 2020 (Qua rte r E nde d

Consolidating to Red Plans for week commencing 11 May 2020 In week commencing 11 May we propose

for Critical Care Dr Jane Eddleston Background: In the UK 170,000 patients undergo

Epidemic Burden in in Ohio Total Confirmed Cases 564 Total Confirmed Healthcare Workers 16.1%

Eye Tracking and Topics EMA in Computer Eye tracking definition Science Eye tracker

Independent Component Analysis: Algorithms and Applications Aapo Hyvrinen and Erkki Oja Neural

Operational amplifiers Types of operational amplifiers (bioelectric amplifiers have different

Incorporation of DAWN AC DOAC Modules Into Practice Peter Collins, PharmD, RPh Advanced Practice

Daily Activity Recognition Combining Gaze Motion and Visual - PowerPoint PPT Presentation

Daily Activity Recognition Combining Gaze Motion and Visual Features Yuki Shiga, Takumi Toyama, Yuzuko Utsumi, Andreas Dengel, Koichi Kise Outline Introduction Proposed Method Experiment Conclusion Outline Introduction

gaze-following and recognizing intentions from gaze Outline infant gaze following studies

Gaze Tracking -Shashank Shekhar Aim To estimate a person's gaze using a webcam. Gaze

a story telling robot: modelling and evaluation of human-like gaze behaviour 1 motivations

Visual Motion Motion illusions Uses for motion cues Optic flow Motion blindness

Saccade Tasks Visual Search Saccades Micro-Fixation Saccades Reading Gaze Shifts Reading Gaze

Learning to Predict Gaze in Egocentric Videos Yin Li, Alireza Fathi, James M. Rehg Outline: -

Learning video saliency from human gaze using candidate selection Rudoy,Goldman, Schechtman,

Outline Gaze-Based Interaction in Cinematic 360 VR Cinematic 360 VR Gaze-Based

CS 403X Mobile and Ubiquitous Computing Lecture 12: Activity Recognition Emmanuel Agu Activity

Forces and Motion Click on the topic to go to that section Motion Motion Graphs of Motion

Forces and Motion Click on the topic to go to that section Motion Motion Graphs of Motion

Motion Estimation for Video Coding Motion-Compensated Prediction Bit Allocation Motion

Multimodal Interaction Eye Gaze and Head Movement Tracking Iris Recognition Dr Pradipta Biswas,

Real-time motion and activity recognition Seminar: Post-Desktop User Interfaces Tim Hemig and

Outline Outline Motion &amp; Inverse Motion Motion &amp; Inverse Motion Time

Learning to Synthesize Motion Blur CVPR 2019 Tim Brooks and Jon Barron Research Motion During

Re sults for Q1 F isc al 2021 E a rning s Anno unc e me nt: July 30, 2020 (Qua rte r E nde d

Consolidating to Red Plans for week commencing 11 May 2020 In week commencing 11 May we propose

for Critical Care Dr Jane Eddleston Background: In the UK 170,000 patients undergo

Epidemic Burden in in Ohio Total Confirmed Cases 564 Total Confirmed Healthcare Workers 16.1%

Eye Tracking and Topics EMA in Computer Eye tracking definition Science Eye tracker

Independent Component Analysis: Algorithms and Applications Aapo Hyvrinen and Erkki Oja Neural

Operational amplifiers Types of operational amplifiers (bioelectric amplifiers have different

Incorporation of DAWN AC DOAC Modules Into Practice Peter Collins, PharmD, RPh Advanced Practice

Outline Outline Motion & Inverse Motion Motion & Inverse Motion Time