Eye Gaze Tracking Usin ing an RGBD Camera: A Comparison with an RGB - PowerPoint PPT Presentation

Eye Gaze Tracking Usin ing an RGBD Camera: A Comparison with an RGB Solu lution Xuehan Xiong (CMU) , Qin Cai, Zicheng Liu, Zhengyou Zhang Microsoft Research, Redmond, WA, USA zhang@microsoft.com http://research.microsoft.com/~zhang/ R The 4th International Workshop on Pervasive Eye Tracking and Mobile Eye-Based Interaction (PETMEI 2014)

Outline • Goal and motivation • Challenges • Approach • Results

Goals and motivations 1. Kinect-based eye tracking 2. Comparison between RGBD and RGB alone

Goals and motivations • Most commercial eye trackers are IR-based • Short range • Does not work outdoor • Non-IR based system • Outdoor • Cheaper • Better capability of being integrated • Less accurate

Outline • Motivation • Challenges • Approach • Results

Challenges • Eye images from IR-based approaches • Eye images from Kinect

Approach • What is gaze (in our model)? 𝐭 𝟒 Notation: 𝐭 𝟏 p -- pupil v -- visual axis t -- optical axis 𝐒 vo -- rotation compensation v b/w v and t v = 𝐒 vo t t 𝐭 𝟐 𝐭 𝟑 r a -- head center 𝐛𝐟 -- offset p 𝐒 hp -- head rotation e r – eyeball radius a Eyeball center: 𝐟 = 𝐛 + 𝐒 hp 𝐛𝐟

Approach • What are fixed (in our model)? 𝐭 𝟒 Notation: 𝐭 𝟏 p -- pupil v -- visual axis t -- optical axis 𝐒 vo -- rotation compensation v b/w v and t v = 𝐒 vo t t 𝐭 𝟐 𝐭 𝟑 r a -- head center 𝐛𝐟 -- offset p 𝐒 hp -- head rotation e r – eyeball radius a Eyeball center: 𝐟 = 𝐛 + 𝐒 hp 𝐛𝐟

Approach • What to be measured (in our model)? 𝐭 𝟒 Notation: 𝐭 𝟏 p -- pupil v -- visual axis t -- optical axis 𝐒 vo -- rotation compensation v b/w v and t v = 𝐒 vo t t 𝐭 𝟐 𝐭 𝟑 r a -- head center 𝐛𝐟 -- offset p 𝐒 hp -- head rotation e r – eyeball radius a Eyeball center: 𝐟 = 𝐛 + 𝐒 hp 𝐛𝐟

Approach • System calibration • Head pose • Head center • Pupil • User calibration

System calibration • World = color camera • Intrinsic parameters, centered at [0,0,0] • Depth camera • Intrinsic and extrinsic parameters • Monitor screen • Screen-camera calibration

Screen-camera calibration • 4 images capturing screen + pattern • 1 image from Kinect camera capturing the pattern

Calibration results z x y

Head pose estimation • Build a person-specific 3D face model Rigid points Average over 10 frames

Head pose estimation • For each frame t R,T Procrustes Red – Noisy Reference model Blue – De-noised

Head center • The average of 13 landmarks

2D Iris detection

3D pupil estimation r p e l Camera center o = [0,0,0] T from camera intrinsic parameters 𝐯 = 𝑣, 𝑤, 𝑔 𝐦 = 𝐯 𝐯

User calibration • What are fixed (in our model)? 𝐭 𝟒 Notation: 𝐭 𝟏 p -- pupil v -- visual axis t -- optical axis 𝐒 vo -- rotation compensation v b/w v and t v = 𝐒 vo t t 𝐭 𝟐 𝐭 𝟑 r a -- head center 𝐛𝐟 -- offset p 𝐒 hp -- head rotation e min σ 𝑗 1 − (𝐒 vo 𝐮 𝑗 ) 𝑈 𝐰 𝑗 2 over 𝐒 vo , 𝐛𝐟 , r r – eyeball radius a Eyeball center: 𝐟 = 𝐛 + 𝐒 hp 𝐛𝐟

Results • Simulation

Error modeling • Assuming perfect calibration (system and user) • 3 sources of errors (assuming normal distribution with zero mean) • Head pose • Head center • Pupil • Units • Head pose: degree • Head center: mm • Pupil: pixel

Simulation Result with low variances • Variances – 0.1

Back to reality Variance – 0.25 Variance – 0.5

Real Data: Free head movement 9 calibration points A subject with colored stickers

Experimental setup • The monitor has a dimension of 520mm by 320mm. • The distance between a test subject and the Kinect is between 600mm and 800mm. • There are 9 subjects participated in the data collection. • We collect three training sessions and two test sessions for each subject.

Best case scenario

Training error Left eye Right eye

Testing error Left eye Right eye

Testing error 2 Left eye Right eye

Sample Results Without Stickers

Qin – training error Left eye Right eye

Qin – testing error Left eye Right eye

Qin – testing error 2 Left eye Right eye

No (little) head movement

Best case scenario

Training error Left eye Right eye

Sample Results Without Stickers

Qin – training error Left eye Right eye

Qin – testing error Left eye Right eye

Gaze errors on real-world data Gaze errors in degrees Average errors: 4.6 degrees with RGBD, and 5.6 degrees with RGB

Low-bound of gaze errors With colored stickers Gaze errors in degrees Average errors: 2.1 degrees with RGBD, and 3.2 degrees with RGB

Conclusions • Using depth information directly from Kinect provides more accurate gaze estimation compared with the one from only RGB images. • The lower bound for gaze error is around 2 degrees with RGBD and 4 degrees with RGB • Future work • Better RGBD sensor -> lower gaze error • Leverage two eyes Zhengyou Zhang, Qin Cai , Improving Cross-Ratio-Based Eye Tracking Techniques by Leveraging the Binocular Fixation Constraint, in ETRA 2014.

Thank You

Eye Gaze Tracking Usin ing an RGBD Camera: A Comparison with an RGB - PowerPoint PPT Presentation

Eye Gaze Tracking Usin ing an RGBD Camera: A Comparison with an RGB Solu lution Xuehan Xiong (CMU) , Qin Cai, Zicheng Liu, Zhengyou Zhang Microsoft Research, Redmond, WA, USA zhang@microsoft.com http://research.microsoft.com/~zhang/ R The

Gaze Tracking -Shashank Shekhar Aim To estimate a person's gaze using a webcam. Gaze

gaze-following and recognizing intentions from gaze Outline infant gaze following studies

# Camera camera = Camera.open(); Camera camera

Camera camera = Camera.open();

Eye Tracking and Topics EMA in Computer Eye tracking definition Science Eye tracker

Multimodal Interaction Eye Gaze and Head Movement Tracking Iris Recognition Dr Pradipta Biswas,

# Camera camera = Camera.open();

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

13 th November 2015 John Liddle Senior Account Manager Tobii Dynavox Tobii Dynavox Our

a story telling robot: modelling and evaluation of human-like gaze behaviour 1 motivations

Specification In Inference Usin ing Context-Free Language Reachability Osbert Bastani, Saswat

holder.addCallback(this); holder.setType(SurfaceHolder.STP); MediaRecorder r = new

Multi-view geometry Slides from L. Lazebnik Structure from motion Camera 1 Camera 3 Camera 2 R

LOW-LATENCY, NEAR-EYE GAZE ESTIMATION Michael Stengel, Alexander Majercik Part I (Michael) 25 min

Implementation Strategies for Eye Gaze Users Katelyn Oeser SLP Brenda Del Monte SLP They are

Depth Prediction and RGBD Images for Recognition Yihui He, Metehan Ozten yihuihe@foxmail.com, m

Disclaimer This document is provided for informational and reference purposes only. The Fresno

2016-17 Final Annual Update Local Control and Accountability Plan Local Control and

MIGRANT EDUCATION PROGRAM (MEP) SUBGRANTING Office of Migrant Education (OME) The mission of the

INFORMATIONAL MEETING SCRIPT SPEAKER: Becky Slides 1-5 (Slide 2) Good evening, and welcome to

IRIS BIOMETRIC SYSTEM CS635 Dept. of Computer Science & Engineering NIT Rourkela Iris

MINNESOTA EDUCATION FINANCE 101 Key Items to Review 1. Education Budget: Big Picture 2.

Duchenne henne Muscul cular ar Dy Dystr strophy (DM (DMD) D) Aims Of The Training To give

California and the Every Student Succeeds Act California Practitioners Advisory Group October

Eye Gaze Tracking Usin ing an RGBD Camera: A Comparison with an RGB - PowerPoint PPT Presentation

Eye Gaze Tracking Usin ing an RGBD Camera: A Comparison with an RGB Solu lution Xuehan Xiong (CMU) , Qin Cai, Zicheng Liu, Zhengyou Zhang Microsoft Research, Redmond, WA, USA zhang@microsoft.com http://research.microsoft.com/~zhang/ R The

Gaze Tracking -Shashank Shekhar Aim To estimate a person's gaze using a webcam. Gaze

gaze-following and recognizing intentions from gaze Outline infant gaze following studies

# Camera camera = Camera.open(); Camera camera

Camera camera = Camera.open();

Eye Tracking and Topics EMA in Computer Eye tracking definition Science Eye tracker

Multimodal Interaction Eye Gaze and Head Movement Tracking Iris Recognition Dr Pradipta Biswas,

# Camera camera = Camera.open();

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

13 th November 2015 John Liddle Senior Account Manager Tobii Dynavox Tobii Dynavox Our

a story telling robot: modelling and evaluation of human-like gaze behaviour 1 motivations

Specification In Inference Usin ing Context-Free Language Reachability Osbert Bastani, Saswat

holder.addCallback(this); holder.setType(SurfaceHolder.STP); MediaRecorder r = new

Multi-view geometry Slides from L. Lazebnik Structure from motion Camera 1 Camera 3 Camera 2 R

LOW-LATENCY, NEAR-EYE GAZE ESTIMATION Michael Stengel, Alexander Majercik Part I (Michael) 25 min

Implementation Strategies for Eye Gaze Users Katelyn Oeser SLP Brenda Del Monte SLP They are

Depth Prediction and RGBD Images for Recognition Yihui He, Metehan Ozten yihuihe@foxmail.com, m

Disclaimer This document is provided for informational and reference purposes only. The Fresno

2016-17 Final Annual Update Local Control and Accountability Plan Local Control and

MIGRANT EDUCATION PROGRAM (MEP) SUBGRANTING Office of Migrant Education (OME) The mission of the

INFORMATIONAL MEETING SCRIPT SPEAKER: Becky Slides 1-5 (Slide 2) Good evening, and welcome to

IRIS BIOMETRIC SYSTEM CS635 Dept. of Computer Science &amp; Engineering NIT Rourkela Iris

MINNESOTA EDUCATION FINANCE 101 Key Items to Review 1. Education Budget: Big Picture 2.

Duchenne henne Muscul cular ar Dy Dystr strophy (DM (DMD) D) Aims Of The Training To give

California and the Every Student Succeeds Act California Practitioners Advisory Group October

IRIS BIOMETRIC SYSTEM CS635 Dept. of Computer Science & Engineering NIT Rourkela Iris