Perceptive Context Awareness of the User -- Visual Conversation - PDF document

Perceptive Context Awareness of the User -- Visual Conversation Cues: Interfaces (kiosks, agents, robots…) are currently blind Perceptive Context to users…machines should be aware of presence, pose, expression, and non-verbal dialog cues… Trevor Darrell Awareness of the Environment -- Perceptive Devices: Vision Interface Group Mobile devices (cellphones, PDAs, laptops) bring MIT CSAIL computing and communications with us wherever we go, but they are blind to their environment…they should be able to see things of interest in the environment just as we do… Today Head modeling and pose tracking • Visually aware conversational interfaces (“ read my body language!”) - head modeling and pose estimation - articulated body tracking • Mobile devices that can see their environment (“ what’s that thing there?”) - mobile location specification - image-based mobile web browsing 3D Head Pose Tracker Face aware interfaces • Agent should know when it’s being attended to • Turn-taking discourse cues: who is talking to whom? Stereo rigid stereo • Model attention of user camera motion • Agreement: head nod and shake gestures estimation range Current frame • Grounding: shared physical reference intensity Reference frame

Face cursor Face-responsive agent Subject not looking at SAM: ASR turned off SAM Pose tracker Face-responsive agent Face-responsive agent Subject looking at SAM: Subject not looking ASR turned on at SAM: ASR turned off SAM SAM Pose tracker Pose tracker Face-responsive agent Face-responsive agent Subject looking at SAM: Subject looking at SAM: ASR turned on ASR turned on SAM SAM • General conversational turn-taking • Agreement (Nod/Shake) Pose tracker Pose tracker • Grounding / Object reference…

Room tracking for Location Context Learning activity zones Location is an important cue for pervasive computing applications… • Location context should provide a finer scale cue than room-ID, but more abstract than 3-space position and orientation. Plan view • Regions (“zones”) should be learned from observing actual user Foreground Room Range behavior. Motion Clustering Plan view Foreground Room Range Activity zones Zone map formed from observing user behavior Using activity zones Articulated pose sensing Plan view Foreground Room Range zone 4 prefs Activity zones Current zone determines application context [Koile, Darrell, et. al, UBICOMP 03] Model-based Approach Interactive Wall model depth image ICP with articulation constraint 1. Find closest points 2. Update poses 3. Constrain…

Multimodal studio Articulated Pose from a single image? Model based approach difficult with more impoverished observations: - contours - edge features - texture - (noisy stereo…) hard to fit a single image reliably! � Example-based learning paradigm Example-based matching Parameter sensitive hashing • Match 2-D features against large corpus of 2-D to 3-D example mappings • Fast hashing for approximate nearest neighbor search • Feature selection using paired classification problem • Data collection: use motion capture data, or exploit synthetic (but realistic) models 2D->3D with Parameter sensitive hashing Today • Visually aware conversational interfaces -- read my body language! - head modeling and pose estimation - articulated body tracking • Mobile devices that can see their environment -- what’s that thing there? - mobile location specification - image-based mobile web browsing

Physical awareness Physical awareness How can device be aware of what user is Asking a friend, “What’s this?” looking at? Human Expert User User What is this? MIT Dome CBIR: Content-based Image Retrieval IDeixis • Use image (or video) query to database. Instead, use CBIR (Content-based Image Retrieval) system: • For place recognition, many current matching methods can be successful User CBIR System - PCA - Gobal orientation histograms [Torralba et al.] - Local features (Affine-invariant detectors/descriptors What is this? [Schmid], SIFT [Lowe], etc.) http://mit.edu/.. … where to get the database? The Web First Prototype • Many location images can be found on the web 1. Take an Image 2. Send image via 3. View search result MMS (matching location 4. Browse a relevant images) webpage

Bootstrap image web search Images -> keywords (-> images) • Hard to compile an image database of entire web! Web Bootstrap Image Database • But given matches in subset of web: - Extract salient keywords - Keyword-based image search CBIR - Apply content-based filter to keyword-matched pages • And/or allow direct keyword search • Weighted term/bigram frequency sufficient for early experiments… (1) (3) (4) Effiel Tower CBIR (2) Advantages Visual Interfaces and Devices • Recognizing distant location (by taking photo) • Infrastructure free (by using the web) Interfaces (kiosks, agents, robots…) are currently blind to users…machines should be aware of presence, pose, • Large-scale image-based web search (by bootstrapping expression, and non-verbal dialog cues… keywords) Mobile devices (cellphones, PDAs, laptops) bring • With advances in segmentation, can apply to many other computing and communications with us wherever we go, object recognition problems but they are blind to their environment…they should be – mobile signs able to see things of interest in the environment just as – appliance we do… – product packaging Acknowledgements END David Demirdjian Kimberlie Koile Louis Morency Greg Shakhnarovich Mike Siracusa Konrad Tollmar Tom Yeh & many others…

Perceptive Context Awareness of the User -- Visual Conversation - PDF document

Perceptive Context Awareness of the User -- Visual Conversation Cues: Interfaces (kiosks, agents, robots) are currently blind Perceptive Context to usersmachines should be aware of presence, pose, expression, and non-verbal dialog cues

Perceptive Context Trevor Darrell Vision Interface Group MIT CSAIL Perceptive Context

Perceptually Aware Displays Camera associated with display Perceptive Context for Pervasive

Perceptive Context for Pervasive Computing Trevor Darrell Vision Interface Group MIT AI Lab

Budget Journal eForm New Process within Perceptive Content Last updated 10/24/18 Overview

Motivational Profile (MP) Comprehensive. Simple. Perceptive A psychological assessment tool It

Context Context Context Context Full control of EIP no longer yields immediate arbitrary

Summary User-centric Social Social Multimedia Multimedia Computing From Users: user-perceptive

Gr o w ing S chools 1 context context E x t e n s i v e 174,045 mo r e p eo p le w ill need

Context Sensitivity Example of a CSG Informatics 2A: Lecture 26 2 Context in Programming

Context Sensitive Solutions Context Context Sensitive Solutions (CSS) is a collaborative approach

Language Disabled Children Are: Bright Creative Imaginative Perceptive bwolf 1 6/5/2018

MINUTE MINUTE PRESENTER PRESENTER Endorsed internationally by business 8 steps to successful

Principles for Catholic Scripture Study Context, Context, Context Four Senses of Scripture

Hillside Dr. Context Hillside Dr. Site Issues Hillside Dr. Context Broadview Ave

Ol Old Timeline and context: Timeline and context: 1:9,22 A new Pharoah came to power

are Val alue-bas ased Car A Complimentary Webinar From healthsystemCIO.com Sponsored by

Part 15: Context Dependent Recommendations Francesco Ricci Free University of Bozen-Bolzano

ECA-DL e Transformaes Patrcia Dockhorn Costa pdcosta@inf.ufes.br Context-Awareness

Select Bus Service New York City Context 2 New York City Context 3 Transit Context 8.5

Java Technologies Servlets The Context We are in the context of developing a distributed

San Jos: History and Context San Jos: History and Context Historically Multimodal Growing Up

Policy 101 Vision & Context Project Development Funding Vision & Context CS is

WORKING TOGETHER FOR MEDELLIN 1 Some context first SOME CONTEXT | Strong institutions TOUCHING

Java Technologies Enterprise Java Beans (EJB) The Context We are in the context of

Perceptive Context Awareness of the User -- Visual Conversation - PDF document

Perceptive Context Awareness of the User -- Visual Conversation Cues: Interfaces (kiosks, agents, robots) are currently blind Perceptive Context to usersmachines should be aware of presence, pose, expression, and non-verbal dialog cues

Perceptive Context Trevor Darrell Vision Interface Group MIT CSAIL Perceptive Context

Perceptually Aware Displays Camera associated with display Perceptive Context for Pervasive

Perceptive Context for Pervasive Computing Trevor Darrell Vision Interface Group MIT AI Lab

Budget Journal eForm New Process within Perceptive Content Last updated 10/24/18 Overview

Motivational Profile (MP) Comprehensive. Simple. Perceptive A psychological assessment tool It

Context Context Context Context Full control of EIP no longer yields immediate arbitrary

Summary User-centric Social Social Multimedia Multimedia Computing From Users: user-perceptive

Gr o w ing S chools 1 context context E x t e n s i v e 174,045 mo r e p eo p le w ill need

Context Sensitivity Example of a CSG Informatics 2A: Lecture 26 2 Context in Programming

Context Sensitive Solutions Context Context Sensitive Solutions (CSS) is a collaborative approach

Language Disabled Children Are: Bright Creative Imaginative Perceptive bwolf 1 6/5/2018

MINUTE MINUTE PRESENTER PRESENTER Endorsed internationally by business 8 steps to successful

Principles for Catholic Scripture Study Context, Context, Context Four Senses of Scripture

Hillside Dr. Context Hillside Dr. Site Issues Hillside Dr. Context Broadview Ave

Ol Old Timeline and context: Timeline and context: 1:9,22 A new Pharoah came to power

are Val alue-bas ased Car A Complimentary Webinar From healthsystemCIO.com Sponsored by

Part 15: Context Dependent Recommendations Francesco Ricci Free University of Bozen-Bolzano

ECA-DL e Transformaes Patrcia Dockhorn Costa pdcosta@inf.ufes.br Context-Awareness

Select Bus Service New York City Context 2 New York City Context 3 Transit Context 8.5

Java Technologies Servlets The Context We are in the context of developing a distributed

San Jos: History and Context San Jos: History and Context Historically Multimodal Growing Up

Policy 101 Vision &amp; Context Project Development Funding Vision &amp; Context CS is

WORKING TOGETHER FOR MEDELLIN 1 Some context first SOME CONTEXT | Strong institutions TOUCHING

Java Technologies Enterprise Java Beans (EJB) The Context We are in the context of

Policy 101 Vision & Context Project Development Funding Vision & Context CS is