Social Interactions: A First-Person Perspective. A. Fathi, J. - PowerPoint PPT Presentation

Social Interactions: A First-Person Perspective. A. Fathi, J. Hodgins, J. Rehg Presented by Jacob Menashe November 16, 2012

Social Interaction Detection Objective: Detect social interactions from video footage.

Social Interaction Detection Objective: Detect social interactions from video footage. ◮ Consider faces and attention

Social Interaction Detection Objective: Detect social interactions from video footage. ◮ Consider faces and attention ◮ Account for temporal context

Social Interaction Detection Objective: Detect social interactions from video footage. ◮ Consider faces and attention ◮ Account for temporal context ◮ Analyze first-person movements cues

Introduction Overview Features Temporal Context Experiments

Video Example Red Dialogue Yellow Walking Dialogue Green Discussion Light Blue Walking Discussion Dark Blue Monologue None Background Link

Features Features are constructed based on first- and third-person information.

Features Features are constructed based on first- and third-person information. 1. Dense optical flow (first-person movement).

Features Features are constructed based on first- and third-person information. 1. Dense optical flow (first-person movement). 2. Face locations (relative to first person)

Features Features are constructed based on first- and third-person information. 1. Dense optical flow (first-person movement). 2. Face locations (relative to first person) 3. Attention and Roles. For each person x :

Features Features are constructed based on first- and third-person information. 1. Dense optical flow (first-person movement). 2. Face locations (relative to first person) 3. Attention and Roles. For each person x : ◮ Faces looking at x

Features Features are constructed based on first- and third-person information. 1. Dense optical flow (first-person movement). 2. Face locations (relative to first person) 3. Attention and Roles. For each person x : ◮ Faces looking at x ◮ Whether first person looks at x

Features Features are constructed based on first- and third-person information. 1. Dense optical flow (first-person movement). 2. Face locations (relative to first person) 3. Attention and Roles. For each person x : ◮ Faces looking at x ◮ Whether first person looks at x ◮ Mutual attention between x and first person

Features Features are constructed based on first- and third-person information. 1. Dense optical flow (first-person movement). 2. Face locations (relative to first person) 3. Attention and Roles. For each person x : ◮ Faces looking at x ◮ Whether first person looks at x ◮ Mutual attention between x and first person ◮ Number of faces looking at where x is looking

Feature Example

Conditional Random Fields CRFs are described in Lafferty et al. [2001].

Conditional Random Fields CRFs are described in Lafferty et al. [2001]. ◮ Observations and labels form a Markov chain. ◮ Nodes pend on neighbors. y 1 y 2 y 3 x 1 x 2 x 3

Conditional Random Fields CRFs are described in Lafferty et al. [2001]. ◮ Observations and labels form a Markov chain. ◮ Nodes pend on neighbors. y 1 y 1 y 2 y 3 p ( y 1 | x 1 , y 2 ) x 1 x 2 x 3

Conditional Random Fields CRFs are described in Lafferty et al. [2001]. ◮ Observations and labels form a Markov chain. ◮ Nodes pend on neighbors. y 1 y 1 y 2 y 2 y 3 p ( y 2 | y 1 , y 3 , x 2 ) x 1 x 2 x 3

Conditional Random Fields CRFs are described in Lafferty et al. [2001]. ◮ Observations and labels form a Markov chain. ◮ Nodes pend on neighbors. y 1 y 2 y 3 y 3 p ( y 3 | y 2 , x 3 ) x 1 x 2 x 3

Hidden Conditional Random Fields A micro view of the HCRF model as described in Quattoni et al. [2007]. Y h 1 h 2 h 3 x i

Hidden Conditional Random Fields A micro view of the HCRF model as described in Quattoni et al. [2007]. ◮ Y is a label for the whole sequence. Y h 1 h 2 h 3 x i

Hidden Conditional Random Fields A micro view of the HCRF model as described in Quattoni et al. [2007]. ◮ Y is a label for the whole sequence. ◮ x i is a single observation in the sequence. Y h 1 h 2 h 3 x i

Hidden Conditional Random Fields A micro view of the HCRF model as described in Quattoni et al. [2007]. ◮ Y is a label for the whole sequence. ◮ x i is a single observation in the sequence. ◮ Each h i is a possible hidden state. Y h 1 h 2 h 3 x i

Hidden Conditional Random Fields (cont.) A macro view of the HCRF model as described in Quattoni et al. [2007]. Y h 1 h 2 h 3 x 1 x 2 x 3

Hidden Conditional Random Fields (cont.) A macro view of the HCRF model as described in Quattoni et al. [2007]. ◮ Y is a label for the whole sequence. Y h 1 h 2 h 3 x 1 x 2 x 3

Hidden Conditional Random Fields (cont.) A macro view of the HCRF model as described in Quattoni et al. [2007]. ◮ Y is a label for the whole sequence. ◮ Each x i is a single observation in the sequence. Y h 1 h 2 h 3 x 1 x 2 x 3

Hidden Conditional Random Fields (cont.) A macro view of the HCRF model as described in Quattoni et al. [2007]. ◮ Y is a label for the whole sequence. ◮ Each x i is a single observation in the sequence. ◮ Each h i is the hidden state label assigned to x i . Y h 1 h 2 h 3 x 1 x 2 x 3

Hidden Conditional Random Fields (cont.) A macro view of the HCRF model as described in Quattoni et al. [2007]. ◮ Y is a label for the whole sequence. ◮ Each x i is a single observation in the sequence. ◮ Each h i is the hidden state label assigned to x i . Y p ( h 1 | Y , h 2 , x 1 ) h 1 h 1 h 2 h 3 x 1 x 2 x 3

Hidden Conditional Random Fields (cont.) A macro view of the HCRF model as described in Quattoni et al. [2007]. ◮ Y is a label for the whole sequence. ◮ Each x i is a single observation in the sequence. ◮ Each h i is the hidden state label assigned to x i . Y p ( h 2 | Y , h 1 , h 3 , x 2 ) h 1 h 2 h 2 h 3 x 1 x 2 x 3

Hidden Conditional Random Fields (cont.) A macro view of the HCRF model as described in Quattoni et al. [2007]. ◮ Y is a label for the whole sequence. ◮ Each x i is a single observation in the sequence. ◮ Each h i is the hidden state label assigned to x i . Y p ( h 3 | Y , h 2 , x 3 ) h 1 h 2 h 3 h 3 x 1 x 2 x 3

Hidden Conditional Random Fields (cont.) A macro view of the HCRF model as described in Quattoni et al. [2007]. ◮ Y is a label for the whole sequence. ◮ Each x i is a single observation in the sequence. ◮ Each h i is the hidden state label assigned to x i . Y Y p ( Y |{ h i } ) = p ( Y |{ x i } ) h 1 h 2 h 3 x 1 x 2 x 3

HCRF Example Suppose we want to find the likelihood of “walking dialogue” ( WDlg ) vs “walking discussion” ( WDisc ). WDlg h 1 h 2 h 3 x 1 x 2 x 3

HCRF Example Suppose we want to find the likelihood of “walking dialogue” ( WDlg ) vs “walking discussion” ( WDisc ). ◮ Each x i is now a feature extracted from video frames. WDlg h 1 h 2 h 3 x 1 x 1 x 2 x 2 x 3 x 3

HCRF Example Suppose we want to find the likelihood of “walking dialogue” ( WDlg ) vs “walking discussion” ( WDisc ). ◮ Each x i is now a feature extracted from video frames. ◮ Each h i is determined from training: WDlg h 1 h 1 h 2 h 2 h 3 h 3 x 1 x 2 x 3

HCRF Example Suppose we want to find the likelihood of “walking dialogue” ( WDlg ) vs “walking discussion” ( WDisc ). ◮ Each x i is now a feature extracted from video frames. ◮ Each h i is determined from training: ◮ h 1 : John wants to hear about my weekend. WDlg h 1 h 1 h 2 h 3 x 1 x 2 x 3

HCRF Example Suppose we want to find the likelihood of “walking dialogue” ( WDlg ) vs “walking discussion” ( WDisc ). ◮ Each x i is now a feature extracted from video frames. ◮ Each h i is determined from training: ◮ h 2 : I’m feeling talkative. WDlg h 1 h 2 h 2 h 3 x 1 x 2 x 3

HCRF Example Suppose we want to find the likelihood of “walking dialogue” ( WDlg ) vs “walking discussion” ( WDisc ). ◮ Each x i is now a feature extracted from video frames. ◮ Each h i is determined from training: ◮ h 3 : Mary wants to listen to her iPod. WDlg h 1 h 2 h 3 h 3 x 1 x 2 x 3

HCRF Example Suppose we want to find the likelihood of “walking dialogue” ( WDlg ) vs “walking discussion” ( WDisc ). ◮ Each x i is now a feature extracted from video frames. ◮ Each h i is determined from training: ◮ h 1 : John wants to hear about my weekend. WDlg p ( h 1 | Y , h 2 , x 1 ) h 1 h 1 h 2 h 3 x 1 x 2 x 3

HCRF Example Suppose we want to find the likelihood of “walking dialogue” ( WDlg ) vs “walking discussion” ( WDisc ). ◮ Each x i is now a feature extracted from video frames. ◮ Each h i is determined from training: ◮ h 2 : I’m feeling talkative. WDlg p ( h 2 | Y , h 1 , h 3 , x 2 ) h 1 h 2 h 2 h 3 x 1 x 2 x 3

HCRF Example Suppose we want to find the likelihood of “walking dialogue” ( WDlg ) vs “walking discussion” ( WDisc ). ◮ Each x i is now a feature extracted from video frames. ◮ Each h i is determined from training: ◮ h 3 : Mary wants to listen to her iPod. WDlg p ( h 3 | Y , h 2 , x 3 ) h 1 h 2 h 3 h 3 x 1 x 2 x 3

Social Interactions: A First-Person Perspective. A. Fathi, J. - PowerPoint PPT Presentation

Social Interactions: A First-Person Perspective. A. Fathi, J. Hodgins, J. Rehg Presented by Jacob Menashe November 16, 2012 Social Interaction Detection Objective: Detect social interactions from video footage. Social Interaction Detection

Smart Cameras Mark DiVelbiss, Selena Grant, Qing Liu Overview - First Person vs Third Person -

Perspective LanguaL Structured Vocabulary: USDA Perspective Joanne Holden Perspective: Earth

First-Person Animation Ryan Duffin Senior Animator at Electronic Arts Twitter: @AnimationMerc

English Language - IGCSE Touching the Void (I) Viewpoint 1 st person account vs 3 rd person

{ narrating it. Either as a first person, second person or third person. By: Una Bach Billy

New Defence Perspective New Defence Perspective New Defence Perspective New Defence Perspective

A legal perspective A legal perspective A legal perspective A legal perspective I. Engineers

Part II: (S)RG and Low-Momentum Interactions To understand the properties of complex nuclei from

WGCV/WGISS interactions Ric har d MORE NO WGISS Chair CNE S WGCV / WGISS interactions

Intermolecular interactions and scattering M.H.J. Koch 1 Intermolecular interactions

Person Re-Identification Yiheng Liu Outli line Background Image-Based Person

Social Science Social Science and Fuels and Fuels Management: Management: Management:

The First Perspective Image Perspective in Art Giotto, 1300 Campin, 1430 (before perspective)

European Social Network Social services in Europe Christian Fillet Chair, European Social

Management of interactions between aquaculture and wild salmonids a Scottish perspective Alan

How to Avoid Clique Culture Timnit Gebru What is a Clique? Social Interactions Poster

Redefiniendo la Universidad Nuevos modelos informales para el aprendizaje en red Universitat de

Developing Agility 22.09.2020 www.schoolleaderstraining.co.uk

Outline Introduction Related work iASK design iASK implementation Evaluation

Subbarao Kambhampati Arizona State University Pop Quiz: Magellan, the explorer, went around the

How do senators vote? -Issues - Strategies -Affiliation - Influence Voting Behavior 1

Identifying Causal Efgects in Experiments with Social Interactions and Non-compliance Francis J.

LinkMirage: Enabling Privacy-preserving Analytics on Social Relationships Changchang Liu, Prateek

Counting social interactions for discrete subsets of the plane Samantha Fairchild University of

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us