An Egocentric Perspec/ve on Ac/ve Vision and Visual Object Learning - PowerPoint PPT Presentation

An Egocentric Perspec/ve on Ac/ve Vision and Visual Object Learning in Toddlers S. Bambach, D. Crandall, L. Smith, C. Yu. ICDL 2017 Experiment presenters: Arjun, Ginevra

Their Experiments Image source: paper

Their Experiments Authors could not control training set Image source: paper

Our Experiments • We generate images where – Labeled object occupies fixed percentage of view – Background objects do not move Image source: collages we made from Caltech 256 database

Our Experiments • Simulate toddler bringing object to face – We control scale to measure its effect on tes/ng accuracy Image source: collages we made from Caltech 256 database

Our Dataset • 5 classes, 3633 images • Collages – Construct ‘scenes of toys’ using Caltech-256 – 1 posi/ve image amongst many nega/ves – Simulate toddler perspec/ve Image source: Caltech 256 database

Scene Genera/on • Scene dim: 224 x 224 – Scale largest image dim to 70 – Rotate randomly from -15° to 15° • 10 nega/ves – Select uniformly from Caltech-256 nega/ves – Placed randomly in within scene boundary • 1 posi/ve – Scale 0 (1x), 1 (1.5x), 2 (2x), 3 (3x) – Place randomly within scene boundary (at scale 1) • 2 scenes per training instance

VGG 16 Image source, and source of some code used in the experiments: h]ps://www.cs.toronto.edu/~frossard/post/vgg16/

VGG 16 for 5 classes Image source: h]ps://www.cs.toronto.edu/~frossard/post/vgg16/, modified by us

Experiment Setup • Experiment 1 – Train on different scales, test on clean image • Experiment 2 – Train on different scales and clean, test on different scales Scale 0 Scale 1 Scale 2 Scale 3 Clean 10% of view 20% of view 30% of view 60% of view Image Image source: collages we made from Caltech 256 database

Experiment 1 - objec/ve • Test effect of ‘bringing object to face’ for isolated classifica/on • Ques/ons to consider – Effect of viewing at mul/ple scales? – Single ideal scale or result of mul/ple scales? Image source: h]ps://en.wik/onary.org/wiki/ques/on_mark

Experiment 1 - data Train0 Image source: collages we made from Caltech 256 database

Experiment 1 - data Train3only Image source: collages we made from Caltech 256 database

Experiment 1 - data Correct number of epochs to compensate for more training examples Image source: collages we made from Caltech 256 database

Experiment 1 - data Test Image source: collages we made from Caltech 256 database

Experiment 1 - results 1 0.9 0.8 Tes*ng accuracy on clean image 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Train0 Train1 Train2 Train3 Train3only Train Set

Experiment 1 - results 1 0.9 0.8 Tes*ng accuracy on clean image 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Train0 Train1 Train2 Train3 Train3only Train Set Training on larger scale images only yields to best test accuracy.

Experiment 1 - results • Images misclassified when network trained in low scales benefit from training in higher scales Misclassified aier train0, train1, train2 Correctly classified aier train3 and train3only (Category: bag) Image source: Caltech 256 database

Experiment 1 - results • Images misclassified when network trained in low scales benefit from training in higher scales Misclassified aier train0, train1, train2, train3 Correctly classified only aier train3only (Category: plane) Image source: Caltech 256 database

Experiment 1 - results • Images misclassified aier train3only were misclassified aier all other trainings Bag Plane Plane Image source: Caltech 256 database

Experiment 1 - conclusions • Toddler’s data gives be]er training because object is closer, not because it is ‘brought to face’ • Significant jump in accuracy if object occupies >30% of view in training • Training images where object occupies <30% of view do more harm than good Image source: collages we made from Caltech 256 database

Experiment Setup • Experiment 1 – Train on different scales, test on clean image • Experiment 2 – Train on different scales and clean, test on different scales Scale 0 Scale 1 Scale 2 Scale 3 Clean 10% of view 20% of view 30% of view 60% of view Image Image source: collages we made from Caltech 256 database

Experiment 2 - objec/ve • Effect of ‘bringing to face’ for object-in-scene detec/on • Ques/ons to consider – Does ‘cleaning’ the scene decrease detec/on in clu]ered environment? Image source: h]ps://en.wik/onary.org/wiki/ques/on_mark

Experiment 2 - data TrainClean Image source: collages we made from Caltech 256 database

Experiment 2 - data Correct number of epochs to compensate for more training examples Image source: collages we made from Caltech 256 database

Experiment 2 - data Test0 On different images compared to train sets Image source: collages we made from Caltech 256 database

Experiment 2 - data Test1only On different images compared to train sets Image source: collages we made from Caltech 256 database

Experiment 2 - results 1 0.9 0.8 0.7 0.6 Tes*ng accuracy Test0 0.5 Test1only Test2only 0.4 Test3only 0.3 0.2 0.1 0 Train0 Train1 Train2 Train3 TrainClean Train set

Experiment 2 - results 1 0.9 0.8 0.7 0.6 Tes*ng accuracy Test0 0.5 Test1only Test2only 0.4 Test3only 0.3 0.2 0.1 0 Train0 Train1 Train2 Train3 TrainClean Train set Training by ‘bringing to face’ yields to best accuracy

Experiment 2 - conclusions • Can learn more from different scales than from clean, as long as scale 3 is included • Learning from different scales gives be]er accuracies when tested on lower scales • Test on clean much be]er than test on scales Image source: collages we made from Caltech 256 database

Conclusions • With our controlled datasets, we could verify that network learns be]er from larger scale • Tes/ng needs to be done on clean images, no ma]er which scales were used in training • Training on scales >30% gives more robustness when tes/ng on all scales • Training on scales <30% hurts accuracy

An Egocentric Perspec/ve on Ac/ve Vision and Visual Object Learning - PowerPoint PPT Presentation

An Egocentric Perspec/ve on Ac/ve Vision and Visual Object Learning in Toddlers S. Bambach, D. Crandall, L. Smith, C. Yu. ICDL 2017 Experiment presenters: Arjun, Ginevra Their Experiments Image source: paper Their Experiments Authors could

Egocentric Relational Event Models Christopher Steven Marcum and Lorien Jasny August 25 th ,

Summarizing Egocentric Video Kristen Grauman Department of Computer Science University of Texas

egoSlider Visual Analysis of Egocentric Network Evolution Presented by: Ken Mansfield CPSC 547

Egocentric Videos Yair Poleg Chetan Arora Shmuel Peleg CVPR 2014 Presenter: Hsin-Ping

EgoNetCloud: Event-based Egocentric Dynamic Network Visualization Qingsong Liu, Yifan Hu, Lei

Egocentric Analysis of Dynamic Networks with EgoLines Jian Zhao, Michael Glueck, Fanny Chevalier,

Learning Where to Look and Listen: Egocentric and 360 Computer Vision Kristen Grauman Facebook

egoSlider: Visual Analysis of Egocentric Network Evolution Yanhong Wu , Naveen Pitipornvivat, Jian

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

Learning to Predict Gaze in Egocentric Videos Yin Li, Alireza Fathi, James M. Rehg Outline: -

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

Studying the visual system (1) Early Vision and The visual system can be (and is) studied using

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION

Strengthening Early Childhood in Kansas in 2019 WEBINAR March 6, 2019 Welcome Amanda

Reinforcement Learning CS 188: Artificial Intelligence Reinforcement Learning Instructors:

the Early Years: Big Ideas, Best Practices and Future Directions Chip Donohue, PhD TEC Center

If PREP REP is the he appl pple, A Adul dulthoo hood P d Prepa eparation on Subjec ects

Impact Project An Intensive Consultation and Technical Assistance Opportunity for States,

Seeds of Learning The New Britain Infant/Toddler Early Childhood Business Incubator Robin

Mana aotroa/Exploration Do you let me fly? 1 Unuhia te p, te p whiri mrama Tomokia te

OneList Updates and Consultation How We Got Here Child Tech Care Support Survey Data

An Egocentric Perspec/ve on Ac/ve Vision and Visual Object Learning - PowerPoint PPT Presentation

An Egocentric Perspec/ve on Ac/ve Vision and Visual Object Learning in Toddlers S. Bambach, D. Crandall, L. Smith, C. Yu. ICDL 2017 Experiment presenters: Arjun, Ginevra Their Experiments Image source: paper Their Experiments Authors could

Egocentric Relational Event Models Christopher Steven Marcum and Lorien Jasny August 25 th ,

Summarizing Egocentric Video Kristen Grauman Department of Computer Science University of Texas

egoSlider Visual Analysis of Egocentric Network Evolution Presented by: Ken Mansfield CPSC 547

Egocentric Videos Yair Poleg Chetan Arora Shmuel Peleg CVPR 2014 Presenter: Hsin-Ping

EgoNetCloud: Event-based Egocentric Dynamic Network Visualization Qingsong Liu, Yifan Hu, Lei

Egocentric Analysis of Dynamic Networks with EgoLines Jian Zhao, Michael Glueck, Fanny Chevalier,

Learning Where to Look and Listen: Egocentric and 360 Computer Vision Kristen Grauman Facebook

egoSlider: Visual Analysis of Egocentric Network Evolution Yanhong Wu , Naveen Pitipornvivat, Jian

Biovision team 2 Retina Visual cortex 3 Retina Visual cortex 3 Retina Visual cortex 3

Overview Overview Visual displays Visual displays Visual and tactile displays Visual and

Learning to Predict Gaze in Egocentric Videos Yin Li, Alireza Fathi, James M. Rehg Outline: -

CHRONIC CHRONIC VISUAL LOSS VISUAL LOSS Wasu Supakornthanasarn, MD. Visual loss Sensory

A Model of Visual Imagery A Model of Visual Imagery John Abbondanza, OD, FCOVD John Abbondanza,

Computer Vision Computer Vision How does vision work? What is vision for? Ela Claridge

Studying the visual system (1) Early Vision and The visual system can be (and is) studied using

Branding Presentation VISION Mevushal VISION Muscat of Alexandria &amp; Viognier VISION

Strengthening Early Childhood in Kansas in 2019 WEBINAR March 6, 2019 Welcome Amanda

Reinforcement Learning CS 188: Artificial Intelligence Reinforcement Learning Instructors:

the Early Years: Big Ideas, Best Practices and Future Directions Chip Donohue, PhD TEC Center

If PREP REP is the he appl pple, A Adul dulthoo hood P d Prepa eparation on Subjec ects

Impact Project An Intensive Consultation and Technical Assistance Opportunity for States,

Seeds of Learning The New Britain Infant/Toddler Early Childhood Business Incubator Robin

Mana aotroa/Exploration Do you let me fly? 1 Unuhia te p, te p whiri mrama Tomokia te

OneList Updates and Consultation How We Got Here Child Tech Care Support Survey Data

Branding Presentation VISION Mevushal VISION Muscat of Alexandria & Viognier VISION