Objects and scenes Objects and scenes: Recognizing Multiple Object - PowerPoint PPT Presentation

Current View of Recognition Training Appearance Object Appearance Examples Representation Representation Model LAB Histogram x x x x x oo Textons oo x x o o HOG x x x Bag of SIFT A. Farhadi, I. Endres, and D. Hoiem 2010

Current View of Recognition g Training Appearance Object Appearance Examples Representation Representation Model LAB Histogram x x x x x oo Textons oo x x o o HOG x x x Bag of SIFT Lots of effort – fancy stuff A. Farhadi, I. Endres, and D. Hoiem 2010

Current View of Recognition Training Appearance Object Appearance Examples Representation Representation Model LAB Histogram x x x x x oo Textons oo x x o o HOG x x x Bag of SIFT Not much changed A. Farhadi, I. Endres, and D. Hoiem 2010

Value of basic categories Has head Is animal Is furry DOG Is small Can be pet Eats meat A. Farhadi, I. Endres, and D. Hoiem 2010

Limitations of basic categories They provide limited prediction and description DOG DOG A. Farhadi, I. Endres, and D. Hoiem 2010

Limitations of basic categories g They do not apply to objects from novel categories y pp y j g Familiar Objects New Object ??? Horse Dog Cat A. Farhadi, I. Endres, and D. Hoiem 2010

Limitations of basic categories g They do not make it easier to learn new categories y g Dog Appearance Classifier Features Appearance Zebra Features Classifier

Category-based representation • Limited description and prediction • No generalization to objects outside of learned categories g • Provides little guidance for learning So what would make a better So what would make a better representation? A. Farhadi, I. Endres, and D. Hoiem 2010

Attribute-based Representation Learn intermediate structure with object categories Multiple Categories ears fur animal, land animal, …, cat Viewpoint/pose eyes lying down, left side, facing camera mouth th F Function ti fast runner, climb trees, eat small tail animals, jump high, household pet scratch pet, scratch feet A. Farhadi, I. Endres, and D. Hoiem 2010

What we mean by attributes • Properties that we want to describe or predict • Shared across basic categories • Made explicit through supervision Multiple Categories ears fur animal, land animal, …, cat Viewpoint/pose eyes lying down, left side, facing camera mouth th F Function ti fast runner, climb trees, eat small tail animals, jump high, household pet scratch pet, scratch feet A. Farhadi, I. Endres, and D. Hoiem 2010

What do these attributes get us? Image Level Contains donkey Detailed Attributes Level Categories Animal Land animal d l Mammal Four legged animal Elk Pose Lying down = 1 Back = 1 … Object Level Object Level Functional Can see Horse Horse Can walk Herbivorous … Material Pixel segmentations A. Farhadi, I. Endres, and D. Hoiem 2010

Advantages of supervised attributes • Enables verbal description of objects and images p j g Large angry dog with pointy teeth A. Farhadi, I. Endres, and D. Hoiem 2010

Advantages of supervised attributes • Provides correspondence for objects from different categories categories STANDING HEAD HEAD SITTING LEG LEG LEG HEAD STANDING LEG A. Farhadi, I. Endres, and D. Hoiem 2010

Domain-based Recognition Basic-Level Superordinate Parts Parts Categories Categories … Cat Dog Detector Detector Head 4-Legged Animal Detector D t Detector t A. Farhadi, I. Endres, and D. Hoiem 2010

Domain-based Recognition Cat Detector 4-Legged Animal gg D Dog Detector Head 4-Legged Animal Detector D t t Head Walking Left Detector A. Farhadi, I. Endres, and D. Hoiem 2010

Domain-based recognition: overview Voting using Voting using Trained Detectors Shared Spatial Models Animal Vehicle Basic Level Categories Elephant, Dog, Eagle, Object Camel, Lizard, Bat, Localization Dog, Penguin, Monkey, … Broad Categories Four-legged Animal, Attribute ib Mammal, Water Animal, Animal Four-legged Predictors Animal Mammal Head Can run Can run Parts Object Can Jump Leg Leg, Horn, Wing, Head, Eye, Description Is Herbivorous Facing right Ear, Foot, Mouth, Nose, Tail A. Farhadi, I. Endres, and D. Hoiem 2010

CORE Dataset C ross-category O bject RE cognition • 2780 Images – from ImageNet 2780 I f I N • 3192 Objects – 28 Categories • 26695 Parts – 71 types • 30046 Attributes – 34 types • 1052 Material Images – 10 types Download or browse online: http://vision.cs.uiuc.edu/CORE http://vision.cs.uiuc.edu/CORE A. Farhadi, I. Endres, and D. Hoiem 2010

CORE Dataset Annotation Example Mirrors Vehicle Gas tank Two-wheeled Motorcycle Seat Headlight Lic. Plate Motorcycle Facing right Tail light On the street Metal Exhaust Has a rider Has a rider Rubber Rubber Engine Wheel Wheel A. Farhadi, I. Endres, and D. Hoiem 2010

Dataset examples: animals Categories Seen During Training and Testing Categories Seen Only During Testing A. Farhadi, I. Endres, and D. Hoiem 2010

Dataset examples: vehicles Categories Seen Only Categories Seen Only Categories Seen During Training and Testing During Testing A. Farhadi, I. Endres, and D. Hoiem 2010

Result: Part detectors can generalize across categories Part Detections for Novel Object Hump Head Leg Detectors trained using (Felzenszwalb Girshik McAllester Ramanan 2009) method

Result: Broad category detectors can generalize across basic categories Category Detections for Novel Object Four-legged Animal Mammal Animal Mammal Detectors trained using (Felzenszwalb Girshik McAllester Ramanan 2009) method

describe objects from familiar categories i Trunk u Trunk Leg Leg Foot Foot Foot

describe objects from familiar categories i ROC for Localization of Familiar Objects A. Farhadi, I. Endres, and D. Hoiem 2010

describe objects from familiar categories i AUC for Attribute Prediction for Familiar Objects Baseline: Infer from Basic Categories Our Method: Infer from All Animals Vehicles 1 1 0,9 0 9 0 9 0,9 0,8 0,8 0,7 0,7 0,6 0,6 0,5 0,5 Has Part Has Part Basic Basic Broad Function Broad Function Pose Pose Has Part Basic Cat Broad Function Has Part Basic Cat Broad Function Pose Pose Cat Cat Cat A. Farhadi, I. Endres, and D. Hoiem 2010

Result using only basic categories Elk Semi Truck Eagle Camel Snowmobile Dog A. Farhadi, I. Endres, and D. Hoiem 2010

Result 3: We can find and describe objects from novel categories Four-legged Animal Animal Mammal a a Vehicle V hi l Head Wheel Leg Can run Can Jump Is Herbivorous b Moves on road Facing right Facing right A. Farhadi, I. Endres, and D. Hoiem 2010

Result 3: We can find and describe objects from novel categories ROC for Localization of Unfamiliar Objects A. Farhadi, I. Endres, and D. Hoiem 2010

Result 3: We can find and describe objects from novel categories AUC for Attribute Prediction for Unfamiliar Objects Baseline: Infer from Basic Categories Our Method: Infer from All Animals Vehicles 0,8 0,8 0,7 0,7 0 6 0,6 0,6 0 6 0,5 0,5 Has Part Broad d Function Pose Has Part Broad d Function Pose Cat Cat

Objects and scenes Objects and scenes: Recognizing Multiple Object - PowerPoint PPT Presentation

Reconnaissance dobjets et vision artificielle 2010 Objects and scenes Objects and scenes: Recognizing Multiple Object Classes Josef Sivic and Ivan Laptev http://www.di.ens.fr/~josef INRIA, WILLOW, ENS/INRIA/CNRS UMR 8548 Laboratoire

Recognizing objects and actions in Finding boundaries images and video Recognizing

Mosaics of Scenes with Moving Objects James Davis Computer Science Department Stanford

Mutable Values Announcements Objects (Demo) Objects 4 Objects Objects represent

61A Lecture 12 Announcements Objects (Demo) Objects 4 Objects Objects represent

Challenges in Recognizing Challenges in Recognizing NFL with DY NFL with DY Accessibility

Overview of the Recognizing Inference in TExt (RITE-2) at Recognizing Inference in

Recognizing object instances 3. Recognizing object instances Kristen Grauman UT-Austin Image

Compilers Recognizing Handles Alex Aiken Recognizing Handles Bad News There are no known

Building 3D Scenes With QML Building 3D OpenGL Scenes with Qt 5 and QML Krzysztof Krzewniak

Objects & Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Behind the scenes of a C64 demo Ninja / The Dreams 28C3 Ninja / The Dreams Behind the scenes of

Lecture 19: Motion Sparse stereo matching Indexing scenes Indexing scenes Tuesday, Nov

Lets examine the abstract: The problem of recognizing objects subject to affine transformation

Live Objects Live Objects Live Objects Live Objects Krzys Ostrowski, Ken Birman, Danny Dolev

Recognizing Uncertainty Eric Bartelsman 1 , Jing Chen 1 and Atanas Kolev 2 1 Vrije Universiteit

Recognizing stances, arguments, viewpoints Ruth Morrison, Julian Chan Somasundaran and Wiebe

Lifestyle Change for Happiness + Health Liana Lianov, MD, MPH, FACPM, FACLM Chair, Positive

CMS Tracker Performance Francesco Palmonari (INFN Pisa) on behalf of the CMS Collaboration 8th

Plans and the (Predicate Argument) Structure of Behavior Mark Steedman 19th April 2015 2nd

2PI in expanding backgrounds ...bits and bobs and maybe topological defects Anders Tranberg,

Modeling approaches for switching converters by Giorgio Spiazzi University of Padova ITALY

Learning-based Matching Costs Finalist of the Depth Estimation Challenge at LF4CV & Submitted

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Multiple Linear Regression Recall: a regression model describes how a dependent variable (or

Sambuz

Useful Links

Newsletter

Mail Us

Objects and scenes Objects and scenes: Recognizing Multiple Object - PowerPoint PPT Presentation

Reconnaissance dobjets et vision artificielle 2010 Objects and scenes Objects and scenes: Recognizing Multiple Object Classes Josef Sivic and Ivan Laptev http://www.di.ens.fr/~josef INRIA, WILLOW, ENS/INRIA/CNRS UMR 8548 Laboratoire

Recognizing objects and actions in Finding boundaries images and video Recognizing

Mosaics of Scenes with Moving Objects James Davis Computer Science Department Stanford

Mutable Values Announcements Objects (Demo) Objects 4 Objects Objects represent

61A Lecture 12 Announcements Objects (Demo) Objects 4 Objects Objects represent

Challenges in Recognizing Challenges in Recognizing NFL with DY NFL with DY Accessibility

Overview of the Recognizing Inference in TExt (RITE-2) at Recognizing Inference in

Recognizing object instances 3. Recognizing object instances Kristen Grauman UT-Austin Image

Compilers Recognizing Handles Alex Aiken Recognizing Handles Bad News There are no known

Building 3D Scenes With QML Building 3D OpenGL Scenes with Qt 5 and QML Krzysztof Krzewniak

Objects &amp; Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Behind the scenes of a C64 demo Ninja / The Dreams 28C3 Ninja / The Dreams Behind the scenes of

Lecture 19: Motion Sparse stereo matching Indexing scenes Indexing scenes Tuesday, Nov

Lets examine the abstract: The problem of recognizing objects subject to affine transformation

Live Objects Live Objects Live Objects Live Objects Krzys Ostrowski, Ken Birman, Danny Dolev

Recognizing Uncertainty Eric Bartelsman 1 , Jing Chen 1 and Atanas Kolev 2 1 Vrije Universiteit

Recognizing stances, arguments, viewpoints Ruth Morrison, Julian Chan Somasundaran and Wiebe

Lifestyle Change for Happiness + Health Liana Lianov, MD, MPH, FACPM, FACLM Chair, Positive

CMS Tracker Performance Francesco Palmonari (INFN Pisa) on behalf of the CMS Collaboration 8th

Plans and the (Predicate Argument) Structure of Behavior Mark Steedman 19th April 2015 2nd

2PI in expanding backgrounds ...bits and bobs and maybe topological defects Anders Tranberg,

Modeling approaches for switching converters by Giorgio Spiazzi University of Padova ITALY

Learning-based Matching Costs Finalist of the Depth Estimation Challenge at LF4CV &amp; Submitted

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Multiple Linear Regression Recall: a regression model describes how a dependent variable (or

Sambuz

Useful Links

Newsletter

Mail Us

Objects & Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Learning-based Matching Costs Finalist of the Depth Estimation Challenge at LF4CV & Submitted