Learning an Alphabet of Shape and Appearance for Multi-Class Object - PowerPoint PPT Presentation

Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection Andreas Opelt, Axel Pinz and Andrew Zisserman 09-June-2009 Irshad Ali (Department of CS, AIT) 09-June-2009 1 / 20

Object class recognition Object class recognition is a key issue in computer vision. People use Shape and/or Appearance to categorize objects. In this paper they combine both shape and appearance. The alphabet is the basis for a codebook representation of object categories. The main focus of the paper is on representation and use of shape and geometry rather than appearance. Irshad Ali (Department of CS, AIT) 09-June-2009 2 / 20

Object class recognition Irshad Ali (Department of CS, AIT) 09-June-2009 3 / 20

Boundary-Fragment-Model(BFM) A BFM is restricted to a codebook of boundary fragments and does not represent appearance at all. The boundary represents the shape of many object classes quite naturally without requiring the appearance (e.g. texture) to be learnt and thus we can learn models using less training data to achieve good generalization. Irshad Ali (Department of CS, AIT) 09-June-2009 4 / 20

System Overview Irshad Ali (Department of CS, AIT) 09-June-2009 5 / 20

System Overview (a) (b) Figure: (a) Two alphabet entries (one region, one Boundary-Fragment). (b) Two weak detectors (one region-based, one Boundary-Fragment based). Irshad Ali (Department of CS, AIT) 09-June-2009 6 / 20

Training Data To train the model following data are required: A training image set with the object delineated by a bounding box. A validation image set with counter examples (the object is not present in these images), and further examples with the object’s centroid (but the bounding box is not necessary). Irshad Ali (Department of CS, AIT) 09-June-2009 7 / 20

Learning Learning is performed in two stages. Alphabet entries are added to a codebook. An alphabet entry can either be a Boundary-Fragment (BF-a piece of linked edges), or a patch (salient region and its descriptor). Each entry also casts at least one centroid vote, which is represented as a vector. Weak detectors are formed as pairs of two alphabet entries, and Boosting is used to select a strong detector. A strong detector consists of many weak detectors. This process selects the weak detectors which perform best on positive validation images and rejects the negative images (including a good centroid estimate). Irshad Ali (Department of CS, AIT) 09-June-2009 8 / 20

Implementation Details Linked edges are obtained for each image in the training and in the validation set using a Canny edge detector. Training images provide the candidate boundary fragments γ i by selecting random starting points on the edge map of each image. Then at each such point they grow a boundary fragment along the contour. Growing is performed from a certain fragment starting length L start in steps of L step pixels until a maximum length L stop is reached. Irshad Ali (Department of CS, AIT) 09-June-2009 9 / 20

Weak detector The combination of boundary fragments to form a weak detector h i . It fires on an image if the k boundary fragments ( γ a and γ b ) match image edge chains, the fragments agree in their centroid estimates (within an uncertainty of 2 r ). In the case of positive images, the centroid estimate agrees with the true object centroid ( O n ) within a distance of d c Irshad Ali (Department of CS, AIT) 09-June-2009 10 / 20

Matching weak detectors The top row shows a weak detector with k = 2, that fires on two positive validation image because of highly compact center votes close enough to the true object center (black circle). In the last column a negative validation image is shown. There the same weak detector does not fire (votings do not concur). Bottom row: the same as the top with k = 3. Irshad Ali (Department of CS, AIT) 09-June-2009 11 / 20

Learning a Strong Detector From a weak detector consisting of k boundary fragments and a threshold th hi they learn this threshold and form a strong detector H out of T weak detectors h i using AdaBoost. First they calculate the distances D ( h i , I j ) of all combinations of boundary fragments (using k elements for one combination) on all (positive and negative) images of validation set I 1 , ..., I v . Then in each iteration 1 , ..., T they search for the weak detector that obtains the best detection result on the current image weighting. Irshad Ali (Department of CS, AIT) 09-June-2009 12 / 20

Detection and Segmentation First the edges are detected. The boundary fragments of the weak detectors are matched to this edge image. In order to detect (one or more) instances of the object (instead of classifying the whole image) each weak detector h i votes with a weight w hi in a Hough voting space. Votes are then accumulated as follows: For all candidate points x n found by the strong detector in the test image I T they sum up the (probabilistic) voting of the weak detectors h i in a 2 D Hough voting space. Irshad Ali (Department of CS, AIT) 09-June-2009 13 / 20

Detection and Segmentation Irshad Ali (Department of CS, AIT) 09-June-2009 14 / 20

The BFM for Multiple Categories Building the alphabet of shape for many categories is based on the process for the one-class BFM. They also search over other categories to see if a boundary fragment can be shared. The boundary fragment matches on many positive validation images of another category and gives a roughly correct prediction of the object centroid. In this case they just update the alphabet entry with the new costs for this category and sharing is possible. The boundary fragment matches well on many positive validation images, but the prediction of the object centroid is not correct, though often the predictions for each match are consistent with each other. In this case they add a new centroid vector to the alphabet entry. The third obvious case is where the boundary fragment matches arbitrarily in validation images of a category in which case high costs emerge and sharing is not possible Irshad Ali (Department of CS, AIT) 09-June-2009 15 / 20

Results Irshad Ali (Department of CS, AIT) 09-June-2009 16 / 20

Results The first ten weak detectors learnt in the UM for the categories: Cars-side (UIUC), Cars-rear, Airplanes, Motorbikes and Faces (Caltech). Irshad Ali (Department of CS, AIT) 09-June-2009 19 / 20

Conclusion and Discussion Less False positives. Less Training data. Processing Time?? Scaling and Rotation. Irshad Ali (Department of CS, AIT) 09-June-2009 20 / 20

Learning an Alphabet of Shape and Appearance for Multi-Class Object - PowerPoint PPT Presentation

Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection Andreas Opelt, Axel Pinz and Andrew Zisserman 09-June-2009 Irshad Ali (Department of CS, AIT) 09-June-2009 1 / 20 Object class recognition Object class recognition

TextonBoost : : TextonBoost Joint Appearance, Shape and Context Joint Appearance, Shape and

Alphabet An alphabet is a set of letters . e.g., { a, b, c, . . . , z } e.g., { , , . . . ,

Bonn Agreement Oil Appearance Code Bonn Agreement Oil Appearance Code BAOAC BAOAC Bonn

Active Appearance Models Edwards, Taylor, and Cootes Presented by Bryan Russell Overview

Objective Appearance Measurement Appearance of surface finish Many factors can affect surface

I to no go the Revisit and Review Sing the Alphabet Can you sing this alphabet song along

The Same is Not The Same Postcorrection of Alphabet Confusion Errors in Mixed-Alphabet OCR

Statistical Shape Models Eigenpatches model regions Assume shape is fixed What if it

Shape Features WangRuchen CVBIOUC http://vision.ouc.edu.cn/~zhenghaiyong How to Convex hull

WELCOME Students Book page 45 Answers 2 black The alphabet 3 brown 4 pink 1.02 Look at

2. Coding-Theoretic Foundations Source alphabet S Target alphabet {0, 1} Categories of

Shape Contexts Newton Petersen 4/25/2008 "Shape Matching and Object Recognition Using

1 Shape- -Context: Matching Context: Matching Scale Invariance in Clutter ? Shape Scale

Computer Vision Statistical Shape Analysis Shape Shape is the geometric information that

John Heartfeld J. Otto Seibold Tempest Half life Piet Mondrian The 7 elements of art 1. line

Shape Context Matching For Efficient OCR Sudeep Pillai May 14, 2012 Sudeep Pillai Shape Context

SOFTWARE | REVITALIZED STROMASYS 2.0 YOUR GATEWAY TO BUSINESS CONTINUITY Dirk Lobo Gines THE

SubFinder Process Training Presenter: Karen Beadles Curriculum

Project Albatross Subsurface Data Acquisition using Semi-Autonomous Aquatic Robotics Team

CAS ASA upd update tes 2018 2018 Recent Progress and Future Plans of CASA and Pipeline

ACCELERATING GROWTH, , AND GOOGLE/FACEBOOK BY WHITNEY TILSON | WTILSON@KASELEARNING.COM

WELCOME STUDENTS L E T S R E M E M B E R T H E A L P H A B E T B Y G E O R G I A KO U R O U

For more resources, visit the CMES Website For more educational resources, visit the CMES Outreach

SYNAESTHETIC COLOUR RESPONSES TO LETTERS OF THE ALPHABET: AN INVESTIGATION THROUGH FINE ART

Sambuz

Useful Links

Newsletter

Mail Us

Learning an Alphabet of Shape and Appearance for Multi-Class Object - PowerPoint PPT Presentation

Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection Andreas Opelt, Axel Pinz and Andrew Zisserman 09-June-2009 Irshad Ali (Department of CS, AIT) 09-June-2009 1 / 20 Object class recognition Object class recognition

TextonBoost : : TextonBoost Joint Appearance, Shape and Context Joint Appearance, Shape and

Alphabet An alphabet is a set of letters . e.g., { a, b, c, . . . , z } e.g., { , , . . . ,

Bonn Agreement Oil Appearance Code Bonn Agreement Oil Appearance Code BAOAC BAOAC Bonn

Active Appearance Models Edwards, Taylor, and Cootes Presented by Bryan Russell Overview

Objective Appearance Measurement Appearance of surface finish Many factors can affect surface

I to no go the Revisit and Review Sing the Alphabet Can you sing this alphabet song along

The Same is Not The Same Postcorrection of Alphabet Confusion Errors in Mixed-Alphabet OCR

Statistical Shape Models Eigenpatches model regions Assume shape is fixed What if it

Shape Features WangRuchen CVBIOUC http://vision.ouc.edu.cn/~zhenghaiyong How to Convex hull

WELCOME Students Book page 45 Answers 2 black The alphabet 3 brown 4 pink 1.02 Look at

2. Coding-Theoretic Foundations Source alphabet S Target alphabet {0, 1} Categories of

Shape Contexts Newton Petersen 4/25/2008 &quot;Shape Matching and Object Recognition Using

1 Shape- -Context: Matching Context: Matching Scale Invariance in Clutter ? Shape Scale

Computer Vision Statistical Shape Analysis Shape Shape is the geometric information that

John Heartfeld J. Otto Seibold Tempest Half life Piet Mondrian The 7 elements of art 1. line

Shape Context Matching For Efficient OCR Sudeep Pillai May 14, 2012 Sudeep Pillai Shape Context

SOFTWARE | REVITALIZED STROMASYS 2.0 YOUR GATEWAY TO BUSINESS CONTINUITY Dirk Lobo Gines THE

SubFinder Process Training Presenter: Karen Beadles Curriculum

Project Albatross Subsurface Data Acquisition using Semi-Autonomous Aquatic Robotics Team

CAS ASA upd update tes 2018 2018 Recent Progress and Future Plans of CASA and Pipeline

ACCELERATING GROWTH, , AND GOOGLE/FACEBOOK BY WHITNEY TILSON | WTILSON@KASELEARNING.COM

WELCOME STUDENTS L E T S R E M E M B E R T H E A L P H A B E T B Y G E O R G I A KO U R O U

For more resources, visit the CMES Website For more educational resources, visit the CMES Outreach

SYNAESTHETIC COLOUR RESPONSES TO LETTERS OF THE ALPHABET: AN INVESTIGATION THROUGH FINE ART

Sambuz

Useful Links

Newsletter

Mail Us

Shape Contexts Newton Petersen 4/25/2008 "Shape Matching and Object Recognition Using