Neural Codes for Image Retrieval David Stutz July 22, 2015 David - PowerPoint PPT Presentation

Neural Codes for Image Retrieval David Stutz July 22, 2015 David Stutz | July 22, 2015 David Stutz | July 22, 2015 0/48 1/48

Table of Contents Introduction 1 Image Retrieval 2 Bag of Visual Words Vector of Locally Aggregated Descriptors Sparse-Coded Features Compression and Nearest-Neighbor Search 3 Convolutional Neural Networks Multi-layer Perceptrons Convolutional Neural Networks Architectures Training 4 Neural Codes for Image Retrieval 5 Experiments 6 Summary David Stutz | July 22, 2015 2/48

1. Introduction Image retrieval: Problem. Given a large database of images and a query image, find images showing the same object or scene. advantage: supports activities, emotions, ... Originally: ◮ Text-based retrieval systems based on manual annotations; ◮ unpractical for large collections of images. Today, content-based image retrieval: ◮ Techniques based on the Bag of Visual Words [SZ03] model. David Stutz | July 22, 2015 4/48

2. Image Retrieval Formalization of content-based image retrieval: Problem. Find K -nearest-neighbors of query z 0 in a (large) database X = { x 1 , . . . , x N } of image representations. • • • • • K = 2 , N = 7 z 0 • • • David Stutz | July 22, 2015 6/48

2. Image Retrieval Formalization of content-based image retrieval: Problem. Find K -nearest-neighbors of query z 0 in a (large) database X = { x 1 , . . . , x N } of image representations. • • • • • • • • • • • K = 2 , N large • • z 0 • • • • • • • • • important: image representation David Stutz | July 22, 2015 6/48

2. Image Retrieval Formalization of content-based image retrieval: Problem. Find K -nearest-neighbors of query z 0 in a (large) database X = { x 1 , . . . , x N } of image representations. • • • • • • • • • • • K = 2 , N large • • z 0 • • • • • • • • • important: image representation Examples for image representations from the “Computer Vision” lecture: ◮ Histograms; ◮ Bag of Visual Words [SZ03]. David Stutz | July 22, 2015 6/48

2.1. Bag of Visual Words Intuition: assign local descriptors y l,n of image x n to visual words ˆ y 1 , . . . , ˆ y M previously obtained using clustering. y l,n ˆ y m David Stutz | July 22, 2015 7/48

2.1. Bag of Visual Words 1. Extract local descriptors Y n for each image x n . 2. Cluster all local descriptors Y = � N n =1 Y n to obtain visual words ˆ Y = { ˆ y 1 , . . . , ˆ y M } . 3. Assign each y l,n ∈ Y n to nearest visual word (embedding step): � � f ( y l,n ) = δ ( NN ˆ Y ( y l,n ) = ˆ y 1 ) , . . . . 4. Count visual word occurrences (aggregation step): L � F ( Y n ) = f ( y l,n ) . l =1 David Stutz | July 22, 2015 8/48

2.2. Vector of Locally Aggregated Descriptors Intuition: consider the residuals y l,n − ˆ y m instead of counting visual words. y l,n y m − y l,n ˆ ˆ y m David Stutz | July 22, 2015 9/48

2.2. Vector of Locally Aggregated Descriptors 1. Extract and cluster local descriptors. 2. Compute residuals of local descriptors visual words (embedding step): � � f ( y l,n ) = δ ( NN ˆ Y ( y l,n ) = ˆ y 1 )( y l,n − ˆ y 1 ) , . . . . 3. Aggregate residuals (aggregation step): L � F ( Y n ) = f ( y l,n ) . l =1 4. L 2 -normalize F ( Y n ) . David Stutz | July 22, 2015 10/48

2.3. Sparse-Coded Features Intuition: soft-assign local descriptors to visual words. y l,n y m ′ ˆ ˆ y m David Stutz | July 22, 2015 11/48

2.3. Sparse-Coded Features 1. Extract and cluster local descriptors. 2. Compute sparse codes (embedding step): � y l,n − ˆ Y r l � 2 f ( y l,n ) = argmin 2 + λ � r l � 1 . r l contains ˆ y m as columns 3. Pool sparse codes (aggregation step): � � F ( Y n ) = 1 ≤ l ≤ L { f 1 ( y l,n ) } , . . . max first component of f ( y l,n ) David Stutz | July 22, 2015 12/48

2.4. Compression, Nearest-Neighbor Search Until now: image representation. Additional aspects of image retrieval: ◮ compression of image representations; ◮ efficient indexing and nearest-neighbor search [JDS11]; ◮ query expansion [CPS + 07] and spatial verification [PCI + 07]. For example, compression can be accomplished using: ◮ Unsupervised methods, e.g. Principal Component Analysis (PCA); ◮ or discriminate methods, e.g. Joint Subspace and Classifier Learning [GRPV12] or Large Margin Dimensionality Reduction [SPVZ13]. discussed later ... David Stutz | July 22, 2015 13/48

Neural Codes for Image Retrieval David Stutz July 22, 2015 David - PowerPoint PPT Presentation

Neural Codes for Image Retrieval David Stutz July 22, 2015 David Stutz | July 22, 2015 David Stutz | July 22, 2015 0/48 1/48 Table of Contents Introduction 1 Image Retrieval 2 Bag of Visual Words Vector of Locally Aggregated Descriptors

Retrieval by Content Image Retrieval Image Retrieval Problem Large Image and video data sets

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Visual Instance Retrieval Praveen Krishnan CVIT, IIIT Hyderabad June 15, 2017 1 Outline Image

XML Retrieval XML Retrieval XML Retrieval XML Retrieval DB/IR in DB/IR in Theory Theory Web

INFORMATION RETRIEVAL USING NEURAL NETWORKS VINEETH REDDY ANUGU CMSC 676 INFORMATION RETRIEVAL

Evaluation of neural code compression techniques for image retrieval Feature compression for

Building Codes Building Codes Building Codes Building Codes 1 1 Builder Responsibilities

ECEN 5682 Theory and Practice of Error Control Codes Cyclic Codes Peter Mathys University of

Formal Modeling in Cognitive Science Source Codes Lecture 30: Codes; Kraft Inequality; Source

YIN XU 1. Image Segmentaion & Retrieval What is image segmentation? Whats the

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

Retrieval by Content Part 2: Text Retrieval Term Frequency and Inverse Document Frequency

Information Retrieval Introducing Information Retrieval and Web Search Information Retrieval

CS54701: Information Retrieval CS-54701 Information Retrieval Retrieval Models: Language models

Retrieval Models: Outline CS490W: Web I nformation Search & Management Retrieval Models

Model Divergence Retrieval LM, session 10 CS6200: Information Retrieval Slides by: Jesse

Image search through browsing using NN k networks Daniel Heesch, Marcus Pickering, Stefan Rger,

Large-Scale Video Retrieval Using Image Queries Andr Filgueiras de Araujo Department of

Introduction to Dialectometry III Wilbert Heeringa German Academic Exchange Service DAAD

Labs #2 WebDev Web ebDe Dev v Lab #1 On your local Ubuntu VM, install the Python and C

Joint Inference in Image Databases via Dense Correspondence Michael Rubinstein MIT CSAIL (while

Instance level recognition III: Correspondence and efficient visual search Josef Sivic

Geometric VLAD for Large Scale Image Search Zixuan Wang 1 , Wei Di 2 , Anurag Bhardwaj 2 , Vignesh

Learning for Image Search Wengang Zhou ( ) EEIS Department, University of Science &

Neural Codes for Image Retrieval David Stutz July 22, 2015 David - PowerPoint PPT Presentation

Neural Codes for Image Retrieval David Stutz July 22, 2015 David Stutz | July 22, 2015 David Stutz | July 22, 2015 0/48 1/48 Table of Contents Introduction 1 Image Retrieval 2 Bag of Visual Words Vector of Locally Aggregated Descriptors

Retrieval by Content Image Retrieval Image Retrieval Problem Large Image and video data sets

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Visual Instance Retrieval Praveen Krishnan CVIT, IIIT Hyderabad June 15, 2017 1 Outline Image

XML Retrieval XML Retrieval XML Retrieval XML Retrieval DB/IR in DB/IR in Theory Theory Web

INFORMATION RETRIEVAL USING NEURAL NETWORKS VINEETH REDDY ANUGU CMSC 676 INFORMATION RETRIEVAL

Evaluation of neural code compression techniques for image retrieval Feature compression for

Building Codes Building Codes Building Codes Building Codes 1 1 Builder Responsibilities

ECEN 5682 Theory and Practice of Error Control Codes Cyclic Codes Peter Mathys University of

Formal Modeling in Cognitive Science Source Codes Lecture 30: Codes; Kraft Inequality; Source

YIN XU 1. Image Segmentaion &amp; Retrieval What is image segmentation? Whats the

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

Retrieval by Content Part 2: Text Retrieval Term Frequency and Inverse Document Frequency

Information Retrieval Introducing Information Retrieval and Web Search Information Retrieval

CS54701: Information Retrieval CS-54701 Information Retrieval Retrieval Models: Language models

Retrieval Models: Outline CS490W: Web I nformation Search &amp; Management Retrieval Models

Model Divergence Retrieval LM, session 10 CS6200: Information Retrieval Slides by: Jesse

Image search through browsing using NN k networks Daniel Heesch, Marcus Pickering, Stefan Rger,

Large-Scale Video Retrieval Using Image Queries Andr Filgueiras de Araujo Department of

Introduction to Dialectometry III Wilbert Heeringa German Academic Exchange Service DAAD

Labs #2 WebDev Web ebDe Dev v Lab #1 On your local Ubuntu VM, install the Python and C

Joint Inference in Image Databases via Dense Correspondence Michael Rubinstein MIT CSAIL (while

Instance level recognition III: Correspondence and efficient visual search Josef Sivic

Geometric VLAD for Large Scale Image Search Zixuan Wang 1 , Wei Di 2 , Anurag Bhardwaj 2 , Vignesh

Learning for Image Search Wengang Zhou ( ) EEIS Department, University of Science &amp;

YIN XU 1. Image Segmentaion & Retrieval What is image segmentation? Whats the

Retrieval Models: Outline CS490W: Web I nformation Search & Management Retrieval Models

Learning for Image Search Wengang Zhou ( ) EEIS Department, University of Science &