A Consistent Density-Based Clustering Algorithm and its Application - - PowerPoint PPT Presentation

a consistent density based clustering algorithm and its
SMART_READER_LITE
LIVE PREVIEW

A Consistent Density-Based Clustering Algorithm and its Application - - PowerPoint PPT Presentation

A Consistent Density-Based Clustering Algorithm and its Application to Microstructure Image Segmentation Marilyn Vazquez Landrove Institute for Computational and Experimental Research in Mathematics Providence, RI Computational Imaging Vazquez


slide-1
SLIDE 1

A Consistent Density-Based Clustering Algorithm and its Application to Microstructure Image Segmentation

Marilyn Vazquez Landrove

Institute for Computational and Experimental Research in Mathematics Providence, RI

Computational Imaging

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 1 / 32

slide-2
SLIDE 2

Motivation Image Segmentation

Acknowledgements

National Institute of Standards and Technology Sponsor: Steve Langer Mentor: Gunay Dogan Data: Sheng Yen Li Other Collaborator: Andrew Reid Research supported by Industrial Immersion Program (IIP) at GMU

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 2 / 32

slide-3
SLIDE 3

Motivation Image Segmentation

Material Images: Steel

Pearlite: A mixture of cementite and ferrite Simulations that let them measure the strength of their material.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 3 / 32

slide-4
SLIDE 4

Motivation Image Segmentation

Material Images: Steel

Information needed for simulations: Fraction of pearlite (stripe region) in image Fraction of ferrite (white stripes) in pearlite Space between ferrite stripes etc. First step to achieve these goals: Image Segmentation!

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 4 / 32

slide-5
SLIDE 5

Motivation Image Segmentation

Challenges

(1) Texture is difficult to represent and (2) want the least amount of human involvement

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 5 / 32

slide-6
SLIDE 6

Motivation Image Segmentation

Create Features to Capture Patterns

Edge detection: Use contrast to detect edges and let enclosed areas be the regions of interest. Ex: Canny Edge detector (1987) Match filters: Build filter bank that represents the desired pattern and convolve with image to measure “response.” Ex: Frangi Filter (1998) Region based: Grow/shrink input regions according to partial differential equation. Ex: Level set method by Osher and Sethian (1988)

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 6 / 32

slide-7
SLIDE 7

Motivation Image Segmentation

Image Segmentation as a Clustering Problem

Our idea: Introduce manifold learning to achieve non-parametric image segmentation

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 7 / 32

slide-8
SLIDE 8

Motivation Density Based Clustering

Density Based Clustering

How would you cluster points with the following density f ? Challenges: We do not have access to the real density f , but an estimation ˆ f .

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 8 / 32

slide-9
SLIDE 9

Motivation Density Based Clustering

Cut: Super-level Set

Suppose we knew f , then a solution can be found looking at super-level sets, i.e. {x ∈ XN : f (x) ≥ λ}

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 9 / 32

slide-10
SLIDE 10

Motivation Density Based Clustering

Cluster: Connected Components

Connected components of {x ∈ XN : f (x) ≥ λ}

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 10 / 32

slide-11
SLIDE 11

Motivation Density Based Clustering

Classify

How do we label the rest of the points? Using a classifier!

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 11 / 32

slide-12
SLIDE 12

Consistency of clustering

Section 2 Consistency of clustering

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 12 / 32

slide-13
SLIDE 13

Consistency of clustering Intuition

Consistency

Definition Let T be the true clustering of M and {Tn} be a sequence of random clusterings of M. The sequence {Tn} is consistent if for every sufficiently small δ, γ > 0, the following conditions hold for sufficiently large n:

1 (Separation) Tn ⊂ B(T, δ) 2 (Coverage) T ⊂ B(Tn, δ) 3 (Cohesiveness) The inclusions in (1) and (2) define a one-to-one

correspondence between the connected components of T and Tn with probability 1 − γ.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 13 / 32

slide-14
SLIDE 14

Consistency of clustering Consistency Definition

Separation: Tn ⊂ B(T, δ)

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 14 / 32

slide-15
SLIDE 15

Consistency of clustering Consistency Definition

Coverage: T ⊂ B(Tn, δ)

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 15 / 32

slide-16
SLIDE 16

Consistency of clustering Consistency Definition

Consistency of Density Based Clustering

Let Xn be a data cloud of n points sampled from M ⊆ Rd using probability density function q Let qn be a consistent density estimator of q. For density based clustering, clearly T = Tλ(q) i.e. the superlevel set

  • f density q at λ.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 16 / 32

slide-17
SLIDE 17

Consistency of clustering Consistency Definition

Consistency of Density Based Clustering: Theorem

Theorem Let q : Rd → R be a C d probability density function such that q(x) → 0 as ||x|| → ∞. Also, assume that qn is a consistent density estimator of q with variance σ2

  • n. Suppose that qn has a Lipschitz constant Ln such that

Ld

nσ2 n → 0 as n → ∞. Denote the superlevel set of a function as

Tλ(f ) = {x ∈ Xn : f (x) > λ}. Then for almost every λ > 0 and for small enough δ > 0, B(Tλ(qn), δ) form a consistent clustering of Tλ(q).

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 17 / 32

slide-18
SLIDE 18

Image Segmentation

Section 3 Image Segmentation

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 18 / 32

slide-19
SLIDE 19

Image Segmentation Cut-Cluster-Classify Method

Data Driven Feature Space

Each 4 × 4 patch is a point in a 16 dimensional data cloud, whose PCA representation is on the left.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 19 / 32

slide-20
SLIDE 20

Image Segmentation Cut-Cluster-Classify Method

Patch Space

Original image PCA projection of Patches on top in false colors 4 by 4 patches

  • f their PCA projection

Sample 4 by 4 patches

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 20 / 32

slide-21
SLIDE 21

Image Segmentation Cut-Cluster-Classify Method

Cut

Step 1: Estimate sample density Step 2: Threshold, or cut, according to density

Patch density indexed PCA projection Sorted density with by top left corner pixel colored by density threshold drawn in red

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 21 / 32

slide-22
SLIDE 22

Image Segmentation Cut-Cluster-Classify Method

Cluster

Step 3: Cluster points that passed the threshold

Original image Sampled data clustered Patches on top of in false colors the clustered projection

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 22 / 32

slide-23
SLIDE 23

Image Segmentation Cut-Cluster-Classify Method

No Patch Left Behind

Step 4: Classify remaining points

⇓ ⇓

Cluster 1 Cluster 2 Sample patches being classified. PCA projection of all 4 by 4 patches classified.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 23 / 32

slide-24
SLIDE 24

Image Segmentation Cut-Cluster-Classify Method

From patches to Pixels

Step 5: Label pixels using patch labels

To decide which cluster pixel (3,1) belongs to, we look at all the patches that it appears

  • in. As illustrated in the top images, it only appears in 3 patches. The actual patches,

shown on the bottom, were classified to cluster 1, 2, and 2, respectively. This means that cluster 2 gets 2 votes, and cluster 1 gets 1 vote. Therefore, this pixel gets placed in cluster 2.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 24 / 32

slide-25
SLIDE 25

Image Segmentation Results

Real Grayscale Images

Original image. Segmentation from 30 × 30 patches.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 25 / 32

slide-26
SLIDE 26

Image Segmentation Results

Real Grayscale Images

Original image. Segmentation from 8 × 8 patches.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 26 / 32

slide-27
SLIDE 27

Image Segmentation Results

Real Grayscale Images

Original image. Segmentation from 20 × 20 patches.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 27 / 32

slide-28
SLIDE 28

Image Segmentation Results

Real Grayscale Images

Original image. Segmentation from 4 × 4 patches.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 28 / 32

slide-29
SLIDE 29

Image Segmentation Results

Color Images

Original color image. Our segmentation.

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 29 / 32

slide-30
SLIDE 30

Concluding Remarks

Concluding Remarks

In this research, we have: Created a new mathematical definition for consistency of clustering Shown that B(Tλ(qn), δ) is a consistent clustering of Tλ(q) Developed a clustering method, the Cut-Cluster-Classify, for smooth probability density functions Used our density based clustering to do image segmentation

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 30 / 32

slide-31
SLIDE 31

Concluding Remarks

Future Work

Work that still needs to be done: Develop the multi-scale image segmentation by considering different patch sizes Generalize the theoretical framework How do we find the practical δ using persistence

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 31 / 32

slide-32
SLIDE 32

Concluding Remarks

Thank You!

Vazquez (ICERM) Consistency of Density Based Clustering Computational Imaging 32 / 32