Hierarchical Clustering Lecture 15 David Sontag New York - PowerPoint PPT Presentation

Jul 31, 2023 •377 likes •465 views

Hierarchical Clustering Lecture 15 David Sontag New York University Agglomerative Clustering Agglomerative clustering: First merge very similar instances Incrementally build larger clusters out

Hierarchical ¡Clustering ¡ Lecture ¡15 ¡ David ¡Sontag ¡ New ¡York ¡University ¡
Agglomerative Clustering • Agglomerative clustering: – First merge very similar instances – Incrementally build larger clusters out of smaller clusters • Algorithm: – Maintain a set of clusters – Initially, each instance in its own cluster – Repeat: • Pick the two closest clusters • Merge them into a new cluster • Stop when there’s only one cluster left • Produces not one clustering, but a family of clusterings represented by a dendrogram
Agglomerative Clustering • How should we define “ closest ” for clusters with multiple elements?
Agglomerative Clustering • How should we define “ closest ” for clusters with multiple elements? • Many options: – Closest pair (single-link clustering) – Farthest pair (complete-link clustering) – Average of all pairs • Different choices create different clustering behaviors
Agglomerative Clustering • How should we define “ closest ” for clusters with multiple elements? Closest pair Farthest pair (single-link clustering) (complete-link clustering) 1 5 6 2 1 5 2 6 3 4 7 8 3 4 7 8 [Pictures from Thorsten Joachims]
Clustering ¡Behavior ¡ Average Farthest Nearest Mouse tumor data from [Hastie et al. ]
Agglomera<ve ¡Clustering ¡ When ¡can ¡this ¡be ¡expected ¡to ¡work? ¡ Strong separation property: Closest pair All points are more similar to points in (single-link clustering) their own cluster than to any points in any other cluster Then, the true clustering corresponds to some pruning of the tree obtained by 1 5 6 2 single-link clustering! Slightly weaker (stability) conditions are solved by average-link clustering 3 4 7 8 (Balcan et al., 2008)

Recommend

Hierarchical clustering David M. Blei COS424 Princeton University February 28, 2008 D. Blei

Hierarchical clustering David M. Blei COS424 Princeton University February 28, 2008 D. Blei Clustering 02 1 / 21 Hierarchical clustering Hierarchical clustering is a widely used data analysis tool. D. Blei Clustering 02 2 / 21

1.1k views • 83 slides

Lecture 23: Spectral clustering Hierarchical clustering What is a good clustering?

Lecture 23: Spectral clustering Hierarchical clustering What is a good clustering? Aykut Erdem May 2016 Hacettepe University Last time K-Means An iterative clustering algorithm - Initialize: Pick K random points as cluster

933 views • 64 slides

Unsupervised Learning and Clustering Owen Roberts, Zach Busser, Ganesh Sugunan Hierarchical

Unsupervised Learning and Clustering Owen Roberts, Zach Busser, Ganesh Sugunan Hierarchical Clustering Hierarchical Clustering Hierarchical Representations Hierarchical Algorithms Agglomeration "Nearest" Clusters Complexity

769 views • 49 slides

Hierarchical Clustering 4-4-16 Hierarchical clustering: the setting Unsupervised learning

Hierarchical Clustering 4-4-16 Hierarchical clustering: the setting Unsupervised learning no labels/output, only x/input Clustering Group similar points together Machine learning taxonomy Supervised Semi-Supervised Unsupervised

366 views • 15 slides

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering Clustering by Partitioning, e.g. k-Means Density Based Clustering, e.g. DBScan Grid Based Clustering Compendium slides for Guide to Intelligent

1.32k views • 82 slides

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics Elhanan Borenstein A quick review The clustering problem: partition genes into distinct sets with high homogeneity and high separation Many

768 views • 43 slides

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

Clustering What is Clustering? Types of Data in Cluster Analysis Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods Hierarchical Methods Hierarchical Methods 1 2 What is Clustering?

617 views • 24 slides

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein A quick review The clustering problem: partition genes into distinct sets with high homogeneity

772 views • 40 slides

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Clustering What is Clustering? Types of Data in Cluster Analysis Clustering A Categorization of Major Clustering Methods Partitioning Methods Hierarchical Methods 1 2 What is Clustering? What is Clustering? Typical

647 views • 18 slides

CSCE 478/878 Lecture 8: Stephen Scott Clustering Introduction Outline Clustering Stephen

CSCE 478/878 Lecture 8: Clustering CSCE 478/878 Lecture 8: Stephen Scott Clustering Introduction Outline Clustering Stephen Scott k -Means Clustering Hierarchical Clustering sscott@cse.unl.edu 1 / 19 Introduction CSCE 478/878 If

200 views • 19 slides

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns in data, or grouping similar groups of data-points together into clusters . Clustering algorithms for numeric data: Lloyds K-means, EM

688 views • 19 slides

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

LUDWIG- MAXIMILIANS- INSTITUTE DATABASE UNIVERSITT FOR SYSTEMS MNCHEN MNCHEN INFORMATICS INFORMATICS GROUP GROUP Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative Clustering, Multiview

513 views • 9 slides

LECTURE 7 Clustering The k-means algorithm Hierarchical Clustering The DBSCAN algorithm

DATA MINING LECTURE 7 Clustering The k-means algorithm Hierarchical Clustering The DBSCAN algorithm Clustering Evaluation What is a Clustering? In general a grouping of objects such that the objects in a group (cluster) are similar (or

1.39k views • 110 slides

RECSM Summer School: Machine Learning for Social Sciences Session 3.4: Hierarchical Clustering

RECSM Summer School: Machine Learning for Social Sciences Session 3.4: Hierarchical Clustering Reto West Department of Political Science and International Relations University of Geneva 1 Clustering Clustering Hierarchical Clustering

237 views • 22 slides

Chapter 7: Clustering (Unsupervised Data Organization) 7.1 Hierarchical Clustering 7.2 Flat

Chapter 7: Clustering (Unsupervised Data Organization) 7.1 Hierarchical Clustering 7.2 Flat Clustering 7.3 Embedding into Vector Space for Visualization 7.4 Applications Clustering: unsupervised grouping (partitioning) of objects into

763 views • 49 slides

Clustering: Hierarchical Clustering and K- Means Clustering Machine

Clustering: Hierarchical Clustering and K- Means Clustering Machine Learning 10-601B Seyoung Kim Many of these slides are derived from William Cohen,

660 views • 46 slides

Author Profiling using Complementary Second Order Attributes and Stylometric Features

Introduction Proposed Method Experimental Results Conclusions and Future Work Author Profiling using Complementary Second Order Attributes and Stylometric Features Konstantinos Bougiatiotis* Anastasia Krithara Institute of Information and

906 views • 52 slides

Performance of b jet identification at s = 13 TeV with the ATLAS detector at CERN By Wasikul

FERMILAB-SLIDES-18-081-PPD Performance of b jet identification at s = 13 TeV with the ATLAS detector at CERN By Wasikul Islam Department of Physics, Oklahoma State University, USA & Argonne National Laboratory, USA New New Per

393 views • 18 slides

The Carlitz-Scoville-Vaughan Theorem and its Generalizations Ira M. Gessel Department of

The Carlitz-Scoville-Vaughan Theorem and its Generalizations Ira M. Gessel Department of Mathematics Brandeis University Joint Mathematics Meeting San Diego January 12, 2013 Counting pairs of sequences In their 1976 paper Enumeration of

763 views • 61 slides

Diagrammatically maximal and geometrically maximal knots Jessica Purcell Monash University,

Diagrammatically maximal and geometrically maximal knots Jessica Purcell Monash University, School of Mathematical Sciences Joint work with Abhijit Champanerkar, Ilya Kofman J. Purcell Diagrammatically maximal and geometrically maximal knots

792 views • 37 slides

Clustering Duen Horng (Polo) Chau Georgia Tech Partly based on materials by Professors

CSE 6242 / CX 4242 Clustering Duen Horng (Polo) Chau Georgia Tech Partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko, Christos Faloutsos, Le Song Clustering in Google Image Search How would you build this?

418 views • 14 slides

Machine Learning Lecture Notes on Clustering (II) 2017-2018 Davide Eynard davide.eynard@usi.ch

Machine Learning Lecture Notes on Clustering (II) 2017-2018 Davide Eynard davide.eynard@usi.ch Institute of Computational Science Universit` a della Svizzera italiana p. 1/39 Todays Outline K-Means limits K-Means extensions:

701 views • 39 slides

Projects Chandrasekar, Arun Kumar, Group 17 Nearly all group have submitted a proposal

Projects Chandrasekar, Arun Kumar, Group 17 Nearly all group have submitted a proposal May 21: Each person gives one slide, 15 min/group. First principles vs Data driven Small data Big data to train Data High reliance on domain

823 views • 44 slides

Clustering Algorithms Dalya Baron (Tel Aviv University) XXX Winter School, November 2018

Clustering Algorithms Dalya Baron (Tel Aviv University) XXX Winter School, November 2018 Clustering Feature 2 Feature 1 Clustering cluster #1 Feature 2 cluster #2 Feature 1 Clustering Why should we look for clusters? cluster #1 Feature

815 views • 53 slides