Multiple co-clustering and its application Tomoki Tokuda, Okinawa - PowerPoint PPT Presentation

Multiple co-clustering and its application Tomoki Tokuda, Okinawa Institute of Science and Technology Graduate University 1 / 13

Outline 1. Introduction 2. Method for multiple co-clustering 3. Application to depression data 4. Conclusion 2 / 13

Introduction 3 / 13

What is multiple clustering? Conventional clustering method: One clustering solution 4 / 13

Multiple clustering method: Multiple clustering solutions 5 / 13

Method for multiple co-clustering 6 / 13

Multiple clustering in data matrix Multiple clustering solutions : appropriately partitioning features (without overlapping) and subsequently clustering objects. Figure 1: Original data → Multiple clustering solutions It reveals associations between features and object-clustering . 7 / 13

Idea of algorithm ◮ Clustering object → Fitting certain distribution family (in iterative manner). Clustering objects Partitioning features (Local) (Global) ◮ Iteratively optimize objective function (i.e., likelihood) 8 / 13

Challenges in multiple clustering for high-dimensional data ◮ No information on the number of views or object-clusters. → Dirichlet process (infinite number of views and clusters) ◮ Missing values → Integrate out (Bayesian framework) We work on the following challenges. ◮ Possible over-fitting to data: Typically, the number of samples is much smaller than the number of features. ◮ Mixing of several types of data: We want to analyze data combining numerical and categorical features! 9 / 13

Our proposed model Ingredients: ◮ Similar features are fitted by the same univariate distribution (feature cluster; hence, co-clustering ). ◮ Allowing for mixing of different types of distributions (Gaussian, Poisson, multinomial) Byproduct; ◮ Easy interpretation for similar features. ◮ Computationally efficient: O ( nd ) for a single iteration. Such modifications broaden the scope of application. 10 / 13

Model Likelihood log p ( X | Y , Z , Θ ) I ( Y ( m ) j , v , g = 1) I ( Z i , v , k = 1) log p ( X ( m ) i , j | θ ( m ) � = v , g , k ) , m , v , g , k , j , i m : Type of distribution (pre-specified) Y j , v , g : Feature j for a membership of view v and f.cluster g Z i , v , k : Object i for a membership of object-cluster k in view v . Prior for distribution parameters Conjugate prior for distribution families of Gaussian, Poisson and multinomial. 11 / 13

Essence of algorithm: Variational Bayesian method ◮ We want to know posterior p ( φ | X ) → Analytically impossible. ◮ So, we consider approximation. By Jensen’s inequality, q ( φ ) log p ( X , φ ) � log p ( X ) ≥ (1) q ( φ ) d φ where q ( φ ) is arbitrary; equality holds when q ( φ ) = p ( φ | X ). ◮ Assume factorization of q ( φ ) = � q i ( φ i ). ◮ We want to optimize distribution q ( φ ) to maximize the right hand side in Eq.(1). ◮ An (conditionally) optimal distribution is given by q i ( φ i ) ∼ exp { E − q i ( φ i ) log p ( X , φ ) } where E − q i ( φ i ) denotes averaging over all parameters but φ i . 12 / 13

4. Conclusion ◮ A novel method of multiple clustering for high-dimensional data. ◮ Co-clustering structure in view enables efficient and easy interpretation of features. ◮ In application to depression data, one subject-clustering solution has been found, which is relevant to treatment effect. ◮ This model may provide possible prediction of treatment effect based on stress experiences in childhood and functional connectivity in the brain. 13 / 13

Multiple co-clustering and its application Tomoki Tokuda, Okinawa - PowerPoint PPT Presentation

Multiple co-clustering and its application Tomoki Tokuda, Okinawa Institute of Science and Technology Graduate University 1 / 13 Outline 1. Introduction 2. Method for multiple co-clustering 3. Application to depression data 4. Conclusion 2

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

Evolutionary Clustering Presenter: Lei Tang Evolutionary Clustering Evolutionary Clustering

Clustering A Categorization of Major Clustering Methods Partitioning Methods

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and

Clustering and Dimensionality Reduction Preview Clustering K -means clustering

PAC-Bayesian Analysis of Co-clustering, Graph Clustering and Pairwise Clustering Yevgeny Seldin

Introduction to Machine Learning, Clustering and EM Barnab s P czos Contents Clustering

Trust based Clustering for Group Trust based Clustering for Group Trust based Clustering for

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

CSCE 478/878 Lecture 8: Stephen Scott Clustering Introduction Outline Clustering Stephen

Clustering kMeans, Expectation Maximization, Self-Organizing Maps Outline K-means

Lecture 23: Spectral clustering Hierarchical clustering What is a good clustering?

CHAPTER VIII VIII CHAPTER Data Clustering and Data Clustering and Self- -Organizing Feature

Bayesian Linear Regression Seung-Hoon Na Chonbuk National University Bayesian Linear Regression

Tutorial 2 Monday 8 th August, 2016 Problem 1. Case for non-IID dataset: In the class, we

Related to Bayesian Statistics by Atsuhide Mori (Osaka Dental University, Japan) Geometric

Introduction to Probabilistic Machine Learning Piyush Rai Dept. of CSE, IIT Kanpur (Mini-course

CS 559: Machine Learning Fundamentals and Applications 4 th Set of Notes Instructor: Philippos

Graphs and their representations After this lesson, you should be able to define the

Logistics Quizzes Quiz 5: 74% Other quizzes: ~98% Ill drop lowest 2 quizzes HW HW 2 back:

Graphs Autumn 2018 Shrirang (Shri) Mare shri@cs.washington.edu Thanks to Kasey Champion, Ben