Extracting Relevant Information from Samples Naftali Tishby School - PowerPoint PPT Presentation

Mathematics of relevance The Information Bottleneck Method Further work and Conclusions Extracting Relevant Information from Samples Naftali Tishby School of Computer Science and Engineering Interdisciplinary Center for Neural Computation The Hebrew University of Jerusalem, Israel ISAIM 2008 Naftali Tishby Extracting Relevant Information from Samples

Mathematics of relevance Motivating examples The Information Bottleneck Method Sufficient Statistics Further work and Conclusions Relevance and Information Outline Mathematics of relevance 1 Motivating examples Sufficient Statistics Relevance and Information The Information Bottleneck Method 2 Relations to learning theory Finite sample bounds Consistency and optimality 3 Further work and Conclusions The Perception Action Cycle Temporary conclusions Naftali Tishby Extracting Relevant Information from Samples

Mathematics of relevance Motivating examples The Information Bottleneck Method Sufficient Statistics Further work and Conclusions Relevance and Information Examples: Co-occurrence data (words-topics, genes-tissues, etc.) Naftali Tishby Extracting Relevant Information from Samples

Mathematics of relevance Motivating examples The Information Bottleneck Method Sufficient Statistics Further work and Conclusions Relevance and Information Example: Objects and pixels Naftali Tishby Extracting Relevant Information from Samples

Mathematics of relevance Motivating examples The Information Bottleneck Method Sufficient Statistics Further work and Conclusions Relevance and Information Example: Neural codes (e.g. de-Ruyter and Bialek) Naftali Tishby Extracting Relevant Information from Samples

Mathematics of relevance Motivating examples The Information Bottleneck Method Sufficient Statistics Further work and Conclusions Relevance and Information Neural codes (Fly H1 cell recording, with Rob de-Ruyter and Bill Bialek) Naftali Tishby Extracting Relevant Information from Samples

Mathematics of relevance Motivating examples The Information Bottleneck Method Sufficient Statistics Further work and Conclusions Relevance and Information Sufficient statistics What captures the relevant properties in a sample about a parameter? Given an i.i.d. sample x ( n ) ∼ p ( x | θ ) Definition (Sufficient statistic) A sufficient statistic: T ( x ( n ) ) is a function of the sample such that p ( x ( n ) | T ( x ( n ) ) = t , θ ) = p ( x ( n ) | T ( x ( n ) ) = t ) . Theorem (Fisher Neyman factorization) T ( x ( n ) ) is sufficient for θ in p ( x | θ ) ⇐ ⇒ there exist h ( x ( n ) ) and g ( T , θ ) such that p ( x ( n ) | θ ) = h ( x ( n ) ) g ( T ( x ( n ) ) , θ ) . Naftali Tishby Extracting Relevant Information from Samples

Mathematics of relevance Motivating examples The Information Bottleneck Method Sufficient Statistics Further work and Conclusions Relevance and Information Minimal sufficient statistics There are always trivial (complex) sufficient statistics - e.g. the sample itself. Definition (Minimal sufficient statistic) S ( x ( n ) ) is a minimal sufficient statistic for θ in p ( x | θ ) if it is a function of any other sufficient statistics T ( x ( n ) ) . S ( X n ) gives the coarser sufficient partition of the n -sample space. S is unique (up to 1-1 map). Naftali Tishby Extracting Relevant Information from Samples

Mathematics of relevance Motivating examples The Information Bottleneck Method Sufficient Statistics Further work and Conclusions Relevance and Information Sufficient statistics and exponential forms What distributions have sufficient statistics? Theorem (Pitman, Koopman, Darmois.) Among families of parametric distributions whose domain does not vary with the parameter, only in exponential families , �� p ( x | θ ) = h ( x ) exp η r ( θ ) A r ( x ) − A 0 ( θ ) , r there are sufficient statistics for θ with bounded dimensionality: T r ( x ( n ) ) = � n k = 1 A r ( x k ) , (additive for i.i.d. samples). Naftali Tishby Extracting Relevant Information from Samples

Mathematics of relevance Motivating examples The Information Bottleneck Method Sufficient Statistics Further work and Conclusions Relevance and Information Sufficiency and Information Definition (Mutual Information) For any two random variables X and Y with joint pdf P ( X = x , Y = y ) = p ( x , y ) , Shannon’s mutual information I ( X ; Y ) is defined as I ( X ; Y ) = E p ( x , y ) log p ( x , y ) p ( x ) p ( y ) . I ( X ; Y ) = H ( X ) − H ( X | Y ) = H ( Y ) − H ( Y | X ) ≥ 0 I ( X ; Y ) = D KL [ p ( x , y ) | p ( x ) p ( y )] , maximal number (on average) of independent bits on Y that can be revealed from measurements on X . Naftali Tishby Extracting Relevant Information from Samples

Extracting Relevant Information from Samples Naftali Tishby School - PowerPoint PPT Presentation

Mathematics of relevance The Information Bottleneck Method Further work and Conclusions Extracting Relevant Information from Samples Naftali Tishby School of Computer Science and Engineering Interdisciplinary Center for Neural Computation The

1 Methods of Extracting or Obtaining Essential Oils The most common method for extracting

Samples Advertising of samples and handing out samples Advertising Education and Assurance

-Samples [AB98] Hyp: domain S is a smooth curve or surface. S 1 -Samples [AB98] Hyp:

Business Statistics CONTENTS Comparing two samples Comparing two unrelated samples Comparing

Program Analysis Program Analysis Extracting information, in order to present Extracting

A simple and robust A simple and robust algorithm for extracting algorithm for extracting

Extracting Tables from PDFs Extracting Tables from PDFs Using Camelot and Excalibur to

Extracting Gait Parameters Extracting Gait Parameters from Raw Data from Raw Data

CKM 2006 CKM 2006 Extracting CKM phase from phase from Extracting CKM B K

STAT 113 Independent vs. Paired Samples Colin Reimer Dawson Oberlin College November 16, 2017

Biased and Unbiased Samples James J. Heckman Econ 312, Spring 2019 May 14, 2019 1 / 125

Biased and Unbiased Samples James J. Heckman Econ 312, Spring 2019 May 13, 2019 1 / 125

Program Analysis Extracting static and dynamic information from a software system Program

Extracting relevant information from UHTS data: analysis pipelines (smallRNA) Patricia Otten

User Interface Design Prof. Dr. Jan M. Pawlowski Autumn 2013 Contents Introduction Definitions

Snowball : Extracting Relations from Large Plain-Text Collections Eugene Agichtein Luis Gravano

Quickly Detecting Relevant Program Invariants Michael Ernst, Adam Czeisler, Bill Griswold

File System Aging: Increasing the Relevance of File System Benchmarks Keith A. Smith Margo I.

Relevance Feedback in Web Search Sergei Vassilvitskii (Stanford University) Eric Brill

CiviBooking A first look Civi booking What did we have? For our previous client we developed

Constructing a sigma model from semiclassics In collaboration with: Alexander Altland, Petr

The Importance of Being Zero T OM AS R ECIO , J. R AFAEL S ENDRA AND C ARLOS V ILLARINO . P

PRODUCTION AND FIRMS PART I WHAT IS A FIRM? Traditional or textbook view: Firm essentially

A 14907 - Strategic Management Accoun6ng (2018-2019) Session 11 Opera6onal Decisions Paul G.