The Minimum Description Length Principle Peter Grnwald CWI - PowerPoint PPT Presentation

Feb 22, 2023 •1.5k likes •1.69k views

The Minimum Description Length Principle Peter Grnwald CWI Amsterdam www.grunwald.nl (slides edited by Tim van Erven) Machine Learning Course, Vrije Universiteit Amsterdam December 5 th 2007 Minimum Description Length Principle Rissanen

The Minimum Description Length Principle Peter Grünwald CWI Amsterdam www.grunwald.nl (slides edited by Tim van Erven) Machine Learning Course, Vrije Universiteit Amsterdam December 5 th 2007
Minimum Description Length Principle Rissanen 1978, 1987, 1996, Barron, Rissanen and Yu 1998 • ‘MDL’ is a method for inductive inference… – machine learning – pattern recognition – statistics • …based on ideas from data compression (information theory) • In contrast to most other methods, MDL automatically deals with overfitting, arguably the central problem in machine learning and statistics
Minimum Description Length Principle • MDL is based on the correspondence between ‘regularity’ and ‘compression’: – The more you are able to compress a sequence of data, the more regularity you have detected in the data – Example: 001 0010 0100 1001 0010 0100 1001 ::::0 01 010 1101 1100 1001 1101 0001 0101 ::::0 10
Minimum Description Length Principle • MDL is based on the correspondence between ‘regularity’ and ‘compression’: – The more you are able to compress a sequence of data, the more regularity you have detected in the data… – …and thus the more you have learned from the data: • ‘inductive inference’ as trying to find regularities in data (and using those to make predictions of future data)
Model Selection/Overfitting Given data D and hypothesis spaces/models , which model best explains M 1 , M 2, M 3 ,  the data ? – Need to take into account • Complexity of models • Error (minus Goodness-of-fit) – Example: • Selecting the degree of a polynomial in regression • Sum of squared errors
Example: Regression
Example: Regression
Example: Regression
Example: Regression
Example: Regression

Recommend

Minimum Description Length Bono Nonchev Principle in Model Selection Information Theory The

Minimum Description Length Principle in Model Selection Minimum Description Length Bono Nonchev Principle in Model Selection Information Theory The MDL Principle Bono Nonchev Model Selection Faculty of Mathematics and Informatics,

736 views • 18 slides

DATA MINING LECTURE 10 Minimum Description Length Information Theory Co-Clustering MINIMUM

DATA MINING LECTURE 10 Minimum Description Length Information Theory Co-Clustering MINIMUM DESCRIPTION LENGTH Occams razor Most data mining tasks can be described as creating a model for the data E.g., the EM algorithm models the

515 views • 38 slides

On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a

Minimum rank The minimum rank of a random graph over the binary field An algorithm to decide the minimum rank for fixed k Future work On the minimum rank of a graph Jisu Jeong June 21, 2013 Jisu Jeong On the minimum rank of a graph Minimum

865 views • 27 slides

Govt. of Gujarat Gujarat Coastline Zone Accretion Erosion length Stable length Total length

Coastal Protection works 15 th Meeting of CPDAC Committee Narmada, Water Resources, Water Supply and Kalpsar Department Govt. of Gujarat Gujarat Coastline Zone Accretion Erosion length Stable length Total length length (km) (km) (km) (km)

877 views • 31 slides

Class 14 Slides SLIDE what is the designing principle how does designing principle

Class 14 Slides SLIDE what is the designing principle how does designing principle differ from premise how do you find the designing principle examples of designing principle examples of designing principle from

87 views • 8 slides

Verification of Security Protocols with Lists: from Length One to Unbounded Length Miriam Paiola

Introduction Protocols with lists Generalized Horn Clauses From any length to length one An approximation algorithm Conclusion Verification of Security Protocols with Lists: from Length One to Unbounded Length Miriam Paiola Bruno Blanchet {

1.02k views • 44 slides

For Friday Read Chapter 10, sections 1 and 2 Prolog Handout 4 Length of a List

For Friday Read Chapter 10, sections 1 and 2 Prolog Handout 4 Length of a List Definition of length/2 length([], 0). length([_ | Tail], N) :- length(Tail, N1), N is 1 + N1. Note: all loops must be implemented via recursion

629 views • 26 slides

Fibre Delay Line: Fibre Delay Line: FDL Principle drawing Principle drawing The length of

Fibre Delay Line: Fibre Delay Line: FDL Principle drawing Principle drawing The length of the jumper is factory set up Th l th f th j i f t t thanks to interferometric method 1ps delay precision at 1550nm Free space beam

60 views • 5 slides

February 2017 European Minimum Income Network - Introduction The European Minimum Income Network

Minimum income policies as a tool to tackle poverty Presentation of the European Minimum Income Network (EMIN) February 2017 European Minimum Income Network - Introduction The European Minimum Income Network (EMIN) is an informal Network of

366 views • 13 slides

Outline and Reading Minimum Spanning Trees ( 12.7.3) Minimum Spanning Tree n Definitions n A

Minimum Spanning Tree 7/8/03 12:53 Outline and Reading Minimum Spanning Trees ( 12.7.3) Minimum Spanning Tree n Definitions n A crucial fact Prim-Jarniks Algorithm ( 12.7.3.2) Kruskals Algorithm ( 12.7.3.1) 7/8/03 12:53 Minimum

238 views • 5 slides

THE MINIMUM WAGE Background I Established in US in 1938 THE MINIMUM WAGE Background I

THE MINIMUM WAGE Background I Established in US in 1938 THE MINIMUM WAGE Background I Established in US in 1938 I It is illegal for rms in covered sectors to pay less than the minimum wage (historic graph) THE MINIMUM WAGE Background I

713 views • 42 slides

Reducing Extraneous Processing Modality Principle Jan L. Plass, ECT Coherence Principle

3/29/10 Reducing Extraneous Processing Modality Principle Jan L. Plass, ECT Coherence Principle Contiguity Principle Signaling Principle Club Marian Maid Marian Extraneous Processing (60 min.) Working in groups of

203 views • 4 slides

End-to-End principle End-to-end Principle Broad networking principle First implementation

End-to-End principle End-to-end Principle Broad networking principle First implementation in French CYCLADES network (after ARPA) (1970) Articulated in its most recognizable form by Saltzer, Reed, Clark (1981) [paper] Guidance on

123 views • 9 slides

2/2/2015 FUNDAMENTAL LEGAL PRINCIPLES Principle of Indemnity Principle of Insurable

2/2/2015 FUNDAMENTAL LEGAL PRINCIPLES Principle of Indemnity Principle of Insurable Interest Principle of Subrogation Principle of Utmost Good Faith Requirements of an Insurance Contract Distinct Legal Characteristics of

277 views • 5 slides

Bayesian Learning Bayes Theorem MAP, ML hypotheses MAP learners Minimum description

Bayesian Learning Bayes Theorem MAP, ML hypotheses MAP learners Minimum description length principle Bayes optimal classifier Naive Bayes learner Example: Learning over text data Bayesian belief networks

1.11k views • 51 slides

Lecture 6A LENSES, FOCAL LENGTH, & PORTRAITS LOUDEN 1 What is focal Length? Very

Lecture 6A LENSES, FOCAL LENGTH, & PORTRAITS LOUDEN 1 What is focal Length? Very simply, it is the distance from the lens to the film , when focused on a subject at infinity. In other words, focal length equals image distance for a

1.06k views • 63 slides

Data Governance, Ethics and the law Ethics and law: why does it matter to computer scientists?

Data Governance, Ethics and the law Ethics and law: why does it matter to computer scientists? Costs: fines and reputation No Buy-in by needed users Loss of trust Fines (under GDPR: a fine up to 10,000,000 EUR or up to 2% of the annual

466 views • 36 slides

shipping industry An exploratory study of adoption likelihood and scenario- based opportunities

Blockchain adoption in the shipping industry An exploratory study of adoption likelihood and scenario- based opportunities and risks for IT service providers Programme: MSc. in International Business Authors: Riccardo Di Gregorio & Stian

408 views • 13 slides

I&E Study Project 2018 8th Session - Nov 29th, 2018 olli-pekka.mutanen (at) aalto.fi

I&E Study Project 2018 8th Session - Nov 29th, 2018 olli-pekka.mutanen (at) aalto.fi saara.brax (at) lut.fi Agenda Nov 29th, 2018 1. Dress Rehearsal by the teams (12 min/each) 2. Deadlines and Final presentation 3. Reporting instructions

490 views • 26 slides

First Quarter Fiscal 2018 Conference Call February 1, 2018 Preliminary Statements Forward

First Quarter Fiscal 2018 Conference Call February 1, 2018 Preliminary Statements Forward Looking Statements This document contains certain forward-looking statements. These statements are based on the companys current expectations as to the

605 views • 25 slides

Better to LOOK stupid, than to BE stupid Fred Henry Williams Agile Prague, 2018 Never

Better to LOOK stupid, than to BE stupid Fred Henry Williams Agile Prague, 2018 Never underestimate the power of human stupidity. Robert A. Heinlein (P.s. Many highly intelligent people are actually stupid) Epistemology the

344 views • 16 slides

1-genericity and the finite intersection principle Peter Cholak, Rod Downey, Gregory Igusa*

1-genericity and the finite intersection principle Peter Cholak, Rod Downey, Gregory Igusa* University of Notre Dame 25 June, 2014 Peter Cholak, Rod Downey, Gregory Igusa* 1-genericity and the finite intersection principle Background In

960 views • 58 slides

Permutons and Pattern Densities Peter Winkler (Dartmouth) with Rick Kenyon (Brown), Dan

Permutation Patterns, Reykjavik 6/17 This image cannot currently be displayed. Permutons and Pattern Densities Peter Winkler (Dartmouth) with Rick Kenyon (Brown), Dan Krl (Warwick) & Charles Radin (Texas) work begun at ICERM,

734 views • 29 slides

Mean Field Games on Unbounded Networks and the Graphon MFG Equations Peter E. Caines McGill

Mean Field Games on Unbounded Networks and the Graphon MFG Equations Peter E. Caines McGill University Work with Shuang Gao and Minyi Huang CROWDS models and control CIRM, Marseille, France, June, 2019 Work supported by NSERC and ARL 1 / 55

1.01k views • 55 slides