Compression: Prefix Codes Greg Plaxton Theory in Programming - PowerPoint PPT Presentation

Compression: Prefix Codes Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin

(Binary, Static) Code • Maps each symbol a of a given finite alphabet A to a codeword w ( a ) in { 0 , 1 } ∗ (i.e., a binary codeword) – The mapping is static , i.e., a is always encoded as w ( a ) , regardless of the surrounding context – So the mapping determines the encoder – But decoding can be problematic (why?) Theory in Programming Practice, Plaxton, Spring 2004

Uniquely Decodable Code • A code is uniquely decodable if the associated encoder maps distinct input strings to distinct encoded strings – Necessary and sufficient for lossless decoding – Example of a code that is uniquely decodable? One that is not? • Let ℓ ( a ) denote the length of w ( a ) Theory in Programming Practice, Plaxton, Spring 2004

Optimal Code • Suppose we are given a frequency f ( a ) for each symbol a in A f ( a ) – Let p ( a ) denote P b ∈ A f ( b ) – Note that p ( a ) may be viewed as a probability • We define the weight of a code as � a ∈ A p ( a ) · ℓ ( a ) • A code is optimal (for a given alphabet and associated probability distribution) if it has minimum weight over all uniquely decodable codes – Remark: Keep in mind that we are only talking about optimality with respect to the set of binary static codes; we will revisit this issue later Theory in Programming Practice, Plaxton, Spring 2004

An Entropy-Based Lower Bound on Code Weight • Let H denote the entropy of the probability distribution associated with alphabet A , i.e., � H = − p ( a ) log p ( a ) a ∈ A • Theorem: The weight of any uniquely decodable code for A is at least H • Hint: Use the two inequalities given on the next slide and the fact that the logarithm function is concave over the positive reals Theory in Programming Practice, Plaxton, Spring 2004

Two Inequalities • McMillan: Any uniquely decodable code satisfies 2 − ℓ ( a ) ≤ 1 � a ∈ A • Jensen: If λ 1 , . . . , λ n are nonnegative reals summing to 1 and f is a concave function over an interval containing the reals x 1 , . . . , x n then �� λ i · f ( x i ) ≤ f λ i · x i i i Theory in Programming Practice, Plaxton, Spring 2004

Prefix Code • A prefix code is a code in which no codeword is the prefix of another – Uniquely decodable – Easy to decode • Exercise: Give an example of a code that is uniquely decodable but is not a prefix code Theory in Programming Practice, Plaxton, Spring 2004

Kraft-McMillan Inequality • Kraft: For any sequence of integers such that ℓ 1 , . . . , ℓ | A | 1 ≤ i ≤| A | 2 − ℓ i ≤ 1 , there is a prefix code for A with codeword lengths � ℓ 1 , . . . , ℓ | A | • Since every uniquely decodable code satisfies McMillan’s inequality, we can restrict our attention to prefix codes in searching for an optimal code • McMillan’s inequality and the above result are often stated together (in two parts) and referred to as the Kraft-McMillan inequality Theory in Programming Practice, Plaxton, Spring 2004

An Entropy-Based Upper Bound on the Weight of an Optimal Code • Theorem: There is an optimal (prefix) code for A with weight less than H + 1 • Hint: First use the Kraft-McMillan inequality to establish the existence 1 of a prefix code for where ℓ ( a ) = ⌈ log p ( a ) ⌉ for all a in A Theory in Programming Practice, Plaxton, Spring 2004

Summary and Discussion of Entropy-Based Bounds • The weight of an optimal prefix code lies in the interval [ H, H + 1) • If H is high, then an optimal prefix code is guaranteed to achieve close to the best possible compression ratio achievable with any coding technique (static or not) – Here we are appealing to Shannon’s entropy bound • If H is close to zero, then an optimal prefix code might achieve a compression ratio that is dramatically worse than the best possible – Example? – Other compression techniques may be applied in such situations in order to achieve near-optimal performance (e.g., arithmetic coding or run-length coding) Theory in Programming Practice, Plaxton, Spring 2004

Computing an Optimal Prefix Code • Huffman’s algorithm will be presented in the next lecture Theory in Programming Practice, Plaxton, Spring 2004

Compression: Prefix Codes Greg Plaxton Theory in Programming - PowerPoint PPT Presentation

Compression: Prefix Codes Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin (Binary, Static) Code Maps each symbol a of a given finite alphabet A to a codeword w ( a ) in {

This week, we are going to look at another prefix. What is a prefix? Choose the right answer. A

This week, we are going to look again at another prefix. What is a prefix? Click on the right

Lossless compression in lossy compression systems Almost every lossy compression system

14.9.2 JPEG2000 compression DCT compression basis for JPEG wavelet compression

Parallel prefix adders Kostas Vitoroulis, 2006. Presented to Dr. A. J. Al-Khalili. Concordia

Recap: Prefix Sums Given A : set of n integers Find B : prefix sums A: 3 1 1 7 2 5

IP Prefix Advertisement in EVPN draft-rabadan-l2vpn-evpn-prefix-advertisement-01 Jorge Rabadan

JPEG Compression Ian Snyder December 11, 2009 Ian Snyder JPEG Compression Outline

Lecture 9: Compression 1 / 52 Compression Recap Bu ff er Management Recap 2 / 52 Compression

Compression Overview Multimedia Encoding and Compression Huffman codes Lossless

Building Codes Building Codes Building Codes Building Codes 1 1 Builder Responsibilities

ECEN 5682 Theory and Practice of Error Control Codes Cyclic Codes Peter Mathys University of

Formal Modeling in Cognitive Science Source Codes Lecture 30: Codes; Kraft Inequality; Source

Digital Image Compression Digital Image Compression Digital Image Compression and JPEG Standards

Digital Video Compression Digital Video Compression Digital Video Compression and H.261

From Sorting to Heaps to Compression Data Compression video on demand/set top box jpeg

Metering Points in Codes of Practice 1, 2, 3, 5 and 10 SVG193/04 28 February 2017 Giulia

5/22/2012 Ethics for Design Ethics for Design Professionals Presented by: Mark Schultz, Esquire

Algebraic codes are good Patrick Sol e joint works with Adel Alahmadi, Cem Gueneri, MinJia Shi,

ENSURING THE TRUSTWORTHY REUSE OF HEALTH DATA FOR RESEARCH Dipak Kalra President Electronic

Ethical Dilemmas Date/Time: Wednesday 12th March 2008 6.00pm Venue: BCS London Office,

Do trade agreements substantially limit development of local and sustainability food systems?

JCP State of the Nation FOSDEM 2013 Heather VanCura (JCP), Martijn Verburg (LJC) @jcp_org

Biology Enzymes 2015-08-28 www.njctl.org Slide 3 / 64 Vocabulary Click on each word below to