15-150 Fall 2020 Lecture 8 Stephen Brookes trees vs. lists - PowerPoint PPT Presentation

15-150 Fall 2020 Lecture 8 Stephen Brookes

trees vs. lists • Representing a collection as a tree may enable a parallel speed-up • Using a sorted tree may enable faster code, e.g. for searching • With lists, even sorted lists, there’s less potential for parallelism • But badly balanced trees are no better than lists, and balance may be hard to achieve!

the plan • First, a quick review • We’ll discuss how to search in lists and trees - under various assumptions (sorted, balanced) • Then we’ll implement an algorithm for sorting a tree - and prove its correctness - and analyze its work and span

but first… • Someone asked about naming conventions • I prefer T for trees, t for types (and tea to drink) • I often use capitalized names for datatype constructors like Node, Empty, SOME, NONE • Not required by ML, but you must be consistent datatype ’a tree = Empty | Node of ’a tree * ’a * ’a tree; fun size empty = 0 | size (Node(A, _, B)) = 1 + size A + size B What happens?

balanced trees We can build a balanced tree from a list… … and (if we do it right) get the same list back by in-order traversal 1 list2tree [4,1,2] 4 2 inord

recall datatype ’a tree = Empty | Node of ’a tree * ’a * ’a tree fun size Empty = 0 | size (Node(T1, x, T2)) = 1 + (size T1) + (size T2) fun depth Empty = 0 | depth (Node(T1, x, T2)) = 1 + Int.max(depth T1, depth T2) • size T = number of nodes • depth T = length of longest path from root to leaf • A full binary tree of depth d has 2 d - 1 nodes • depth T is O(log (size T)) for a balanced tree, depth T is O(size T) otherwise(!)

recall fun inord Empty = [ ] | inord (Node(T1, x, T2)) = (inord T1) @ x :: (inord T2) fun list2tree [ ] = Empty | list2tree [x] = Node(Empty, x, Empty) | list2tree L = let val n = length L val (A, x::B) = takedrop (n div 2, L) in Node(list2tree A, x, list2tree B) end • inord T = inorder traversal list of T • length(inord T) = size T

question • Would it have been OK to omit the [x] clause? fun list2tree [ ] = Empty | list2tree L = let val n = length L val (A, x::B) = takedrop (n div 2, L) in Node(list2tree A, x, list2tree B) end list2tree [4] = ???

answer • Would it have been OK to omit the [x] clause? fun list2tree [ ] = Empty | list2tree L = let val n = length L val (A, x::B) = takedrop (n div 2, L) in Node(list2tree A, x, list2tree B) end YES Correctness proof still works!

precision (or lack thereof) • There may be MANY balanced trees with the same inorder traversal list • list2tree L builds a balanced tree with inorder traversal list L • We don’t need to (or care to) say which one! Go back and see if/where/why we used imprecise specs before!

balanced • Empty is balanced • Node(A, x, B) is balanced iff |size(A) - size(B)| ≤ 1 and A, B are balanced A structurally inductive definition

balanced • Empty is balanced • Node(A, x, B) is balanced iff |size(A) - size(B)| ≤ 1 and A, B are balanced A structurally inductive definition • If T is balanced, every node of T is balanced • If T is balanced, its children each have about half the data

balanced • Empty is balanced • Node(A, x, B) is balanced iff |size(A) - size(B)| ≤ 1 and A, B are balanced A structurally inductive definition • If T is balanced, every node of T is balanced (by definition + an easy structural induction) • If T is balanced, its children each have about half the data

balanced • Empty is balanced • Node(A, x, B) is balanced iff |size(A) - size(B)| ≤ 1 and A, B are balanced A structurally inductive definition • If T is balanced, every node of T is balanced (by definition + an easy structural induction) • If T is balanced, its children each have about half the data (how could you prove this?)

sorted lists nil is sorted x::R is sorted iff x is ≤ every integer in R and R is sorted also a structurally inductive definition

sorted trees Empty is sorted Node(A, x, B) is sorted iff every integer in A is ≤ x, every integer in B is ≥ x, and A and B are sorted

sorted trees Empty is sorted Theorem Node(A, x, B) is sorted iff T is a sorted tree iff every integer in A is ≤ x, inord T is a sorted list every integer in B is ≥ x, and A and B are sorted

sorted trees Empty is sorted Theorem Node(A, x, B) is sorted iff T is a sorted tree iff every integer in A is ≤ x, inord T is a sorted list every integer in B is ≥ x, prove by structural induction and A and B are sorted

sorted trees Empty is sorted Node(A, x, B) is sorted iff every integer in A is ≤ x, every integer in B is ≥ x, prove by structural induction and A and B are sorted

sorted trees Empty is sorted Node(A, x, B) is sorted iff every integer in A is ≤ x, every integer in B is ≥ x, and A and B are sorted

sorted trees Empty is sorted Node(A, x, B) is sorted iff every integer in A is ≤ x, every integer in B is ≥ x, and A and B are sorted . 42 . . 42 81 . . 57 . 99 . 3 14

sorted trees Empty is sorted Node(A, x, B) is sorted iff every integer in A is ≤ x, every integer in B is ≥ x, and A and B are sorted . . 42 42 . . . . 14 81 42 81 . . 57 . 99 . . . 57 . 99 . 3 42 3 14

all all : (int -> bool) * int tree -> bool fun all (p, Empty) = true | all (p, Node(A, x, B)) = (p x) andalso all (p, A) andalso all (p, B) REQUIRES p is total ENSURES all (p, T) = true iff every integer in T satisfies p

all all : (int -> bool) * int tree -> bool fun all (p, Empty) = true | all (p, Node(A, x, B)) = (p x) andalso all (p, A) andalso all (p, B) REQUIRES p is total ——————— p x terminates, for all x in T ENSURES all (p, T) = true iff every integer in T satisfies p

sorted fun sorted (T : int tree) : bool = case T of Empty => true | Node(A, x, B) => all ( fn y => y <= x, A) andalso all ( fn y => y >= x, B) andalso sorted A andalso sorted B sorted T = true iff T is a sorted tree

sorted fun sorted Empty = true | sorted (Node(A, x, B)) = all ( fn y => y <= x, A) andalso all ( fn y => y >= x, B) andalso sorted A andalso sorted B sorted T = true iff T is a sorted tree Useful in specs, never used in code!

motivation Sorted data may be easier to deal with… • That’s why dictionaries are in lexicographic order! Let’s look at functions for searching data contained in • lists (unsorted, sorted) • trees (unsorted, sorted) • We’ll contrast the work and span.

searching an unsorted list mem : int * int list -> bool fun mem (x, [ ]) = false fun mem (x, [ ]) = false | mem (x, y::L) = (x = y) orelse mem (x, L) | mem (x, y::L) = (x = y) orelse mem (x, L) REQUIRES true ENSURES mem (x, L) = true iff x is in L W mem (x, L) is O(length L) S mem (x, L) is also O(length L)

searching a sorted list mem : int * int list -> bool fun mem (x, [ ]) = false | mem (x, y::L) = case Int.compare(x, y) of LESS => false | EQUAL => true | GREATER => mem (x, L) REQUIRES L is a sorted list ENSURES mem (x, L) = true iff x is in L W mem (x, L) is O(length L) S mem (x, L) is also O(length L)

searching an unsorted tree mem : int * int tree -> bool fun mem (x, Empty) = false fun mem (x, Empty) = false | mem (x, Node(A, y, B)) = | mem (x, Node(A, y, B)) = (x = y) orelse mem (x, A) orelse mem (x, B) (x = y) orelse mem (x, A) orelse mem (x, B) (* not designed for parallel evaluation *) REQUIRES T is a tree ENSURES mem (x, T) = true iff x is in T W mem (x, T) is O(size T) S mem (x, T) is also O(size T)

searching an unsorted tree mem : int * int tree -> bool fun mem (x, Empty) = false | mem (x, Node(A, y, B)) = (x = y) orelse let val (a, b) = (mem (x, A), mem (x, B)) in (* designed for parallel evaluation *) a orelse b end W mem (x, T) is O(size T) S mem (x, T) is O(depth T) … let’s see why

searching an unsorted tree fun mem (x, Empty) = false | mem (x, Node(A, y, B)) = (x = y) orelse let val (a, b) = (mem (x, A), mem (x, B)) in a orelse b end S(mem(x, Empty)) = 1 S(mem(x, Node(A, y, B))) = 1 + max(S(mem(x,A)), S( mem (x, B)))

searching an unsorted tree fun mem (x, Empty) = false | mem (x, Node(A, y, B)) = (x = y) orelse let val (a, b) = (mem (x, A), mem (x, B)) in a orelse b end S(mem(x, Empty)) = 1 S(mem(x, Node(A, y, B))) = 1 + max(S(mem(x,A)), S( mem (x, B))) S( mem (x, T)) is O(depth T)

searching an unsorted tree fun mem (x, Empty) = false | mem (x, Node(A, y, B)) = (x = y) orelse let val (a, b) = (mem (x, A), mem (x, B)) in a orelse b end S(mem(x, Empty)) = 1 S(mem(x, Node(A, y, B))) = 1 + max(S(mem(x,A)), S( mem (x, B))) S( mem (x, T)) is O(depth T) Let S mem (d) be span for mem(x,T) with T of depth d

15-150 Fall 2020 Lecture 8 Stephen Brookes trees vs. lists - PowerPoint PPT Presentation

15-150 Fall 2020 Lecture 8 Stephen Brookes trees vs. lists Representing a collection as a tree may enable a parallel speed-up Using a sorted tree may enable faster code, e.g. for searching With lists, even sorted lists, theres less

MEDP 150 / FILMP 150 MEDP 150 / FILMP 150 Whether you are thinking about a career in filmmaking,

The Problem = $1,500 taxes/yr. Median Value = $150,000 x 1% for City Property Tax + $150 to roads

Leica Sprinter 50 / 150 / 150M / 250M Push the Button Leica Sprinter 50 / 150 Construction

Fall to Fall Enrollment Comparison Fall to Fall Enrollment Comparison Student FTE, Fall 2000

Seasonal Outreach Fall Fall Outreach Campaign Fall Outreach Campaign Fall Outreach Fall

14 C 4 CFR PAR ART 150 150 N NOISE AN AND D LAND USE C SE COM OMPATIBILITY ST STUDY

CPB Approach 0,5 0 2000 2002 2004 2006 2008 2010 2012 2014 -0,5 5 November 2015 Fall 06 Fall

Sampling CS 6965 Fall 2011 Creative Program 3 CS 6965 Fall 2011 2 CS 6965 Fall 2011 3 CS

in the CDHS 150 150 (50 Annually 25 Summer) 9 th Grade High School Students 6 6 High School

US 150 Eastbound (McClugage Bridge) over the Illinois River PRE-BID MEETING

Building on 150 years CSR 1855-2005 CELEBRATING 150 YEARS CSR Limited Results Presentation Half

150 Proportion of Users 100 50 0 0 1000 2000 3000 4000 Duration of User Session 150

! e Picha Project ENJOY MEALS , EMPOWER LIVES picha from Myanmar 150,000 registered refugees

FALL PLANNING BELLEVUE SCHOOL DISTRICT Fall Planning 2020 STEERING COMMITTEE July 29, 2020

EQUATION OF FREE FALL Chapter 2 = Free Fall v = u - gt Chapter 2 = Free Fall v = u - gt

CS 251 Fall 2019 CS 251 Fall 2019 CS 251 Fall 2019 CS 251 Fall 2019 Principles of

AVL Trees All keys in left subtree smaller than nodes key 2 6 10 12 All keys in

Some Key Questions on the Nature of Time Antony Galton Department of Computer Science,

Generalized Effective Reducibility Merlin Carl Europa-Universit at Flensburg Generalized

Far beyond Goodmans Theorem? Michael Rathjen University of Leeds Proof Theory Virtual Seminar

Chris Wyatt Electrical and Computer Engineering Virginia Tech The average complexity (number of

Week 15 -Wednesday What did we talk about last time? Review up to Exam 1 Lab hours

Digital Logic Design: a rigorous approach c Chapter 12: Trees Guy Even Moti Medina School

Advanced Data Structures Lecturer: Shi Li Department of Computer Science and Engineering

Sambuz

Useful Links

Newsletter

Mail Us