Hash Tables, Dictionaries, and the Art of O(1) Lookup n. a - PowerPoint PPT Presentation

Hash Tables, Dictionaries, and the Art of O(1) Lookup n. a presentation by Matt Zhang for Algorithm Group 1

Dictionary: (n) an unordered and mutable collection of items composed of (key, value) pairs. These slides are shamelessly ripped off from https://just-taking-a- ride.com/inside_python_dict/chapter1.html. Take a look, it's interactive! 2

A Python dictionary is a keyword- based data organization method. Bella = {"species":"dog", "age":1, "breed":"pit_bull", "weight":46} Keywords can be used to reference, add, remove, or retrieve data. Bella["species"] ➞ "dog" Bella["n_legs"] = 4 Bella["n_legs"] ➞ 4 Bella.pop("breed") 3

How do we make a database that is rapidly searchable via keyword? If you were stupid like me, this is how you would have done it: keys = ["species", "age", "breed", "weight"] values = ["dog", 1, "pit_bull", 46] O(n) search! def find(my_key): for i, key in enumerate(keys): if key == my_key: return values[i] 4

Hash Tables 5

A hash function maps data of any arbitrary size onto data of a fixed size. The value produced by a hash function is called the checksum. A hash function is not one-to-one. You may have "collisions", but the chances of two arbitrary pieces of data colliding when hashed is very low. Can be used for quickly comparing two pieces of data. 6

The Luhn algorithm is an example hash function for determining the validity of a credit card number. \ 7

Instead of looking through a list to find a key, we can convert the key to an index via hashing. Example: list_length = 10 key = "breed" hash(key) = -8837423875198100574 hash(key) % list_length = 6 8

Probing Functions 10

When we have a collision, we need to find an empty space in the list via a probing function. 11

Separate chaining is another commonly used probing method. 12

When performing linear probing, at each probe we need to check if the existing key == the new key. == ? 13

To prevent too much unnecessary probing, Python generally allocates a list with size 2x the number of keys when the hash table is created. 14

Since NONE is both hashable and a valid key, we need to create a special object to act as a null key. 15

Removing Items 16

Q: If we want to remove a key, can we just find its position in the hash table and set it to EMPTY? A: No way! Any key that probed past it could not be found if we did this! 17

13 44 If key 18, 13, or 59 were deleted, we would not be able to find key 44 again! 18

To safely remove a key, replace it with a DUMMY object. 44 DUMMY 19

Resizing the Table 20

After a while, the dictionary can start getting pretty full. When the "load factor" reaches 66%, Python creates a new, larger table and enters the keys and values from the old table. DUMMY keys can be discarded when this happens. 21

To prevent resizing too often, we make the new table have 2x as much space as necessary. 22

More Tricks 23

When inserting a key, if we find that it doesn't already exist in the table, we can "recycle" the first DUMMY key that it passed over. DUMMY 44 not in table, so it can be inserted here. 24

The actual probing function in Python is not linear. Rather, it jumps around in order to prevent repeated lookups from related clusters of keys. 25

If a table is small, Python will quadruple its size when resizing rather than doubling. 26

Example Problem 27

Using a sliding window approach, we can look at the substring from index i to j. asdfqwedsasdwr Deciding whether character j+1 is already in the window makes the problem O(n^2) if we check the naive way. Using a hash table makes this O(n). 29

Hash Tables, Dictionaries, and the Art of O(1) Lookup n. a - PowerPoint PPT Presentation

Hash Tables, Dictionaries, and the Art of O(1) Lookup n. a presentation by Matt Zhang for Algorithm Group 1 Dictionary: (n) an unordered and mutable collection of items composed of (key, value) pairs. These slides are shamelessly ripped off

Hash Functions and Hash Tables (2.5.2) A hash function h maps keys of a given type to

CS1100: Computer Science and Its Applications Table Lookup and Error Processing Created By

CS 758/858: Algorithms http://www.cs.unh.edu/~ruml/cs758 Searching Hash Tables Hash Functions

Working with Hash Tables Daniel Petrolito (ANZ Bank) Working With Hash Tables Daniel SAS

Dictionaries and Hash Tables 0 1 025-612-0001 2 981-101-0002 3 4 451-229-0004

Lecture 8: Dictionaries and Hash Tables Instructor: Saravanan Thirumuruganathan CSE 5311

Datastructures 1 Hash Tables Red Black Trees Week 8 Objectives Hash Tables, Hashing

Hash Tables 1 / 91 Hash Tables Administrivia Assignment 2 has been released. We will be

HISTORY ART Pre- Historic Art Egyptian Art Greek Art Roman Art Byzantine Art Medieval Art

HISTORY ART Pre- Historic Art Egyptian Art Greek Art Roman Art Byzantine Art Medieval Art

Hash Functions in Action Hash Functions in Action Lecture 12 Hash Functions Hash Functions

Hash Functions in Action Hash Functions in Action Lecture 11 Hash Functions Hash Functions

Hash tables Hash functions Open addressing March 09, 2020 Cinda Heeren / Andy Roth / Geoffrey

dictionaries (aka hash tables or hash maps) Genome 559: Introduction to Statistical and

dictionaries (aka hash tables or hash maps) Genome 559: Introduction to Statistical and

Hash Tables Direct-Address Tables Hash Functions Universal Hashing Chaining Open Addressing

trts r r srt trr

Techniques in Cartographic Design SynerGIS Ithaca, NY April 28, 2015 Elaine At least

HAPPY SABBATH ! ...exercise yourself toward godliness. For bodily exercise profits a little,

W HO I S O LD T ESTAMENT N EW T ESTAMENT Gospels Genesis Ecclesiastes Matthew 1 Timothy

Functional Programming is Everywhere Ilya Sergey ilyasergey.net PLMW @ ICFP 2019 2

Static Analysis: Overview, Syntactic Analysis and Abstract Interpretation TDDC90: Software

Sorting with Pop Stacks Stack sorting Pop stack sorting 1-pop-stack sortability 2-pop-stack

PREVENIRE LE EMISSIONI DI METANO DALLA FILIERA DI OIL&GAS Ben Ratner 4 December 2019 XI

Hash Tables, Dictionaries, and the Art of O(1) Lookup n. a - PowerPoint PPT Presentation

Hash Tables, Dictionaries, and the Art of O(1) Lookup n. a presentation by Matt Zhang for Algorithm Group 1 Dictionary: (n) an unordered and mutable collection of items composed of (key, value) pairs. These slides are shamelessly ripped off

Hash Functions and Hash Tables (2.5.2) A hash function h maps keys of a given type to

CS1100: Computer Science and Its Applications Table Lookup and Error Processing Created By

CS 758/858: Algorithms http://www.cs.unh.edu/~ruml/cs758 Searching Hash Tables Hash Functions

Working with Hash Tables Daniel Petrolito (ANZ Bank) Working With Hash Tables Daniel SAS

Dictionaries and Hash Tables 0 1 025-612-0001 2 981-101-0002 3 4 451-229-0004

Lecture 8: Dictionaries and Hash Tables Instructor: Saravanan Thirumuruganathan CSE 5311

Datastructures 1 Hash Tables Red Black Trees Week 8 Objectives Hash Tables, Hashing

Hash Tables 1 / 91 Hash Tables Administrivia Assignment 2 has been released. We will be

HISTORY ART Pre- Historic Art Egyptian Art Greek Art Roman Art Byzantine Art Medieval Art

HISTORY ART Pre- Historic Art Egyptian Art Greek Art Roman Art Byzantine Art Medieval Art

Hash Functions in Action Hash Functions in Action Lecture 12 Hash Functions Hash Functions

Hash Functions in Action Hash Functions in Action Lecture 11 Hash Functions Hash Functions

Hash tables Hash functions Open addressing March 09, 2020 Cinda Heeren / Andy Roth / Geoffrey

dictionaries (aka hash tables or hash maps) Genome 559: Introduction to Statistical and

dictionaries (aka hash tables or hash maps) Genome 559: Introduction to Statistical and

Hash Tables Direct-Address Tables Hash Functions Universal Hashing Chaining Open Addressing

trts r r srt trr

Techniques in Cartographic Design SynerGIS Ithaca, NY April 28, 2015 Elaine At least

HAPPY SABBATH ! ...exercise yourself toward godliness. For bodily exercise profits a little,

W HO I S O LD T ESTAMENT N EW T ESTAMENT Gospels Genesis Ecclesiastes Matthew 1 Timothy

Functional Programming is Everywhere Ilya Sergey ilyasergey.net PLMW @ ICFP 2019 2

Static Analysis: Overview, Syntactic Analysis and Abstract Interpretation TDDC90: Software

Sorting with Pop Stacks Stack sorting Pop stack sorting 1-pop-stack sortability 2-pop-stack

PREVENIRE LE EMISSIONI DI METANO DALLA FILIERA DI OIL&amp;GAS Ben Ratner 4 December 2019 XI

PREVENIRE LE EMISSIONI DI METANO DALLA FILIERA DI OIL&GAS Ben Ratner 4 December 2019 XI