Algorithms and Data Structures thread Taught by Kyriakos Kalorkoti - PDF document

1 / 20 2 / 20 Algorithms and Data Structures thread Taught by Kyriakos Kalorkoti (KK), IF5.26, kk@inf.ed.ac.uk . Inf 2B: Introduction to Algorithms Lecture 1 of ADS thread Topics: 1: Algorithms, analysing algorithms, Asymptotic notation (for talking about running-times), Sequential Data Structures, Kyriakos Kalorkoti Tree data structures, Hashing, Priority Queues, Advanced sorting. School of Informatics University of Edinburgh 2: Algorithms for searching graphs, applications to graph problems. 3: Algorithms for the WWW: indexing, searching. 3 / 20 4 / 20 Textbooks Study advice For Algorithms and Data Structures (recommended, not 1. Education is done with you not to you. required.): 2. You are here because you want to learn the subject. I [GT] Data Structures and Algorithms in Java , by Goodrich 3. Course consists of: & Tamassia (4th or 3rd ed), Wiley. I Lectures. Gentle textbook, best for this course (doesn’t have WWW I Tutorials. I Practical work (2 assignments only, 1 for each thread). stuff). I Private study. Java. Deciding not to take an active part in all of these is I [CLRS] Introduction to Algorithms , by Cormen, Leiserson, deciding to under perform at best and fail at worst. Rivest & Stein, MIT Press. Lots of Algorithms & Data Structures. It is not possible to coast along and revise just before the Technical. exams (unless failure seems like a good idea). No Java (or any other programing language). My promise: If you ask for help I will do my utmost to Course text for 3rd year Algorithms and Data Structures provide it. But please use the channels above first when course. appropriate. Questions from you: Strongly encouraged, during lectures, If you will not take 3rd year ADS, choose [GT], but don’t rush after lectures or email. out to buy a book straight away. 5 / 20 6 / 20 Finally: Our Ingredients Algorithms Step-by-step procedure (a “recipe”) for performing I Lectures start at 4.10, keep any eye on the clock and wind a task. down any conversation. Data Structures Systematic way of organising data and making I In lectures either I talk or you talk but not both! it accessible in certain ways. I Laptops, tablets, phones should be put away (unless a medical condition requires the use of an aid). I We are interested in the design and analysis of “good” I If you have any special needs that need my cooperation algorithms and data structures. please speak to me. I Think about very large systems and the need to have them work within acceptable time.

7 / 20 8 / 20 What you have probably seen already Evaluating algorithms Data Structures Arrays, linked lists, stacks, trees. Algorithm design principles I Correctness Recursive algorithms. I Efficiency w.r.t. Searching and Sorting Algorithms 8 Linear search and Binary search. Insertion sort, — running time , > > — space (=amount of memory used) , selection sort. < — network traffic , Other prerequisites: > > : — number of times secondary storage is accessed . I The ability to reason mathematically, spot a bad argument I Simplicity from a mile off. I Write down a mathematical argument fluently . It should be a pleasure to read. I See Note 1 for advice on setting out mathematical reasoning. 9 / 20 10 / 20 Measuring Running time Example 1: Linear Search in JAVA The running time of a program depends on a number of factors such as: public static int linSearch(int[] A,int k) { for(int i = 0; i < A.length; i++) 1. The input . if ( A[i] == k ) 2. The running time of the algorithm. return i; 3. The quality of the implementation and the quality of the return -1; code generated by the compiler . } 4. The machine used to execute the program . This is Java. We will rarely be concerned with the implementation quality , the I We want to ignore implementation details, so we map this code quality or the machine . to pseudocode. I A given algorithm can be implemented by many different In reality things are the other way round! programs (indeed languages). 11 / 20 12 / 20 Linear Search in Pseudocode Worst Case Running Time Assign a size to each possible input. Input: Integer array A , integer k being searched. Output: The least index i such that A [ i ] = k ; otherwise � 1. Definition The (worst-case) running time of an algorithm A is the function T A : N ! N where T A ( n ) is the maximum number of Algorithm linSearch ( A , k ) computation steps performed by A on an input of size n . 1. for i 0 to A . length � 1 do Example: linSearch. 2. if A [ i ] = k then I Suppose the size is the length of the array A . 3. return i I Worst-case running time is a linear function of size. 4. return � 1 Note: Suppose A = h 19 , 5 , 6 , 77 , 2 , 1 , 90 , 3 , 4 , 22 , 1 , 5 , 6 i and k = 1. I Implicit assumption that array entries are of bounded size. What happens? I Otherwise we could take sum of all array entry sizes as measure of input size (plus size of k ).

13 / 20 14 / 20 Average Running Time Analysis of Algorithms In general worst-case seems overly pessimistic. Definition A nice approach would be to combine: The average running time of an algorithm A is the function AVT A : N ! R where AVT A ( n ) is the average number of Worst-Case Analysis + Experiments computation steps performed by A on an input of size n . Problems with average time We will aim for this but I What precisely does average mean? What is meant by an I Java’s Garbage Collection hampers the quality of our “average” input depends on the application. experiments. I Average time analysis is mathematically very difficult and often infeasible (OK for linSearch). 15 / 20 16 / 20 Example 2: Binary Search Running-time of Binary search Input array with n = i 2 � i 1 + 1 (the number of items in the Input: Integer array A in increasing order, integers i 1 , i 2 , k . region we search). Output: An index i with i 1  i  i 2 and A [ i ] = k , if such an i I Do at most a constant c amount of work. exists, � 1 otherwise. I If k found done else recurse on array of size about n / 2. I Do a constant c amount of work. Algorithm binarySearch ( A , k , i 1 , i 2 ) I If k found done else recurse on array of size about n / 2 2 . 1. if i 2 < i 1 then return � 1 . 2. else . . j b i 1 + i 2 3. 2 c I Do a constant c amount of work. if k = A [ j ] then 4. I If k found done else recurse on array of size about n / 2 r . 5. return j Base case: n / 2 r = 1, i.e., r = lg ( n ) . Then one more call. 6. else if k < A [ j ] then Total work done (time) no more than 7. return binarySearch ( A , k , i 1 , j � 1 ) 8. else � � c lg ( n ) + 2 . 9. return binarySearch ( A , k , j + 1 , i 2 ) Better than linSearch? 17 / 20 18 / 20 lg n versus n Put m = lg n . By definition n = 2 m . Now: m ! m + 1 n ! 2 n m ! m + 5 n ! 32 n m ! m + 10 n ! 1024 n n ! 2 c n m ! m + c T linSearch ( n ) = 10 n + 10, T binarySearch ( n ) = 1000 lg ( n ) + 1000.

19 / 20 20 / 20 Some Statistics Why not just do experiments? Jan 2008 on a DICE machine. I Consider sorting arrays of the integers 1 , 2 , . . . , 100 held in some order. size wc linS avc linS wc binS avc binS 10  1 ms  1 ms  1 ms  1 ms I Just take a 1% sample of all possible inputs. 100  1 ms  1 ms  1 ms  1 ms I How many experiments?  1 ms  1 ms  1 ms  1 ms 1000 10000  1 ms  1 ms  1 ms  1 ms 99 ! = 9332621544394415268169923885626670049071596826438 100000  1 ms  1 ms  1 ms  1 ms 162146859296389521759999322991560894146397615651 200000  1 ms  1 ms  1 ms  1 ms 828625369792082722375825118521091686400000000000 400000 3 ms  1 ms  1 ms  1 ms 00000000000 . 600000 3 ms 1.3 ms  1 ms  1 ms 800000 3 ms 1.5 ms  1 ms  1 ms Assume algorithm can sort 10 50 instances per second(!). 1000000 5 ms 2.1 ms  1 ms  1 ms How long do we need to wait? 2000000 7 ms 3.7 ms  1 ms  1 ms 4000000 12 ms 6.9 ms  1 ms  1 ms 99 ! 60 ⇥ 60 ⇥ 24 ⇥ 366 ⇥ 10 50 ⇡ 2 . 951269209 ⇥ 10 98 years. 6000000 24 ms 11.6 ms  1 ms  1 ms 24 ms 15.6 ms  1 ms  1 ms 8000000 Be seeing you! 200 repetitions for each size.

Algorithms and Data Structures thread Taught by Kyriakos Kalorkoti - PDF document

1 / 20 2 / 20 Algorithms and Data Structures thread Taught by Kyriakos Kalorkoti (KK), IF5.26, kk@inf.ed.ac.uk . Inf 2B: Introduction to Algorithms Lecture 1 of ADS thread Topics: 1: Algorithms, analysing algorithms, Asymptotic notation (for

Algorithms and Data Structures: Overview Algorithms and data structures Data Abstraction,

Introduction to Algorithms and Data Structures CSC 1051 Algorithms and Data Structures I Dr.

Data Structures and What is a data structure? Algorithms Way of storing data in computer

Data Structures and Java Collections Framework 1 Algorithms and Data Structures

Review Informatik 2 What did we learn about Algorithms and Data Structures? The Big Picture

Exams Gerth Stlting Brodal Algorithms and Data Structures Retreat, Sandbjerg, Denmark, March 3,

Analysis of Algorithms Data Structures and Algorithms for CL III, WS 2019-2020 Corina Dima

CS 310 - Advanced Data Structures and Algorithms Basic Data Structures May 31, 2018 Mohammad

More Graph Algorithms Data Structures and Algorithms CSE 373 SP 18 - KASEY CHAMPION 1

Cache-Oblivious and Cache-Aware Algorithms , July 2004 Data Structures , February-March 2002

Algorithms and Data Structures Balanced Trees (AVL-Trees, (a,b)-Trees, Red-Black-Trees)

Data Structures for Disjoint Sets Course: CS 5130 - Advanced Data Structures and Algorithms

Algorithms for prerequisites Computer Games fundamentals of algorithms and data structures

Heaps Algorithms CSE 373 SU 18 BEN JONES 1 Warm Up We have seen several data structures

10 B: Graph Algorithms IV CS1102S: Data Structures and Algorithms Martin Henz March 27, 2009

Algorithms and Data Structures, or . . . Classical Algorithms of the 50s, 60s and 70s Mary Cryan

Algorithms and Data Structures Lecture 6 Binary Search Trees I Fabian Kuhn Algorithms and

Data Structures and Algorithms Course at D-MATH (CSE) of ETH Zurich Spring 2020 1. Introduction

Data Structures and Algorithms CS 110 Physical Data

Prerequisite: Comp. 110/110L. Introduction to data structures and the algorithms that use

COL106: Data Structures and Algorithms Ragesh Jaiswal, IIT Delhi Ragesh Jaiswal, IIT Delhi

3/25/13 CS200 Algorithms and Data Structures Colorado State University CS200 Algorithms and

Thread-Modular Reasoning for Lock-Free Data Structures Roland Meyer based on joint work with

Algorithms and Data Structures Open Addressing, Priority Queue Albert-Ludwigs-Universitt