symbol comparisons in quicksort quickselect
play

Symbol Comparisons in QuickSort & QuickSelect I. Overview - PowerPoint PPT Presentation

The Number of Symbol Comparisons in QuickSort & QuickSelect I. Overview ~~~ Philippe Flajolet II. Average-Case Analysis ~~~ Brigitte Valle III. Distributions ~~~ Jim Fill Wednesday, June 17, 2009 1 1.


  1. The Number of Symbol Comparisons in QuickSort & QuickSelect I. Overview ~~~ Philippe Flajolet II. Average-Case Analysis ~~~ Brigitte Vallée III. Distributions ~~~ Jim Fill Wednesday, June 17, 2009 1

  2. • 1. Algorithms & analysis • 2. Cost measures • 3. Sources (data model) • 4. Results: average-case & distributional Wednesday, June 17, 2009 2

  3. 1.QuickSort & QuickSelect Wednesday, June 17, 2009 3

  4. P: pivot <P >P k-1 n-k Wednesday, June 17, 2009 4

  5. Analyses of QuickSort • Average-case : recurrences, then generating functions (GFs). Exchanges; Median-of-3, etc. • Variance: multivariate GFs • Distribution : MGFs & moments, Martingales, Contraction Hoare; Knuth; Sedgewick [1960-1975] Hennequin, Régnier, Rösler [1989+] Fill & Janson [2000], Martinez... Wednesday, June 17, 2009 5

  6. m m P: pivot <P >P m < k? m = k? m > k? Wednesday, June 17, 2009 6

  7. Various brands of QuickSelect: Wednesday, June 17, 2009 7

  8. Average-case analyses Knuth et al [ca 1970] Wednesday, June 17, 2009 8

  9. Distributional analyses • Quickselect: e.g., Dickman distribution Mahmoud-Modarres-Smythe, Grübel, Rösler, Hwang-Tsai, et al. perpetuities : 1+U 1 +U 1 U 2 +U 1 U 2 U 3 +... i.i.d. unif. [0,1] (fixed rank; fixed quantile) • Multiple Quickselect, ancestors, &c Lent-Mahmoud, Prodinger, et al. Wednesday, June 17, 2009 9

  10. 2. Cost measures Wednesday, June 17, 2009 10

  11. Sedgewick @ AofA-02(?): “actual complexity matters!” • So far: number of key-comparisons • But... keys are often “non-atomic” records! • And...need common information-theoretic basis, to compare with radix methods, hashing, etc. Wednesday, June 17, 2009 11

  12. Alphabet: Σ • Count all symbol comparisons in algorithms: • comparing u and v has cost 1 + coincidence ( u,v ). a b a b b b... a b a a b a... coincidence=3; #comparisons=4. ( γ ) ( β ) Wednesday, June 17, 2009 12

  13. A Binary Search Tree: symbol comparisons Wednesday, June 17, 2009 13

  14. Under a wide range of classical STRING (WORD) MODELS: It takes O( n .log n ) symbol comparisons to “distinguish” n elements --- in probability, on average With high probability, the common prefix of any two words has length at most O(log n ). Many many people in the audience... Wednesday, June 17, 2009 14

  15. TRIES • Bernoulli, Markov, etc. • Devroye’s density model • Vallée’s dynamic sources... S n =O(K n .log( n )) Upper bounds • Quicksort: O( n .(log n ) 2 ) • Quickselect: O( n .log n ) Wednesday, June 17, 2009 15

  16. Symbol comparisons • QuickSort: [Janson & Fill 2002] binary source + density model . ~C n .log( n ) 2 • QuickSelect: [Fill-Nakama 2007-9] binary source for QuickMin/Max & QuickRand ~C’. n CONSTANTS? (cf also: Panholzer & Prodinger) Wednesday, June 17, 2009 16

  17. 3. Sources “A source models the way data (symbols) are produced.” “La Source” by Ingres @ Musée d’Orsay Wednesday, June 17, 2009 17

  18. Axioms for SOURCES • Totally ordered alphabet (usually finite) ∑ • Fundamental probabilities (p w ) := the probability of starting with w • p w → 0 as |w| → ∞ • Keys are invariably i.i.d. [Later] + “regularity” conditions: tameness Wednesday, June 17, 2009 18

  19. 0 1 a b aa ab ba bb aba abb Property : The Source is parameterized by [0,1]: to an infinite word w, there corresponds α such that M( α )=w. Wednesday, June 17, 2009 19

  20. Notations: a w b w 1 0 p w- p w p w+ Pr(prefix<w) Pr(prefix=w) Pr(prefix>w) Fundamental constants of QuickStuffs will be all expressed in terms of fundamental probabilities Wednesday, June 17, 2009 20

  21. • Standard binary source (uniform: 1/2,1/2); Bernoulli sources such as 1/2, 1/6,1/3. • Density models: Standard binary source with density f(x) or c.d.f F(x). • Markov • Dynamical sources [Devroye 1986] [Vallée 2001; Clément-Fl-Vallée 2001] Wednesday, June 17, 2009 21

  22. Fundamental intervals & triangles 1/2 1/6 1/3 Wednesday, June 17, 2009 22

  23. 4. Results (Le Savant Cosinus) Wednesday, June 17, 2009 23

  24. Average-case 1 2 ➜ ➜ QuickMin, QuickRand QuickVal ☞ ☞ Wednesday, June 17, 2009 24

  25. QUICKVAL( α ): is dual to QuickSelect P: pivot <P >P v<P v=P v>P • QuickVal( n , α ) := rank of element whose parameter [corresponding to value v ] is α . • QuickVal( n , α ) behaves “almost” as QuickSelect( n α ). Wednesday, June 17, 2009 25

  26. Distribution Theorem : Assuming a suitable tameness condition, there exists a limiting distribution of the cost S n /n of QuickQuant( α ), which can be described explicitly Wednesday, June 17, 2009 26

Recommend


More recommend