Theory and Implementation of Dynamic Data Structures for the GPU - PowerPoint PPT Presentation

Theory and Implementation of Dynamic Data Structures for the GPU John Owens Martín Farach-Colton UC Davis Rutgers

NVIDIA OptiX & the BVH Tero Karras. Maximizing parallelism in the construction of BVHs, octrees, and k - d trees. In High-Performance Graphics , HPG ’12, pages 33–37, June 2012.

The problem • Many data structures are built on the CPU and used on the GPU • Very few data structures can be built on the GPU • Sorted array • (Cuckoo) hash table • Several application-specific data structures (e.g., BVH tree) • No data structures can be updated on the GPU

Scale of updates • Update 1–few items • Fall back to serial case, slow, probably don’t care • Update very large number of items • Rebuild whole data structure from scratch • Middle ground: our goal • Questions: How and when?

Approach • Pick data structures useful in serial case, try to find parallelizations? • Pick what look like parallel-friendly data structures with parallel-friendly updates?

Log-structured merge tree merge . Michael A. Bender, Martin Farach-Colton, Jeremy T. Fineman, Yonatan R. Fogel, Bradley C. Kuszmaul, and Jelani Nelson. 2007. Cache-oblivious Streaming B-trees . In Proceedings of the Nineteenth 2 1 0 2 1 0 2 1 0 Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA ’07). 81–92. • Supports dictionary and range queries • log n sorted levels, each level 2x the size of the last • Insert into a filled level results in a merge, possibly cascaded. Operations are coarse (threads cooperate).

LSM results/questions • Update rate of 225M elements/s • 13.5x faster than merging with a sorted array • Lookups: 7.5x/1.75x slower than hash table/sorted array • Deletes using tombstones • Semantics for parallel insert/delete operations? • Minimum batch size? • Atom size for searching? • Fractional cascading? Saman Ashkiani, Shengren Li, Martin Farach-Colton, Nina Amenta, and John D. Owens. GPU COLA: A dynamic dictionary data structure for the GPU . January 2017. Unpublished.

Quotient Filter 0 1 2 3 4 5 6 7 8 9 • Probabilistic a c f g f q f f r A 1 a B 1 b b d h C 3 c D 3 d membership queries E 3 e e F 4 f G 6 g & lookups: false H 6 h is_continuation is_shifted cluster is_occupied run positives are 0 1 2 3 4 5 6 7 8 9 0 0 0 1 0 0 0 1 1 1 0 0 1 1 1 0 1 1 1 0 1 0 0 1 0 1 1 0 0 0 a b c d e f g h possible . Michael A. Bender, Martin Farach-Colton, Rob • Comparable to a Johnson, Russell Kraner, Bradley C. Kuszmaul, Dzejla Medjedovic, Pablo Montes, Bloom filter but also Pradeep Shetty, Richard P. Spillane, and Erez Zadok. 2012. Don’t Thrash: How to Cache supports deletes and Your Hash on Flash . Proceedings of the VLDB Endowment 5, 11 (Aug. 2012), 1627–1637. merges

QF results/questions • Lookup perf. for point queries: 3.8–4.9x vs. BloomGPU • Bulk build perf.: 2.4–2.7x vs. BloomGPU • Insertion is significantly faster for BloomGPU • Similar memory footprint • 3 novel implementations of bulk build + 1 of insert • Bulk build == non-associative scan • Limited to byte granularity Afton Geil, Martin Farach-Colton, and John D. Owens. GPU Quotient Filters: Approximate Membership Queries on the GPU . January 2017. Unpublished.

Cross-cutting issues • Useful models for GPU memory hierarchy • Independent threads vs. cooperative threads? • More broadly, what’s the right work granularity? • Memory allocation (& impact on hardware) • Cleanup operations, and programming model implications • Integration into higher-level programming environments • Use cases! Chicken & egg problem

Theory and Implementation of Dynamic Data Structures for the GPU - PowerPoint PPT Presentation

Theory and Implementation of Dynamic Data Structures for the GPU John Owens Martn Farach-Colton UC Davis Rutgers NVIDIA OptiX & the BVH Tero Karras. Maximizing parallelism in the construction of BVHs, octrees, and k - d trees. In

Hypo contact and Sasakian SU ( 2 ) -structures in 5-dimensions structures on Lie groups Sasakian

CS 310 - Advanced Data Structures and Algorithms Basic Data Structures May 31, 2018 Mohammad

Chapter 19 Data Structures - struct -dynamic memory allocation Data Structures A data structure

Data Structures Topic 12 ADTS, Data Structures, Java Collections S S C A Data Structure

WITH C++ Prof. Amr Goneid AUC Part 10. Pointers & Dynamic Data Structures Prof. amr

Data Structures II Partial Sums Dynamic Arrays Philip Bille Data Structures II

Contact manifolds and SU ( 2 ) -structures in 5-dimensions SU ( n ) -structures Sasaki-Einstein

COMMUNICATING [with empathy] @ DY DYNAMIC JILL JILL @ DY DYNAMIC JILL TENSION IS INEVITABLE @

Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Minema Minema

CS 310 Advanced Data Structures and Algorithms Dynamic Programming July 5, 2018 Mohammad

CS261 Data Structures Linked Lists - Introduction Dynamic Arrays Revisited Dynamic array can

Data Structures 1 / 27 Built-in Data Structures Values can be collected in data structures:

Data Structures Data Structures Lists Trees Trees Graphs CSE 680 Review basic

CS 310 - Advanced Data Structures and Algorithms Basic Data Structures June 5, 2017 Tong Wang

16. Dynamic Data Structures A data structure is a particular way of organizing data in a computer

Week 6 Oliver Kullmann Dynamic sets Data structures Simple implementa- tion Dynamic sets

Memory Questions? What is main memory? CSCI [4|6]730 How does multiple processes share

Dynamic Memory Overview Dynamically allocated memory is stored in the Heap-section of

Allocating memory in a lock-free manner Anders Gidenstam, Marina Papatriantafilou and Philippas

More Self-study Operators Unary operators, sizeof, boolean operators, comma, and operators

Dynamic Data Structures for the GPU John Owens Child Family Professor of Engineering &

Clicker Question 1 Topic 11 Linked Lists What is output by the following code?

Week 11 -Wednesday What did we talk about last time? Exam 2 Before that: Review

Dynamically Allocating 2-D Arrays Spring Semester 2011 Programming and Data Structure 1 You may

Theory and Implementation of Dynamic Data Structures for the GPU - PowerPoint PPT Presentation

Theory and Implementation of Dynamic Data Structures for the GPU John Owens Martn Farach-Colton UC Davis Rutgers NVIDIA OptiX & the BVH Tero Karras. Maximizing parallelism in the construction of BVHs, octrees, and k - d trees. In

Hypo contact and Sasakian SU ( 2 ) -structures in 5-dimensions structures on Lie groups Sasakian

CS 310 - Advanced Data Structures and Algorithms Basic Data Structures May 31, 2018 Mohammad

Chapter 19 Data Structures - struct -dynamic memory allocation Data Structures A data structure

Data Structures Topic 12 ADTS, Data Structures, Java Collections S S C A Data Structure

WITH C++ Prof. Amr Goneid AUC Part 10. Pointers &amp; Dynamic Data Structures Prof. amr

Data Structures II Partial Sums Dynamic Arrays Philip Bille Data Structures II

Contact manifolds and SU ( 2 ) -structures in 5-dimensions SU ( n ) -structures Sasaki-Einstein

COMMUNICATING [with empathy] @ DY DYNAMIC JILL JILL @ DY DYNAMIC JILL TENSION IS INEVITABLE @

Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Minema Minema

CS 310 Advanced Data Structures and Algorithms Dynamic Programming July 5, 2018 Mohammad

CS261 Data Structures Linked Lists - Introduction Dynamic Arrays Revisited Dynamic array can

Data Structures 1 / 27 Built-in Data Structures Values can be collected in data structures:

Data Structures Data Structures Lists Trees Trees Graphs CSE 680 Review basic

CS 310 - Advanced Data Structures and Algorithms Basic Data Structures June 5, 2017 Tong Wang

16. Dynamic Data Structures A data structure is a particular way of organizing data in a computer

Week 6 Oliver Kullmann Dynamic sets Data structures Simple implementa- tion Dynamic sets

Memory Questions? What is main memory? CSCI [4|6]730 How does multiple processes share

Dynamic Memory Overview Dynamically allocated memory is stored in the Heap-section of

Allocating memory in a lock-free manner Anders Gidenstam, Marina Papatriantafilou and Philippas

More Self-study Operators Unary operators, sizeof, boolean operators, comma, and operators

Dynamic Data Structures for the GPU John Owens Child Family Professor of Engineering &amp;

Clicker Question 1 Topic 11 Linked Lists What is output by the following code?

Week 11 -Wednesday What did we talk about last time? Exam 2 Before that: Review

Dynamically Allocating 2-D Arrays Spring Semester 2011 Programming and Data Structure 1 You may

WITH C++ Prof. Amr Goneid AUC Part 10. Pointers & Dynamic Data Structures Prof. amr

Dynamic Data Structures for the GPU John Owens Child Family Professor of Engineering &