peloton
play

PELOTON THE SELF-DRIVING DBMS 2008 5,000 txn/sec H-Store: A - PowerPoint PPT Presentation

PELOTON THE SELF-DRIVING DBMS 2008 5,000 txn/sec H-Store: A High-Performance, Distributed Main Memory Transaction Processing System VLDB 2008 2008 5,000 txn/sec 2010 11,000 txn/sec On Predictive Modeling for Optimizing Transaction


  1. PELOTON THE SELF-DRIVING DBMS

  2. 2008 5,000 txn/sec H-Store: A High-Performance, Distributed Main Memory Transaction Processing System VLDB 2008

  3. 2008 5,000 txn/sec 2010 11,000 txn/sec On Predictive Modeling for Optimizing Transaction Execution in Parallel OLTP Systems VLDB 2011

  4. 2008 5,000 txn/sec 2010 11,000 txn/sec 2012 50,000 txn/sec Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems SIGMOD 2012

  5. 2008 5,000 txn/sec 2010 11,000 txn/sec 2012 50,000 txn/sec 2015 TicToc: Time Traveling Optimistic Concurrency Control 4,000,000 txn/sec SIGMOD 2016

  6. ONLY MAXIMIZING OLTP Throughput Leads to an Unfulfilling life.

  7. AVERAGE SALARY FOR Database ADMINS IN 2015 was $8 $81, 1,710. Source: Bureau of Labor Statistics

  8. Self-Driving A DBMS THAT can configure, tune, and optimize itself without any human intervention.

  9. YES NO Database Design Security & ACL S Data Placement Data Integration Query Optimization UNPLANNED HALTS Knob Configuration Version Control Back-up & Recovery Provisioning

  10. What’s New? Previous EFFORTS are reactive & human-driven. A self-driving Dbms has to be predictive.

  11. Why Now? Recent advancements in hardware and deep neural networks make autonomous operation now possible.

  12. In-MEMORY OLTP+OLAP LLVM EXEC Autonomous

  13. The Brain Integrated Deep Learning FRAMEWORK to model, predict, and optimize HTAP Database workloads. Self-Driving Database Management Systems CIDR 2017

  14. Workload Categorization 2 4 H r s Unsupervised 7 d a y s ... 4 0 d a y s

  15. Workload Workload Forecasting Categorization 2 4 H r s Unsupervised 7 d a y s ... 4 0 d a y s Long short-term Memory

  16. Workload Workload Optimization Forecasting Categorization Planning Unsupervised ...

  17. Workload Workload Optimization Forecasting Categorization Planning Unsupervised ... Catalog Benefit

  18. Evaluation Synthetic workload based on Reddit Traffic Data. Forecast with Tensorflow. Adaptive storage.

  19. Error Rate: 14.7% CPU Training: 25min 1min intervals Probe: 2MS Update: 5ms Size: 2MB Error Rate: 17.9% CPU Training: 18min 1hr intervals Probe: 2ms Update: 5MS Size: 2MB

  20. Adaptive Storage Change the layout of data over time based on how it is accessed. Bridging the Archipelago Between Row-Stores and Column-Stores for Hybrid Workloads SIGMOD 2016

  21. UPDATE myTable SET A = 123, B = 456, A B C D C = 789 WHERE D = “xxx” Hot SELECT AVG (B) FROM myTable WHERE C < “yyy” Cold

  22. UPDATE myTable SET A = 123, B = 456, A B C D A B C D C = 789 WHERE D = “xxx” Hot A B C D SELECT AVG (B) FROM myTable WHERE C < “yyy” Cold

  23. Row Layout Column Layout Adaptive Layout 1600 Execution Time (ms) 1200 800 400 0 Scan Insert Scan Insert Scan Insert Scan Insert Scan Insert Scan Insert Sep-15 Sep-16 Sep-17 Sep-18 Sep-19 Sep-20

  24. Current Status Single-node Only. Looking for real-world Deployments to test. Apache 2.0 License

  25. http:/ /pelotondb.org

  26. END

Recommend


More recommend