beyond the cluster
play

BEYOND THE CLUSTER: WAN DATA REPLICATION WITH GRIDGAIN YAKOV - PowerPoint PPT Presentation

BEYOND THE CLUSTER: WAN DATA REPLICATION WITH GRIDGAIN YAKOV ZHDANOV WHO? Yakov Zhdanov: - GridGains Product Development VP - With GridGain since 2010 - Apache Ignite committer and PMC - Passion for performance & scalability -


  1. BEYOND THE CLUSTER: WAN DATA REPLICATION WITH GRIDGAIN YAKOV ZHDANOV

  2. WHO? Yakov Zhdanov: - GridGain’s Product Development VP - With GridGain since 2010 - Apache Ignite committer and PMC - Passion for performance & scalability - Finding ways to make product better - St. Petersburg, Russia

  3. PLAN 1)Why replicate?

  4. PLAN 1)Why replicate? 2)How do DBs solve this?

  5. PLAN 1)Why replicate? 2)How do DBs solve this? 3)Replication: Monolith vs Distributed 4)

  6. PLAN 1)Why replicate? 2)How do DBs solve this? 3)Replication: Monolith vs Distributed 4)GridGain DR overview – roles, features, process

  7. PLAN 1)Why replicate? 2)How do DBs solve this? 3)Replication: Monolith vs Distributed 4)GridGain DR overview – roles, features, process 5)Future plans – Sync/Async TX replication

  8. WHY REPLICATE YOUR DATA?

  9. WHY REPLICATE?  Data security

  10. WHY REPLICATE?  Data security  Failover

  11. WHY REPLICATE?  Data security  Failover  Data warehousing

  12. WHY REPLICATE?  Data security  Failover  Data warehousing  Load balancing

  13. WHY REPLICATE?  Data security  Failover  Data warehousing  Load balancing  Increasing system capacity

  14. POSTGRESQL REPLICATION  PostgreSQL is an object-relational database management system (ORDBMS)  Pioneered many things and concepts  High maturity level  Opensource and widely used

  15. POSTGRESQL REPLICATION  Shared disk storage https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  16. POSTGRESQL REPLICATION  Shared disk storage  File system replication https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  17. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  18. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping  Logical Replication https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  19. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping  Logical Replication  T rigger-Based Master-Standby Replication https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  20. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping  Logical Replication  T rigger-Based Master-Standby Replication  Statement-Based Replication Middleware https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  21. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping  Logical Replication  T rigger-Based Master-Standby Replication  Statement-Based Replication Middleware  Async Multimaster Replication https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  22. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping  Logical Replication  T rigger-Based Master-Standby Replication  Statement-Based Replication Middleware  Async Multimaster Replication  Sync Multimaster Replication https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  23. REPLICATION IN DISTRIBUTED SYSTEMS Monolith Distributed Data security + ? Failover + + Load balancing + ?/+ Increasing system capacity + ? Data warehousing + ?

  24. REPLICATION IN GRIDGAIN  Introduced in 2012/2013  Completely new feature  Required a lot of engineering efgorts  Required revisiting of existing logic  Async KEY/VALUE mode available  Sync/Async TX replication under development https://docs.gridgain.com/docs/data-center- replication

  25. REPLICATION IN GRIDGAIN: ROLES • Sender cache • Sender hub • Receiver hub • Receiver cache https://docs.gridgain.com/docs/data-center- replication

  26. REPLICATION IN GRIDGAIN: FEATURES • Complex topologies (up to 32 datacenters) • Failover • Pluggable conflict resolution • Filtering • Pause/Resume • Full state transfer https://docs.gridgain.com/docs/data-center- replication

  27. REPLICATION IN GRIDGAIN: HOW IT WORKS

  28. REPLICATION IN GRIDGAIN: COMPLEX TOPOLOGIES

  29. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Node failure in sending topology

  30. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Node failure in sending topology

  31. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Sender hub(s) failure

  32. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Sender hub(s) failure

  33. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Receiver hub(s) failure

  34. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Receiver hub(s) failure

  35. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Node failure in receiving topology

  36. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Node failure in receiving topology

  37. REPLICATION IN GRIDGAIN: WHAT CAN BE BETTER? Batching on per-node basis vs per-partition basis Per-node batching Per-partition batching  More efficient from memory standpoint  No need for additional processing on receiving side  Batches collected quickly  Less contention – honors thread-per-partition model  Probably, higher GC pressure  But still expected to perform better

  38. REPLICATION IN GRIDGAIN: FUTURE PLANS TX Replication

  39. REPLICATION IN GRIDGAIN: FUTURE PLANS TX Replication – over stretched cluster

  40. REPLICATION IN GRIDGAIN: FUTURE PLANS TX replication mechanism

  41. REPLICATION IN GRIDGAIN: FUTURE PLANS TX Replication – sync modes  Strict SYNC mode Main primary node (PN) → Sender → Receiver → Stand-In PN →Receiver →Sender → Main PN  Merciful SYNC mode Main PN → Sender → Receiver → Receiver's WAL → Sender → Main PN  ASYNC mode Main PN->Sender->Sender WAL->Main PN

  42. LESSONS LEARNED  Is replication able to solve your problem?

  43. LESSONS LEARNED  Is replication able to solve your problem?  Pickup proper settings: sync/async, physical vs logical changes.

  44. LESSONS LEARNED  Is replication able to solve your problem?  Pickup proper settings: sync/async, physical vs logical changes.  Be aware of internals – know what makes it work.

  45. LESSONS LEARNED  Is replication able to solve your problem?  Pickup proper settings: sync/async, physical vs logical changes.  Be aware of internals – know what makes it work.  Make sure to test, tune and monitor.

  46. CONTACTS yzhdanov@gridgain.com http://ignite.apache.org dev@ignite.apache.org user@ignite.apache.org

  47. QUESTIONS? ANY QUESTIONS?

Recommend


More recommend