MySQL High Availability Solutions Alex Poritskiy Percona The Five - PowerPoint PPT Presentation

MySQL High Availability Solutions Alex Poritskiy   Percona

The Five 9s of Availability Clustering & disasters Geographical Redundancy power failures network failures Clustering Technologies hardware failures software failures Replication Technologies maintenance operations hardware upgrades Well-Managed software upgrades Unmanaged 35 days 4 days 8 hours 50 mins 5 mins Telco ISPs & Small Data Banking Military Mainstream Business Centers Medical Defense Business 9 9 9 9 9 % � 2

MySQL High Availability Solutions ● MySQL Replication ● MHA/Orchestrator by Github and MySQL Replication ● Percona XtraDB Cluster/Galera ● MySQL InnoDB Cluster (Group Replication) ● MySQL NDB Cluster � 3

MySQL Replication 9 9 9 % � 4

MySQL Replication Topologies Master > Slave Master > Slaves Master > Slave > Slaves Masters > Slave (Multi-Source) Circular (Multi-Master) Master < > Master (Multi-Master) � 5

Inside MySQL Replication Writes & Reads Web/App Web/App Server Server Writes relay binlog mysqld I/O SQL Thread Thread index & binlogs data Replication binlog data mysqld MySQL Slave MySQL Master � 6

MySQL Replication (Scale-Out) Web/App Web/App Server Server Writes & Reads Reads Master Server Slave Server Backups Replication Writes Writes Possible Roles Index & • Fail over server Bin Log • Used for performing backups Rotation • Read load balancing • Additional slaves allow Scale-Out � 7

MySQL Replication(Fail-Over) Web/App Web/App Server Server X Manual Writes & Reads Fail Over X Master Server Slave Server r e v O l i a F X Replication Writes Writes X Index & Bin Log Rotation � 8

MySQL Replication Pros ● Mature, standard feature of MySQL. ● Compatible with many topologies. ● Supports multi-datacenter “clusters”. ● Easy to configure. ● Continues to gain new features: ▪ Semi-synchronous ▪ Row-based binary logs Parallel SQL threads ▪ ▪ Global Transaction ID (GTID) � 9

MySQL Replication Cons ● Slave lag. ▪ Use row-based binary logs and other strategies. ● Slave drift. ▪ Use read-only, pt-table-checksum, pt-table-sync. ● No automatic failover. ▪ Use haproxy and/or keepalived or other solutions. � 10

Topology managers + MySQL replication . 9 9 9 % 9 � 11

MHA toolkit � 12

MHA toolkit � 13

MHA toolkit ● Note that MHA never rebuilds an existing multi-master replication link, it always reconfigures topology using master- slave. ● MHA only takes care of the MySQL failover but has the ability to add custom code to make the application and any load balancers aware of the change. � 14

MHA toolkit: Pros ● Uses traditional MySQL replication. ● Easy to deploy for an existing replication group. ● Subscribes slaves to the correct binlog position. ● Strong command line support ● Very good point in time recovery via binary logs � 15

MHA toolkit: Cons ● Disconnects after failover ● No GUI ● Not updated, long term stable release ● Doesn’t crawl and auto discover topology ● Cannot manage PXC, co-masters or active/active ● Failover isn’t guaranteed instantly - can cause delays as logs are scanned. ● Still async replication (Consistency may be compromised) ● Monitor can be a single point of failure (monitor is only needed for failover) � 16

Orchestrator by Github ● Orchestrator crawls through your replication topology and updates its GUI based on your current topology and health. ● Orchestrator also knows about binary log position, GTID, and Binlog Servers. ● Refactoring replication topologies can be a matter of dragging and dropping a replica under another master. � 17

Orchestrator by Github � 18

Orchestrator: Pros ● Integrated into PMM ● GUI in addition to command line ● Various options for managing replication (GTID, psuedo-GTID etc) ● Built for GTID ● Can manage backups and other tasks ● Currently in development with a dedicated team ● Auto Discovers topology ● Works with co-masters � 19

Orchestrator: Cons ● Does not recognise PXC nodes as part of a single cluster (sees them as separate masters) ● Point in time recovery relies on good slaves, doesn’t do binary log discovery � 20

PXC, MySQL InnoDB Cluster . 9 9 9 9 % 9 � 21

Percona XtraDB Cluster(PXC) ● Synchronous replication—every node is writable. � 22

Percona XtraDB Cluster(PXC) � 23

Percona XtraDB Cluster(PXC) 5 nodes in the 2 Datacenters(example) Main Data Center Secondary Data Center MySQL WAN MySQL gmcast.segment=1 gmcast.segment=2 � 24

Percona XtraDB Cluster(PXC) 3 nodes in the single datacenter + Async Slave Main Data Center Secondary Data Center WAN MySQL Slave Server Asynchronous MySQL Replication slave (not part of PXC) Percona XtraDB Cluster � 25

Percona XtraDB Cluster(PXC) : Pros ● Writing to any node updates all other nodes synchronously (like semi-sync replication). ● All nodes can be writers. ● Adding a new node, or recovering an offline node, initialises data automatically. ● Multi-datacenter (assuming low latency) ● Quorum-based selection of nodes that can take queries. Nodes not in quorum will be shut down and must copy data before they can rejoin the cluster. � 26

Percona XtraDB Cluster(PXC): Cons ● Not suitable for large transactions. ● Not suitable for heavy write traffic, especially rapid writes to a hotspot – don't rely on multiple writers. ● Reinitialising a node can be expensive. � 27

MySQL InnoDB Cluster ● Synchronous replication—every node is writable as well. � 28

MySQL InnoDB Cluster: Pros ● Writing to any node updates all other nodes synchronously (like semi-sync replication). ● All nodes can be writers. ● Single-Primary and Multi-Primary modes available ● Quorum-based selection of nodes that can take queries. � 29

MySQL InnoDB Cluster: Cons ● Not suitable for large transactions. ● Not suitable for heavy write traffic, especially rapid writes to a hotspot – don't rely on multiple writers. ● No SST concept - all nodes needs to start with same data ● Replication error just stop replication to the node ● Inconsistencies can/should be fixed manually � 30

Percona XtraDB Cluster and MySQL Cluster similar limitations ● Large & Long running transactions ▪ higher chance on failures ▪ non-efficient replication of large transactions ● Workload hotspots (updating same set of rows in parallel)   ● Often writing to 1 node is the best solution � 31

. 9 9 9 9 9 9 % � 32

MySQL NDB Cluster ● Shared-Nothing ● Synchronous (2-phase commit) ● In memory store ● Traditionally used by: ▪ Telecom operators ▪ Gaming companies � 33

MySQL NDB Cluster Basic Architecture MySQL Server or NDB API Web/App Web/App Web/App Web/App for all Writes & Reads Server Server Server Server NDB API NDB API Management Management NDB Memory Server Server Storage Engine & Data Data Data Data Disk Node Node Node Node Management Management Server Server MySQL Cluster � 34

MySQL NDB Cluster Components ● Data Nodes ▪ Store redundant data fragments ▪ Combined to form of Node Groups ● Management Nodes ▪ Used at startup ▪ Re-configuration ● API Nodes ▪ Standard SQL interface ▪ NDB API( The NDB API is an object-oriented application programming interface for NDB Cluster that implements indexes, scans, transactions, and event handling ) � 35

MySQL NDB Cluster Data node Architecture ID ID Capital Capital Country Country UTC UTC Data Data Data Data 1 1 Copenhagen Copenhagen Denmark Denmark 2 2 Node Node Partition 1 Node Node 2 2 Berlin Berlin Germany Germany 2 2 3 3 New York City New York City USA USA -5 -5 P1-Primary P1-Secondary Partition 2 4 4 Tokyo Tokyo Japan Japan 9 9 P2-Secondary P2-Primary 5 5 Athens Athens Greece Greece 2 2 Partition 3 6 6 Moscow Moscow Russia Russia 4 4 Node Group 1 7 7 Oslo Oslo Norway Norway 2 2 Partition 4 8 8 Beijing Beijing China China 8 8 Data Data Data Data Node Node Node Node • Four Data Nodes • Two Replicas P3-Primary P3-Secondary • Two Node Groups P4-Secondary P4-Primary Node Group 2 � 36

MySQL NDB Cluster Data node Architecture X Data Data Data Data Node Node Node Node ID ID Capital Capital Country Country UTC UTC 1 1 Copenhagen Copenhagen Denmark Denmark 2 2 Partition 1 2 2 Berlin Berlin Germany Germany 2 2 P1-Primary P1-Secondary 3 3 New York City New York City USA USA -5 -5 Partition 2 P2-Secondary P2-Primary 4 4 Tokyo Tokyo Japan Japan 9 9 5 5 Athens Athens Greece Greece 2 2 Partition 3 data still 6 6 Moscow Moscow Russia Russia 4 4 Node Group 1 available 7 7 Oslo Oslo Norway Norway 2 2 Partition 4 8 8 Beijing Beijing China China 8 8 X Data Data Data Data Node Node Node Node • Four Data Nodes • Two Replicas P3-Primary P3-Secondary • Two Node Groups P4-Secondary P4-Primary Node Group 2 � 37

MySQL High Availability Solutions Alex Poritskiy Percona The Five - PowerPoint PPT Presentation

MySQL High Availability Solutions Alex Poritskiy Percona The Five 9s of Availability Clustering & disasters Geographical Redundancy power failures network failures Clustering Technologies hardware failures software failures

Performance Guide for MySQL Cluster Mikael Ronstrm, Ph.D Senior MySQL Architect Sun

MySQL Replication Update MySQL 5.5 (GA) & MySQL 5.6.2 (Dev. Milestone) Lars Thalmann

MySQL Proxy meets: binlogs Jan Kneschke MySQL Enterprise Tools mailto: jan@mysql.com What is

MySQL Proxy Making MySQL more flexible Jan Kneschke jan@mysql.com MySQL Proxy proxy-servers

MySQL Group Replication & MySQL InnoDB Cluster Production Ready? Kenny Gryp MySQL Practice

MySQL Cluster und MySQL Proxy Alles Online Diese Slides gibt es auch unter:

PhxSQL: A High-Availability & Strong-Consistency MySQL Cluster Ming CHEN@WeChat Why PhxSQL

Reducing Risk When Upgrading Your MySQL Environment Kenny Gryp MySQL Practice Manager My

PHP and MySQL Dr. E. Benoist Winter Term 2006-2007 PHP and MySQL 1 PHP and MySQL Introduction

More on gdb for MySQL DBAs or Using gdb to study MySQL internals and as a last resort Valerii

Drupal High Availability High Performance Samstag, 3. November 12 Drupal High Availability

MySQL Replication and HA at Facebook Part-II Jeff Jiang Production Engineer Facebook, Inc

Percona MySQL About Me Qunar.com DB Director

Forecasting MySQL Scalability Baron Schwartz O'Reilly MySQL Conference & Expo 2011

PHP + MySQL MySQL on the command line is great and all well not its not really that great

gdb tips and tricks for MySQL DBAs or How gdb can help you to solve MySQL problems Valerii

Har Hardw dware are box? Whats inside the Inside the case Motherboard Ethernet

Introduction to Network Security Security Chapter 5 Physical Network Layer Dr. Doug Jacobson -

Balancing on the edge Transport affinity without network state Joo Taveira Arajo, Lorenzo

Hardware Enclave Attacks CS261 Threat Model of Hardware Enclaves Intel Attestation Process

Replication and Consistency Setting: Concurrent threads accessing shared data Roland Meyer (TU

Reconfigurable hardware for big ig data Gustavo Alonso Systems Group Department of Computer

The Multikernel: A new OS architecture for scalable multicore systems Andrew Baumann, Paul

Ideas for evolution of replication technology @ CERN Openlab Minor Review December 14 th , 2010

Sambuz

Useful Links

Newsletter

Mail Us

MySQL High Availability Solutions Alex Poritskiy Percona The Five - PowerPoint PPT Presentation

MySQL High Availability Solutions Alex Poritskiy Percona The Five 9s of Availability Clustering & disasters Geographical Redundancy power failures network failures Clustering Technologies hardware failures software failures

Performance Guide for MySQL Cluster Mikael Ronstrm, Ph.D Senior MySQL Architect Sun

MySQL Replication Update MySQL 5.5 (GA) &amp; MySQL 5.6.2 (Dev. Milestone) Lars Thalmann

MySQL Proxy meets: binlogs Jan Kneschke MySQL Enterprise Tools mailto: jan@mysql.com What is

MySQL Proxy Making MySQL more flexible Jan Kneschke jan@mysql.com MySQL Proxy proxy-servers

MySQL Group Replication &amp; MySQL InnoDB Cluster Production Ready? Kenny Gryp MySQL Practice

MySQL Cluster und MySQL Proxy Alles Online Diese Slides gibt es auch unter:

PhxSQL: A High-Availability &amp; Strong-Consistency MySQL Cluster Ming CHEN@WeChat Why PhxSQL

Reducing Risk When Upgrading Your MySQL Environment Kenny Gryp MySQL Practice Manager My

PHP and MySQL Dr. E. Benoist Winter Term 2006-2007 PHP and MySQL 1 PHP and MySQL Introduction

More on gdb for MySQL DBAs or Using gdb to study MySQL internals and as a last resort Valerii

Drupal High Availability High Performance Samstag, 3. November 12 Drupal High Availability

MySQL Replication and HA at Facebook Part-II Jeff Jiang Production Engineer Facebook, Inc

Percona MySQL About Me Qunar.com DB Director

Forecasting MySQL Scalability Baron Schwartz O'Reilly MySQL Conference &amp; Expo 2011

PHP + MySQL MySQL on the command line is great and all well not its not really that great

gdb tips and tricks for MySQL DBAs or How gdb can help you to solve MySQL problems Valerii

Har Hardw dware are box? Whats inside the Inside the case Motherboard Ethernet

Introduction to Network Security Security Chapter 5 Physical Network Layer Dr. Doug Jacobson -

Balancing on the edge Transport affinity without network state Joo Taveira Arajo, Lorenzo

Hardware Enclave Attacks CS261 Threat Model of Hardware Enclaves Intel Attestation Process

Replication and Consistency Setting: Concurrent threads accessing shared data Roland Meyer (TU

Reconfigurable hardware for big ig data Gustavo Alonso Systems Group Department of Computer

The Multikernel: A new OS architecture for scalable multicore systems Andrew Baumann, Paul

Ideas for evolution of replication technology @ CERN Openlab Minor Review December 14 th , 2010

Sambuz

Useful Links

Newsletter

Mail Us

MySQL Replication Update MySQL 5.5 (GA) & MySQL 5.6.2 (Dev. Milestone) Lars Thalmann

MySQL Group Replication & MySQL InnoDB Cluster Production Ready? Kenny Gryp MySQL Practice

PhxSQL: A High-Availability & Strong-Consistency MySQL Cluster Ming CHEN@WeChat Why PhxSQL

Forecasting MySQL Scalability Baron Schwartz O'Reilly MySQL Conference & Expo 2011