Cassandra Jonathan Ellis Motivation Scaling reads to a relational - PowerPoint PPT Presentation

Cassandra Jonathan Ellis

Motivation ● Scaling reads to a relational database is hard ● Scaling writes to a relational database is virtually impossible ● … and when you do, it usually isn't relational anymore

The new face of data ● Scale out, not up ● Online load balancing, cluster growth ● Flexible schema ● Key-oriented queries ● CAP-aware

CAP theorem ● Pick two of Consistency, Availability, Partition tolerance

T wo famous papers ● Bigtable: A distributed storage system for structured data , 2006 ● Dynamo: amazon's highly available key- value store , 2007

T wo approaches ● Bigtable: “How can we build a distributed db on top of GFS?” ● Dynamo: “How can we build a distributed hash table appropriate for the data center?”

10,000 ft summary ● Dynamo partitioning and replication ● Log-structured ColumnFamily data model similar to Bigtable's

Cassandra highlights ● High availability ● Incremental scalability ● Eventually consistent ● T unable tradeoffs between consistency and latency ● Minimal administration ● No SPF

Dynamo architecture & Lookup

Architecture details ● O(1) node lookup ● Explicit replication ● Eventually consistent

Architecture layers Messaging service Commit log T ombstones Gossip Memtable Hinted handoff Failure detection SST able Read repair Cluster state Indexes Bootstrap Partitioner Compaction Monitoring Replication Admin tools

Writes ● Any node ● Partitioner ● Commitlog, memtable ● SST able ● Compaction ● Wait for W responses

Memtable / SST able Disk Commit log

SST able format ● Key / data

SST able Indexes ● Bloom filter ● Key ● Column (Similar to Hadoop MapFile / Tfile)

Compaction ● Merge keys ● Combine columns ● Discard tombstones

Remove ● Deletion marker (tombstone) necessary to suppress data in older SST ables, until compaction ● Read repair complicates things a little ● Eventually consistent complicates things more ● Solution: configurable delay before tombstone GC, after which tombstones are not repaired

Cassandra write properties ● No reads ● No seeks ● Fast ● Atomic within ColumnFamily ● Always writable

Read path ● Any node ● Partitioner ● Wait for R responses ● Wait for N – R responses in the background and perform read repair

Cassandra read properties ● Read multiple SST ables ● Slower than writes (but still fast) ● Seeks can be mitigated with more RAM ● Scales to billions of rows

Consistency in a BASE world ● If W + R > N, you will have consistency ● W=1, R=N ● W=N, R=1 ● W=Q, R=Q where Q = N / 2 + 1

vs MySQL with 50GB of data ● MySQL ● ~300ms write ● ~350ms read ● Cassandra ● ~0.12ms write ● ~15ms read ● Achtung!

Data model ● Rows, ColumnFamilies, Columns

ColumnFamilies keyA column1 column2 column3 keyC column1 column7 column11 Column Byte[] Name Byte[] Value I64 timestamp

Super ColumnFamilies keyF Super1 Super2 column column column column column column keyJ Super1 Super5 column column column column column column

T ypes of queries ● Single column ● Slice ● Set of names / range of names ● Simple slice -> columns ● Super slice -> supercolumns ● Key range

Range queries ● Add “master” server ● Implement on top of K/V ● Order-preserving partitioning

Modification ● Insert / update ● Remove ● Single column or batch ● Specify W, number of nodes to wait for

Thrift struct Column { 1: binary name, 2: binary value, 3: i64 timestamp, } struct SuperColumn { 1: binary name, 2: list<Column> columns, } Column get_column(table, key, column_path, block_for=1) list<string> get_key_range(table, column_family, start_with="", stop_at="", max_results=100) void insert(table, key, column_path, value, timestamp, block_for=0) void remove(tablename, key, column_path_or_parent, timestamp)

Honestly, Thrift kinda sucks

Example: a multiuser blog T wo queries - the most recent posts belonging to a given blog, in reverse chronological order - a single post and its comments, in chronological order

First try JBE Cassandra is teh awesome BASE FTW blog post comment comment post comment comment Evan I like kittens And Ruby blog post comment comment post comment comment <ColumnFamily T ype="Super" CompareWith="TimeString" CompareSubcolumnsWith="UUID" Name="Blog"/>

Second try JBE blog Cassandra BASE FTW Cassandr comment comment is teh a is teh awesome awesome Evan blog I like kittens And Ruby Base FTW comment comment I like comment comment kittens And Ruby comment comment <ColumnFamily <ColumnFamily CompareWith="UUIDT ype" CompareWith="UUIDT ype" Name="Blog"/> Name="Comment"/>

Roadmap

Cassandra 0.3 ● Remove support ● OPP / Range queries ● T est suite ● Workarounds for JDK bugs ● Rudimentary multi-datacenter support

Cassandra 0.4 ● Branched May 18 ● Data file format change to support billions of rows per node instead of millions ● API changes (no more colon delimiters) ● Multi-table (keyspace) support ● LRU key cache ● fsync support ● Bootstrap ● Web interface

Cassandra 0.5 ● Bootstrap ● Load balancing ● Closely related to “bootstrap done right” ● Merkle tree repair ● Millions of columns per row ● This will require another data format change ● Multiget ● Callout support

Users Production: facebook, RocketFuel Production RSN: Digg, Rackspace No date yet: IBM Research, T witter Evaluating: 50+ in #cassandra on freenode

More ● Eventual consistency: http://www.allthingsdistributed.com/2008/1 ● Introduction to distributed databases by T odd Lipcon at NoSQL 09: http://www.vimeo.com/5145059 ● Other articles/videos about Cassandra: http://wiki.apache.org/cassandra/ArticlesAn ● #cassandra on irc.freenode.net

Cassandra

Cassandra Jonathan Ellis Motivation Scaling reads to a relational - PowerPoint PPT Presentation

Cassandra Jonathan Ellis Motivation Scaling reads to a relational database is hard Scaling writes to a relational database is virtually impossible and when you do, it usually isn't relational anymore The new face of data

Apache Cassandra STL Java Users Group Cliff Gilmore DataStax Solutions Architect / Engineer

SASI, Cassandra on the full text search ride DuyHai DOAN Apache Cassandra Evangelist 1 5

On Cassandra's evolution Berlin Buzzwords (June 4th 2013) Sylvain Lebresne Apache Cassandra

Cassandra and Apollo By Octavia, Baylee, and Tilah Cassandra was not an oracle.she could not see

Apache Cassandra for Big Data Applications Christof Roduner Java User Group Switzerland COO and

Balens 2017 CPD Event Legal Update Social Media Cassandra Dighton BSG Solicitors Social

Duy Hai DOAN @doanduyhai Who Am I ? Duy Hai DOAN Cassandra technical advocate talks, meetups,

and other platforms Sankalp Sah, Manish Singh MityLytics Inc Why ARM for Cassandra ? RISC

Cassandra on RocksDB Dikang Gu Software Engineer @ Facebook Agenda 1. Motivation 2. Approaches

Lessons Learned with Cassandra & Spark_ Matthias Niehoff Apache: Big Data 2017

Day 4 Lab1: Docker container for Kafka - Spark streaming - Cassandra This Dockerfile sets up

Presented by Fiona Stewart, Cassandra ONeill & Monica Brinkerhoff Leadership for Change

Presented by Fiona Stewart, Cassandra ONeill & Monica Brinkerhoff Leadership for Change

Cassandra: Distributed Access Control Policies with Tunable Expressiveness Moritz Y. Becker and

Cassandra Offline Analytics Dongqian Liu, Yi Liu 2017/05/02 Agenda Introduction Use Case

Cassandra By Example: Data Modelling with CQL3 Berlin Buzzwords June 4, 2013 Eric Evans

Middleware for Gossip Protocols Michael Chow and Robbert van Renesse Cornell University Mo:va:on

Alireza Rezvanian School of Computer Science, Institute for

Orinda Union School District Facilities Master Plan Committee Meeting #2 October 11, 2017 agenda

Enhance CRM with integrated data to drive strategic decisions + Data Integration RigDig CRM

High Performance Actors Kiki, principal enterprise architect @ Lightbend What we mean by

Project General Presentation This project hasreceived funding from the European Unions Horizon

Black Market Botnets Black Market Botnets Nathan Friess Friess Nathan John Aycock Aycock

Parent Ambassadors Marketing March 31, 2016 Kurt Lewis Director of Enrollment Marketing

Cassandra Jonathan Ellis Motivation Scaling reads to a relational - PowerPoint PPT Presentation

Cassandra Jonathan Ellis Motivation Scaling reads to a relational database is hard Scaling writes to a relational database is virtually impossible and when you do, it usually isn't relational anymore The new face of data

Apache Cassandra STL Java Users Group Cliff Gilmore DataStax Solutions Architect / Engineer

SASI, Cassandra on the full text search ride DuyHai DOAN Apache Cassandra Evangelist 1 5

On Cassandra's evolution Berlin Buzzwords (June 4th 2013) Sylvain Lebresne Apache Cassandra

Cassandra and Apollo By Octavia, Baylee, and Tilah Cassandra was not an oracle.she could not see

Apache Cassandra for Big Data Applications Christof Roduner Java User Group Switzerland COO and

Balens 2017 CPD Event Legal Update Social Media Cassandra Dighton BSG Solicitors Social

Duy Hai DOAN @doanduyhai Who Am I ? Duy Hai DOAN Cassandra technical advocate talks, meetups,

and other platforms Sankalp Sah, Manish Singh MityLytics Inc Why ARM for Cassandra ? RISC

Cassandra on RocksDB Dikang Gu Software Engineer @ Facebook Agenda 1. Motivation 2. Approaches

Lessons Learned with Cassandra &amp; Spark_ Matthias Niehoff Apache: Big Data 2017

Day 4 Lab1: Docker container for Kafka - Spark streaming - Cassandra This Dockerfile sets up

Presented by Fiona Stewart, Cassandra ONeill &amp; Monica Brinkerhoff Leadership for Change

Presented by Fiona Stewart, Cassandra ONeill &amp; Monica Brinkerhoff Leadership for Change

Cassandra: Distributed Access Control Policies with Tunable Expressiveness Moritz Y. Becker and

Cassandra Offline Analytics Dongqian Liu, Yi Liu 2017/05/02 Agenda Introduction Use Case

Cassandra By Example: Data Modelling with CQL3 Berlin Buzzwords June 4, 2013 Eric Evans

Middleware for Gossip Protocols Michael Chow and Robbert van Renesse Cornell University Mo:va:on

Alireza Rezvanian School of Computer Science, Institute for

Orinda Union School District Facilities Master Plan Committee Meeting #2 October 11, 2017 agenda

Enhance CRM with integrated data to drive strategic decisions + Data Integration RigDig CRM

High Performance Actors Kiki, principal enterprise architect @ Lightbend What we mean by

Project General Presentation This project hasreceived funding from the European Unions Horizon

Black Market Botnets Black Market Botnets Nathan Friess Friess Nathan John Aycock Aycock

Parent Ambassadors Marketing March 31, 2016 Kurt Lewis Director of Enrollment Marketing

Lessons Learned with Cassandra & Spark_ Matthias Niehoff Apache: Big Data 2017

Presented by Fiona Stewart, Cassandra ONeill & Monica Brinkerhoff Leadership for Change

Presented by Fiona Stewart, Cassandra ONeill & Monica Brinkerhoff Leadership for Change