Cassandra A Decentralized Structured Storage System Motivation - PowerPoint PPT Presentation

Sep 18, 2023 •231 likes •333 views

Cassandra A Decentralized Structured Storage System Motivation Facebook Inbox search: Billions of write per day Geographical distribution of servers and users Data Model A table is a distributed multi-dimensional map indexed by

Cassandra A Decentralized Structured Storage System
Motivation • Facebook Inbox search: – Billions of write per day – Geographical distribution of servers and users
Data Model • A table is a distributed multi-dimensional map indexed by a key • Columns are grouped together into sets called column families
API • insert(table,key,rowMutation) • get(table,key,columnName) • insert(table,key,columnName)
System Architecture: Partitioning • Partitions data across the cluster using consistent hashing • Each node in the system is assigned a random value on the ring space • A data item belong on the first node with a position larger than the item’s position • Only direct neighbour affected by a node • Incoming node alleviate heavily loaded nodes
System Architecture: Replication • Each data item is replicated at N hosts • Coordinator node is in charge of the replication of the data • “Rack Unaware”: use N -1 successors • “Rack Aware” or “Data Centre Aware”: nodes elect a leader who assigns a replica range to every node
System Architecture: Membership • Membership is based on Scuttlebutt: an anti- entropi Gossip based mechanism • Use Failure detection to avoid attempts to communicate with unreachable nodes
System Architecture: Bootstrapping • When a node starts for the first time, it chooses a random token for its position in the ring • This information is then gossiped • When a node needs to join the cluster, it reads its configuration file which contains a few contact points within the cluster
System Architecture: Scaling • When a new node is added, it gets assigned a token such that it can alleviate a heavily loaded node.
System Architecture: Local Persistence • Write: – Use an in-memory data structure – Write to in-memory only performed after successful write into a commit log – When the in-memory data structure goes over a threshold, it dumps itself to disk • Read: – First look at in-memory data – Then check a bloom filter for each file in which the key could be

Recommend

Apache Cassandra STL Java Users Group Cliff Gilmore DataStax Solutions Architect / Engineer

Apache Cassandra STL Java Users Group Cliff Gilmore DataStax Solutions Architect / Engineer Aug 14, 2014 1 Agenda Cassandra Overview Cassandra Architecture Cassandra Query Language Interacting with Cassandra using Java

578 views • 39 slides

SASI, Cassandra on the full text search ride DuyHai DOAN Apache Cassandra Evangelist 1 5

SASI, Cassandra on the full text search ride DuyHai DOAN Apache Cassandra Evangelist 1 5 minutes introduction to Apache Cassandra 2 SASI introduction 3 SASI cluster-wide 4 SASI local read/write path 5 Query planner 6 Some

813 views • 64 slides

On Cassandra's evolution Berlin Buzzwords (June 4th 2013) Sylvain Lebresne Apache Cassandra

On Cassandra's evolution Berlin Buzzwords (June 4th 2013) Sylvain Lebresne Apache Cassandra Fully Distributed Database Massively Scalable High performance Highly reliable/available #bbuzz 3/22 Cassandra: the past

358 views • 22 slides

Cassandra and Apollo By Octavia, Baylee, and Tilah Cassandra was not an oracle.she could not see

Cassandra and Apollo By Octavia, Baylee, and Tilah Cassandra was not an oracle.she could not see in the future. She was a beautiful young priestess,with great ambition When Apollo swung her by personality to take a look at his temple all

163 views • 5 slides

Apache Cassandra for Big Data Applications Christof Roduner Java User Group Switzerland COO and

Apache Cassandra for Big Data Applications Christof Roduner Java User Group Switzerland COO and co-founder January 7, 2014 christof@scandit.com AGENDA 2 Cassandra origins and use How we use Cassandra Data model and query language

535 views • 49 slides

Balens 2017 CPD Event Legal Update Social Media Cassandra Dighton BSG Solicitors Social

Balens 2017 CPD Event Legal Update Social Media Cassandra Dighton BSG Solicitors Social Media in healthcare Friend or Foe? An overview of using social media in practice. Introduction Cassandra Dighton Qualified as a solicitor

677 views • 20 slides

Duy Hai DOAN @doanduyhai Who Am I ? Duy Hai DOAN Cassandra technical advocate talks, meetups,

Apache Zeppelin, the missing GUI for your BigData eco-system Duy Hai DOAN @doanduyhai Who Am I ? Duy Hai DOAN Cassandra technical advocate talks, meetups, confs open-source devs ( Achilles , ) OSS Cassandra point of contact

336 views • 32 slides

and other platforms Sankalp Sah, Manish Singh MityLytics Inc Why ARM for Cassandra ? RISC

Cassandra on Armv8 - A comparison with x86 and other platforms Sankalp Sah, Manish Singh MityLytics Inc Why ARM for Cassandra ? RISC architecture as opposed to x86 Lower Cost - $0.50/hr Thermals Power and its management

639 views • 53 slides

Cassandra on RocksDB Dikang Gu Software Engineer @ Facebook Agenda 1. Motivation 2. Approaches

Cassandra on RocksDB Dikang Gu Software Engineer @ Facebook Agenda 1. Motivation 2. Approaches 3. Design 4. Performance metrics 2 3 Stories Direct Live Explore 4 5 Apache Cassandra Highly scalable partitioned data store

579 views • 47 slides

Lessons Learned with Cassandra & Spark_ Matthias Niehoff Apache: Big Data 2017

Lessons Learned with Cassandra & Spark_ Matthias Niehoff Apache: Big Data 2017 @matthiasniehoff 1 @codecentric Our Use Cases_ join read write join read write Lessons Learned with Cassandra Data modeling: Primary key_ Primary

1.26k views • 47 slides

Day 4 Lab1: Docker container for Kafka - Spark streaming - Cassandra This Dockerfile sets up

Day 4 Lab1: Docker container for Kafka - Spark streaming - Cassandra This Dockerfile sets up a complete streaming environment for experimenting with Kafka, Spark streaming (PySpark), and Cassandra. It installs Kafka 0.10.2.1 Spark 2.1.1

156 views • 4 slides

Presented by Fiona Stewart, Cassandra ONeill & Monica Brinkerhoff Leadership for Change

Presented by Fiona Stewart, Cassandra ONeill & Monica Brinkerhoff Leadership for Change Moving From Ideas to Action Brought to you by Redleaf Press Presented by Fiona Stewart, Cassandra ONeill & Monica Brinkerhoff This five

847 views • 28 slides

Presented by Fiona Stewart, Cassandra ONeill & Monica Brinkerhoff Leadership for Change

274 views • 26 slides

Cassandra: Distributed Access Control Policies with Tunable Expressiveness Moritz Y. Becker and

Cassandra: Distributed Access Control Policies with Tunable Expressiveness Moritz Y. Becker and Peter Sewell Computer Laboratory, University of Cambridge, U.K. Cassandra: Distributed Access Control Policies with Tunable Expressiveness p.

1.44k views • 20 slides

Cassandra Offline Analytics Dongqian Liu, Yi Liu 2017/05/02 Agenda Introduction Use Case

Cassandra Offline Analytics Dongqian Liu, Yi Liu 2017/05/02 Agenda Introduction Use Case Problem & Solution Suitable User Scenario Cassandra Internals Implementation Details Performance Similar Projects

330 views • 28 slides

Cassandra By Example: Data Modelling with CQL3 Berlin Buzzwords June 4, 2013 Eric Evans

Cassandra By Example: Data Modelling with CQL3 Berlin Buzzwords June 4, 2013 Eric Evans eevans@opennms.com @jericevans CQL is... Query language for Apache Cassandra Almost SQL (almost) Alternative query interface First class

794 views • 48 slides

Is replication research the study of research or of researchers? ANNETTE N. BROWN Principal

OCTOBER 2019 Is replication research the study of research or of researchers? ANNETTE N. BROWN Principal Economist @anbrowndc A presentation in two parts 2 A presentation in two parts Is science computationally What are replication

583 views • 35 slides

Transforming Canadas cereals sector through value creation Stakeholder engagement 1 Purpose

Transforming Canadas cereals sector through value creation Stakeholder engagement 1 Purpose This presentation will: Outline the objectives for the ongoing consultations on value creation in cereals Provide background and context

353 views • 24 slides

Addressing Non-CO 2 Gases & Sinks in GHG Scenarios: Experience from Energy Modeling Forum 21

Addressing Non-CO 2 Gases & Sinks in GHG Scenarios: Experience from Energy Modeling Forum 21 Francisco C. de la Chesnaye John P. Weyant US Environmental Protection Agency Stanford University NIES - EMF Workshop on GHG Stabilization

580 views • 32 slides

Presentation March 22, 2016 Requested Projects for Consideration 1. Resurfacing of Town-Owned

Capital Projects Sales Tax Presentation March 22, 2016 Requested Projects for Consideration 1. Resurfacing of Town-Owned Roads 2. Sidewalk Construction 3. Paris Avenue Park(Port Redevelopment) 4. Construction of New Port Spine Road(Port

542 views • 18 slides

Nuclear Fuel Reprocessing By Daniel Bolgren Jeff Menees Goals of the Project Develop a

Nuclear Fuel Reprocessing By Daniel Bolgren Jeff Menees Goals of the Project Develop a reprocessing technique that 1. can: Reprocess used nuclear fuel. 1. Reduce proliferation concerns. 2. Optimize a reprocessing location using: 2.

927 views • 67 slides

POL POL201Y1: Po Politics of Development Karol Czuba, University of Toronto Lecture 11:

POL POL201Y1: Po Politics of Development Karol Czuba, University of Toronto Lecture 11: Developmental states Re Recap State-making in Europe: War State capacity Representativeness and accountability + rule of law

415 views • 23 slides

Paperless Cross-Border Trade In Islamic Republic of Iran: S.Z.Moosavi , I.R.Iran Customs

In the Name of God Paperless Cross-Border Trade In Islamic Republic of Iran: S.Z.Moosavi , I.R.Iran Customs Administration Many national and international laws and instruments recognize customs administrations crucial role in trade

594 views • 13 slides

Online Clinical Trial Notification (CTN) Adelina Tan Director, Experimental Products Section

Online Clinical Trial Notification (CTN) Adelina Tan Director, Experimental Products Section Pharmacovigilance & Special Access Branch Medicines Regulation Division, TGA May 2016 Overview Presentation objectives Background

506 views • 26 slides

Cassandra A Decentralized Structured Storage System Motivation - PowerPoint PPT Presentation

Cassandra A Decentralized Structured Storage System Motivation Facebook Inbox search: Billions of write per day Geographical distribution of servers and users Data Model A table is a distributed multi-dimensional map indexed by

Apache Cassandra STL Java Users Group Cliff Gilmore DataStax Solutions Architect / Engineer

SASI, Cassandra on the full text search ride DuyHai DOAN Apache Cassandra Evangelist 1 5

On Cassandra's evolution Berlin Buzzwords (June 4th 2013) Sylvain Lebresne Apache Cassandra

Cassandra and Apollo By Octavia, Baylee, and Tilah Cassandra was not an oracle.she could not see

Apache Cassandra for Big Data Applications Christof Roduner Java User Group Switzerland COO and

Balens 2017 CPD Event Legal Update Social Media Cassandra Dighton BSG Solicitors Social

Duy Hai DOAN @doanduyhai Who Am I ? Duy Hai DOAN Cassandra technical advocate talks, meetups,

and other platforms Sankalp Sah, Manish Singh MityLytics Inc Why ARM for Cassandra ? RISC

Cassandra on RocksDB Dikang Gu Software Engineer @ Facebook Agenda 1. Motivation 2. Approaches

Lessons Learned with Cassandra &amp; Spark_ Matthias Niehoff Apache: Big Data 2017

Day 4 Lab1: Docker container for Kafka - Spark streaming - Cassandra This Dockerfile sets up

Presented by Fiona Stewart, Cassandra ONeill &amp; Monica Brinkerhoff Leadership for Change

Presented by Fiona Stewart, Cassandra ONeill &amp; Monica Brinkerhoff Leadership for Change

Cassandra: Distributed Access Control Policies with Tunable Expressiveness Moritz Y. Becker and

Cassandra Offline Analytics Dongqian Liu, Yi Liu 2017/05/02 Agenda Introduction Use Case

Cassandra By Example: Data Modelling with CQL3 Berlin Buzzwords June 4, 2013 Eric Evans

Is replication research the study of research or of researchers? ANNETTE N. BROWN Principal

Transforming Canadas cereals sector through value creation Stakeholder engagement 1 Purpose

Addressing Non-CO 2 Gases &amp; Sinks in GHG Scenarios: Experience from Energy Modeling Forum 21

Presentation March 22, 2016 Requested Projects for Consideration 1. Resurfacing of Town-Owned

Nuclear Fuel Reprocessing By Daniel Bolgren Jeff Menees Goals of the Project Develop a

POL POL201Y1: Po Politics of Development Karol Czuba, University of Toronto Lecture 11:

Paperless Cross-Border Trade In Islamic Republic of Iran: S.Z.Moosavi , I.R.Iran Customs

Online Clinical Trial Notification (CTN) Adelina Tan Director, Experimental Products Section

Lessons Learned with Cassandra & Spark_ Matthias Niehoff Apache: Big Data 2017

Presented by Fiona Stewart, Cassandra ONeill & Monica Brinkerhoff Leadership for Change

Presented by Fiona Stewart, Cassandra ONeill & Monica Brinkerhoff Leadership for Change

Addressing Non-CO 2 Gases & Sinks in GHG Scenarios: Experience from Energy Modeling Forum 21