On Brewing Fresh Espresso: LinkedIns Distributed Data Serving - PowerPoint PPT Presentation

Mar 01, 2023 •106 likes •246 views

On Brewing Fresh Espresso: LinkedIns Distributed Data Serving Platform Thomas Marshall Motivation Better performance and horizontal scalability than traditional RDBMS. Better consistency, transactions, and schema support than

On Brewing Fresh Espresso: LinkedIn’s Distributed Data Serving Platform Thomas Marshall
Motivation ● Better performance and horizontal scalability than traditional RDBMS. ● Better consistency, transactions, and schema support than NoSQL. ● Integration into LinkedIn’s data ecosystem.
Data Model ● Nested entities and independent entities. ● Relational ○ Documents - the equivalent of rows ● Hierarchical ○ Document groups - share same partitioning key, span tables, largest unit of transactions
Secondary Indexes ● Allow for efficient lookup based on values other than the primary key. ● Local secondary indexes - apply to one document group. ● Global secondary indexes - apply across doc groups, implemented as derived tables.
Secondary Indexes ● Lucene ○ Inverted index. ○ Log structured. ● Prefix ○ Inverted index, prefixed by the partition key.
Architecture ● Client - submit requests via REST API. ● Router - send request to appropriate node based on partitioning protocol.
Architecture ● Helix ○ Cluster management system ○ Assigns partitions
Architecture ● Fault tolerance ○ When a master partition fails, a slave is promoted by Helix. ○ Zookeeper heartbeat and performance metrics determine failure.
Overpartitioning ● Shard data into many more partitions than there are nodes. ● Eases failover/cluster expansion.
Architecture ● Storage node ○ Stores partitions. ○ Performs queries. ○ Maintains log. ○ Performs background tasks.
Architecture ● Databus ○ Achieves replication via pub/sub ○ Ensures timeline consistency ○ Replicated for fault tolerance
Future Work ● Transactions across document groups. ● OLAP workloads. ● Multiple data center deployment.
Conclusion ● Espresso attempts to find a nice medium between traditional RDBMS and NoSQL. ● LinkedIn particularly emphasized operability - ease of schema changes, horizontal scalability, etc.

Recommend

Brewing and Distilling BSc Brewing and Distilling @ Heriot-Watt? International Centre for

Welcome to Brewing and Distilling BSc Brewing and Distilling @ Heriot-Watt? International Centre for Brewing and Distilling ICBD School of Engineering and Physical Sciences EPS HWU Heriot-Watt University Brewing & Distilling

336 views • 14 slides

Home Brew Con 2018 LOW-OXYGEN EN BREWING Preserves the fresh malt/grain flavor that exists in

Low Oxygen Brewing Notes: Home Brew Con 2018 LOW-OXYGEN EN BREWING Preserves the fresh malt/grain flavor that exists in your malt before you even begin brewing. Theo Th eory Ascorbic Acid Oxidase (AAO) is a malt antioxidant that, when

315 views • 10 slides

Migrating from Oracle to Espresso David Max Senior Software Engineer LinkedIn About LinkedIn

Migrating from Oracle to Espresso David Max Senior Software Engineer LinkedIn About LinkedIn New York Engineering Located in Empire State Building Approximately 100 engineers and 1000 employees total New York Multiple teams, front

467 views • 31 slides

Practical Enzymatic Brewing An intermediate exploration of Brewing Enzymes Presentation Summary

Practical Enzymatic Brewing An intermediate exploration of Brewing Enzymes Presentation Summary This seminar is a companion to a previous presentation, Basic Enzymology for Brewing. This presentation is focused on: A review of the sources

815 views • 48 slides

An Introduction to Brewing Experiments Chris Everett Greenbelt Brewing since 2010 Society of

An Introduction to Brewing Experiments Chris Everett Greenbelt Brewing since 2010 Society of Barley Engineers BJCP National #G1176 February, 2016 There are things known and there are things unknown, and in between are the doors of perception.

745 views • 48 slides

San Diego Craft Brewing Industry Marc M. Martin Vice President of Beer Karl Strauss Brewing Co.

San Diego Craft Brewing Industry Marc M. Martin Vice President of Beer Karl Strauss Brewing Co. 1/14/16 Todays craft consumer is more educated about beer Craft beer enthusiasts They have high interest in craft, drink craft more often than

282 views • 24 slides

For eight generations, our beer is shaped by our rich history, and our passion for brewing.

For eight generations, our beer is shaped by our rich history, and our passion for brewing. Brouwerij Martens, as a family run brewery, Brouwerij Martens, as a family run brewery, has been brewing beer for over 255 years, has been brewing beer

370 views • 7 slides

FRESH BUCKS S N A P I N C E N T I V E P R O G R A M WHAT IS FRESH BUCKS? Fresh Bucks helps

FRESH BUCKS S N A P I N C E N T I V E P R O G R A M WHAT IS FRESH BUCKS? Fresh Bucks helps Hoosier families eat healthy, affordable food. Incentive program for SNAP users at Indiana farmers markets and grocery stores. Allows

238 views • 11 slides

For personal use only Banana Tree Trunk Cross Section (fresh billet) WALKAMIN FACTORY - FRESH

For personal use only Banana Tree Trunk Cross Section (fresh billet) WALKAMIN FACTORY - FRESH OUTER VENEER For personal use only Banana Tree Trunk Cross Section (fresh billet) WALKAMIN FACTORY - FRESH OUTER VENEER For personal use only Round

769 views • 25 slides

Data-Driven Reserve Prices for Social Advertising Auctions at LinkedIn Tingting Cui Lijun Peng

Data-Driven Reserve Prices for Social Advertising Auctions at LinkedIn Tingting Cui Lijun Peng Kun Liu Deepak Kumar Deepak Agarwal David Pardoe Relevance @ LinkedIn KDD 2017 Introduction LinkedIn Sponsored Content (SC) LinkedIn

1.16k views • 28 slides

Kobalto Highlights Patented Z3000 Necta espresso brewer producing 15 bar pressure for the

Kobalto Highlights Patented Z3000 Necta espresso brewer producing 15 bar pressure for the perfect espresso Easy to brand Simple to use no need for an expensive skilled barista Up to 12oz drinks thanks to the large chamber,

205 views • 9 slides

ESPResSo under the hood Axel Arnold Institute for Computational Physics Universit at

http://www.icp.uni-stuttgart.de ESPResSo under the hood Axel Arnold Institute for Computational Physics Universit at Stuttgart ESPResSo Summer School October 2012 Working on the current code: git http://www.icp.uni-stuttgart.de Getting

454 views • 29 slides

Mr.Coffee Espresso Machine Jefferson Delgado Allan Li Thanh Tran Antonio Whitehead

Mr.Coffee Espresso Machine Jefferson Delgado Allan Li Thanh Tran Antonio Whitehead Introduction Model: ECMP50 Brand: Mr. Coffee Type: Espresso Machine Features: Automated pump Automated temperature controls Switches Diagram

217 views • 17 slides

How to Stand Out from the Crowd on How to Stand Out from the Crowd on LinkedIn LinkedIn Maureen

How to Stand Out from the Crowd on How to Stand Out from the Crowd on LinkedIn LinkedIn Maureen Coogan April 7, 2020 Objectives Objectives Create a professional profile on the LinkedIn platform Detail the different methods to

833 views • 25 slides

How to Get Started with Advertising on LinkedIn Mallory Fahy Sammy Elazab Head of Client

How to Get Started with Advertising on LinkedIn Mallory Fahy Sammy Elazab Head of Client Solutions, Asia Pacific Head of Online Sales, Southeast Asia 1 LinkedIn in APAC 2 LinkedIn Pages 3 LinkedIn Ads L I N K E D I N S M I S S I O

766 views • 44 slides

Getting The Most From LinkedIn Voltron- Sourcing Highlights From Session 5 Of LinkedIn Xtreme

Getting The Most From LinkedIn Voltron- Sourcing Highlights From Session 5 Of LinkedIn Xtreme Mastery Workshop www.TheBestLinkedInTraining.Com Or Sourcing only session: http://www.thedynamicsale.com/product/linkedin-

1.02k views • 25 slides

ECE 3060 VLSI and Advanced Digital Design Lecture 12 Computer-Aided Heuristic Two-level Logic

ECE 3060 VLSI and Advanced Digital Design Lecture 12 Computer-Aided Heuristic Two-level Logic Minimization Computer-Aided Heuristic Two- level Logic Minimization Heuristic logic minimization Principles Operators on logic covers

420 views • 25 slides

CSEE 6861 CAD of Digital Systems Handout: Lecture #3 2/4/16 Prof. Steven M. Nowick

CSEE 6861 CAD of Digital Systems Handout: Lecture #3 2/4/16 Prof. Steven M. Nowick nowick@cs.columbia.edu Department of Computer Science (and Elect. Eng.) Columbia University New York, NY, USA ESPRESSO Algorithm: The EXPAND Step,

477 views • 16 slides

PEDIATRIC FOIE GRAS: I have nothing to disclose NON-ALCOHOLIC FATTY LIVER DISEASE Patrika

5/17/13 Disclosures PEDIATRIC FOIE GRAS: I have nothing to disclose NON-ALCOHOLIC FATTY LIVER DISEASE Patrika Montricul Tsai, MD, MPH Pediatric Gastroenterology, Hepatology, and Nutrition University of California, San Francisco May 17,

110 views • 10 slides

Setting up myAccount on www.Revenue.ie PAYE Modernisation What is required to set up myAccount

PAYE Modernisation ICE Pay Setting up myAccount on www.Revenue.ie PAYE Modernisation What is required to set up myAccount PPS Number Two Forms of ID: Date of Birth Irish Drivers Licence Mobile or Landline number P60

434 views • 17 slides

New developments in the quantum ESPRESSO software distribution for quantum simulations at the

New developments in the quantum ESPRESSO software distribution for quantum simulations at the nanoscale Paolo Giannozzi Universit` a di Udine, Italy Workshop From experiments to theory & models... Roma Tor Vergata, 2017/12/5 Typeset by

805 views • 35 slides

G o i n g b e y o n d L o c a l D e n s i t y a n d G r a d i e n

G o i n g b e y o n d L o c a l D e n s i t y a n d G r a d i e n t C o r r e c t e d X C f u n c t i o n a l s i n Q u a n t u m- E S P R E S S O Jacob's ladder of Density

619 views • 40 slides

QE, main strategies of parallelization and levels of parallelisms Fabio AFFINITO SCAI - Cineca

QE, main strategies of parallelization and levels of parallelisms Fabio AFFINITO SCAI - Cineca What I cannot compute, I do not understand. (adapted from Richard P. Feynman) Quantum ESPRESSO: introduction Quantum ESPRESSO is an

280 views • 16 slides

I Logs Apache Kafka, Stream Processing, and Real-time Data Jay Kreps The Plan 1. What is Data

I Logs Apache Kafka, Stream Processing, and Real-time Data Jay Kreps The Plan 1. What is Data Integration? 2. What is Apache Kafka? 3. Logs and Distributed Systems 4. Logs and Data Integration 5. Logs and Stream Processing Data Integration

661 views • 42 slides