Infrastructure Technologies for Large- Scale Service-Oriented - PowerPoint PPT Presentation

Apr 02, 2023 •252 likes •366 views

Infrastructure Technologies for Large- Scale Service-Oriented Systems Kostas Magoutis magoutis@csd.uoc.gr http://www.csd.uoc.gr/~magoutis Kafka Data logged User activity (logins, page views, clicks, likes, sharing, comments, search

Infrastructure Technologies for Large- Scale Service-Oriented Systems Kostas Magoutis magoutis@csd.uoc.gr http://www.csd.uoc.gr/~magoutis
Kafka • Data logged – User activity (logins, page views, clicks, likes, sharing, comments, search queries) – Operational metrics (call latency, errors, system metrics) • Uses – Search relevance – Recommendations driven by item popularity or co- occurrence in activity stream – Ad targeting and reporting – Security applications – Newsfeed of user status for friends / connections to read
Challenges • High event rates – Search, recommendations, and advertising require computing granular click-through rates – China Mobile 5-7TB of phone call records / day – Facebook gathers ~6TB of various user activity events / day • Traditional enterprise messaging systems too strict – Unnecessarily rich set of delivery guarantees • IBM WebSphere MQ: allow atomic inserts into multiple queues • JMS spec: ack each individual message after consumption – Performance issues: No API to batch messages (JMS) – No easy way to partition and store msgs on many machines – Assuming near-immediate consumption of messages
Kafka architecture
Kafka log • Each partition of a topic corresponds to a logical log • Flush to disk after configurable number of published messages
Efficiency of single partition • Simple storage – Consumer acknowledges message offsets – Under the cover, consumer issues async pull requests – Broker locates segment file, sends data back to consumer • Efficient transfer – No user-space caching by brokers, reduces JVM GC costs – Direct transfer from files to network sockets • Stateless broker – Does not know whether all subscribers have consumed msg – Automatic message deletions after 7 days – Subscribers can rewind and replay messages
Consumer groups • One or more consumers that jointly consume a set of subscribed topics – Each message delivered to only one consumer within CG • No coordination needed across CGs • Goal is to divide messages stored in brokers evenly among consumers • All messages from one partition consumed by single consumer in a CG – Multiple consumers of a partition would need to coordinate – To balance load, multiple partitions per consumer
Coordination service: ZooKeeper • Simple file-like API on znodes • Can register watcher on a path, get notified • Ephemeral vs. persistent paths • Highly available service Image courtesy of https://zookeeper.apache.org
Offset of last consumed message per partition Offset registry Kafka data structures in ZooKeeper offset [partitionId] / [topic] consumers brokers offsets [groupId] topics ids owners [topic] ids [topic] [brokerId] partitions [consumerId] [partitionId] [partitionId] state consumerId Consumer registry Broker registry CG consumer belongs to, Ownership set of topics it subscribes to Broker hostname/port, set of topics/partitions it stores registry Partition-to-consumer mapping
Rebalancing partitions • Detect the addition or removal of brokers or consumers • Trigger a re-balance process when that happens
Typical Kafka deployment

Recommend

E Evolution of NTCIR: l Infrastructure of Large-Scale Infrastructure of Large Scale

E Evolution of NTCIR: l Infrastructure of Large-Scale Infrastructure of Large Scale Information Access Technologies Evaluation and Testing Evaluation and Testing Noriko Kando Noriko Kando National Institute of Informatics, Japan

632 views • 61 slides

Adaptive System Infrastructure for Adaptive System Infrastructure for Ultra- -Large Large-

Adaptive System Infrastructure for Adaptive System Infrastructure for Ultra- -Large Large- -Scale Systems Scale Systems Ultra SMART Conference, Thursday, March 6 th , 2008 SMART Conference, Thursday, March 6 th , 2008 Dr. Douglas C.

432 views • 14 slides

GLAST Large Area Telescope: GLAST Large Area Telescope: Gamma- -ray Large ray Large Gamma

GLAST LAT Project GSFC Monthly, 2 March 2006 GLAST Large Area Telescope: GLAST Large Area Telescope: Gamma- -ray Large ray Large Gamma Area Space Area Space Telescope Telescope LAT System Engineering LAT Quality Assurance Pat Hascall

614 views • 15 slides

Technologies : Retour sur le Futur ? Technologies : Retour sur le Futur ? Technologies : Retour

Technologies : Retour sur le Futur ? Technologies : Retour sur le Futur ? Technologies : Retour sur le Futur ? Technologies : Retour sur le Futur ? FOURPOINTS Funds Info Tech November 4. 2014 Fund Managers: Benot Flamant Twitter:

642 views • 47 slides

BBC Technologies: Our LATAM Experience Who are BBC Technologies? BBC Technologies Where we are

BBC Technologies: Our LATAM Experience Who are BBC Technologies? BBC Technologies Where we are today Technology World leaders in Advance Processing Technologies for small fruits and vegetables. R&D Strong focus on research

383 views • 17 slides

ZEBRA TECHNOLOGIES ZEBRA TECHNOLOGIES DevTalk - Enterprise Browser 2.5 Darryn Campbell SW

ZEBRA TECHNOLOGIES ZEBRA TECHNOLOGIES DevTalk - Enterprise Browser 2.5 Darryn Campbell SW Architect, Zebra Technologies May 20 th 2020 ZEBRA TECHNOLOGIES DevTalk Enterprise Browser 2.5 ZEBRA TECHNOLOGIES DevTalk Enterprise Browser

367 views • 33 slides

Medical Infrastructure in Medical Infrastructure in Medical Infrastructure in Medical

19-03-2018 Medical Infrastructure in Medical Infrastructure in Medical Infrastructure in Medical Infrastructure in Gujarat Gujarat Gujarat Gujarat Dr N B Dholakia Additional Director, Medical Services Department of Health and Family

657 views • 23 slides

Cyber- -Science Infrastructure: Science Infrastructure: Cyber Cyber-Science Infrastructure:

Cyber- -Science Infrastructure: Science Infrastructure: Cyber Cyber-Science Infrastructure: the next- -generation national academic generation national academic the next the next-generation national academic information infrastructure for

412 views • 20 slides

What can Infrastructure do for you today? Daniel Humbedooh Gruno Infrastructure Architect,

What can Infrastructure do for you today? Daniel Humbedooh Gruno Infrastructure Architect, The Apache Software Foundation What is infrastructure? What is infrastructure? The Apache Infrastructure Committee (henceforth

482 views • 23 slides

INFRASTRUCTURE 2110414 Large Scale Computing Systems Natawut Nupairoj, Ph.D. Outline 2

2110414 - Large Scale Computing Systems 1 LARGE SCALE INFRASTRUCTURE 2110414 Large Scale Computing Systems Natawut Nupairoj, Ph.D. Outline 2 Overview Hardware Virtualization Storage Technology 2110414 - Large Scale Computing

512 views • 29 slides

RT Large Model Launch August 2010 Copeland Hermetic Reciprocating Products Large RT Model

R*T Large Model Launch August 2010 Copeland Hermetic Reciprocating Products Large R*T Model Launch R*T Small Models R*T Large Models Launched Sept07 Launched June 10 CS/CF Small R Large R Small R*T Large R*T 6400 BTU A*T

327 views • 9 slides

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

A large-scale International IPv6 Network A large-scale International IPv6 Network A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org www.6net.org A large-scale International IPv6 Network A

174 views • 15 slides

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large Scale Solar Lead April 2017 CONTENTS 1. Introduction to CEFC 2. Investment trends 3. The future of large scale solar 4. Pathway to sustainable

664 views • 21 slides

THANK YOU FOR YOUR INTEREST IN POKE YOKE TECHNOLOGIES POKE YOKE TECHNOLOGIES AT A GLANCE POKE

THANK YOU FOR YOUR INTEREST IN POKE YOKE TECHNOLOGIES POKE YOKE TECHNOLOGIES AT A GLANCE POKE YOKE TECHNOLOGIES PRIVATE LIMITED POKE YOKE TECHNOLOGIES IS A MANAGEMENT CONSULTING COMPANY WITH ENHANCING PERFORMANCE AT ITS CORE FOUNDED IN 2014

682 views • 10 slides

ZEBRA TECHNOLOGIES ZEBRA TECHNOLOGIES DevTalk Whats new for Zebra developers in Android 10

ZEBRA TECHNOLOGIES ZEBRA TECHNOLOGIES DevTalk Whats new for Zebra developers in Android 10 Darryn Campbell SW Architect, Zebra Technologies 15 th July 2020 ZEBRA TECHNOLOGIES What's new for Zebra developers in Android 10 Android 10

471 views • 24 slides

Selecting Least Cost Green Infrastructure James W. Ridgway, PE October 14, 2015 Integrated

Selecting Least Cost Green Infrastructure James W. Ridgway, PE October 14, 2015 Integrated Water Management?? IS GREEN INFRASTRUCTURE LESS COSTLY THEN GRAY INFRASTRUCTURE? Cost of Green Infrastructure vs. Gray Infrastructure Storm Water

525 views • 34 slides

The Illusion of Free Will Let us then understand free will as the capacity unique to persons

The Illusion of Free Will Let us then understand free will as the capacity unique to persons that allows them to control their actions. - Internet Encyclopedia of Philosophy A belief that there is a component to biological behaviour

263 views • 22 slides

lets build our technologies around the needs of digitally CONTEXT: FUTURE SCENARIOS?

Are digital Stuart Allan Director of Online Learning technologies To support equality of access, Edinburgh Business School complicit in acts of @OpenPlanStuart exclusion and marginalisation? lets build our technologies around the needs

94 views • 8 slides

Chenhao Tan 1 Can machines think? (Turing, 1950) 2 3 4 Atari game (Bonus: try Google

Chenhao Tan 1 Can machines think? (Turing, 1950) 2 3 4 Atari game (Bonus: try Google image search atari breakout) 5 Minh et al. 2013 https://www.youtube.com/watch?v=V1e YniJ0Rnk 6 Ghazvininejad, Shi, Choi, and

348 views • 34 slides

Identifying Changes in the Cybersecurity Threat Landscape using the LDA-Web Topic Modelling Data

Identifying Changes in the Cybersecurity Threat Landscape using the LDA-Web Topic Modelling Data Search Engine Thursday 13 th July 2017 Multidisciplinary approaches to Cloud Crime HCII 2017, Vancouver Canada Noura Al Moubayed, David Wall, and

544 views • 9 slides

Objectives Computer Science is Complexity Science BI: Facebook Apr 5, 2019 Sprenkle -

Objectives Computer Science is Complexity Science BI: Facebook Apr 5, 2019 Sprenkle - CSCI111 1 Review What are common constructs in programming languages? What are some differences between programming languages? Apr 5, 2019

555 views • 13 slides

Mobile Communication Special Topics in Mobile Systems (FC5260) Instructor: Venkat Padmanabhan

Mobile Communication Special Topics in Mobile Systems (FC5260) Instructor: Venkat Padmanabhan Note: includes slides generously made available by the authors of the papers being discussed 1 This Lecture: Mobile Communication Papers to be

896 views • 56 slides

How To Sharpen Your Focus for Increased Speed Presenter: Paul Nowak Founder | Iris Reading

3/13/15% Ask Yourself: Why am I reading this? Decide on Your Purpose How To Sharpen Your Focus for Increased Speed Presenter: Paul Nowak Founder | Iris Reading www.irisreading.com ! Material Needed for this Class: Your own reading

338 views • 8 slides

Nodal Analysis Background M. A. Hameed ECSE Department Rensselaer Polytechnic Institute Intro

Nodal Analysis Background M. A. Hameed ECSE Department Rensselaer Polytechnic Institute Intro to ECSE Kirchhoff's Current Law (KCL) Definition: The algebraic sum of all currents at a node is zero. Other words: The sum of currents

149 views • 4 slides