A snapshot, a stream, and a bunch of deltas Applying Lambda - PowerPoint PPT Presentation

A snapshot, a stream, and a bunch of deltas Applying Lambda Architectures in a post-Microservice World Q-Con London March 6th 2018 Ade Trenaman, SVP Engineering, Raconteur, HBC Tech t: @adrian_trenaman http://tech.hbc.com t: @hbcdigital fa: @hbcdigital in: hbc_digital

~$3.5Bn annual e-commerce revenue

00’s of Stores

What this talk is about Solving the problem of microservice dependencies with lambda architectures: > performance, scalability, reliability Lambda architecture examples: > product catalog, search, real-time inventory, third-party integration Lessons learnt: > It’s not all rainbows and unicorns > Kinesis vs. Kafka

λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ λ 2010 2012 2016 2018+ 2007 Service µ-Services Rise of Multi-banner Monolith Oriented Serverless λ Multi-tenant Multi-region λ architectures Some context: a minimalist abstraction of our Streams GraphQL In the architectural evolution seams

Slam on the breaks! Dublin Microservices Meetup, Feb 2015

Part 0 In which we briefly describe lambda architecture, and the Hollywood Principle

Batch processing ‘the Preserve the Provide low-latency, baseline’ integrity and high-throughput, purpose of the reliable, convenient source of truth. access to the data. Some kind of A view of the data data-source Stream: an append-only, Stream processing immutable log store of ‘real-time’ interesting events. Lambda architecture: making batch processing sexy again.

T_1: foo T_1: foo T_2: bar T_2: bar T_3: pepe T_3: pepe T_4: pipi T_4: pipi Batch processing ‘the T_5: lala T_5: lala baseline’ T_6: dipsy Some kind of A view of the data data-source T_6: T_7: T_8: Need to rebuild the ... dipsy po tinky view? Take latest snapshot, and replay all events with a Stream processing greater timestamp. ‘real-time’ Lambda architecture: making batch processing sexy again.

“Don’t call us, we’ll call you.”

Preserve the Provide low-latency, integrity and high-throughput, purpose of the reliable, convenient source of truth. access to the data. Some kind of A view of the data data-source Inversion of control: previously, we ask for data when we need it.

Preserve the Provide low-latency, integrity and high-throughput, purpose of the reliable, convenient source of truth. access to the data. Some kind of A view of the data data-source Inversion of control: now, when the data changes, we are informed.

Part I In which we learn the perils of caching in a microservices architecture, and how lambda architecture helped us out.

Gilt: we source luxury brands...

… we shoot the product in our studios

… we receive

… we sell every day at noon

… stampede!

The Gilt Problem Massive pulse of traffic, every day. => serve fast Low inventory quantities of high value merchandise, changing rapidly => can’t cache Individually personalised landing experiences => can’t cache

Caching “Just say no.” “Until you have to say yes.” “Then, just say maybe .”

consumer (e.g. web-pdp) Hmm, engineer adds a local brand cache to reduce network calls.. brand cache … and then later, another cache for product information. product cache Leads to (1) arbitrary caching policies, & commons <<lib>> (2) duplicated cache information. product-service inventory-service price-service A stateless, cache-free library, busted .

consumer (e.g. web-pdp) We changed the commons library to cache products with a consistent, timed refresh (20m). brand cache Worked well, until the business changed product cache its mind about one small thing: let’s commons <<lib>> make everything in the warehouse sellable. Orders of magnitude more SKUs: * JSON from product service > 1Gb * Startup time > 10m product-service inventory-service price-service * JVM garbage collection every 20m on cache clear * ~1hr to propagate a change. * m4.xlarge, w/ 14Gb JVM Heap A caching library. Worked well initially, but...

commons admin Calatrava λ Kinesis � s product L1 -service Source of Truth - PG Elasticache web-pdp S3 Brands, products, * Startup time ~1s sales, channels, ... * No more stop-the-world GC * ~seconds to propagate a change. * c4.xlarge (CPU!!!), w/ 6Gb JVM Heap Next: replace JSON marshalling with binary Near real-time caching at scale OTW format (e.g. AVRO)

https://github.com/gilt/calatrava - soon to be public

Part 2 In which we learn how we’ve used Lambda architecture to implement a near real-time search index, but needed an additional relational ‘view of truth’.

admin product search -service -indexer Source of Truth - PG Problem: polling a polling service means changes to product data are not reflected in realtime.

admin Calatrava � VOT Kinesis � svc-search -feed s Source of Truth - PG S3 Brands, products, View of Truth - PG sales, channels, ... * Changes are propagated in real-time to Solr * Rebuild of index (s + � *) with zero down time * Same logic for batch & stream (thank you akka-streams) * V.O.T.: “We needed a relational DB to solve a relational problem”

Part 3 In which we use a lambda architecture to facade an unscalable unreliable system as a reliable R+W API… and benefit from always using the same flow .

Inventory SOT ? OMS internet warehouse stores Real-time inventory: bridging bricks’n’clicks

R+W λ Inventory SOT RTAM * Every sku inventory level every 24hrs OMS * Threshold (O, LWM, HWM) inventory events. Elasticache REST API warehouse * APIBuilder.io stores Real-time inventory: bridging bricks’n’clicks * Absolute inventory values

3. 2. R+W λ Inventory SOT RTAM 5. 7. 4. 1. OMS λ 6. Elasticache REST API warehouse stores 1. Is inventory >0 ? 2. Attempt a reservation with OMS. IF it fails, generate a random reservation ID. 3. Put the change on the RTAM stream 4. Update the cache (and stream, not shown) Making a web reservation 5, 6, 7. Trigger a best effort to true-up inventory with ATP (available to purchase)

THERE IS ONLY ONE PATH X

Part 4 In which we learn that the paradigm generalises across third-party boundaries.

International E-Commerce: Taxes, Shipping & Duty is HARD. Performance is critical!

Product Listing Intl Pricing Cache Product Details pricing Third Party service Shipping Partner Checkout Typical solution: cache for PDP & PA, go direct at checkout. Asymmetric, with chance of sticker-shock.

Product Listing Product Details Pricing Checkout Elasticache Service Stream driven solution with flow.io

Part 5 In which we consider Kafka vs. Kinesis

∞ Stream: an immutable, append-only log. Except it isn’t. Which makes us use snapshots, and complicates our architecture.

LOG COMPACTION

(1, janes bond) (2, dr. who) (1, james bond) (3, fr. ted) (1, janes bond) Source of Truth (2, dr. who) (1, james bond) (3, fr. ted) “Log compaction”: always remember the latest version of the same object.

TABLE STREAM DUALITY

KTable & Kafka Streams Library

K-Table & Kafka Streams...

(0) Apply lambda arch to create scalable, reliable offline systems. (1) Replicate and transform the one source of truth (2) It’s not all unicorns and rainbows: complex VOT, snapshots (3) Kinesis is the gateway drug; Kafka is the destination. #thanks @adrian_trenaman @gilttech @hbcdigital

A snapshot, a stream, and a bunch of deltas Applying Lambda - PowerPoint PPT Presentation

A snapshot, a stream, and a bunch of deltas Applying Lambda Architectures in a post-Microservice World Q-Con London March 6th 2018 Ade Trenaman, SVP Engineering, Raconteur, HBC Tech t: @adrian_trenaman http://tech.hbc.com t: @hbcdigital

Corporate Presentation June, 2018 Group Snapshot July 00, 2017 Group Snapshot Group Snapshot -

DELTAS project 2014-2016 Development of a Global Delta Vulnerability Index Fabrice Renaud

1 Agenda Topics I. Welcome / Introductions / News II. General Ledger Deltas III. Billing

Deltas Social Action Plan 2018 Moving Forward on Deltas Social Profile What is a

? sync ref chosen as sync source by Listener Stream B: Presentation Stream C: timestamps

Differentiating tide-influenced and tide-dominated deltas: Devonian Baltic Basin Piret

MODERN WEB APPLICATION DEVELOPMENT WORKFLOW FIRST, LETS LOOK AT THE PAST THROW A BUNCH OF

Stream Ciphers Stream Ciphers 1 Stream Ciphers Generalization of one-time pad Trade

Assessing stream and riparian conditions Stream Habitat Assessment Conducted yearly

Phase III Stream Assessment Study: Potential Stream Restoration Projects Strawberry Run and

UPLOAD VIDEOS TO MICROSOFT STREAM VIA ACCESSUH To upload a video on Microsoft Stream, go to

Fresh water stream ecosystem Gr ov p 2 The description of stream lives Quadrat 1: Hong Kong Newt

CS162: Introduction to Computer Science II Streams 1 Streams A stream is a flow of data

Stream Processing Marco Serafini COMPSCI 532 Lecture 5 Stream vs. Batch Processing Batch

Stream Switching Control draft-gentric-mmusic-stream-switching-00.txt Philippe Gentric

B.e) Stream Ciphers W. Schindler: Cryptography, B-IT, winter 2006 / 2007 2 B.125 Stream Ciphers

South Asian Network Operators Group (SANOG) I 22-28 January, 2003 Kathmandu, Nepal Gaurab Raj

Cache Logistics Trust Extraordinary General Meeting Presentation 19 December 2014 Enduring.

Alpha Presentation Ford Augmented Reality Owners Manual The Capstone Experience Team Ford

Evolving a data system Patterns & Antipatterns Simon Metson Any views or opinions expressed

How I Started Using VoiceThread VISUAL THINKING CLASS Exposure to new technologies for learning

E NGINEERING C OMPUTING S ERVICES (785) 532-4643 support@engg.ksu.edu

Smart Schools Investment Plan Central Square Submission #2 Presentation to the Board of Education

Dam, data collection/display and shoreline protection discussion issues Rob Montgomery

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

A snapshot, a stream, and a bunch of deltas Applying Lambda - PowerPoint PPT Presentation

A snapshot, a stream, and a bunch of deltas Applying Lambda Architectures in a post-Microservice World Q-Con London March 6th 2018 Ade Trenaman, SVP Engineering, Raconteur, HBC Tech t: @adrian_trenaman http://tech.hbc.com t: @hbcdigital

Corporate Presentation June, 2018 Group Snapshot July 00, 2017 Group Snapshot Group Snapshot -

DELTAS project 2014-2016 Development of a Global Delta Vulnerability Index Fabrice Renaud

1 Agenda Topics I. Welcome / Introductions / News II. General Ledger Deltas III. Billing

Deltas Social Action Plan 2018 Moving Forward on Deltas Social Profile What is a

? sync ref chosen as sync source by Listener Stream B: Presentation Stream C: timestamps

Differentiating tide-influenced and tide-dominated deltas: Devonian Baltic Basin Piret

MODERN WEB APPLICATION DEVELOPMENT WORKFLOW FIRST, LETS LOOK AT THE PAST THROW A BUNCH OF

Stream Ciphers Stream Ciphers 1 Stream Ciphers Generalization of one-time pad Trade

Assessing stream and riparian conditions Stream Habitat Assessment Conducted yearly

Phase III Stream Assessment Study: Potential Stream Restoration Projects Strawberry Run and

UPLOAD VIDEOS TO MICROSOFT STREAM VIA ACCESSUH To upload a video on Microsoft Stream, go to

Fresh water stream ecosystem Gr ov p 2 The description of stream lives Quadrat 1: Hong Kong Newt

CS162: Introduction to Computer Science II Streams 1 Streams A stream is a flow of data

Stream Processing Marco Serafini COMPSCI 532 Lecture 5 Stream vs. Batch Processing Batch

Stream Switching Control draft-gentric-mmusic-stream-switching-00.txt Philippe Gentric

B.e) Stream Ciphers W. Schindler: Cryptography, B-IT, winter 2006 / 2007 2 B.125 Stream Ciphers

South Asian Network Operators Group (SANOG) I 22-28 January, 2003 Kathmandu, Nepal Gaurab Raj

Cache Logistics Trust Extraordinary General Meeting Presentation 19 December 2014 Enduring.

Alpha Presentation Ford Augmented Reality Owners Manual The Capstone Experience Team Ford

Evolving a data system Patterns &amp; Antipatterns Simon Metson Any views or opinions expressed

How I Started Using VoiceThread VISUAL THINKING CLASS Exposure to new technologies for learning

E NGINEERING C OMPUTING S ERVICES (785) 532-4643 support@engg.ksu.edu

Smart Schools Investment Plan Central Square Submission #2 Presentation to the Board of Education

Dam, data collection/display and shoreline protection discussion issues Rob Montgomery

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Evolving a data system Patterns & Antipatterns Simon Metson Any views or opinions expressed