From check-ins to recommendations Jon Hoffman @hoffrocket QCon NYC - PowerPoint PPT Presentation

From check-ins to recommendations Jon Hoffman @hoffrocket QCon NYC – June 11, 2014

About Foursquare

Scaling in two parts • Part one: data storage • Part two: application complexity

Part 1: Data Storage 2009

Table splits DB.A Venues Checkins DB Venues Checkins Users DB.B Friends Users Friends

Replication Master RW Slave Slave RO RO

Outgrowing our hardware • Not enough RAM for indexes and working data set • 100 writes/second/disk

Sharding • Manage ourselves in application code on top of postgres? • Use something called Cassandra? • Use something called HBase? • Use something called Mongo?

Besides Mongo • Memcache • Elastic search – nearby venue search – user search • Custom data services – Read only key value server – in memory cache with business logic

HFile Service: Read only KV Store HFile Servers Hadoop hfile_0_a hfile_0_b hfile_0 Application hfile_1 Servers MR HDFS hfile_1_a hfile_1_b Zookeeper: - data type to machine mapping - key hash to shard mapping

Caching Services Oplog Tailer Mongo Kafka Kafka Consumers Cache Redis Servers getUserVenueCounts( 1: list<i64> userIds 2: list<ObjectId> venues) Application Servers

Part 2: application complexity 2009

RPC Tracing

Throttles

Remember the goats?

Monolithic problems • Compiling all the code, all the time • Deploying all the code all the time • Hard to isolate cause of performance regressions and resource leaks

SOA Infancy • Single codebase, Multiple builds API Web Offline

Finagle Era • Twitter’s scala based RPC library service ¡Geocoder ¡{ ¡ ¡ ¡GeocodeResponse ¡geocode( ¡ ¡ ¡ ¡1: ¡GeocodeRequest ¡r ¡ ¡ ¡) ¡ } ¡

Benefits • Independent compile targets • Fined grained control on releases and bug fixes • Functional isolation

Problems • Duplication in packaging and deployment efforts • Hard to trace execution problems • Hard to define/change where things live • Networks aren’t reliable

Builds and deploys • single service definition file • consistent build packaging • simple deployment of canary & fleet ./service_releaser ¡–j ¡service_name ¡ ¡

Monitoring • healthcheck endpoint over http • consistent metric names • dashboard for every service

Distributed Tracing

Exception Aggregation

Application Discovery • Finagle Server Sets + ZK

Circuit Breaking • Fast failing RPC calls after some error rate threshold • Loosely based on Netflix’s hystrix

SOA Problem Recap • Duplication in packaging and deployment efforts – Build and deploy automation • Hard to trace execution problems – Monitoring consistency – Distributed Tracing – Error aggregation • Hard to define/change where things live – Application discovery with zookeeper • Networks aren’t reliable – Circuit breaking

Organization • Smaller teams owning front to back implementation of features • Desire to have quick deploy cycles on new API endpoints

Remote Endpoints Wouldn’t it be cool if a developer could expose a new API endpoint without redeploying our still monolithic API server?

Remote Endpoint Benefits • Very easy to experiment with new endpoints • Tight contract for service interaction – JSON responses – all http params passed along • Clear path to breaking off more chunks from API monolith

Future work: Part 3? • Further isolating services with independent storage layers? • Completely automated continuous deployment • Hybrid immutable/mutable data storage – mongo & hfile & cache service

Thanks! • Want to build these things? https://foursquare.com/jobs • jon@foursquare.com

From check-ins to recommendations Jon Hoffman @hoffrocket QCon NYC - PowerPoint PPT Presentation

From check-ins to recommendations Jon Hoffman @hoffrocket QCon NYC June 11, 2014 About Foursquare Scaling in two parts Part one: data storage Part two: application complexity Part 1: Data Storage 2009 Table splits DB.A Venues

Introducing OSGi Eclipse Plug-ins 1 Plug-in State Information Plug-in Structure

Inertial Sensing & Navigation 3/9/2016 3/15/2017 GPS Noise Source: www.insidegnss.com INS

Year 1 Phonics Screening Check Phonics Screening Check All schools have to administer a

4/17/20 Design, Analysis, and Assessment of Learning Workgroup Recommendations Recommendations

LENNAR CORP LENNAR CORP v. v. MARKEL AMERICAN INS. MARKEL AMERICAN INS. R. Brent Cooper R.

F elipe Mandarino Pereira P Pereira P as as s s os os Ins Ins titute, R titute, R io

Ins & Outs Ins & Outs of Contrac of Contracts Office of Research Support &

Ins & Outs of Contrac Ins & Outs of Contracts Office of Research Support &

Market Risk Management Gen Ins. Risk Management Life Ins. Risk Management Op.

FNFNES Recommendations Recommendations from the FNFNES are currently being finalized,

Recommendations Survey June 4, 2020 Recap of Policy Recommendations survey Policy

CheckMate Inline Check Valve Red Valve Company CheckMate Check Valve Red Valve Company 1

Marshal Agenda Introduction Check Starter Marshal Introduction Swimming Official

Interactive Preview v1.6 E-Verify Self Check Interactive Preview The intent of this Preview is

Can you find them all? 1 WALT check calculations. What number sentence is this showing? + = 2

Check & Connect An evidence-based comprehensive student engagement model Check & Connect

New Junior Cycle Information for Parents Parents Association A.G.M. 8 th October 2018 What stays

Click Title Information for 2 nd & 3 rd Year Parents Overview 1. Our students 2. Structure of

MULTI-COUNTRY - WORKSHOP ON THE IMPLEMENTATION OF THE EU TIMBER REGULATION FOR MEDITERRANEAN

Community Power in Ontario The Road Ahead Clean Air Council November 24, 2017 20 year

ALMA Development Program Jeff Kern CASA Team Lead Atacama Large Millimeter/submillimeter Array

Visualization Toolkit: Improving Rendering and Compute on GPUs GTC, San Jose, CA April, 2016

To Receive CPE Credit Participate in entire webinar Answer attendance checks & polls

SAN DIEGO MESA COLLEGE Program Review Kickoff Institutional Planning and Governance Guide

From check-ins to recommendations Jon Hoffman @hoffrocket QCon NYC - PowerPoint PPT Presentation

From check-ins to recommendations Jon Hoffman @hoffrocket QCon NYC June 11, 2014 About Foursquare Scaling in two parts Part one: data storage Part two: application complexity Part 1: Data Storage 2009 Table splits DB.A Venues

Introducing OSGi Eclipse Plug-ins 1 Plug-in State Information Plug-in Structure

Inertial Sensing &amp; Navigation 3/9/2016 3/15/2017 GPS Noise Source: www.insidegnss.com INS

Year 1 Phonics Screening Check Phonics Screening Check All schools have to administer a

4/17/20 Design, Analysis, and Assessment of Learning Workgroup Recommendations Recommendations

LENNAR CORP LENNAR CORP v. v. MARKEL AMERICAN INS. MARKEL AMERICAN INS. R. Brent Cooper R.

F elipe Mandarino Pereira P Pereira P as as s s os os Ins Ins titute, R titute, R io

Ins &amp; Outs Ins &amp; Outs of Contrac of Contracts Office of Research Support &amp;

Ins &amp; Outs of Contrac Ins &amp; Outs of Contracts Office of Research Support &amp;

Market Risk Management Gen Ins. Risk Management Life Ins. Risk Management Op.

FNFNES Recommendations Recommendations from the FNFNES are currently being finalized,

Recommendations Survey June 4, 2020 Recap of Policy Recommendations survey Policy

CheckMate Inline Check Valve Red Valve Company CheckMate Check Valve Red Valve Company 1

Marshal Agenda Introduction Check Starter Marshal Introduction Swimming Official

Interactive Preview v1.6 E-Verify Self Check Interactive Preview The intent of this Preview is

Can you find them all? 1 WALT check calculations. What number sentence is this showing? + = 2

Check &amp; Connect An evidence-based comprehensive student engagement model Check &amp; Connect

New Junior Cycle Information for Parents Parents Association A.G.M. 8 th October 2018 What stays

Click Title Information for 2 nd &amp; 3 rd Year Parents Overview 1. Our students 2. Structure of

MULTI-COUNTRY - WORKSHOP ON THE IMPLEMENTATION OF THE EU TIMBER REGULATION FOR MEDITERRANEAN

Community Power in Ontario The Road Ahead Clean Air Council November 24, 2017 20 year

ALMA Development Program Jeff Kern CASA Team Lead Atacama Large Millimeter/submillimeter Array

Visualization Toolkit: Improving Rendering and Compute on GPUs GTC, San Jose, CA April, 2016

To Receive CPE Credit Participate in entire webinar Answer attendance checks &amp; polls

SAN DIEGO MESA COLLEGE Program Review Kickoff Institutional Planning and Governance Guide

Inertial Sensing & Navigation 3/9/2016 3/15/2017 GPS Noise Source: www.insidegnss.com INS

Ins & Outs Ins & Outs of Contrac of Contracts Office of Research Support &

Ins & Outs of Contrac Ins & Outs of Contracts Office of Research Support &

Check & Connect An evidence-based comprehensive student engagement model Check & Connect

Click Title Information for 2 nd & 3 rd Year Parents Overview 1. Our students 2. Structure of

To Receive CPE Credit Participate in entire webinar Answer attendance checks & polls