Scaling Social Nets Pablo Rodriguez Telefonica Research, Barcelona - PDF document

5/23/2011 Scaling Social Nets Pablo Rodriguez Telefonica Research, Barcelona Social Networks: Rapid Growth � Facebook increased from 100M a 200M in less than 8 months � Radio took 38 years to reach 50M, TV 13 years, Internet 4 years, iPod 3 and Facebook 2! � Ashton Kutcher, first Twitter celebrity has more than 1M followers � Nielsen research, says social networks are more popular than email � Facebook is replacing Google as a tool to find content: my friends filter the Web for me. Telefónica Investigación y Desarrollo 2 1

5/23/2011 DataBase Towards transparent scalability Elastic resource allocation for the Presentation and the Logic layer: More machines when load increases. Components are stateless, therefore, independent and duplicable Components are interdependent, therefore, non-duplicable on demand. 4 2

5/23/2011 Scalability is a pain: the designers ’ dilemma If data is local to a machine ▪ Centralized programming paradigm can be used ▪ All read queries resolved locally without requests sent to other machines ▪ Management simple If data is not local (distributed) ▪ Distributed programming – not trivial! ▪ Management costs increases ▪ Performance degrades Why sharding is not enough for OSN? Shards in OSN can never be disjoint because: ▪ Operations on user i require access to the data of other users ▪ at least one-hop away. From graph theory: ▪ there is no partition that for all nodes all neighbors and itself are in the same partition if there is a single connected component. Data locality cannot be maintained by partitioning a social network!! 6 3

5/23/2011 OSN’s operations 101 � Selects and joins across multiple shards of the database 1) Relational Databases are possible but performance is poor (e.g. MySQL Cluster, Oracle Rack) � 2) Key-Value Stores (DHT) More efficient than relational databases: multi-get primitives to transparently fetch data from multiple servers. � But it’s not a silver bullet: o lose SQL query language => programmatic queries o lose abstraction from data operations o suffer from high traffic, eventually affecting performance: Incast issue • • Multi-get hole • latency dominated by the worse performing server 7 Maintaining Data Locality Semantics Sketch of a Social Network to be split in two servers... Full Replication (typical RDBMS) Random Partition (typical Key-Value) Random Partition + Replication SPAR Social-based Partition + Replication 4

5/23/2011 Maintaining Data Locality Semantics Sketch of a Social Network to be split in two servers... Full Replication (typical RDBMS) Random Partition (typical Key-Value) Random Partition + Replication SPAR Social-based Partition + Replication Maintaining Data Locality Semantics Sketch of a Social Network to be split in two servers... Full Replication (typical RDBMS) Random Partition (typical Key-Value) Random Partition + Replication SPAR Social-based Partition + Replication 5

5/23/2011 Performance problems… • Network bandwidth is not an issue but Network I/O is, and CPU I/O too: • Network Latency increases – worse performing server produces delays – multiget hole • Memory hit ratio is decreased – random partition destroys correlations 1 1 Maintaining Data Locality Semantics Sketch of a Social Network to be split in two servers... Full Replication (typical RDBMS) Random Partition (typical Key-Value) Random Partition + Replication SPAR Social-based Partition + Replication 6

5/23/2011 Maintaining Data Locality Semantics Sketch of a Social Network to be split in two servers... Full Replication (typical RDBMS) Random Partition (typical Key-Value) Random Partition + Replication SPAR Social-based Partition + Replication SPAR Algorithm from 10000 feet 1) Online (incremental) a) Dynamics of the SN (add/remove node, edge) b) Dynamics of the System (add/remove server) 2) Fast (and simple) a) Local information b) Hill-climbing heuristic c) Load-balancing via back-pressure 3) Stable and Fair a) Avoid cascades b) Fair allocation of load a) Optimize for MIN_REPLICA ( i.e. NP-Hard ) 4) Effective b) While maintaining a fixed number of replicas per user for redundancy (e.g. K=2) 14 7

5/23/2011 Graph/Social partitioning algorithms fall short… 1) SPAR 2) SPAR optimizes different goal: replicas rather than edges incremental (online) stable (minimize migrations) simple (and fast) MIN_REPLICA Problem SPAR Online from 10.000 feet MIN_REPLICA is NP-Hard ☺ ☺ ☺ ☺ + 3 Heuristic: + ▪ Greedy optimization -1 2 ▪ Local information ▪ Load balance constrain Status quo: replica of 1 Six events: in M3 and replica of 6 in M1. ▪ Add/Remove 1 6 8

5/23/2011 SPAR in the wild Twitter clone (Laconica, now Statusnet) ▪ Centralized architecture, PHP + MySQL/Postgres Twitter data ▪ Twitter as of end of 2008, 2.4M users, 12M tweets in 15 days The little engines ▪ 16 commodity desktops: Pentium Duo at 2.33Ghz, 2GB RAM connected with Gigabit-Ethernet switch Test SPAR on top of: 1 ▪ MySQL (v5.5) 7 SPAR – System Overview 9

5/23/2011 Trying out various partitioning algorithms... Real OSN data Partition algorithms • Random (DHT) Twitter ▪ 2.4M users, 48M edges, 12M tweets for 15 days • METIS (Twitter as of Dec08) • spectral clustering Orkut (MPI) • MO+ ▪ 3M users, 223M edges • modularity optimization + recursive Facebook (MPI) • SPAR online ▪ 60K users, 500K edges 19 How many replicas are generated? Twitter 16 partitions Twitter 128 partitions Rep. overhead % over K=2 Rep. overhead % over K=2 SPAR 2.44 +22% 3.69 +84% MO+ 3.04 +52% 5.14 +157% METIS 3.44 +72% 8.03 +302% Random 5.68 +184% 10.75 +434% 20 10

5/23/2011 How many replicas are generated? Twitter 16 partitions Twitter 128 partitions Rep. overhead % over K=2 Rep. overhead % over K=2 SPAR 2.44 +22% 3.69 +84% MO+ 3.04 +52% 5.14 +157% METIS 3.44 +72% 8.03 +302% Random 5.68 +184% 10.75 +434% 2.44 replicas per user (on average) 2.44 replicas per user (on average) 2 to guarantee redundacy + 0.44 to guarantee data locality (+22%) 2.44 21 How many replicas are generated? Twitter 16 partitions Twitter 128 partitions Rep. overhead % over K=2 Rep. overhead % over K=2 SPAR 2.44 +22% 3.69 +84% MO+ 3.04 +52% 5.14 +157% METIS 3.44 +72% 8.03 +302% Random 5.68 +184% 10.75 +434% Replication with random partitioning, too costly! 22 11

5/23/2011 How many replicas are generated? Twitter 16 partitions Twitter 128 partitions Rep. overhead % over K=2 Rep. overhead % over K=2 SPAR 2.44 +22% 3.69 +84% MO+ 3.04 +52% 5.14 +157% METIS 3.44 +72% 8.03 +302% Random 5.68 +184% 10.75 +434% Other social based partitioning algorithms have higher replication overhead than SPAR. 23 How many replicas are generated? Twitter 16 partitions Twitter 128 partitions Rep. overhead % over K=2 Rep. overhead % over K=2 SPAR 2.44 +22% 3.69 +84% MO+ 3.04 +52% 5.14 +157% METIS 3.44 +72% 8.03 +302% Random 5.68 +184% 10.75 +434% Replication ovearhead grows sub- linearly with the number of partitions 24 12

5/23/2011 SPAR on top of Cassandra • Different read request rate (application level): Vanilla Cassandra SPAR + Cassandra 99 th < 100ms 200 req/s 800 req/s Network bandwidth is not an issue but Network I/O and CPU I/O are: ▪ Network delays is reduced ▪ Worse performing server produces delays ▪ Memory hit ratio is increased ▪ Random partition destroys correlations 25 Conclusions SPAR provides the means to achieve transparent scalability 1) For applications using RDBMS (not necessarily limited to OSN) 2) And, a performance boost for key-value stores due to the reduction of Network I/O 26 13

5/23/2011 Questions? Thanks. 14

Scaling Social Nets Pablo Rodriguez Telefonica Research, Barcelona - PDF document

5/23/2011 Scaling Social Nets Pablo Rodriguez Telefonica Research, Barcelona Social Networks: Rapid Growth Facebook increased from 100M a 200M in less than 8 months Radio took 38 years to reach 50M, TV 13 years, Internet 4 years, iPod 3

Conflict nets: Efficient locally canonical MALL proof nets Dominic J. D. Hughes and Willem

Petri Nets Petri Nets Inputs and Outputs Petri Nets vs FSM Lionel Morel Modeling Templates

Mix-Nets Lecture 19 Some tools for electronic-voting (and other things) Mix-Nets Mix-Nets

Petri Nets and Model Checking Natasa Gkolfi University of Oslo March 31, 2017 Petri Nets and

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

From DB-nets to Coloured Petri Nets with Priorities Marco Montali and Andrey Rivkin KRDB Research

Why Are Convlotuional Nets More Sample-Efficient than Fully-Connected Nets? Zhiyuan Li Joint

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

Incident Mobilization Incident Mobilization (R- -T T- -S) Nets S) Nets (R Mobilization

Iridex Group Plastic Protective sleeves Rabitz mesh Soil stabilization grid W Fencing nets

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Incident Mobilization Incident Mobilization (R- -T T- -S) Nets S) Nets (R Mobilization

The power of Nets! Tychonoff Theorem Product spaces When sequences are not enough The Theorem

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Some Issues related to the Mining of OSNs represented as Graphs (presentation) Conference Paper

Measuring Rehabilitation Intensity in Ontario Beth Linkewich (Beth.Linkewich@sunnybrook.ca)

Campaign Leadership Co-Chairs: Gail Miller, Owner, Larry H. Miller Group of Companies Scott

Interim results Geoff Miller CEO September

Investor Meet 26 March 2009 Mumbai Disclaimer Certain statements included in this presentation

Mit itigating Leakage of Organizational Information in In in the Hyper-Connected Era: From the

Snapshot of El Salvadors Energy Sector Orlando Altam USAID/El Sa Regional Energy/Environment

Backup Strategies with MySQL Enterprise Backup Mike Frank Calvin Sun John Russell Oracle/MySQL

Scaling Social Nets Pablo Rodriguez Telefonica Research, Barcelona - PDF document

5/23/2011 Scaling Social Nets Pablo Rodriguez Telefonica Research, Barcelona Social Networks: Rapid Growth Facebook increased from 100M a 200M in less than 8 months Radio took 38 years to reach 50M, TV 13 years, Internet 4 years, iPod 3

Conflict nets: Efficient locally canonical MALL proof nets Dominic J. D. Hughes and Willem

Petri Nets Petri Nets Inputs and Outputs Petri Nets vs FSM Lionel Morel Modeling Templates

Mix-Nets Lecture 19 Some tools for electronic-voting (and other things) Mix-Nets Mix-Nets

Petri Nets and Model Checking Natasa Gkolfi University of Oslo March 31, 2017 Petri Nets and

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

From DB-nets to Coloured Petri Nets with Priorities Marco Montali and Andrey Rivkin KRDB Research

Why Are Convlotuional Nets More Sample-Efficient than Fully-Connected Nets? Zhiyuan Li Joint

Analysis of Scaling Algorithms for Matrix &amp; Operator Scaling Contents Scaling Algorithms

Incident Mobilization Incident Mobilization (R- -T T- -S) Nets S) Nets (R Mobilization

Iridex Group Plastic Protective sleeves Rabitz mesh Soil stabilization grid W Fencing nets

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Incident Mobilization Incident Mobilization (R- -T T- -S) Nets S) Nets (R Mobilization

The power of Nets! Tychonoff Theorem Product spaces When sequences are not enough The Theorem

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Some Issues related to the Mining of OSNs represented as Graphs (presentation) Conference Paper

Measuring Rehabilitation Intensity in Ontario Beth Linkewich (Beth.Linkewich@sunnybrook.ca)

Campaign Leadership Co-Chairs: Gail Miller, Owner, Larry H. Miller Group of Companies Scott

Interim results Geoff Miller CEO September

Investor Meet 26 March 2009 Mumbai Disclaimer Certain statements included in this presentation

Mit itigating Leakage of Organizational Information in In in the Hyper-Connected Era: From the

Snapshot of El Salvadors Energy Sector Orlando Altam USAID/El Sa Regional Energy/Environment

Backup Strategies with MySQL Enterprise Backup Mike Frank Calvin Sun John Russell Oracle/MySQL

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms