Pregel: A System for Large-Scale Graph Processing Grzegorz - PowerPoint PPT Presentation

Jun 09, 2023 •11 likes •131 views

Pregel: A System for Large-Scale Graph Processing Pregel: A System for Large-Scale Graph Processing Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski Bogdan-Alexandru

Pregel: A System for Large-Scale Graph Processing Pregel: A System for Large-Scale Graph Processing Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski Bogdan-Alexandru Matican University of Cambridge February 26, 2013
Pregel: A System for Large-Scale Graph Processing Table of contents 1 Research questions 2 Design Programming Model Usability Architecture 3 Experiments 4 Conclusion
Pregel: A System for Large-Scale Graph Processing Research questions Main considerations Typical Google system’s paper. Cross-research influences: MapReduce, Chubby, GFS, BigTable. Scalability process graphs of billions of vertexes Usability paradigm, API, features Architecture Master-Slave, network aggregation, data locality Transparency fault tolerance, commodity machines Performance resources, speed, scale
Pregel: A System for Large-Scale Graph Processing Design Programming Model Vertex local action: vertex and outgoing edges message passing communication independent state change: synchronicity
Pregel: A System for Large-Scale Graph Processing Design Programming Model System supersteps (BSP model) message based state alterations aggregation performance optimizations fault tolerance (check-pointing)
Pregel: A System for Large-Scale Graph Processing Design Usability API Design simple interface for users to understand usage pattern driven: Combiner, Aggregator, Http IO format variable for interoperability fault tolerance transparent data partitioning
Pregel: A System for Large-Scale Graph Processing Design Architecture Components and Mechanics data sharding (graph partitioning) Master (ids, sharding, sync, pings) Workers (supersteps, state, buffering) fault tolerance (check-pointing, confined recovery) performance considerations
Pregel: A System for Large-Scale Graph Processing Experiments Scalability Figure : Binary tree topology for 800 workers, 300 machines. Linear scaling of runtime for binary fan-out, high vertex count.
Pregel: A System for Large-Scale Graph Processing Experiments Scalability Figure : Social graph topology for 800 workers, 300 machines. Linear scaling of runtime for relatively sparse graphs with instances of high density.
Pregel: A System for Large-Scale Graph Processing Experiments Notes naive implementation of SSSP no input pre-processing or special sharding comparable results with state-of-the-art systems scalable considerably past points shown in paper
Pregel: A System for Large-Scale Graph Processing Conclusion Contributions programming model design simplicity concurency avoidance fault tolerance performance optimizations
Pregel: A System for Large-Scale Graph Processing Conclusion Critique and questions master failover mechanism? evaluation: good enough for us evaluation: how much faster?

Recommend

Pregel: A System for Large-Scale Graph Processing Grzegorz Malewicz, Matthew H. Austern, Aart

Pregel: A System for Large-Scale Graph Processing Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski Google, Inc. 2010 What is Pregel? A System for Large-Scale

377 views • 16 slides

Pregel Large-Scale Graph Processing William Jones Analysing large graphs is hard. We are

Pregel Large-Scale Graph Processing William Jones Analysing large graphs is hard. We are keenly interested in analysing certain very large graphs. (e.g. the Web graph) These graphs are now too large to store and process on one

236 views • 11 slides

Optimising Graph Algorithms on Pregel-Like Systems S. Salihoglu, J. Widom Stanford University

Optimising Graph Algorithms on Pregel-Like Systems S. Salihoglu, J. Widom Stanford University Philip Leonard November 24 th , 2014 November 24 th , 2014 Philip Leonard (University of Cambridge) Pregel Optimisation 1 / 14 Pregel Reminder Bulk

514 views • 14 slides

Graph Processing Connor Gramazio Spiros Boosalis Pregel why not MapReduce? semantics: awkward

Graph Processing Connor Gramazio Spiros Boosalis Pregel why not MapReduce? semantics: awkward to write graph algorithms efficiency: mapreduces serializes state (e.g. all nodes and edges) while pregel keeps state local (e.g. nodes stay on

992 views • 64 slides

PREGEL: A SYSTEM FOR LARGE-SCALE GRAPH PROCESSING Grzegorz Malewicz, Matthew H. Austern, Aart J.

PREGEL: A SYSTEM FOR LARGE-SCALE GRAPH PROCESSING Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski -2010 Presented by K.M.D.Muthumali Karunarathna 27 th October 2015

218 views • 18 slides

Pregel: A System for Large-Scale Graph Processing Grzegorz Malewicz, Matthew H. Austern, Aart J.

269 views • 24 slides

Pregel A System for Large-Scale Graph Processing Grzegorz Malewicz, Matthew H. Austern, et. al.

Pregel A System for Large-Scale Graph Processing Grzegorz Malewicz, Matthew H. Austern, et. al. Google, Inc. 2010 ACM SIGMOD Conference Presented By: Ezequiel Aguilar Gonzalez Computer Science and Engineering The University of Texas at

1.13k views • 44 slides

Pregel: A System for Large- Scale Graph Processing Written by G. Malewicz et al. at SIGMOD 2010

Pregel: A System for Large- Scale Graph Processing Written by G. Malewicz et al. at SIGMOD 2010 Presented by Chris Bunch Tuesday, October 12, 2010 1 Wednesday, October 13, 2010 Graphs are hard Poor locality of memory access Very

656 views • 31 slides

Efficient Large-Scale Graph Processing on Hybrid CPU and GPU Systems A. Gharaibeh, E.

Efficient Large-Scale Graph Processing on Hybrid CPU and GPU Systems A. Gharaibeh, E. Santos-Neto, L. Costa, M. Ripeanu. IEEE TPC, 2014 Sami (sa894) - R244: Large-scale data processing and optimization Efficient Large-Scale Graph Processing on

381 views • 25 slides

Think Like a {Vertex, Column, Parallel Collection} David Konerding, Google Inc. Pregel: a system

Think Like a {Vertex, Column, Parallel Collection} David Konerding, Google Inc. Pregel: a system for large-scale graph processing Grzegorz Malewicz, Matthew H. Austern, Aart J.C. Bik , James C. Dehnert, Ilan Horn, Naty Leiser, Grzegorz Czajkowski

681 views • 29 slides

Graph Mining - PageRank Mert Terzihan-Zhixiong Chen Content 1. Web as a Graph 2. Why is

Graph Mining - PageRank Mert Terzihan-Zhixiong Chen Content 1. Web as a Graph 2. Why is PageRank important? 3. Markov Chains 4. PageRank Computation 5. Hadoop Review 6. Hadoop PageRank Implementation 7. Pregel Review 8. Pregel PageRank

533 views • 29 slides

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

A large-scale International IPv6 Network A large-scale International IPv6 Network A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org www.6net.org A large-scale International IPv6 Network A

174 views • 15 slides

Granula: Toward Fine-grained Performance Analysis of Large-scale Graph Processing Platforms Wing

Granula: Toward Fine-grained Performance Analysis of Large-scale Graph Processing Platforms Wing Lung Ngai, Tim Hegeman, Stijn Heldens, and Alexandru Iosup @Large Research Massivizing Computer Systems Large-scale Graph Processing 2 OpenG

266 views • 13 slides

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large Scale Solar Lead April 2017 CONTENTS 1. Introduction to CEFC 2. Investment trends 3. The future of large scale solar 4. Pathway to sustainable

664 views • 21 slides

HelP: High-level Primitives for Large- Scale Graph Processing Semih Salihoglu Stanford

HelP: High-level Primitives for Large- Scale Graph Processing Semih Salihoglu Stanford University Jennifer Widom Stanford University 1 Large-scale Graph Processing 10s or 100s billion vertices and edges Distributed Shared-Nothing

658 views • 16 slides

GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan*

Graph Mining and Graph Kernels GRAPH MINING AND GRAPH KERNELS Part I: Graph Mining Karsten Borgwardt^ and Xifeng Yan* ^University of Cambridge *IBM T. J. Watson Research Center August 24, 2008 | ACM SIG KDD, Las Vegas Graph Mining and Graph

1.28k views • 60 slides

_________________________ Solutions for solar energy Rev.1 Contents Few Basics of PV Module

_________________________ Solutions for solar energy Rev.1 Contents Few Basics of PV Module String Sizing Power Ratio Optimisation Different kind of I nverters Rev.1 Rev.1 Rev.1 Rev.1 Rev.1 Rev.1 Rev.1 Rev.1 Rev.1 Example sizing

562 views • 33 slides

Outline Evolutionary Computations (EC) Parallel and Distributed EC Master-slave

Distributed BEAGLE: An Environment for Parallel and Distributed Evolutionary Computations Christian Gagn, Marc Parizeau, and Marc Dubreuil Dpartement de gnie lectrique et de gnie informatique Qubec (Qubec), Canada Outline

795 views • 17 slides

A Mathematical Solution to Power Optimal Pipeline Design Power Optimal Pipeline Design by

A Mathematical Solution to Power Optimal Pipeline Design Power Optimal Pipeline Design by Utilizing Soft Edge Flip Flops M. Ghasemazar, B. Amelifard, M. Pedram University of Southern California Department of Electrical Engineering August 11,

320 views • 19 slides

Securing Australias Future Energy with Storage Anand I. Bhatt | Research Team Leader Australian

Securing Australias Future Energy with Storage Anand I. Bhatt | Research Team Leader Australian Energy Storage Alliance Networking Evening Thursday 26 th March 2015 ENERGY FLAGSHIP Whats happening in Australias grid? Cost of

476 views • 12 slides

E XTINGUISHING SYSTEM FOR SCS RACK Mateusz Tabua Marek Kowaluk Warsaw University of Technology

Slow Control System JINR Dubna 2016 E XTINGUISHING SYSTEM FOR SCS RACK Mateusz Tabua Marek Kowaluk Warsaw University of Technology Veksler and Baldin Laboratory of High Energy Physics Supervisor : M.Sc. Eng. Marek Peryt Slow Control System

298 views • 8 slides

CBCR AND MASTER FILE Narendra Kumar J Jain, NNMS Legal Chambers Narendra@nnms.in 1 BACKGROUND

CBCR AND MASTER FILE Narendra Kumar J Jain, NNMS Legal Chambers Narendra@nnms.in 1 BACKGROUND As part of Organization for Economic Co-operation and Development (OECD) BEPS Project, it came up with BEPS Action Plan reports containing 15

909 views • 37 slides

New Pharm acovigilance Legislation and I m plem enting Measures Minim um Requirem ents for

New Pharm acovigilance Legislation and I m plem enting Measures Minim um Requirem ents for Quality System s ( MAH, EMA, NCA) , m inim um requirem ents for Pharm acovigilance System Master File Stakeholder Meeting, 17 June 2011, EMA,

508 views • 22 slides

Pharmacovigilance System Master File Discussion of the need to revise GVP guidance 8th industry

Pharmacovigilance System Master File Discussion of the need to revise GVP guidance 8th industry stakeholder platform operation of EU PV legislation Presented by Sophia Mylona on 1 July 2016 Compliance and inspections department An agency of

280 views • 5 slides