Shuffle Phase Executed only in the case of one or more reducers - PowerPoint PPT Presentation

Feb 18, 2024 •22 likes •102 views

Shuffle Phase Executed only in the case of one or more reducers Transfers data between the mappers and reducers Groups records by their keys to ensure local processing in the reduce phase 01/23/2018 15 Shuffle Phase Map 1 Map 2 Map 3

Shuffle Phase Executed only in the case of one or more reducers Transfers data between the mappers and reducers Groups records by their keys to ensure local processing in the reduce phase 01/23/2018 15
Shuffle Phase … Map 1 Map 2 Map 3 Map M … Reduce 1 Reduce 2 Reduce N 01/23/2018 16
Shuffle Phase (Map-side) Map i k A k v k v k v k v 0 0 0 0 k v k v k v k v k v k v k v k v k v k v Input Split Partition k v k v k v k v map k v 1 k v k v k v k v 1 1 k v k v k v k v 1 k v k v k v k v k v k v k v k v N-1 k v k v k v k v N-1 N-1 N-1 k v k Z k v k v k v … Reduce 1 Reduce 2 Reduce N 01/23/2018 17
Shuffle Phase (Reduce-side) k v … k v Map 1 Map 2 Map 3 Map M k v Reduce j Copy part 1 part 2 part 3 part M Sort k v k v k v Reduce k v k v k v k v 01/23/2018 18
Reduce Phase Apply the reduce function to each group of similar keys k 1 v reduce k 1 v k 2 v reduce k 2 v k 3 v k 3 v reduce k 3 v output reduce k … v k N v k N v k N v reduce k N v k N v 01/23/2018 19
Output Writing Materializes the final output to disk All results are from one process (mapper/reducer) are stored in a subdirectory An OutputFormat is used to Create any files in the output directory Write the output records one-by-one to the output Merge the results from all the tasks (if needed) While the output writing runs in parallel, the final commit step runs on a single machine 01/23/2018 20
MapReduce Examples Input: A log file Filter Aggregation Conversion 01/23/2018 21
Advanced Issues Map failures Reduce failures Straggler problem Custom keys and values Efficient sorting on serialized data Pipeline MapReduce jobs 01/23/2018 22

Recommend

On the double shuffle Lie algebra structure: Ecalles approach Adriana Salerno (joint work

The double shuffle Lie algebra Ecalles theory of Moulds A new proof of Racinets theorem On the double shuffle Lie algebra structure: Ecalles approach Adriana Salerno (joint work with Leila Schneps) Bates College December 2, 2014

1.17k views • 25 slides

ShuffleWatcher : Shuffle-aware Scheduling in Mul5-tenant

ShuffleWatcher : Shuffle-aware Scheduling in Mul5-tenant MapReduce Clusters Faraz Ahmad * Srimat T. Chakradhar Anand Raghunathan T. N.

877 views • 44 slides

A SHUFFLE ARGUMENT SECURE IN THE GENERIC MODEL Prastudy Fauzi, Helger Lipmaa, Michal Zajac

A SHUFFLE ARGUMENT SECURE IN THE GENERIC MODEL Prastudy Fauzi, Helger Lipmaa, Michal Zajac University of Tartu, Estonia ASIACRYPT 2016 OUR RESULTS A new efficient CRS-based NIZK shuffle argument OUR RESULTS A new efficient CRS-based

1.4k views • 113 slides

Encryption based on Card Shuffle Jooyoung Lee Faculty of Mathematics and Statistics, Sejong

Encryption based on Card Shuffle Jooyoung Lee Faculty of Mathematics and Statistics, Sejong University October 3, 2015 Jooyoung Lee Encryption based on Card Shuffle Block Cipher k n n E u v A block cipher is a function E : { 0 , 1 }

594 views • 30 slides

On recognizing words that are squares for the shuffle product Laboratoire dInformatique

On recognizing words that are squares for the shuffle product Laboratoire dInformatique Gaspard-Monge Universit e Paris-Est Marne-la-Vall ee UMR CNRS 8049 Romeo Rizzi & St ephane Vialette Technische Universit at Berlin

797 views • 63 slides

Optimizing Shuffle in Wide-Area Data Analytics Shuhao Liu * , Hao Wang, Baochun Li Department of

Optimizing Shuffle in Wide-Area Data Analytics Shuhao Liu * , Hao Wang, Baochun Li Department of Electrical & Computer Engineering University of Toronto What is: - Wide-Area Data Analytics? - Shuffle? 2 Wide-Area Data Analytics

482 views • 21 slides

Phase IB Supplement Phase II Submission Progressing Towards a Phase II Submission Phase IB

Phase IB Supplement Phase II Submission Progressing Towards a Phase II Submission Phase IB Supplement Foster relationships with strategic partners and investors Advance Phase I research Help bridge the Phase I/Phase II gap

370 views • 9 slides

In Intr troduc ductory Mus usic ic Shuffle Along by Eubie Blake and Noble Sissle Two

In Intr troduc ductory Mus usic ic Shuffle Along by Eubie Blake and Noble Sissle Two Enduring Classics Love will Find A way Im Just Wild About Harry In The Chorus: Josephine Baker Zora Neale Hurston Meets Fannie

384 views • 25 slides

-deformed shuffle bialgebras and renormalization V.C. B` ui, G.H.E. Duchamp, Hoang Ngoc Minh,

-deformed shuffle bialgebras and renormalization V.C. B` ui, G.H.E. Duchamp, Hoang Ngoc Minh, Q.H. Ng o Paths to, from and in renormalization February, 8th-12th 2016, Potsdam Plan 1. Introduction 1.1 Renormalization of (all) divergent

782 views • 32 slides

Shuffle regularized multiple Eisenstein series and the Goncharov coproduct Henrik Bachmann -

Shuffle regularized multiple Eisenstein series and the Goncharov coproduct Henrik Bachmann - University of Hamburg joint work with Koji Tasaka (PMI, POSTECH) Numbers and Physics (NAP2014) ICMAT Madrid, 17 September 2014 Henrik Bachmann -

587 views • 41 slides

Signatures of paths, the shuffle algebra, and de Bruijns formula Laura Colmenarejo (UMass

Signatures of paths, the shuffle algebra, and de Bruijns formula Laura Colmenarejo (UMass Amherst) (Joint work with F. Galuppi & M. Micha lek, and J. Diehl & M.-S . Sorea) ACPMS June 19, 2020 L. Colmenarejo (UMass

393 views • 28 slides

COMMUNITY GAME RETURN TO PLAY ROADMAP Phase 1 Phase 2A Phase 2B Phase 3 Phase 4 Phase 5 WRU &

COMMUNITY GAME RETURN TO PLAY ROADMAP Phase 1 Phase 2A Phase 2B Phase 3 Phase 4 Phase 5 WRU & club Return to Household/ Non-Sanctioned sanctioned Season start - Competition Contact household Group Training Small Group

432 views • 6 slides

On optimal threshold defender structures of resharing-based oblivious shuffle protocols for

On optimal threshold defender structures of resharing-based oblivious shuffle protocols for secret-shared secure multi-party computations Jan Willemson Cybernetica Trve Theory Days October 7th-9th, 2011 Secret Shared Databases If we

710 views • 16 slides

Shuffle: Tips and Tricks Julien Demouth, NVIDIA Glossary Warp Implicitly synchronized

Shuffle: Tips and Tricks Julien Demouth, NVIDIA Glossary Warp Implicitly synchronized group of threads (32 on current HW) Warp ID ( warpid ) Identifier of the warp in a block: threadIdx.x / 32 Lane ID ( laneid ) Coordinate

527 views • 23 slides

OPS: Optimized Shuffle Management System for Apache Spark Yuchen Cheng * , Chunghsuan Wu * ,

OPS: Optimized Shuffle Management System for Apache Spark Yuchen Cheng * , Chunghsuan Wu * , Yanqiang Liu * , Rui Ren * , Hong Xu , Bin Yang , Zhengwei Qi * * Shanghai Jiao Tong University City University of Hong Kong Intel

217 views • 19 slides

Anti-Combining for MapReduce Alper Okcan Mirek Riedewald Northeastern University, Boston, USA

Anti-Combining for MapReduce Alper Okcan Mirek Riedewald Northeastern University, Boston, USA SIGMOD 2014 ICT MapReduce Overview Shuffle ICT Shuffle is always the bottleneck of a MR job execution large amounts of data are grouped,

359 views • 23 slides

How to Encipher Messages on a Small Domain Deterministic Encryption and the Thorp Shuffle Ben

How to Encipher Messages on a Small Domain Deterministic Encryption and the Thorp Shuffle Ben Morris Phil Rogaway Till Stegers University of California, Davis University of California, Davis Dept of Mathematics Dept of Computer

1.01k views • 25 slides

January 2018 Over 60 years of history PHASE IV PHASE II PHASE I PHASE III PROFITABILITY AND

Ripley Corp January 2018 Over 60 years of history PHASE IV PHASE II PHASE I PHASE III PROFITABILITY AND REPOSITIONING BEGINNING SCALE SELECTIVE GROWTH Opening of Acquisition of Credit Opening of First bond Consolidation Ripley

838 views • 30 slides

An enciphering scheme based on a card shuffle Ben Morris Mathematics, UC Davis Joint work with

An enciphering scheme based on a card shuffle Ben Morris Mathematics, UC Davis Joint work with Viet Tung Hoang (Computer Science, UC Davis) and Phil Rogaway (Computer Science, UC Davis). Setting Blockcipher construction pseudorandom function

317 views • 27 slides

Bijective Proofs for Shuffle Compatibility Duff Baker-Jarvis Wake Forest University and Bruce

Bijective Proofs for Shuffle Compatibility Duff Baker-Jarvis Wake Forest University and Bruce Sagan Michigan State University www.math.msu.edu/sagan University of Florida AMS Meeting November 2, 2019 Definitions The method Comments and

249 views • 9 slides

Shuffle-compatibility for the exterior peak set Darij Grinberg (UMN) 12 July 2018 Dartmouth

Shuffle-compatibility for the exterior peak set Darij Grinberg (UMN) 12 July 2018 Dartmouth College slides: http://www.cip.ifi.lmu.de/~grinberg/algebra/ dartmouth18.pdf paper: http: //www.cip.ifi.lmu.de/~grinberg/algebra/gzshuf2.pdf project:

1.32k views • 127 slides

Phase 2 1 cmarinas@uni-bonn.de Phase 2 Phase 2: BEAST and partial Belle II Phase 3: Full

Phase 2 1 cmarinas@uni-bonn.de Phase 2 Phase 2: BEAST and partial Belle II Phase 3: Full Belle II detector Phase 2 (BEAST II) The SuperKEKB accelerator will be operating, for the first time, with QCS magnets First operation with

597 views • 56 slides

Riffle: Optimized Shuffle Service for Avery Ching Large-Scale Data Analytics Michael J. Freedman

Princeton University Facebook Haoyu Zhang Brian Cho Ergin Seyfe Riffle: Optimized Shuffle Service for Avery Ching Large-Scale Data Analytics Michael J. Freedman Batch analytics systems are widely used Large-scale SQL queries

903 views • 22 slides

Optimal Shuffle Code with Permutation Instructions Sebastian Buchwald, Manuel Mohr, Ignaz Rutter

Optimal Shuffle Code with Permutation Instructions Sebastian Buchwald, Manuel Mohr, Ignaz Rutter Chair for Programming Paradigms & Chair for Algorithmics, Karlsruhe Institute of Technology (KIT) 1 August 5, 2015 Sebastian Buchwald, Manuel

1.25k views • 110 slides