Just tired of endless loops! or parallel : Stata module for parallel - PowerPoint PPT Presentation

Just tired of endless loops! or parallel : Stata module for parallel computing George G. Vega Yon 1 Brian Quistorff 2 1 University of Southern California vegayon@usc.edu 2 Microsoft AI and Research Brian.Quistorff@microsoft.com Stata Conference Baltimore July 27–28, 2017 Thanks to Stata users worldwide for their valuable contributions. The usual disclaimers applies.

Agenda Motivation What is it and how does it work Benchmarks Syntax and Usage Concluding Remarks

Motivation ◮ Both computation power and size of data are ever increasing

Motivation ◮ Both computation power and size of data are ever increasing ◮ Often our work is easily broken down into independent chunks

Motivation ◮ Both computation power and size of data are ever increasing ◮ Often our work is easily broken down into independent chunks ◮ Implementing parallel computing, even for these “embarrassingly parallel” problems, however, is not easy.

Motivation ◮ Both computation power and size of data are ever increasing ◮ Often our work is easily broken down into independent chunks ◮ Implementing parallel computing, even for these “embarrassingly parallel” problems, however, is not easy. ◮ Stata/MP exists, but only parallelizes a limited set of internal commands, not user commands.

Motivation ◮ Both computation power and size of data are ever increasing ◮ Often our work is easily broken down into independent chunks ◮ Implementing parallel computing, even for these “embarrassingly parallel” problems, however, is not easy. ◮ Stata/MP exists, but only parallelizes a limited set of internal commands, not user commands. ◮ parallel aims to make this more convenient.

Motivation What is it and how does it work Benchmarks Syntax and Usage Concluding Remarks

What is it and how does it work What is it? ◮ Inspired by the R package “snow” (several other examples exists: HTCondor, Matlab’s Parallel Toolbox, etc.)

What is it and how does it work What is it? ◮ Inspired by the R package “snow” (several other examples exists: HTCondor, Matlab’s Parallel Toolbox, etc.) ◮ Launches “child” batch-mode Stata processes across multiple processors (e.g. simultaneous multi-threading, multiple cores, sockets, cluster nodes).

What is it and how does it work What is it? ◮ Inspired by the R package “snow” (several other examples exists: HTCondor, Matlab’s Parallel Toolbox, etc.) ◮ Launches “child” batch-mode Stata processes across multiple processors (e.g. simultaneous multi-threading, multiple cores, sockets, cluster nodes). ◮ Depending on the task, can reach near linear speedups proportional to the number of processors.

What is it and how does it work What is it? ◮ Inspired by the R package “snow” (several other examples exists: HTCondor, Matlab’s Parallel Toolbox, etc.) ◮ Launches “child” batch-mode Stata processes across multiple processors (e.g. simultaneous multi-threading, multiple cores, sockets, cluster nodes). ◮ Depending on the task, can reach near linear speedups proportional to the number of processors. ◮ Thus having a quad-core computer can lead to a 400% speedup.

Simple usage Serial: ◮ gen v2 = v*v ◮ do byobs calc.do ◮ bs, reps(5000): reg price foreign rep

Simple usage Serial: Parallel: ◮ gen v2 = v*v ◮ parallel: gen v2 = v*v ◮ do byobs calc.do ◮ parallel do byobs calc.do ◮ bs, reps(5000): reg price foreign ◮ parallel bs, reps(5000): reg price rep foreign rep

What is it and how does it work How does it work? ◮ Method is split-apply-combine like MapReduce.

What is it and how does it work How does it work? programs globals Starting (current) stata instance loaded with Data data plus user defined globals , programs , mata mata mata objects and mata programs objects programs Splitting the data set Cluster 1 Cluster 2 Cluster 3 Cluster n ... Passing A new stata instance (batch-mode) for every objects data-clusters. Programs, globals and mata objects/programs are passed to them. Task ( stata batch-mode ) The same algorithm (task) is simultaneously ap- plied over the data-clusters. After every instance stops, the data-clusters are appended into one. Cluster Cluster Cluster Cluster ... 1’ 2’ 3’ n ’ Appending the data set Ending (resulting) stata instance loaded with the globals programs new data. Data’ mata mata User defined globals , programs , mata objects objects programs and mata programs remind unchanged.

What is it and how does it work How does it work? ◮ Method is split-apply-combine like MapReduce. Very flexible!

What is it and how does it work How does it work? ◮ Method is split-apply-combine like MapReduce. Very flexible! ◮ Straightforward usage when there is observation- or group-level work

What is it and how does it work How does it work? ◮ Method is split-apply-combine like MapReduce. Very flexible! ◮ Straightforward usage when there is observation- or group-level work ◮ If each iteration needs the entire dataset, then use procedure to split the tasks and load the data separately. Examples:

What is it and how does it work How does it work? ◮ Method is split-apply-combine like MapReduce. Very flexible! ◮ Straightforward usage when there is observation- or group-level work ◮ If each iteration needs the entire dataset, then use procedure to split the tasks and load the data separately. Examples: ◮ Table of seeds for each bootstrap resampling

What is it and how does it work How does it work? ◮ Method is split-apply-combine like MapReduce. Very flexible! ◮ Straightforward usage when there is observation- or group-level work ◮ If each iteration needs the entire dataset, then use procedure to split the tasks and load the data separately. Examples: ◮ Table of seeds for each bootstrap resampling ◮ Table of parameter values for simulations

What is it and how does it work How does it work? ◮ Method is split-apply-combine like MapReduce. Very flexible! ◮ Straightforward usage when there is observation- or group-level work ◮ If each iteration needs the entire dataset, then use procedure to split the tasks and load the data separately. Examples: ◮ Table of seeds for each bootstrap resampling ◮ Table of parameter values for simulations ◮ If the list of tasks is data-dependent then the “nodata” alternative mechanism allows for more flexibility.

Implementation Some details ◮ Uses shell on Linux/MacOS. On Windows we have a compiled plugging allowing:

Implementation Some details ◮ Uses shell on Linux/MacOS. On Windows we have a compiled plugging allowing: ◮ Functionality when the parent Stata is in batch-mode

Implementation Some details ◮ Uses shell on Linux/MacOS. On Windows we have a compiled plugging allowing: ◮ Functionality when the parent Stata is in batch-mode ◮ Seamless user experience by launching the child programs in a hidden desktop (otherwise GUI for each steals focus)

Implementation Some details ◮ Uses shell on Linux/MacOS. On Windows we have a compiled plugging allowing: ◮ Functionality when the parent Stata is in batch-mode ◮ Seamless user experience by launching the child programs in a hidden desktop (otherwise GUI for each steals focus) ◮ For a Linux/MacOS cluster with a shared filesystem (e.g. NFS) and ssh-like commands, can distribute across nodes.

Implementation Some details ◮ Uses shell on Linux/MacOS. On Windows we have a compiled plugging allowing: ◮ Functionality when the parent Stata is in batch-mode ◮ Seamless user experience by launching the child programs in a hidden desktop (otherwise GUI for each steals focus) ◮ For a Linux/MacOS cluster with a shared filesystem (e.g. NFS) and ssh-like commands, can distribute across nodes. ◮ New feature so we’d appreciate help from the community to extend to other cluster settings (e.g. PBS)

Implementation Some details ◮ Uses shell on Linux/MacOS. On Windows we have a compiled plugging allowing: ◮ Functionality when the parent Stata is in batch-mode ◮ Seamless user experience by launching the child programs in a hidden desktop (otherwise GUI for each steals focus) ◮ For a Linux/MacOS cluster with a shared filesystem (e.g. NFS) and ssh-like commands, can distribute across nodes. ◮ New feature so we’d appreciate help from the community to extend to other cluster settings (e.g. PBS) ◮ Make sure that child tempnames or tempvars don’t clash with those coming from parent.

Implementation Some details ◮ Uses shell on Linux/MacOS. On Windows we have a compiled plugging allowing: ◮ Functionality when the parent Stata is in batch-mode ◮ Seamless user experience by launching the child programs in a hidden desktop (otherwise GUI for each steals focus) ◮ For a Linux/MacOS cluster with a shared filesystem (e.g. NFS) and ssh-like commands, can distribute across nodes. ◮ New feature so we’d appreciate help from the community to extend to other cluster settings (e.g. PBS) ◮ Make sure that child tempnames or tempvars don’t clash with those coming from parent. ◮ Passes through programs, macros and mata objects, but NOT Stata matrices or scalars. No state but datasets are returned to parent.

Just tired of endless loops! or parallel : Stata module for parallel - PowerPoint PPT Presentation

Just tired of endless loops! or parallel : Stata module for parallel computing George G. Vega Yon 1 Brian Quistorff 2 1 University of Southern California vegayon@usc.edu 2 Microsoft AI and Research Brian.Quistorff@microsoft.com Stata Conference

LOOPS Loops Loops Loops! How can we repeat a piece of code without having to write it out over

Endless LLP Access to Finance Event Richard Harrison 18 January 2017 1 Introduction to Endless

Tutorial 3 Loops Side Effects 1 CS 136 Spring 2020 Tutorial 3 Loops: for loops &

Loops! Flow of Control: Loops (Savitch, Chapter 4) TOPICS while Loops do while

Loops! Loops! Loops! Lecture 10 COP 3014 Spring 2017 January 31, 2017 Repetition Statements

Loops! Loops! Loops! Lecture 5 COP 3014 Fall 2020 September 17, 2020 Repetition Statements

Clean. Clean. Simple Simple. . Smar Smart. t. refr fresh esh your w y y our workplace

PerfectBeam Endless possibilities to shape light PerfectBeam Endless possibilities to shape

Building Java Programs Chapter 5 Lecture 5-1: while Loops, Fencepost Loops, and Sentinel Loops

Repetition with for loops Topic 5 for loops and nested loops So far, repeating a statement is

Types of loops Topic 15 definite loop : A loop that executes a known number of Indefinite

Building Java Programs Chapter 5 Lecture 10: while Loops, Fencepost Loops, and Sentinel Loops

ARM Assembler Structure / Loops Structure / Loops p. 1/12 Loops Four parts to any loop

Loops Simone Campanoni simonec@eecs.northwestern.edu Outline Loops Identify loops

Building Java Programs Chapter 5 Lecture 5-1: while Loops, Fencepost Loops, and Sentinel Loops

Building Java Programs Chapter 5 Lecture 11: while Loops, Fencepost Loops, and Sentinel Loops

Echoicity and contrast in Spanish conditionals Elena Castroviejo and Laia Mayol Ikerbasque and

Copy raising and perception: A fine-grained semantics for raising and control Ash Asudeh &

MySQL Replication and HA at Facebook Part-II Jeff Jiang Production Engineer Facebook, Inc

Tips for Obtaining a Security Clearance Information obtained from Partnership for Public Service

Local search algorithms CS271P, Winter 2018 Introduction to Artificial Intelligence Prof.

A Command-Line Driver Generator or What I did when I got tired of writing command-line

Population Protocols and Predicates Pierre Ganty IMDEA Software Institute The computer science

A Fixed Point Theorem for Non-Monotonic Functions Esik 1 and P. Rondogiannis 2 an Zolt 1

Just tired of endless loops! or parallel : Stata module for parallel - PowerPoint PPT Presentation

Just tired of endless loops! or parallel : Stata module for parallel computing George G. Vega Yon 1 Brian Quistorff 2 1 University of Southern California vegayon@usc.edu 2 Microsoft AI and Research Brian.Quistorff@microsoft.com Stata Conference

LOOPS Loops Loops Loops! How can we repeat a piece of code without having to write it out over

Endless LLP Access to Finance Event Richard Harrison 18 January 2017 1 Introduction to Endless

Tutorial 3 Loops Side Effects 1 CS 136 Spring 2020 Tutorial 3 Loops: for loops &amp;

Loops! Flow of Control: Loops (Savitch, Chapter 4) TOPICS while Loops do while

Loops! Loops! Loops! Lecture 10 COP 3014 Spring 2017 January 31, 2017 Repetition Statements

Loops! Loops! Loops! Lecture 5 COP 3014 Fall 2020 September 17, 2020 Repetition Statements

Clean. Clean. Simple Simple. . Smar Smart. t. refr fresh esh your w y y our workplace

PerfectBeam Endless possibilities to shape light PerfectBeam Endless possibilities to shape

Building Java Programs Chapter 5 Lecture 5-1: while Loops, Fencepost Loops, and Sentinel Loops

Repetition with for loops Topic 5 for loops and nested loops So far, repeating a statement is

Types of loops Topic 15 definite loop : A loop that executes a known number of Indefinite

Building Java Programs Chapter 5 Lecture 10: while Loops, Fencepost Loops, and Sentinel Loops

ARM Assembler Structure / Loops Structure / Loops p. 1/12 Loops Four parts to any loop

Loops Simone Campanoni simonec@eecs.northwestern.edu Outline Loops Identify loops

Building Java Programs Chapter 5 Lecture 5-1: while Loops, Fencepost Loops, and Sentinel Loops

Building Java Programs Chapter 5 Lecture 11: while Loops, Fencepost Loops, and Sentinel Loops

Echoicity and contrast in Spanish conditionals Elena Castroviejo and Laia Mayol Ikerbasque and

Copy raising and perception: A fine-grained semantics for raising and control Ash Asudeh &amp;

MySQL Replication and HA at Facebook Part-II Jeff Jiang Production Engineer Facebook, Inc

Tips for Obtaining a Security Clearance Information obtained from Partnership for Public Service

Local search algorithms CS271P, Winter 2018 Introduction to Artificial Intelligence Prof.

A Command-Line Driver Generator or What I did when I got tired of writing command-line

Population Protocols and Predicates Pierre Ganty IMDEA Software Institute The computer science

A Fixed Point Theorem for Non-Monotonic Functions Esik 1 and P. Rondogiannis 2 an Zolt 1

Tutorial 3 Loops Side Effects 1 CS 136 Spring 2020 Tutorial 3 Loops: for loops &

Copy raising and perception: A fine-grained semantics for raising and control Ash Asudeh &