Lessons Learned from 10k Experiments to Compare Virtual and Physical - PowerPoint PPT Presentation

Lessons Learned from 10k Experiments to Compare Virtual and Physical Testbeds Jonathan Crussell , Thomas M Kroeger, David Kavaler, Aaron Brown, Cynthia Phillips August 12th, 2019 Supported by the Laboratory Directed Research and Development program at Sandia Na�onal Laboratories, a mul�mission laboratory managed and operated by Na�onal Technology and Engineering Solu�ons of Sandia, LLC., a wholly owned subsidiary of Honey- well Interna�onal, Inc., for the U.S. Department of Energy's Na�onal Nuclear Security Administra�on under contract DE-NA-0003525.

Methodology: Run representa�ve workloads on physical and virtual testbeds Collect, compare, and contrast metrics Applica�on-, OS-, and network-level Goals Discover where and how virtual and physical testbeds differ “Virtualiza�on ar�facts” August 12th, 2019 2

Goals Discover where and how virtual and physical testbeds differ “Virtualiza�on ar�facts” Methodology: Run representa�ve workloads on physical and virtual testbeds Collect, compare, and contrast metrics Applica�on-, OS-, and network-level August 12th, 2019 2

Lessons Learned Methodology and experimental results presented in previous work: Jonathan Crussell, Thomas M. Kroeger, Aaron Brown, and Cynthia Phillips. Virtually the same: Comparing physical and virtual testbeds. In 2019 Interna�onal Conference on Compu�ng, Networking and Communica�ons (ICNC), pages 847–853, Feb 2019. This work focuses on lessons learned in four areas: Tool Valida�on and Development Instrumenta�on Data Collec�on and Aggrega�on Data Analysis August 12th, 2019 3

Methodology & Results ApacheBench fetching fixed pages from an HTTP server Over 10,000 experiments across three clusters Over 500TB of data (without full packet captures) Varied payload size, network drivers, network bandwidth Large varia�ons in network driver offloading behavior Near-iden�cal sequences of system calls Leverage minimega toolset (see http://minimega.org for details) August 12th, 2019 4

Tool Valida�on and Development Lesson: Using a testbed toolset for experimenta�on requires substan�al effort and considera�on to put tools together in a workable and validated fashion. August 12th, 2019 5

bash$ bash run.bash USAGE: run.bash DIR ITER DURATION CONCURRENT VMTYPE DRIVER NCPUS OFFLOAD RATE NWORKERS URL NREQUESTS INSTRUMENT bash$ bash sweep.bash /scratch/output params.bash > sweep-params.bash bash$ head -n 1 sweep-params.bash bash /root/run.bash /scratch/output/ 1 360 1 kvm e1000 1 on 1000 1 http://10.0.0.1/ 100000 true bash$ sort -R sweep-params.bash | parallel -j1 --eta -S $(python igor.py --heads jc[0-9]) Tool Valida�on and Development August 12th, 2019 6

bash$ mount | grep megamount | head -n 5 megamount_5562 on /tmp/minimega/5562/fs type overlay megamount_5566 on /tmp/minimega/5566/fs type overlay megamount_5611 on /tmp/minimega/5611/fs type overlay megamount_5752 on /tmp/minimega/5752/fs type overlay megamount_5774 on /tmp/minimega/5774/fs type overlay bash$ mount | grep megamount | wc -l 96 Tool Valida�on and Development Running thousands of repe��ons: Handle all edge cases (rare bug in minimega ’s capture API) Clean up all state (failed to unmount container filesystem) August 12th, 2019 7

# On VMs minimega -e cc exec mkdir /que minimega -e cc background protonuke -serve -http minimega -e cc recv /miniccc.log # On physical nodes rond -e exec mkdir /que rond -e bg protonuke -serve -http rond -e recv /miniccc.log Tool Valida�on and Development Toolset improvements: Add snaplen op�on to capture API Add standalone C2 server August 12th, 2019 8

Instrumenta�on Lesson: Instrumenta�on is invaluable but it is o�en manually added, expensive, and experiment-specific. Integra�ng more forms of instrumenta�on into the toolset could help researchers to more rapidly develop experiments. August 12th, 2019 9

Instrumenta�on Two forms of instrumenta�on: Verify func�onality of environment Understand and evaluate experiments SRC A B C D DST Integra�ng the former into the toolset could simplify experiments August 12th, 2019 10

bash$ tcpdump -i eth0 -w foo.pcap tcpdump: listening on eth0, link-type EN10MB (Ethernet), capture size 262144 bytes 9001 packets captured 9000 packet received by filter 1 packets dropped by kernel bash$ sysdig -w foo.scap <no indication of dropped events > Instrumenta�on Mismatch between capture loca�ons Dropped events for containers August 12th, 2019 11

Instrumenta�on Many ways to instrument experiment at many levels, mostly by hand Applica�on-level: RPS, Ji�er, ... OS-level: System calls, u�liza�on, perf stats, ... Network-level: Flow sta�s�cs, bandwidth, ... Use to understand anomalies e1000 stalls Performance increase August 12th, 2019 12

Data Collec�on and Aggrega�on Lesson: Testbeds can provide a wealth of data to researchers but should do more to streamline the process of collec�ng and aggrega�ng it into a usable form. August 12th, 2019 13

Data Collec�on and Aggrega�on How to extract instrumenta�on data from VMs? C2 has limits on file size VMs write to qcow2, host extracts Future: mount guest filesystem? How to aggregate data? Dump per-itera�on data into SQLite database Combine SQLite databases a�er all itera�ons complete August 12th, 2019 14

zwnd probe bytes: 1 adv wind scale: Y/Y req 1323 ws/ts: Y/Y req 1323 ws/ts: 1/1 SYN/FIN pkts sent: 1/1 SYN/FIN pkts sent: pushed data pkts: adv wind scale: 1 pushed data pkts: 0 outoforder pkts: 0 outoforder pkts: 0 zwnd probe bytes: 0 7 7 0 1 pkts throughput: 0.7 ms idletime max: 1.0 ms idletime max: 0.000 secs data xmit time: 0.000 secs data xmit time: truncated packets: ========================== <15 lines omitted> ========================== 0 pkts truncated packets: 352 bytes truncated data: 0 bytes truncated data: 0 bytes missed data: 0 bytes missed data: 259754 Bps zwnd probe pkts: throughput: pure acks sent: 0 dsack pkts sent: 0 dsack pkts sent: 0 sack pkts sent: 0 sack pkts sent: 2 2 0 pure acks sent: 5 ack pkts sent: 4 ack pkts sent: 5 total packets: 5 total packets: max sack blks/ack: max sack blks/ack: 0 486 zwnd probe pkts: 0 rexmt data bytes: 0 rexmt data bytes: 0 rexmt data pkts: 0 rexmt data pkts: actual data bytes: 0 72 actual data bytes: 1 actual data pkts: 1 actual data pkts: 486 unique bytes sent: 72 unique bytes sent: 38482 Bps Data Analysis How to reduce storage? August 12th, 2019 15

Data Analysis How to reduce storage? tcptrace produces 78 sta�s�cs per flow Compute summary sta�s�cs over all flows for itera�on Compare mean of means across itera�ons and parameters August 12th, 2019 16

Data Analysis Lesson: Testbeds allow for many repe��ons but care should be used when analyzing the data, especially in confla�ng sta�s�cal significance with prac�cal importance. August 12th, 2019 17

Data Analysis Hypothesis tes�ng Everything seems significant with many itera�ons But prac�cally important? ( e.g. 0.1% decrease in latency) Mul�ple comparisons Comparing many metrics can result in significance by chance August 12th, 2019 18

What’s next? Experiments with conten�on Run N client/server pairs on the same hardware Generates Nx the data Surprising performance improvement when N is small (<12) August 12th, 2019 19

Ques�ons/Comments? Lessons: Using a testbed toolset for experimenta�on requires substan�al effort and considera�on to put tools together in a workable and validated fashion. Instrumenta�on is invaluable but it is o�en manually added, expensive, and experiment-specific. Integra�ng more forms of instrumenta�on into the toolset could help researchers to more rapidly develop experiments. Testbeds can provide a wealth of data to researchers but should do more to streamline the process of collec�ng and aggrega�ng it into a usable form. Testbeds allow for many repe��ons but care should be used when analyzing the data, especially in confla�ng sta�s�cal significance with prac�cal importance. Presenter: Jonathan Crussell jcrusse@sandia.gov August 12th, 2019 20

Lessons Learned from 10k Experiments to Compare Virtual and Physical - PowerPoint PPT Presentation

Lessons Learned from 10k Experiments to Compare Virtual and Physical Testbeds Jonathan Crussell , Thomas M Kroeger, David Kavaler, Aaron Brown, Cynthia Phillips August 12th, 2019 Supported by the Laboratory Directed Research and Development

Pos Total Pos Cat. Name Surname Cat. Club 10k 10k 6m 10k 10k 2 2 1 3 1 Thomas Webb

Lessons Learned Lessons Learned From From Lessons Learned Lessons Learned From From

PRODUCT NUMBER: C10K CANNON 10K Part Number: C10K FASTER Two pullers in one 5K / 10K with a

EVS 8K and EVS 10K Solder Recovery System EVS Solder Recovery Systems Turns Dross Into

GROUPS Virtual Group Topics Overview of Virtual Groups Participating as a Virtual Group in

Lessons Learned From Sequenced, Integrated Strategies of Economic After Hours Seminar

Some lessons learned from Team Science Some lessons learned from Team Science Lewis Cantley Weill

Opportunities Opportunities Lessons Learned Using Lessons Learned Using Vegetative

OSHA Lessons Learned Adam Fries OSHA Compliance Officer February 13, 2018 OSHA Lessons Learned

Lessons Learned from A Three-Week Lessons Learned from A Three-Week Long User Study w ith

OVERVI EW OF MTN 015 AND OVERVI EW OF MTN 015 AND LESSONS LEARNED LESSONS LEARNED Peter Mutale

Lessons Learned from Evaluating the Robustness of Defenses to Adversarial Examples Nicholas

Institutionalizing Lessons Learned October 25, 2006 Loren Plisco Region II Background

DEBUGGING LESSONS LEARNED WHILE DEBUGGING LESSONS LEARNED WHILE FIXING NETBSD FIXING NETBSD

3/8/2019 Epidemiology, Risk Factors, and Outcomes of Pediatric PVD: LESSONS learned from the

Ten lessons learned about Ten lessons learned about Ubiquitous Computing Ubiquitous Computing

Axiomatizing Probabilistic Logic of Quantum Programs Jort Bergfeld and Joshua Sack Amsterdam,

Conventional Wisdom vs. Why Different? Quantifying the Effects Our Findings complex traffic

ADAM FOSTER KYLE DAVIS VALVE PORTAL 2 AND TF2 ALTERNATE REALITY GAMES Q&A Why ARG?

On Hard Instances of the Minimum-Weight Triangulation Problem Sndor P. Fekete, Andreas Haas,

Probe or Wait : Handling tail losses using Multipath TCP Kiran Yedugundla, Per Hurtig, Anna

Why Im NOT Why Im NOT Jewish/ Christian Atheist Agnostic Hindu Muslim Buddhist

RAP: End-to-end Rate Based Control for Real Time Streams in the Internet R. Rejaie et al, Infocom

Informatics 1 Computation and Logic Traffic Lights Michael Fourman @mp4man 1 This course

Lessons Learned from 10k Experiments to Compare Virtual and Physical - PowerPoint PPT Presentation

Lessons Learned from 10k Experiments to Compare Virtual and Physical Testbeds Jonathan Crussell , Thomas M Kroeger, David Kavaler, Aaron Brown, Cynthia Phillips August 12th, 2019 Supported by the Laboratory Directed Research and Development

Pos Total Pos Cat. Name Surname Cat. Club 10k 10k 6m 10k 10k 2 2 1 3 1 Thomas Webb

Lessons Learned Lessons Learned From From Lessons Learned Lessons Learned From From

PRODUCT NUMBER: C10K CANNON 10K Part Number: C10K FASTER Two pullers in one 5K / 10K with a

EVS 8K and EVS 10K Solder Recovery System EVS Solder Recovery Systems Turns Dross Into

GROUPS Virtual Group Topics Overview of Virtual Groups Participating as a Virtual Group in

Lessons Learned From Sequenced, Integrated Strategies of Economic After Hours Seminar

Some lessons learned from Team Science Some lessons learned from Team Science Lewis Cantley Weill

Opportunities Opportunities Lessons Learned Using Lessons Learned Using Vegetative

OSHA Lessons Learned Adam Fries OSHA Compliance Officer February 13, 2018 OSHA Lessons Learned

Lessons Learned from A Three-Week Lessons Learned from A Three-Week Long User Study w ith

OVERVI EW OF MTN 015 AND OVERVI EW OF MTN 015 AND LESSONS LEARNED LESSONS LEARNED Peter Mutale

Lessons Learned from Evaluating the Robustness of Defenses to Adversarial Examples Nicholas

Institutionalizing Lessons Learned October 25, 2006 Loren Plisco Region II Background

DEBUGGING LESSONS LEARNED WHILE DEBUGGING LESSONS LEARNED WHILE FIXING NETBSD FIXING NETBSD

3/8/2019 Epidemiology, Risk Factors, and Outcomes of Pediatric PVD: LESSONS learned from the

Ten lessons learned about Ten lessons learned about Ubiquitous Computing Ubiquitous Computing

Axiomatizing Probabilistic Logic of Quantum Programs Jort Bergfeld and Joshua Sack Amsterdam,

Conventional Wisdom vs. Why Different? Quantifying the Effects Our Findings complex traffic

ADAM FOSTER KYLE DAVIS VALVE PORTAL 2 AND TF2 ALTERNATE REALITY GAMES Q&amp;A Why ARG?

On Hard Instances of the Minimum-Weight Triangulation Problem Sndor P. Fekete, Andreas Haas,

Probe or Wait : Handling tail losses using Multipath TCP Kiran Yedugundla, Per Hurtig, Anna

Why Im NOT Why Im NOT Jewish/ Christian Atheist Agnostic Hindu Muslim Buddhist

RAP: End-to-end Rate Based Control for Real Time Streams in the Internet R. Rejaie et al, Infocom

Informatics 1 Computation and Logic Traffic Lights Michael Fourman @mp4man 1 This course

ADAM FOSTER KYLE DAVIS VALVE PORTAL 2 AND TF2 ALTERNATE REALITY GAMES Q&A Why ARG?