Selecting points of interest in traces using patterns of events - PowerPoint PPT Presentation

Selecting points of interest in traces using patterns of events François Trahay , Elisabeth Brunet, Mohamed Mosli Bouksiaa, Jianwei Liao

Context  Hardware is more and more complex • NUMA, hierarchical caches, GPU, ...  Software is more and more complex • Hybrid MPI+OpenMP, MPI+CUDA, …  Achieving good performance is hard  Understanding the performance of an application is difficult → Need for performance analysis tools 2

Performance analysis Tracing tools  Tracing applications (#82) 5292 Enter: function 14, process 7, source 0 (#83) 5387 Leave: function 1, process 7, source 0 (#84) 5540 Enter: function 14, process 3, source 0 • Run the application once (#85) 5631 Leave: function 1, process 3, source 0 (#86) 5767 Enter: function 14, process 5, source 0 • Capture interesting events (eg. MPI functions) (#87) 5801 Leave: function 1, process 5, source 0 (#88) 5995 Counter: process 8, counter 1, value 14829 • Generate an execution trace (#89) 6062 Counter: process 8, counter 1, value 14573 (#90) 6747 Enter: function 14, process 9, source 0 - Visualize & understand the behavior of the (#91) 6764 Counter: process 6, counter 1, value 14829 (#92) 6796 Leave: function 1, process 9, source 0 application (#93) 6806 Counter: process 6, counter 1, value 14573 - Find problematic parts of the execution  Examples • VampirTrace • ScalaTrace • Intel Trace Analyzer and Collector • EZTrace • … 3

Visualizing large trace files  Visualizing a large trace is difficult • Millions of events  How to detect the interesting part of the trace ? NPB CG class A 16 MPI Processes – 426 000 events 4

Visualizing large trace files  Visualizing a large trace is difficult • Millions of events  How to detect the interesting part of the trace ? NPB CG class A 16 MPI Processes – 426 000 events 5

Visualizing large trace files repeating patterns  A trace is usually structured • Loops • Functions  Lots of similar information NPB CG class A 16 MPI Processes – 426 000 events 6

Proposal: pointing what users should examinate  Detect similarities in a trace • Application phases that repeat 100 x { MPI_SEND (src=0 dest=1 len=16 tag=0) MPI_RECV (src=1 dest=0 len=16 tag=0) } MPI_Barrier 10000 x { MPI_SEND (src=0 dest=1 len=16 tag=0) MPI_RECV (src=1 dest=0 len=16 tag=0) } MPI_Barrier  Select « points of interests » of the trace • Parts that users should examine first • Where useful information is 7

Detecting similarities 8

Representation of a trace  A trace can be represented as an event list  Goal: detect patterns in this list • Can be viewed as a factorization 9

Factorization algorithm First step: find small patterns  Find a couple of events (e1, e2) that appears several times → 2-event patterns  Browse the event list and search for duplicated sequences 10

Factorization algorithm First step: find small patterns  Find a couple of events (e1, e2) that appears several times → 2-event patterns  Browse the event list and search for duplicated sequences 11

Factorization algorithm Second step: find loops in patterns  A loop is a sequence of events that repeats • Each iteration has been detected as a pattern  Browse the patterns lists and search for consecutive sequences 12

Factorization algorithm Third step: try to expand patterns  Is this 2-event pattern a 3-event pattern ? 16

Factorization algorithm Third step: try to expand patterns  Is this 2-event pattern a 3-event pattern ?  Case 1 : pattern #1 is always followed by event C 17

Factorization algorithm Third step: try to expand patterns  Is this 2-event pattern a 3-event pattern ?  Case 1 : pattern #1 is always followed by event C → pattern #1 is a 3-event pattern 18

Factorization algorithm third step: try to expand patterns  Is this 2-event pattern a 3-event pattern ?  Case 2: pattern #1 is not always followed by event C, but it sometimes is → create pattern #2 that integrates Pattern #1 19

Factorization algorithm third step: try to expand patterns  Is this 2-event pattern a 3-event pattern ?  Case 2: pattern #1 is not always followed by event C, but it sometimes is → create pattern #2 that integrates Pattern #1 20

Factorization algorithm third step: try to expand patterns  Is this 2-event pattern a 3-event pattern ?  Case 3: pattern #1 is followed by event C only once → do nothing 21

Factorization algorithm Limitations  Only valid for 1 thread/process • Based on temporal order → the algorithm needs to run for each thread → can be done in parallel  Complexity : O(n 2 ) • Worst case complexity (when there is no pattern) • In real life : it depends on the size of patterns 22

Evaluation  Implemented in EZTrace • Post mortem analysis • Parallelized with OpenMP Kernel Pattern # of events # of patterns  Stark cluster detection (ms) • 4 nodes CG 178 284 000 160 • Quad-core Xeon MG 186 118 000 2 728 SP 596 557 000 174  Results BT 951 400 000 112 LU 4 564 4 568 000 210 • Detects patterns NPB Class A, Procs=16 • Detects the applications iterations • Cheap compared to data mining techniques 23

Selecting points of interest 24

Searching for duplicated information  Select representative occurrences • Instead of examining 1000 occurrences • Select 1 occurrence per class #327 #549 #871 25

Selecting representative occurrences  Classify occurrences according to their duration  Search for 'peaks' in the distribution 26

Filtering traces  Select one occurrence per peak • Filter out 'similar' occurrences 27

Experimental results NPB class A, 16 procs Kernel # events #events after filtering EP 3 090 2 873 FT 10 256 6 704 IS 18 552 15 948 MG 118 688 41 031 CG 284 754 11 724 BT 399 944 24 338 SP 557 318 68 287 LU 4 568 002 42 881 28

Experimental results NPB class A, 16 procs Kernel # events #events after filtering EP 3 090 2 873 FT 10 256 6 704 IS 18 552 15 948 MG 118 688 41 031 CG 284 754 11 724 BT 399 944 24 338 SP 557 318 68 287 LU 4 568 002 42 881 29

Conclusion 30

Conclusion  Manually detecting the interesting parts of a trace is difficult  Proposal: automate the detection of problems • Detect repeating patterns of events • Compare similar patterns whose duration differ significantly • Filter out redundant information  Future work • Analyze patterns • Integrate to the stable version of EZTrace 31

Question ? François Trahay http://eztrace.gforge.inria.fr/ 32

Backtracking irregularities 33

Selecting points of interest in traces using patterns of events - PowerPoint PPT Presentation

Selecting points of interest in traces using patterns of events Franois Trahay , Elisabeth Brunet, Mohamed Mosli Bouksiaa, Jianwei Liao Context Hardware is more and more complex NUMA, hierarchical caches, GPU, ... Software is more

Traces Exist (Hypothetically)! Carl Pollard Structure and Evidence in Linguistics Workshop in

Dotmetrics Exclusive Users Selecting basic dimensions (country, devices) Selecting timeframe

Factory Patterns: Factory Method and Abstract Factory Design Patterns In Java Bob Tarr

Principles and Patterns 26 February, 2020 Recap Principles Patterns Inheritance Anti-patterns

LN-10 Searching for traces of Norwegian economic thought Intro When trying to find the traces of

The evaluation of evidence relating to traces of cocaine on banknotes Amy Wilson September 2015

Capturing Traffic Traces with Ground- Capturing Traffic Traces with Ground- Truth Information

Modeling interaction traces of an online panel From raw interaction traces to actionable

On The Fidelity of 802.11 Packet Traces Aaron Schulman, Dave Levin, Neil Spring University of

Discovering Bits of Place Histories from People's Activity Traces from People s Activity Traces

Two 2-traces Simon Willerton University of Sheffield f Tr ( f ) := V

ELLIPSOID : traces on the coordinate planes are ellipses 2 2 2 x 2 y 2 z 2 = 1 a b c

Eligibility Traces Chapter 12 Eligibility traces are Another way of interpolating between MC and

Design Patterns Applications Programming What is design patterns? The design patterns are

Design Patterns 1 What are Design Patterns? Design patterns describe common (and successful)

Software, Faster Patterns of Effective Delivery Dan North @tastapod Patterns of Effective

MIT 6.837 - Ray Tracing Ray Tracing MIT EECS 6.837 Most slides are taken from Frdo Durand and

Content-aware Trace Collection and I/O Deduplication for Smartphones Bo Mao 1 , Suzhen Wu 1 , Hong

Tracing and Profiling MySQL Mostly with Linux tools: from strace and gdb to ftrace, bpftrace,

Tracing While Loops 7 January 2019 OSU CSE 1 Consider... int i = 4, j = 1, n = 0; while (i >

ANALYSIS ETI2506 Sunday, 23 October 2016 1 WHERE ARE WE IN THE CURRICULUM? 2 GOAL OF TRAFFIC

VISUALISING REAL TIME TRAFFIC DATA USING ELASTICSEARCH AND C3JS @ jettroCoenradie Trifork

Quasi-Dynamic Network Model Contents Partition Method for Accelerating Parallel Network

11 Introduction Introduction M/M/1 Queueing delay (revisited) R=link bandwidth (bps)

Selecting points of interest in traces using patterns of events - PowerPoint PPT Presentation

Selecting points of interest in traces using patterns of events Franois Trahay , Elisabeth Brunet, Mohamed Mosli Bouksiaa, Jianwei Liao Context Hardware is more and more complex NUMA, hierarchical caches, GPU, ... Software is more

Traces Exist (Hypothetically)! Carl Pollard Structure and Evidence in Linguistics Workshop in

Dotmetrics Exclusive Users Selecting basic dimensions (country, devices) Selecting timeframe

Factory Patterns: Factory Method and Abstract Factory Design Patterns In Java Bob Tarr

Principles and Patterns 26 February, 2020 Recap Principles Patterns Inheritance Anti-patterns

LN-10 Searching for traces of Norwegian economic thought Intro When trying to find the traces of

The evaluation of evidence relating to traces of cocaine on banknotes Amy Wilson September 2015

Capturing Traffic Traces with Ground- Capturing Traffic Traces with Ground- Truth Information

Modeling interaction traces of an online panel From raw interaction traces to actionable

On The Fidelity of 802.11 Packet Traces Aaron Schulman, Dave Levin, Neil Spring University of

Discovering Bits of Place Histories from People's Activity Traces from People s Activity Traces

Two 2-traces Simon Willerton University of Sheffield f Tr ( f ) := V

ELLIPSOID : traces on the coordinate planes are ellipses 2 2 2 x 2 y 2 z 2 = 1 a b c

Eligibility Traces Chapter 12 Eligibility traces are Another way of interpolating between MC and

Design Patterns Applications Programming What is design patterns? The design patterns are

Design Patterns 1 What are Design Patterns? Design patterns describe common (and successful)

Software, Faster Patterns of Effective Delivery Dan North @tastapod Patterns of Effective

MIT 6.837 - Ray Tracing Ray Tracing MIT EECS 6.837 Most slides are taken from Frdo Durand and

Content-aware Trace Collection and I/O Deduplication for Smartphones Bo Mao 1 , Suzhen Wu 1 , Hong

Tracing and Profiling MySQL Mostly with Linux tools: from strace and gdb to ftrace, bpftrace,

Tracing While Loops 7 January 2019 OSU CSE 1 Consider... int i = 4, j = 1, n = 0; while (i &gt;

ANALYSIS ETI2506 Sunday, 23 October 2016 1 WHERE ARE WE IN THE CURRICULUM? 2 GOAL OF TRAFFIC

VISUALISING REAL TIME TRAFFIC DATA USING ELASTICSEARCH AND C3JS @ jettroCoenradie Trifork

Quasi-Dynamic Network Model Contents Partition Method for Accelerating Parallel Network

11 Introduction Introduction M/M/1 Queueing delay (revisited) R=link bandwidth (bps)

Tracing While Loops 7 January 2019 OSU CSE 1 Consider... int i = 4, j = 1, n = 0; while (i >