LinuxCon 2010 Tracing Mini-Summit A new unified Lockless Ring - PowerPoint PPT Presentation

LinuxCon 2010 Tracing Mini-Summit A new unified Lockless Ring Buffer library for efficient kernel tracing Presentation at: http://www.efficios.com/linuxcon2010-tracingsummit E-mail: mathieu.desnoyers@efficios.com Mathieu Desnoyers August 11th, 2010 1

> Presenter ● Mathieu Desnoyers ● EfficiOS Inc. ● http://www.efficios.com ● Author/Maintainer of ● LTTng, LTTV, Userspace RCU ● Ph.D. in computer engineering ● Low-Impact Operating System Tracing Mathieu Desnoyers August 11th, 2010 2

> Plan ● History ● Mandate ● Genericity and Flexibility ● Speed and Compactness ● Reliability ● Working together Mathieu Desnoyers August 11th, 2010 3

> History ● May 2005: LTTng implements its ring buffer from scratch – Learns lessons from K42, RelayFS and LTT. ● October 2005: LTTng becomes lock-less – LTTng gets increasingly used by the industry and shipped with many embedded and RT Linux distributions since then. ● 2008: Ftrace (lock-less in 2009) ● 2010: Perf Mathieu Desnoyers August 11th, 2010 4

> Mandate ● Wish from Linus expressed at the Kernel Summit 2008 to have a common tracer infrastructure in the kernel ● Asked by Steven Rostedt to come up with a unified solution Mathieu Desnoyers August 11th, 2010 5

> Generic Ring Buffer Library ● Input – Data received as parameter from ring buffer library clients ● Output – Data available through a global or per-CPU file descriptor with splice, mmap or read. – Or data available internally to the ring buffer client for reading Mathieu Desnoyers August 11th, 2010 6

> Generic Ring Buffer Library ● Derived from the LTTng ring buffer – Exists since 2005 ● Goals – Generic and flexible – Clean API – Fast and compact – Reliable Mathieu Desnoyers August 11th, 2010 7

> Genericity and Flexibility ● Target Perf, Ftrace, LTTng and drivers ● Not only tracer-specific – Ring buffer sits in /lib ● Achieve genericity without hurting performance – Ring buffer clients – Instantiate client-specific configurations – Express configuration into a constant client structure passed as parameter to inline functions Mathieu Desnoyers August 11th, 2010 8

> API: pre-cooked (simple) APIs ● Create/destroy a channel – Global buffer – Per-CPU buffers ● In-kernel write() ● Read a file descriptor – Global iterator ● The library does fusion merge of per-CPU buffer events based on a heap and quiescent states – Per-CPU iterator Mathieu Desnoyers August 11th, 2010 9

> API: pre-cooked APIs ● Mode – Overwrite – Discard ● Channels – Global – Per-CPU ● Global iterators ● Per-CPU iterators Mathieu Desnoyers August 11th, 2010 10

> Advanced API ● Client configuration ● Client-provided callbacks Mathieu Desnoyers August 11th, 2010 11

> Configuration ● Buffers per-CPU or global ● Overwrite or discard mode ● Natural or packed alignment ● Output – splice(), mmap(), read(), iterator, client-specific ● Memory allocation backend – page, vmap, static ● OOPS consistency, IPI barrier, wakeup Mathieu Desnoyers August 11th, 2010 12

> Client-provided callbacks ● Clock read ● Event and sub-buffer header size ● Sub-buffer begin/end ● Buffer create/finalize ● Record get – For iterators Mathieu Desnoyers August 11th, 2010 13

> Speed and Compactness ● Fast paths – Constant configuration structure – Compiler removes unused code ● Slow paths – Configuration dynamically tested – Same code shared amongst all clients Mathieu Desnoyers August 11th, 2010 14

> Performance ● Throughput ● Scalability Mathieu Desnoyers August 11th, 2010 15

> Throughput (overwrite mode) ● Generic Ring Buffer Library – 83-199 ns/entry (depending on configuration) ● Ftrace – 103-187 ns/entry ● Perf – Mode unavailable Mathieu Desnoyers August 11th, 2010 16

> Throughput (discard mode) ● Generic Ring Buffer Library – 257 ns/entry written ● Perf – 423 ns/entry written ● (approximation from Perf output) ● Getting accurate results is hard, influenced by discarded events Mathieu Desnoyers August 11th, 2010 17

> Scalability Mathieu Desnoyers August 11th, 2010 18

> Reliability ● LTTng – Formal verification of the ring buffer algorithm at the architecture level (modeling execution on superscalar processors) – Testing on large user-base Mathieu Desnoyers August 11th, 2010 19

> Working together ● Ever had the feeling you were trying to fit something square-shaped into a circle ? Mathieu Desnoyers August 11th, 2010 20

> Working together ● Need to polish off the rough spots Mathieu Desnoyers August 11th, 2010 21

> Working together ● Trying to come up with a clean and flexible API ● Nevertheless, does not always map the current Ftrace and Perf APIs ● Trying very hard not to bloat the API Mathieu Desnoyers August 11th, 2010 22

> Working with Ftrace ● Steven has been very helpful ● I'm about 80% done working on Ftrace transition to the generic ring buffer library Mathieu Desnoyers August 11th, 2010 23

> Ftrace odd-fitting pieces ● Ftrace iteration code – Huge set of API functions for iterating on stopped trace buffers without consuming data. – Used for: ● Dumping same output with "cat" many times ● Peek next item to place brackets in function graph tracer output – Could be replaced by "rewind" ability and by modifying the function graph tracer plugin Mathieu Desnoyers August 11th, 2010 24

> Perf ● mmap()-based ABI between kernel and userspace for consuming data. ● No kernel callback invoked when the consumer finishes reading data. – Severely limits design choices ● Does not support (and developers don't consider as valid use-case) reading data while writing into a buffer in flight recorder mode. Mathieu Desnoyers August 11th, 2010 25

> Perf ● Does not use padding between sub-buffers – No concept of sub-buffers – All events are physically contiguous ● Cannot create efficient chunks of data for splice() without copy ● Cannot efficiently index trace without reading all events (increases delay before a large trace can be analyzed) ● Basic data encapsulation principles Mathieu Desnoyers August 11th, 2010 26

> Perf ● Why do they hate sub-buffers so much ? – Claim of simplicity ● False. The fast path ends up being both larger and slower than the generic ring buffer. ● Why is this important ? – Shows how low-level Perf design choices prevent contributors from fulfilling end-user basic use-cases. – Shows Perf developers unwillingness to support use-cases other than kernel developers own needs. Mathieu Desnoyers August 11th, 2010 27

> Funding ● Thanks to Ericsson for funding parts of this work. Mathieu Desnoyers August 11th, 2010 28

> Questions ? ? – http://www.efficios.com ● LTTng Information – http://lttng.org – ltt-dev@lists.casi.polymtl.ca Mathieu Desnoyers August 11th, 2010 29

> API (per-CPU discard) extern struct channel * ring_buffer_percpu_discard_create(size_t buf_size); extern void ring_buffer_percpu_discard_destroy(struct channel *chan); extern int ring_buffer_percpu_discard_write(struct channel *chan, const void *src, size_t len); And map file operation "channel_payload_file_operations" from iterator.h to file descriptor. Mathieu Desnoyers August 11th, 2010 30

LinuxCon 2010 Tracing Mini-Summit A new unified Lockless Ring - PowerPoint PPT Presentation

LinuxCon 2010 Tracing Mini-Summit A new unified Lockless Ring Buffer library for efficient kernel tracing Presentation at: http://www.efficios.com/linuxcon2010-tracingsummit E-mail: mathieu.desnoyers@efficios.com Mathieu Desnoyers August

Advanced Ray Tracing 1 2/8/2006 Distributed Ray Tracing Distributed ray tracing is an

Virgin America and Open Source Software Presented at LinuxCon 2010 Presented at LinuxCon 2010

Computer Graphics - Ray-Tracing II - Hendrik Lensch Computer Graphics WS07/08 Ray Tracing II

1 minute Path tracing Bidirectional path tracing Progressive photon mapping 1 minute

MIT 6.837 - Ray Tracing Ray Tracing MIT EECS 6.837 Most slides are taken from Frdo Durand and

Advanced Ray Tracing Stochastic ray tracing: distribute rays stochastically across pixel

61A Extra Lecture 9 Announcements Pixels (Demo) Ray Tracing Ray Tracing A technique for

MINI OPENDRIVE 1 MINI MINI OPENDRIVE EXP OPENDRIVE EXP Experience, eXpertise, Performance The

LinuxCon Europe UEFI Mini-Summit 7 October 2015 Session 3 LUV Shack: An Automated Linux

LinuxCon Europe UEFI Mini-Summit 7 October 2015 Session 1 UEFI Forum Update and Open

Computer Graphics - Ray Tracing I - Hendrik Lensch Computer Graphics WS07/08 Ray Tracing I

Knowledge Tracing Machines: Factorization Machines for Knowledge Tracing Jill-Jnn Vie Hisashi

Introduction to Path Tracing Marc Sunet Table of contents From Ray Tracing to Path Tracing The

Ray Tracing 1 Ray Tracing Ray Tracing kills two birds with one stone: Solves the Hidden

Tracing with Perf tools Namhyung Kim 2013-11-13 Wed Namhyung Kim Tracing with Perf tools

LinuxCon 2010 Efficient Trace Format for System-Wide Tracing Presentation at:

Imperfec.ons John OBrien Outline Large feedback control

LHO box from top LHO Box from back and top 500 Gallon Liquid Oxygen tank Compressed oxygen

NIM-HV-PSU PRESENTATION Vihtori Virta 12/16/2014 Content Display Board High Voltage

EE3C11: Structured Electronic Design My First Voltage Ampli fi er Design example EE3C11 47u 100n

Electromagnetic Counterparts to Gravitational Waves Mansi M. Kasliwal California Institute of

Observation of sound focusing and defocusing due to propagating nonlinear internal waves J.

MONETARY POLICY COMMITTEE (MEETING OF 30 SEPTEMBER 2013) Ministry of Finance and Economic

Separating & Quantifying the Distinct Impacts of El Ni no and Stratospheric Sudden Warmings

LinuxCon 2010 Tracing Mini-Summit A new unified Lockless Ring - PowerPoint PPT Presentation

LinuxCon 2010 Tracing Mini-Summit A new unified Lockless Ring Buffer library for efficient kernel tracing Presentation at: http://www.efficios.com/linuxcon2010-tracingsummit E-mail: mathieu.desnoyers@efficios.com Mathieu Desnoyers August

Advanced Ray Tracing 1 2/8/2006 Distributed Ray Tracing Distributed ray tracing is an

Virgin America and Open Source Software Presented at LinuxCon 2010 Presented at LinuxCon 2010

Computer Graphics - Ray-Tracing II - Hendrik Lensch Computer Graphics WS07/08 Ray Tracing II

1 minute Path tracing Bidirectional path tracing Progressive photon mapping 1 minute

MIT 6.837 - Ray Tracing Ray Tracing MIT EECS 6.837 Most slides are taken from Frdo Durand and

Advanced Ray Tracing Stochastic ray tracing: distribute rays stochastically across pixel

61A Extra Lecture 9 Announcements Pixels (Demo) Ray Tracing Ray Tracing A technique for

MINI OPENDRIVE 1 MINI MINI OPENDRIVE EXP OPENDRIVE EXP Experience, eXpertise, Performance The

LinuxCon Europe UEFI Mini-Summit 7 October 2015 Session 3 LUV Shack: An Automated Linux

LinuxCon Europe UEFI Mini-Summit 7 October 2015 Session 1 UEFI Forum Update and Open

Computer Graphics - Ray Tracing I - Hendrik Lensch Computer Graphics WS07/08 Ray Tracing I

Knowledge Tracing Machines: Factorization Machines for Knowledge Tracing Jill-Jnn Vie Hisashi

Introduction to Path Tracing Marc Sunet Table of contents From Ray Tracing to Path Tracing The

Ray Tracing 1 Ray Tracing Ray Tracing kills two birds with one stone: Solves the Hidden

Tracing with Perf tools Namhyung Kim 2013-11-13 Wed Namhyung Kim Tracing with Perf tools

LinuxCon 2010 Efficient Trace Format for System-Wide Tracing Presentation at:

Imperfec.ons John OBrien Outline Large feedback control

LHO box from top LHO Box from back and top 500 Gallon Liquid Oxygen tank Compressed oxygen

NIM-HV-PSU PRESENTATION Vihtori Virta 12/16/2014 Content Display Board High Voltage

EE3C11: Structured Electronic Design My First Voltage Ampli fi er Design example EE3C11 47u 100n

Electromagnetic Counterparts to Gravitational Waves Mansi M. Kasliwal California Institute of

Observation of sound focusing and defocusing due to propagating nonlinear internal waves J.

MONETARY POLICY COMMITTEE (MEETING OF 30 SEPTEMBER 2013) Ministry of Finance and Economic

Separating &amp; Quantifying the Distinct Impacts of El Ni no and Stratospheric Sudden Warmings

Separating & Quantifying the Distinct Impacts of El Ni no and Stratospheric Sudden Warmings