PARALLEL SILENCE CODING ALGORITHMS ON GPUS John Cheng and Nanxun - PowerPoint PPT Presentation

Aug 25, 2022 •230 likes •445 views

April 4-7, 2016 | Silicon Valley PARALLEL SILENCE CODING ALGORITHMS ON GPUS John Cheng and Nanxun Dai BGP International Inc, R&D Center April 5, 2016 1 Silence Encoding 2 Algorithm Conversion from Serial CONTENTS to Parallel

April 4-7, 2016 | Silicon Valley PARALLEL SILENCE CODING ALGORITHMS ON GPUS John Cheng and Nanxun Dai BGP International Inc, R&D Center April 5, 2016
1 Silence Encoding 2 Algorithm Conversion from Serial CONTENTS to Parallel Implementation on GPUS with CUB 3 2
SEISMIC DATA COMPRESSION ALGORITHM Wavelet Transformation  Quantization  Prefix Encoding  Silence Encoding  Huffman Encoding  3
A typical data in wave propagation 4
AN ILLUSTRATION OF SILENCE ENCODING a pair data: ( zero, its length ) 5
HOW TO MAKE IT RUN IN PARALLEL  Which thread has a right to write  Where should the thread write to  What should the thread write 6
FOR A NON-ZERO-THREAD  How may zero elements before it  How may zero segments before it 7
FOR A ZERO-THREAD  How may zero elements before it  How may zero segments before it  How may zero elements in its own zero segment 8
PREFIX SCAN Prefix Scan might be considered as a key  primitive in parallel computation All the information we need can be  calculated in parallel by using Prefix-Sum Therefore, we can convert the algorithm  from serial to parallel 9
THE ILLUSTRATION OF PREFIX SCAN partial sum till index 5 10
CALCULATING PRECEDING ZERO ELEMENTS Auxiliary variable Inclusive Prefix-Sum 11
CALCULATING PRECEDING ZERO SEGMENTS Auxiliary variable Exclusive Prefix-Sum 12
PARALLEL SILENCE ENCODING ALGORITHM Step 1: Read global data to shared memory Step 2: Calculate preceding zero elements with inclusive prefix-sum Step 3: Calculate preceding zero segments with exclusive prefix-sum Step 4: Calculate write positions for each thread Step 5: Write the encoded string to shared memory Step 6: Write the encoded string to global memory 13
IMPLEMENTATION WITH CUB Warp-wide primitives  Block-wide primitives  Device-wide primitives  More info: https://nvlabs.github.io/cub/ 14
HOW TO WRAP CUB PRIMITIVES __device__ __forceinline__ void cub_prefix_sum_exclusive (char in, char& out, char& aggregate) { typedef BlockScan <char, DIM, BLOCK_SCAN_RAKING> BlockScanT; typename BlockScanT::TempStorage __shared__ iscan; char data[1]; data[0] = in; __syncthreads(); BlockScanT(iscan).ExclusiveSum (data, data, aggregate); __syncthreads(); out = data[0]; } 15
PERFORMANCE OF DIFFERENT KERNELS Naive Kernel Parallel Kernel 2500 2000 Elapsed Time in ms 1500 1000 500 0 1024x1024x1 1024x1024x100 1024x1024x1000 Data Size 16
DIFFERENT CUB ALGORITHMS 1024x1024x1000 BLOCK_SCAN_WARP_SCANS 352.57 BLOCK_SCAN_RAKING_MEMOIZE 303.05 LOCK_SCAN_RAKING 323.81 270 280 290 300 310 320 330 340 350 360 17
CONCLUSION Prefix-sum is an efficient way to convert  serial computations to parallel computations It is convenient to integrate CUB parallel  primitives into your implementation 18
Each subject in the book is treated with a profile- driven approach 19
April 4-7, 2016 | Silicon Valley THANK YOU John Cheng and Nanxun Dai BGP International Inc, R&D Center 10630 Haddington Dr., Houston, Texas 77043 rwcheng@bgprdc.com

Recommend

Formal Modeling in Cognitive Science 1 Coding Theorems Lecture 28: Kraft Inequality; Source Coding

Coding Theorems Coding Theorems Huffman Coding Huffman Coding Formal Modeling in Cognitive Science 1 Coding Theorems Lecture 28: Kraft Inequality; Source Coding Theorem; Kraft Inequality Huffman Coding Shannon Information Source Coding

341 views • 5 slides

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding sensor-captured video content screen content video Screen Content Video Increasingly becoming important for a number of applications (e.g., online

573 views • 34 slides

ADVANCED MULTIMEDIA ADVANCED MULTIMEDIA CODING CODING Fernando Pereira Instituto Superior

ADVANCED MULTIMEDIA ADVANCED MULTIMEDIA CODING CODING Fernando Pereira Instituto Superior Tcnico Audiovisual Communications, Fernando Pereira, 2011 Video Coding in MPEG Video Coding in MPEG-4 Video Coding in MPEG Video Coding in MPEG-4

1.23k views • 104 slides

Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding

coding the shift transformation Dynamical systems Expanding maps on the circle. Coding Jana Rodriguez Hertz ICTP 2018 coding the shift transformation coding Index coding 1 coding the space + 2 the shift transformation 2

1.46k views • 107 slides

Risk-Based Coding and Reimbursement What is Risk-Based Coding? Risk-Based Coding Overview A

Risk-Based Coding and Reimbursement What is Risk-Based Coding? Risk-Based Coding Overview A diagnosis coding methodology utilized in risk adjustment models to adjust cost for all patients within a health plan or group Risk

385 views • 23 slides

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

Outline Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the Technion) Huffman coding Arithmetic coding Lempel-Ziv coding 2 Entropy Definitions Alphabet : A finite set containing at least

403 views • 10 slides

Coding and Applications in Sensor Networks Coding and Applications in Sensor Networks Why coding?

Coding and Applications in Sensor Networks Coding and Applications in Sensor Networks Why coding? Why coding? Information compression Robustness to errors (error correction codes) Two categories: Two categories: Source

505 views • 23 slides

Applications of Random Coding and Algebraic Coding Theories to Universal Lossless Source Coding

Universal Lossless Coding Performance Bounds 1 Applications of Random Coding and Algebraic Coding Theories to Universal Lossless Source Coding Performance Bounds Gil I. Shamir Department of Electrical & Computer Engineering

370 views • 21 slides

Coding and Applications in Sensor Networks Why coding? Information compression

Coding and Applications in Sensor Networks Why coding? Information compression Robustness to errors (error correction codes) Two categories: Source coding Channel coding Source coding Compression. What is the

917 views • 72 slides

6/4/2019 POWER OF THE PAUSE: LEARNING OUTCOMES THERAPEUTIC BENEFITS OF SILENCE 1. Identify

6/4/2019 POWER OF THE PAUSE: LEARNING OUTCOMES THERAPEUTIC BENEFITS OF SILENCE 1. Identify three ways silence benefits our brain health. 2. Articulate one practical strategy that incorporates the use of therapeutic silence within TR practice.

415 views • 8 slides

CODING: ICD-10 CODING & UB-04 CODING FOR PDPM NELIA ADACI RN, BSN CDONA, DNS-CT, RAC-CTA

7/22/2019 GREATER NY HEALTHCARE FACILITIES ASSOCIATION THE PROOF IS IN THE pudding CODING: ICD-10 CODING & UB-04 CODING FOR PDPM NELIA ADACI RN, BSN CDONA, DNS-CT, RAC-CTA Vice President The CHARTS Group CMSs MESSAGE: If you do

836 views • 42 slides

Lecture 5 Lossless Coding (II) May 20, 2009 Shujun LI ( ): INF-10845-20091 Multimedia

Shujun LI ( ): INF-10845-20091 Multimedia Coding Lecture 5 Lossless Coding (II) May 20, 2009 Shujun LI ( ): INF-10845-20091 Multimedia Coding Outline Review Arithmetic Coding Dictionary Coding Run-Length

845 views • 51 slides

Lecture 11 Vector Linear Network Coding Vector Linear Network Coding Outline Fundamentals for

Introduction to Network Coding Tuvi Etzion Lecture 11 Vector Linear Network Coding Vector Linear Network Coding Outline Fundamentals for vector network coding The combination network Vector network code vs. scalar network code 2 Multicast

657 views • 32 slides

Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jrgen

Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jrgen Ahlberg Outline Part I - Speech Speech History of speech synthesis & coding Speech coding methods Part II Audio

598 views • 32 slides

Image and Video Coding: Hybrid Video Coding s n 1 [ x , y ] s n [ x , y ] m k = ( m x , m

Image and Video Coding: Hybrid Video Coding s n 1 [ x , y ] s n [ x , y ] m k = ( m x , m y ) k x y Hybrid Video Coding Last Lectures: Block-Based Image Coding s k [ x , y ] u k [ x , y ] t k [ x , y ] q k [ x , y ] bitstream scalar

958 views • 42 slides

VIDEO SIGNALS Lossless coding g LOSSLESS CODING LOSSLESS CODING The goal of lossless image

VIDEO SIGNALS Lossless coding g LOSSLESS CODING LOSSLESS CODING The goal of lossless image compression is to The goal of lossless image compression is to represent an image signal with the smallest possible number of bits without loss

783 views • 60 slides

From Serial to Parallel A simple training using the Martix-Vector multiplication algorithm Petros

From Serial to Parallel A simple training using the Martix-Vector multiplication algorithm Petros Anastasiadis National Technical University of Athens 1 From Serial to Parallel www.prace-ri.eu The problem: Dense Matrix-Vector Multiplication

781 views • 26 slides

data-driven AI Using data about models to accelerate ML development Ramesh Sridharan

How Captricity built a human-level handwriting recognition engine using data-driven AI Using data about models to accelerate ML development Ramesh Sridharan @tweetsbyramesh Machine learning has the potential to change industries but ML

380 views • 34 slides

PARALLEL SESSION B SUPPLIES TO MULTIPLE LOCATION ENTITIES 17-18 April 2014 Tokyo, Japan Rob

PARALLEL SESSION B SUPPLIES TO MULTIPLE LOCATION ENTITIES 17-18 April 2014 Tokyo, Japan Rob Dalla Costa Australian Treasury Which approach is for you? The Guidelines contained a range of approaches Direct use, Direct delivery, and the

596 views • 8 slides

While waiting for our session to begin: 1. Make sure you have a DARS report with your intended

While waiting for our session to begin: 1. Make sure you have a DARS report with your intended program (electronic or printed is fine) If you do not have a DARS, please visit the CAE lab now to print one Before we begin the presentation

435 views • 26 slides

Photovoltaic-Thermal Systems (PVT) achieve market relevance Thomas Ramschak, AEE INTEC

Photovoltaic-Thermal Systems (PVT) achieve market relevance Thomas Ramschak, AEE INTEC http://task60.iea-shc.org/ Task Organisation Operating Agent JC Hadorn, Switzerland A B C PVT systems in PVT Performance PVT Systems operation

520 views • 15 slides

Full Network Model: Scheduling and Pricing Scott Harvey Member: California Market Surveillance

Full Network Model: Scheduling and Pricing Scott Harvey Member: California Market Surveillance Committee November 15, 2013 (Corrected November 19, 2013) INTERCHANGE PRICES In LMP markets, interchange prices differ across transactions for two

523 views • 21 slides

The consolidation of the TOTEM DAQ with the Scalable Readout System (SRS) Adrian Fiergolski

SRS Opto-Fec card The OptoRx-Fec setup The consolidation of the TOTEM DAQ with the Scalable Readout System (SRS) Adrian Fiergolski (Warsaw University of Technology, Poland) on behalf of the TOTEM DAQ group CMS GEM Firmware, January 2013

276 views • 14 slides

Financial Institution D&O Litigation: Parallel Proceedings Representing D&Os Faced with

Presenting a live 90 minute webinar with interactive Q&A Financial Institution D&O Litigation: Parallel Proceedings Representing D&Os Faced with Multiple Agency Proceedings, Private Civil Litigation, and Criminal Actions WEDNES

666 views • 32 slides