Histogram-based I/O Optimization for Visualizing Large-scale Data - PowerPoint PPT Presentation

Histogram-based I/O Optimization for Visualizing Large-scale Data www.ultravis.org Yuan Hong, The Ohio State University Tom Peterka, Argonne National Laboratory Han-Wei Shen, The Ohio State University Tom Peterka tpeterka@mcs.anl.gov Mathematics and Computer Science Division

I/O Optimization for Visualization � Motivation Parallel I/O is Performance of parallel necessary, but visualization bound by Effect of space-filling not sufficient Visualization techniques data movement curves diminishes as resulting in sparse traversal can process count exacerbate the problem increases Idea Consider both visibility culling and spatial locality when ordering data. Sample a variety of view directions and construct a histogram of visible blocks, independent of transfer function. Reorder data accordingly to balance load across file servers and produce contiguous access. SC09 Ultrascale Visualization Workshop November 16, 2009 Tom Peterka tpeterka@mcs.anl.gov 2

Related Literature � Background Visibility culling Gao et al., Visibility Culling Using Plenoptic Opacity Functions for Large Volume Visualization, Vis ‘03. Zhang et al., Visibility Culling Using Hierarchical Occlusion Maps, SIGGRAPH ‘97. Out of core methods Pascucci and Frank, Global Static Indexing for Real-Time Exploration of Very Large Regular Grids, SC01. Isenburg and Lindstrom, Streaming Meshes, Vis ‘05. Collective I/O Thakur et al., Optimizing Noncontiguous Access in MPI-IO, Parallel Computing ‘02. Smirni et al., Algorithmic Influences on I/O Access Patterns and Parallel File System Performance, ICPADS ’97. SC09 Ultrascale Visualization Workshop November 16, 2009 Tom Peterka tpeterka@mcs.anl.gov 3

Algorithm � Overview Algorithm overview consists of: partitioning data, sampling views on a view sphere, computing view histograms for each view direction, concatenating view histograms into feature vectors, grouping similar feature vectors into clusters, and striping data blocks onto parallel storage according to the clusters. SC09 Ultrascale Visualization Workshop November 16, 2009 Tom Peterka tpeterka@mcs.anl.gov 4

Compute View Histograms and Feature Vectors � Classify data in all view directions 128 bytes SC09 Ultrascale Visualization Workshop November 16, 2009 Tom Peterka tpeterka@mcs.anl.gov 5

Feature Vector Computational Cost � Scalable parallel implementation The variance across all histogram bins and all view directions as a function of the number of view directions. The variance changes slowly after 256 sampled views, indicating that more samples are not necessary. Viswoman dataset. Total preprocessing time for supernova dataset, from 256 to 2048 cores, on Argonne’s BG/P system. The dataset is 276 GB, and 1024 views were sampled in under seven minutes. SC09 Ultrascale Visualization Workshop November 16, 2009 Tom Peterka tpeterka@mcs.anl.gov 6

Organizing Data in Storage � Layout parameters Block size of 16^3 has best I/O I/O time vs. stripe size for Viswoman performance for Viswoman dataset, dataset. Optimal stripe size is that of irrespective of process count. Block size is average cluster size that results from chosen to be a multiple of the read buffer clustering feature vectors. size, 16 KB in our default MPI-IO implementation. SC09 Ultrascale Visualization Workshop November 16, 2009 Tom Peterka tpeterka@mcs.anl.gov 7

End-to-End Performance � Test conditions, datasets, total and component time Test conditions: System: IBM BG/P at Argonne National Viswoman volume rendering Laboratory, PVFS file system performance with histogram- optimized method Viswoman dataset: 512x512x1728, 2-byte short # Procs I/O time Render Composite Total ints, 16^3 blocks (s) time (s) time (s) time (s) 64 4.37 1.02 1.20 6.59 Richtmyer-Meshkov Instability (RMI) dataset: 128 3.66 0.46 0.80 4.92 2048x2048x1920, 1-byte chars, 32^3 blocks 256 3.43 0.33 0.80 4.56 512 1.77 0.20 0.60 2.57 Supernova dataset: 3456x3456x3456 1024 0.91 0.12 0.50 1.53 supersampled, 4-byte floats, 16^3 block size SC09 Ultrascale Visualization Workshop November 16, 2009 Tom Peterka tpeterka@mcs.anl.gov 8

Comparison to Space-Filling Curves � I/O time for three datasets Viswoman RMI Supernova Top: I/O time for three datasets. Bottom: compositing, rendering, I/O time for supernova. In all test cases, the histogram-optimized method performs better than canonical organization and space-filling curves. Supernova Supernova Histogram optimized Z curve SC09 Ultrascale Visualization Workshop November 16, 2009 Tom Peterka tpeterka@mcs.anl.gov 9

Comparison to Hilbert Curve � Across view directions Across time-steps Standard deviation of I/O time in RMI across I/O time across 64 time-steps of RMI with 256 random view directions demonstrates 512 processors demonstrates consistent consistent performance over variety of view performance over a time-varying dataset. conditions. SC09 Ultrascale Visualization Workshop November 16, 2009 Tom Peterka tpeterka@mcs.anl.gov 10

Independent of Transfer Function � Various opacities, single and multimodal I/O time for histogram-optimized and Hilbert curve for supernova dataset rendered with a variety of transfer functions. Transfer functions were generated synthetically using a nonlinear computation that stochastically produces one or more modes. SC09 Ultrascale Visualization Workshop November 16, 2009 Tom Peterka tpeterka@mcs.anl.gov 11

Histogram-based I/O Optimization for Visualizing Large-scale Data Successes Limitations / Future work - Scale to higher number of processes - Data organization based on visibility culling and spatial locality - Zoom - Scalable feature classification time - Higher-dimension transfer functions - Other storage and file systems - Improved volume rendering performance over space-filling curves www.ultravis.org - Transfer function independence - Heuristics for usage Tom Peterka Acknowledgments: tpeterka@mcs.anl.gov Argonne Leadership Computing Facility US DOE SciDAC UltraVis Institute Mathematics and Computer Science Division

Histogram-based I/O Optimization for Visualizing Large-scale Data - PowerPoint PPT Presentation

Histogram-based I/O Optimization for Visualizing Large-scale Data www.ultravis.org Yuan Hong, The Ohio State University Tom Peterka, Argonne National Laboratory Han-Wei Shen, The Ohio State University Tom Peterka tpeterka@mcs.anl.gov

XL1F: Create Histogram using HISTOGRAM in Excel 2013 V0G XL1F: V0G Create Histogram using

Alternative to Excel Histogram Categories Histogram for the USAs and the Worlds Starbucks

High Performance GPGPU Implementation of a Large 2D Histogram (S9734) Mark Roulo Wed, March

Outline - Tasks - Map projections - Visualizing area data - Visualizing point data -

Visualizing Large Pedigree Visualizing Large Pedigree Charts in 3D Space Charts in 3D Space

Chapter 2 : Informatics Practices Python pandas- Class XII ( As per Histogram & CBSE

Histogram-based methods Nicolas ROUGON ARTEMIS Department Nicolas.Rougon@telecom-sudparis.eu

Visualizing Data with Graphs and Maps Yifan Hu AT&T Labs Research NIST May 7, 2012

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Abstracting and Visualizing Host Behaviour Abstracting and Visualizing Host Behaviour through

VISUALIZING UNCERTAINTY Fall 2017 Mac Hill VISUALIZING UNCERTAINTY 2 DEVELOPING A VISUAL

CME/STATS 195 CME/STATS 195 Lecture 4: Visualizing data Lecture 4: Visualizing data Evan

CSSS 569 Visualizing Data and Models Lab 8: Visualizing Relational Data Kai Ping (Brian) Leung

CSSS 569 Visualizing Data and Models Lab 7: Visualizing Spatial Data Kai Ping (Brian) Leung

Visualizing search results Haystack Europe, London 2018 / sebastian.russ@tudock.de / Visualizing

Visualizing Heart Data Visualizing Heart Data of a living entity by analyzing time- -series data

Histogram sort rt with h Sampl pling ng (HS (HSS) Vipul Harsh, Laxmikant Kale Parallel

Visualizing and Exploring Data Sargur Srihari University at Buffalo The State University of New

Constructive universal high-dimensional distribution generation through deep ReLU networks Dmytro

Image Enhancement The objective is to process an image to improve its suitability for a

Quantification and Density Estimation Gadi Fibich, Tel Aviv University Adi Ditkowski Amir

FORD UNIVERSITY Stuart Rowley FORD UNIVERSITY Agenda for todays discussion: Warranty

A Low Cost Approach to Specimen Level Imaging of Natural History Microscope Slides using a DSLR

Reverse Mortgage Market Index (Q2 2015 RMMI) NRMLA/RiskSpan September 21, 2015 Presentation

Histogram-based I/O Optimization for Visualizing Large-scale Data - PowerPoint PPT Presentation

Histogram-based I/O Optimization for Visualizing Large-scale Data www.ultravis.org Yuan Hong, The Ohio State University Tom Peterka, Argonne National Laboratory Han-Wei Shen, The Ohio State University Tom Peterka tpeterka@mcs.anl.gov

XL1F: Create Histogram using HISTOGRAM in Excel 2013 V0G XL1F: V0G Create Histogram using

Alternative to Excel Histogram Categories Histogram for the USAs and the Worlds Starbucks

High Performance GPGPU Implementation of a Large 2D Histogram (S9734) Mark Roulo Wed, March

Outline - Tasks - Map projections - Visualizing area data - Visualizing point data -

Visualizing Large Pedigree Visualizing Large Pedigree Charts in 3D Space Charts in 3D Space

Chapter 2 : Informatics Practices Python pandas- Class XII ( As per Histogram &amp; CBSE

Histogram-based methods Nicolas ROUGON ARTEMIS Department Nicolas.Rougon@telecom-sudparis.eu

Visualizing Data with Graphs and Maps Yifan Hu AT&amp;T Labs Research NIST May 7, 2012

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Abstracting and Visualizing Host Behaviour Abstracting and Visualizing Host Behaviour through

VISUALIZING UNCERTAINTY Fall 2017 Mac Hill VISUALIZING UNCERTAINTY 2 DEVELOPING A VISUAL

CME/STATS 195 CME/STATS 195 Lecture 4: Visualizing data Lecture 4: Visualizing data Evan

CSSS 569 Visualizing Data and Models Lab 8: Visualizing Relational Data Kai Ping (Brian) Leung

CSSS 569 Visualizing Data and Models Lab 7: Visualizing Spatial Data Kai Ping (Brian) Leung

Visualizing search results Haystack Europe, London 2018 / sebastian.russ@tudock.de / Visualizing

Visualizing Heart Data Visualizing Heart Data of a living entity by analyzing time- -series data

Histogram sort rt with h Sampl pling ng (HS (HSS) Vipul Harsh, Laxmikant Kale Parallel

Visualizing and Exploring Data Sargur Srihari University at Buffalo The State University of New

Constructive universal high-dimensional distribution generation through deep ReLU networks Dmytro

Image Enhancement The objective is to process an image to improve its suitability for a

Quantification and Density Estimation Gadi Fibich, Tel Aviv University Adi Ditkowski Amir

FORD UNIVERSITY Stuart Rowley FORD UNIVERSITY Agenda for todays discussion: Warranty

A Low Cost Approach to Specimen Level Imaging of Natural History Microscope Slides using a DSLR

Reverse Mortgage Market Index (Q2 2015 RMMI) NRMLA/RiskSpan September 21, 2015 Presentation

Chapter 2 : Informatics Practices Python pandas- Class XII ( As per Histogram & CBSE

Visualizing Data with Graphs and Maps Yifan Hu AT&T Labs Research NIST May 7, 2012