thr throu ough ghput put biol biolog ogical ical data
play

THR THROU OUGH GHPUT PUT BIOL BIOLOG OGICAL ICAL DATA Leishi - PowerPoint PPT Presentation

VISU VISUALIZA ALIZATION TION FOR FOR HIGH HIGH THR THROU OUGH GHPUT PUT BIOL BIOLOG OGICAL ICAL DATA Leishi ishi Zha Zhang 1 , , Jasn sna Kulj ljis is 2 and Xiao Xiaohui i Liu Liu 2 1 Univ Univer ersity sity of of Kon


  1. VISU VISUALIZA ALIZATION TION FOR FOR HIGH HIGH THR THROU OUGH GHPUT PUT BIOL BIOLOG OGICAL ICAL DATA Leishi ishi Zha Zhang 1 , , Jasn sna Kulj ljis is 2 and Xiao Xiaohui i Liu Liu 2 1 Univ Univer ersity sity of of Kon onsta stanz, nz, Ger German many 2 Br Brun unel el Univ Univer ersity sity, , UK UK

  2. HIGH THROUGHPUT EXPERIMENTS DNA microarrays High throughput sequencing High content screening Small molecule microarrays 2

  3. HT DATA • large • high-dimensionality • heterogeneous How to make sense out of the data? – visual analytics automated data analysis + interactive visualizations 3

  4. VISUAL ANALYTICS FOR HT DATA Visual Analytics combines • automated data analysis (statistics and data mining methods) • interactive visualization (visual parameters, graphical representations, and human computer interactions) In this talk, I will discuss… • existing visualization techniques • open issues 4

  5. VISUALIZATION DESIGN Information seeking paradigm “Overview First, Zoom and Filter, Details on Demand” - Ben Shneiderman, 1996 What is important? • providing overview as well as details • showing patterns and relations • supporting dynamic queries 5

  6. VISUAL PARAMETER DESIGN Various visual channels • color • Shape • size • position • texture • … • Challenge: how to effectively use/combine different visual parameters to show interesting part of the data? ?... 6

  7. VISUAL PARAMETER DESIGN Play with the parameters line? line+ colors? offset? two tone colors?… 7

  8. GRAPHICAL REPRESENTATION DESIGN Challenge: given the large data, how to design graphical representations which: • highlight patterns and relations • show both overview and details 8

  9. GR DESIGN - OVERVIEW (1) Mapping data values – heatmap vs. parallel coordinates easy to see value differences no overlap 9

  10. GR DESIGN - OVERVIEW (2) Mapping distance/similarity between objects to a 2D/3D display as scatterplot or grids: dimension reduction – Projection Pursuit – Principle Component Analysis – Multi Dimensional Scaling – Self Organising Map – ISOMAP – Locally Linear Embedding – Stochastic Neighbourhood Embedding – Generative Topographic Mapping – … Focus: best approximate the structure (pairwise distance, and/or neighborhood info.) of data in the low dimensional visual space 10

  11. GR DESIGN - OVERVIEW (3) Divide & display: small multiples show details of data dimensions ordering is crucial 11

  12. GR DESIGN – DETAILED INFO (1) Data summarization, detailed comparison and correlation analysis • density plot • box-and-whisker plot • radar/spider plot • correlation plot • … provide good support for statistical analysis, and comparison between subsets of data 12

  13. GR DESIGN – DETAILED INFO (2) none – hierarchical relations Links and relations • force-directed • matrix view • treemap • hyperbolic view • dendrogram • … 13 hierarchical relations

  14. HUMAN COMPUTER INTERACTIONS Design interactive user interface • zooming • panning • linking and brushing • … Typically a HT Data Analysis tool integrates multiple visualization panels with linking and brushing and other mouse/keyboard functions to support dynamic query and detail-on-demand visual data analysis 14

  15. OPEN ISSUES • Scalability 15

  16. OPEN ISSUES • Scalability (hardware, software) • Visualizing uncertainties in data • Visualizing evolving changes • Evaluating quality of visual representations 16

  17. Thank you very much for your attention!  17

Recommend


More recommend