information visualization tables
play

Information Visualization Tables Tamara Munzner Department of - PowerPoint PPT Presentation

Information Visualization Tables Tamara Munzner Department of Computer Science University of British Columbia Lect 6/7, 23/28 Jan 2020 https://www.cs.ubc.ca/~tmm/courses/436V-20 Tables 2 Focus on Tables Dataset Types Spatial Net Tables


  1. Information Visualization Tables Tamara Munzner Department of Computer Science University of British Columbia Lect 6/7, 23/28 Jan 2020 https://www.cs.ubc.ca/~tmm/courses/436V-20

  2. Tables 2

  3. Focus on Tables Dataset Types Spatial Net Tables Networks Fields (Continuous) Geometry (Spatial) Attributes (columns) Link Items Grid of positions (rows) Node (item) Cell Position Cell containing value Node em) Attributes (columns) Multidimensional Table Trees Value in cell Value in cell 3

  4. Exercise: Sketch 2 ways to visualize each table Furthest BPM T1 BPM T2 BPM T3 Age Best 100 m Sex Jump Amy 90 130 150 Amy 16 13.2 5.2 F Basil 70 110 109 Basil 18 12.4 4.2 F Clara 60 140 141 Clara 14 14.1 2.5 F Desmond 84 100 108 Desmond 22 10.01 6.3 M Charles 81 110 130 Charles 19 11.3 5.3 M • socrative: answer when done 4

  5. Tackling tables Age Gender Height Bob 25 M 181 • homogeneity Alice 22 F 185 Chris 19 M 175 –same data type? same scales? BPM 1 BPM 2 BPM 3 Bob 65 120 145 Alice 80 135 185 Chris 45 115 135 • need different approaches based on scale –how many attributes? • up to ~50: tractable with direct visual encoding • thousands: need transformations / analytical methods –how many items? • up to 1K: tractable with direct visual encoding • >> 10K: need transformations / analytical methods 5

  6. Analytic Component Analytic component Multidimensional Scaling Scatterplot Matrices [Doerk 2011] [Bostock] Pixel-based visualizations / heat maps Parallel Coordinates [Bostock] [Chuang 2012] no / little analytics strong analytics component 6

  7. Tasks and techniques Magnitude Distribution Deviation Correlation Ranking Part to whole Change over Time https://github.com/ft-interactive/chart-doctor/tree/master/visual-vocabulary 
 7 https://gramener.github.io/visual-vocabulary-vega/#/Magnitude/

  8. Keys and values Tables • key Attributes (columns) –independent attribute Items (rows) –used as unique index to look up items Cell containing value –simple tables: 1 key –multidimensional tables: multiple keys Multidimensional Table • value –dependent attribute, value of cell Value in cell • classify arrangements by key count –0, 1, 2, many... Express Values 1 Key 2 Keys 3 Keys Many Keys List Matrix Volume Recursive Subdivision 8

  9. 0 Keys: Express values (magnitudes) Express Values 1 Key 2 Keys 3 Keys Many Keys List Matrix Volume Recursive Subdivision 9

  10. Idiom: scatterplot Express Values • express values –quantitative attributes • no keys, only values –data • 2 quant attribs –mark: points –channels • horiz + vert position –tasks • find trends, outliers, distribution, correlation, clusters –scalability • hundreds of items [A layered grammar of graphics. Wickham. Journ. Computational and Graphical Statistics 19:1 (2010), 3–28.] 10

  11. Scatterplots: Encoding more channels • additional channels for point marks –color –size (bubbleplots) • square root since area grows quadratically, radius is misleading –shape https://observablehq.com/@d3/scatterplot-with-shapes https://www.d3-graph-gallery.com/graph/bubble_basic.html 11

  12. Scatterplot tasks • correlation https://www.mathsisfun.com/data/scatter-xy-plots.html • clusters/groups, and clusters vs classes https://www.cs.ubc.ca/labs/imager/tr/2014/DRVisTasks/ 12

  13. Some keys Express Values 1 Key 2 Keys 3 Keys Many Keys List Matrix Volume Recursive Subdivision 13

  14. Some keys: Categorical regions Separate Order Align • regions : contiguous bounded areas distinct from each other –using space to separate (proximity) –following expressiveness principle for categorical attributes • use ordered attribute to order and align regions 1 Key 2 Keys 3 Keys Many Keys List Matrix Volume Recursive Subdivision 14

  15. Idiom: bar chart 100 100 • one key, one value 75 75 50 50 –data 25 25 • 1 categ attrib, 1 quant attrib 0 0 –mark: lines –channels Animal Type Animal Type • length to express quant value • spatial regions: one per mark – separated horizontally, aligned vertically – ordered by quant attrib » by label (alphabetical), by length attrib (data-driven) –task • compare, lookup values –scalability • dozens to hundreds of levels for key attrib 15

  16. Separated and Aligned but not Ordered LIMITATION: Hard to know rank. What’s the 4 th most? The 7 th ? [Slide courtesy of Ben Jones]

  17. Separated, Aligned and Ordered [Slide courtesy of Ben Jones]

  18. Separated but not Ordered or Aligned LIMITATION: Hard to make comparisons [Slide courtesy of Ben Jones]

  19. Idiom: stacked bar chart • one more key –data • 2 categ attrib, 1 quant attrib –mark: vertical stack of line marks • glyph : composite object, internal structure from multiple marks –channels • length and color hue • spatial regions: one per glyph – aligned: full glyph, lowest bar component https://www.d3-graph-gallery.com/graph/ – unaligned: other bar components barplot_stacked_basicWide.html –task • part-to-whole relationship –scalability • several to one dozen levels for stacked attrib 19

  20. Idiom: streamgraph • generalized stacked graph – emphasizing horizontal continuity [Stacked Graphs Geometry & Aesthetics. Byron and Wattenberg. IEEE Trans. Visualization and • vs vertical items Computer Graphics (Proc. InfoVis 2008) 14(6): 1245–1252, (2008).] – data • 1 categ key attrib (movies) • 1 ordered key attrib (time) • 1 quant value attrib (counts) – derived data • geometry: layers, where height encodes counts • 1 quant attrib (layer ordering) – scalability • hundreds of time keys • dozens to hundreds of movies keys – more than stacked bars, since most layers https://flowingdata.com/2008/02/25/ebb-and-flow-of-box-office-receipts-over-past-20-years/ don’t extend across whole chart 20

  21. Idiom: dot plot / line chart 20 • one key, one value 15 – data 10 • 2 quant attribs 5 – mark: points 
 AND line connection marks between them 0 – channels • aligned lengths to express quant value Year 20 • separated and ordered by key attrib into horizontal regions 15 – task 10 • find trend 5 – connection marks emphasize ordering of items along key axis by explicitly showing relationship between 0 one item and the next – scalability • hundreds of key levels, hundreds of value levels Year 21

  22. Choosing bar vs line charts 60 60 50 50 • depends on type of key 40 40 30 30 attrib 20 20 10 10 –bar charts if categorical 0 0 Female Male Female Male –line charts if ordered 60 60 50 50 • do not use line charts for 40 40 categorical key attribs 30 30 20 20 10 10 –violates expressiveness 0 0 10-year-olds 12-year-olds 10-year-olds 12-year-olds principle after [Bars and Lines: A Study of Graphic Communication. • implication of trend so strong Zacks and Tversky. Memory and Cognition 27:6 (1999), that it overrides semantics! 1073–1079.] – “The more male a person is, the taller he/she is” 22

  23. Chart axes • labelled axis is critical • avoid cropping y-axis –include 0 at bottom left –or slope misleads 23 http://www.thefunctionalart.com/2015/10/if-you-see-bullshit-say-bullshit.html

  24. Idiom: dual-axis line charts • controversial –acceptable if commensurate –beware, very easy to mislead! 24

  25. Idiom: connected scatterplots • scatterplot with line connection marks –popular in journalism –horiz + vert axes: value attribs –line connection marks: 
 temporal order –alternative to dual-axis charts • horiz: time • vert: two value attribs • empirical study –engaging, but correlation unclear [The Connected Scatterplot for Presenting Paired Time Series. http://steveharoz.com/research/connected_scatterplot/ Haroz, Kosara and Franconeri. IEEE TVCG 22(9):2174-86, 2016.] 25

  26. Choosing line chart aspect ratios • 1: banking to 45 (1980s) –Cleveland perceptual argument: most accurate angle judgement at 45 26 https://github.com/jennybc/r-graph-catalog/tree/master/figures/fig07-01_sunspot-data-aspect-ratio-1 https://github.com/jennybc/r-graph-catalog/tree/master/figures/fig07-02_annual-report-aspect-ratio-2

  27. Choosing line chart aspect ratios • 2: multi scale banking to 45 (2006) – frequency domain analysis to find ratios • FFT the data, convolve with Gaussian to smooth overall – find interesting spikes/ranges in power spectrum • cull nearby regions if similar, ensure overview – create trend curves (red) for each aspect ratio weekly daily [Multi-Scale Banking to 45 Degrees. Heer and Agrawala, Proc InfoVis 2006] 27

  28. Choosing line chart aspect ratios • 3: arc length based aspect ratio (2011) –minimize the arc length of curve 
 while keeping the area of the plot constant –parametrization and scale invariant –symmetry preserving –robust & fast to compute • meta-points from this progression –young field; prescriptive advice changes rapidly –reasonable defaults required deep dive into Arc perception meets math Banking to 45 Multiscale Banking Length [Arc Length-Based Aspect Ratio Selection. Talbot, Gerth, and Hanrahan. Proc InfoVis 2011] 28

Recommend


More recommend