cs171 visualization
play

CS171 Visualization Alexander Lex alex@seas.harvard.edu Tables - PowerPoint PPT Presentation

CS171 Visualization Alexander Lex alex@seas.harvard.edu Tables [xkcd] This Week Reading: VAD, Chapters 6 & 7 Lecture 9: Tables Lecture 10: Graphs Sections: Designing your Visualization Homework 1 Review Score Distribution Average:


  1. CS171 Visualization Alexander Lex alex@seas.harvard.edu Tables [xkcd]

  2. This Week Reading: VAD, Chapters 6 & 7 Lecture 9: Tables Lecture 10: Graphs Sections: Designing your Visualization

  3. Homework 1 Review

  4. Score Distribution Average: 7.8

  5. How Difficult?

  6. How Long? N=81 Average: 33.85 Goal: 20

  7. Which part took longest?

  8. Office Hours Attendance

  9. Are Sections Helpful?

  10. Section Comments “Pertinent and just enough momentum to get you thinking in the right direction. Section presenter delivered an enthusiastic and polished lesson.” “Topics covered were too easy! Homework problems were way harder.”

  11. Design Studio

  12. Design Studio Comments “I felt it was a huge waste of time because I'm still struggling with d3 let alone attempting a creative design. Also, we didn't really do anything in class.” “DESIGN STUDIOS ARE HARD. Wow, it was cool to see our group trying to think of all of the complex things we could draw and just how quickly it all got overly complex. Might be nice to see an example DS after HW2 is submitted.” “A lot of fun!” “nice chance to interact with more people while working”

  13. General Difficulty

  14. General Comments “The learning curve is quite steep for someone who does not do programming regularly” “I think there is a large discrepancy between the contents of lecture and the problem sets that we are given. Generally, I don't understand why most of the lectures focus on visualization theory and do not discuss actual coding itself.” “Theory might need to be a little bit harder. Some of the code, I think is too hard. Really freaking good course though.” “Please teach us some real code and design problems in lecture. It's a disaster for people who learn Javascript first time.”

  15. What you need to know Lecture Reading Theory Discussion Sections D3 reading Design Lecture Self-study Design Studios Office hours Design Skills Coding Skills

  16. Half-Life of Knowledge How useful Fundamentals & your problem solving skills knowledge (University Education) is Knowledge about a specific 
 technology (Tutorials, etc.) Time

  17. Half-Life of Knowledge HW 1 HW 2 HW 3 HW 4 Project How useful Visualization your Principles knowledge and Theory is Your D3/JavaScript Ninja Skills Time

  18. Two Weeks Ago Vis Guidelinies Tasks

  19. Can you spot the differences?

  20. Start Scales at 0? A. Kriebel, VizWiz

  21. Global Warming? The Daily Mail, UK, Jan 2012

  22. Global Warming? Mother Jones

  23. Global Warming - Frame the Data Mother Jones

  24. Which is better? [Bateman et al. 2010]

  25. Tasks Why are we using Visualization?

  26. Domain and Abstract Tasks Infinite numbers of domain tasks Can be broken down into simpler abstract tasks We know how to address the abstract tasks! Identify task - data combination: solutions probably exist

  27. High-level actions: Analyze Analyze Consume Consume discover vs present Discover Present Enjoy classic split: explore vs explain enjoy: casual, social Produce Produce Annotate Record Derive Annotate, record tag Derive: crucial design choice

  28. Example: Derive

  29. Actions: Mid-level search, low- level query Search what does user know? Target known Target unknown Location Lookup Browse target, location known Location Locate Explore unknown how much of the data Query matters? Identify Compare Summarize one, some, all

  30. Example Compare (& Derive)

  31. Why: Targets NETWORK DATA ALL DATA Topology Trends Outliers Features Paths ATTRIBUTES One Many SPATIAL DATA Dependency Correlation Similarity Distribution Shape Extremes

  32. How? A Preview Encode Manipulate Facet Reduce Arrange Change Juxtapose Filter Express Separate Select Partition Aggregate Order Align Use Navigate Superimpose Embed Map from categorical and ordered attributes

  33. Design Critique

  34. CodeSwarm: http://goo.gl/9exsZH http://vis.cs.ucdavis.edu/~ogawa/codeswarm/

  35. Tables & Multi- Dimensional Data

  36. Basic Plots for Basic Tasks Search ALL DATA Target known Target unknown Trends Outliers Features Location Lookup Browse known Location Locate Explore unknown ATTRIBUTES One Many Query Dependency Correlation Similarity Distribution Identify Compare Summarize Extremes

  37. Comparisons

  38. Bar Chart

  39. Direction Nicolas Rapp

  40. Baseline Problem Flowing Data

  41. Baseline Problem Flowing Data

  42. Different Baselines https://eagereyes.org/basics/baselines

  43. Plot Change Instead https://eagereyes.org/basics/baselines

  44. Trends Over Time http://xkcd.com/605/

  45. Line Charts matplotlib gallery

  46. Bars vs. Lines Lines imply connections & 
 sampling from continuous data. Do not use for categorical 
 data. Zacks 1999

  47. Don’t Use bar charts to compare ratings of books… “Visualizing The Wheel of Time: Reader Sentiment for an Epic Fantasy Series”, J. Siddle, Sept 2013

  48. Baseline Problem (again) True Baseline Clipped Baseline Plotting Change https://eagereyes.org/basics/baselines

  49. Linear vs. Logarithmic Scale Linear Scale Log Scale http://xkcd.com/1162/ Apple Stock Price http://finance.yahoo.com/echarts?s=AAPL

  50. Aspect Ratios Rule of Thumb: Banking to 45º (average line 
 slope: 45º) eagereyes.org

  51. Don’t

  52. Correlations

  53. Scatterplots

  54. Trivariate Data class fare age Do NOT use 3D scatterplots!

  55. Trivariate Data Map the third dimension to some other 
 visual attribute

  56. Overplotting alpha = 1/100

  57. Trend Lines

  58. Compositions

  59. Pie Charts http://xkcd.com/197/

  60. Pie vs. Bar Charts

  61. Donut Chart The Economist Daily Chart

  62. Stacked Bar Chart

  63. Stacked Bar Chart vs. VizWiz Blog

  64. Comparison of bar chart types Pie Chart Stacked bar chart Layered 
 Bar 
 Chart Small 
 Multiples Grouped 
 Bar 
 Chart Streit & Gehlenborg, PoV, Nature Methods, 2014

  65. LineUp Video at http://lineup.caleydo.org

  66. Stacked Area Chart http://stackoverflow.com/questions/2225995/how-can-i-create-stacked-line-graph-with-matplotlib

  67. 100% Stacked Area Chart http://stackoverflow.com/questions/16875546/create-a-100-stacked-area-chart-with-matplotlib

  68. Stacked Area vs. Line Graphs leancrew.com & Practically Efficient

  69. VizWiz, A. Kriebel

  70. Distributions

  71. Histogram # passengers #bins hard to predict make interactive! age rule of thumb: #bins = sqrt(n) 10 Bins # passengers age 20 Bins

  72. Density Plots http://web.stanford.edu/~mwaskom/software/seaborn/tutorial/plotting_distributions.html

  73. Heat Maps binning of scatterplots 2D Density Plots

  74. Box(and Whisker) Plots http://xkcd.com/539/

  75. Box Plots aka Box-and-Whisker Plot Wikipedia

  76. Comparison Streit & Gehlenborg, PoV, Nature Methods, 2014

  77. Violin Plot = Box Plot + Probability Density Function http://web.stanford.edu/~mwaskom/software/seaborn/tutorial/plotting_distributions.html

  78. Showing Expected Values & Uncertainty Error Bars Considered Harmful: Exploring Alternate Encodings for Mean and Error Michael Correll, and Michael Gleicher

  79. Table Lens Rao & Card 1994

  80. Bertifier Matrix/Table representation Authoring Interface http://www.aviz.fr/bertifier Charles Perin, Pierre Dragicevic and Jean-Daniel Fekete

  81. Highdimensional Data

  82. What is High-dimensional Data? Age Gender Height Tabular data, containing Bob 25 M 181 rows (items) Alice 22 F 185 Chris 19 M 175 columns (attributes or items) rows >> columns

  83. High-Dimensional Data Visualization Homogeneity Same data type? How many dimensions? Same scales? ~50 – tractable with “just” vis ~1000 – need analytical methods Age Gender Height How many records? Bob 25 M 181 ~ 1000 – “just” vis is fine Alice 22 F 185 Chris 19 M 175 >> 10,000 – need analytical methods BPM 1 BPM 2 BPM 3 Bob 65 120 145 Alice 80 135 185 Chris 45 115 135

  84. Analytic Component Multidimensional Scaling Scatterplot Matrices 
 [Doerk 2011] [Bostock] Pixel-based visualizations / 
 heat maps Parallel Coordinates 
 [Bostock] [Chuang 2012] no / little analytics strong analytics 
 component

  85. More next time …

Recommend


More recommend