# Data Visualization
Data Visualization is the graphical representation of information and data, using visual elements like charts, graphs, and maps to reveal patterns, trends, and insights that would be difficult to detect in raw numbers. The field draws on statistics, [[Cognitive Psychology]], graphic design, and [[Human-Computer Interaction (HCI)]] to create effective visual communications. Pioneered by figures like William Playfair (inventor of bar and pie charts) and refined by [[Edward Tufte]], data visualization has become essential to data science, journalism, and business intelligence.
Effective visualization leverages the human visual system's ability to rapidly process spatial relationships, color differences, and patterns. Key principles include choosing appropriate chart types for data relationships, maintaining visual integrity (not distorting data), and minimizing non-essential elements ([[Edward Tufte]]'s "data-ink ratio"). The rise of interactive visualization tools and dashboards has democratized the field, while the emergence of data journalism has brought sophisticated graphics to mainstream media.
## Chart Types by Data Relationship
| Relationship | Chart Type | Example Use |
|--------------|------------|-------------|
| **Comparison** | Bar, column, lollipop | Sales by region |
| **Distribution** | Histogram, box plot, violin | Age distribution |
| **Composition** | Pie, stacked bar, treemap | Budget breakdown |
| **Trend over time** | Line, area, sparkline | Stock prices |
| **Correlation** | Scatter, bubble, heatmap | Height vs weight |
| **Geospatial** | Choropleth, cartogram | Election maps |
| **Part-to-whole** | Donut, waterfall, sunburst | Revenue sources |
| **Hierarchy** | Treemap, sunburst, dendrogram | File system |
| **Network** | Node-link, adjacency matrix | Social connections |
## Visualization Process
```
1. Define Purpose
└── What question are you answering?
↓
2. Understand Data
└── Types, ranges, quality, relationships
↓
3. Choose Encoding
└── Chart type, visual variables
↓
4. Design & Refine
└── Layout, labels, colors, annotations
↓
5. Test & Iterate
└── Does audience understand the insight?
```
## Visual Variables (Bertin)
| Variable | Best For | Example |
|----------|----------|---------|
| **Position** | Quantitative comparison | Scatterplot x/y |
| **Length** | Quantitative | Bar height |
| **Area** | Rough comparison | Bubble size |
| **Color hue** | Categories | Line colors |
| **Color saturation** | Magnitude | Choropleth intensity |
| **Shape** | Categories | Marker types |
| **Orientation** | Limited categories | Arrow direction |
## Key Figures
| Person | Contribution |
|--------|--------------|
| William Playfair | Invented bar/pie/line charts (1786-1801) |
| [[Edward Tufte]] | Data-ink ratio, small multiples, sparklines |
| [[Otto Neurath]] | ISOTYPE pictorial statistics |
| Jacques Bertin | Semiology of Graphics, visual variables |
| [[Nigel Holmes]] | Explanation graphics at TIME |
| Ben Shneiderman | Treemaps, information visualization |
| Hans Rosling | Animated bubble charts, Gapminder |
## Tools
| Tool | Type | Strengths |
|------|------|-----------|
| **D3.js** | JavaScript library | Flexibility, custom visuals |
| **Tableau** | BI tool | Ease of use, dashboards |
| **Power BI** | BI tool | Microsoft integration |
| **Matplotlib** | Python library | Scientific visualization |
| **ggplot2** | R library | Grammar of graphics |
| **Observable** | Notebook | Interactive, shareable |
| **Datawrapper** | Web tool | Quick, publication-ready |
## Common Mistakes
| Mistake | Problem |
|---------|---------|
| **Pie charts for many categories** | Hard to compare slices |
| **Truncated axes** | Exaggerates differences |
| **3D effects** | Distorts perception |
| **Rainbow color scales** | Not perceptually uniform |
| **Dual y-axes** | Misleading comparisons |
| **Overplotting** | Data points obscure each other |
| **Too much data** | Overwhelms rather than clarifies |
## References
- Tufte, E. (1983). *The Visual Display of Quantitative Information*
- Few, S. (2012). *Show Me the Numbers*
- https://en.wikipedia.org/wiki/Data_visualization
## Related
- [[Information Design]]
- [[Visual Communication]]
- [[Edward Tufte]]
- [[Nigel Holmes]]
- [[Otto Neurath]]
- [[Treemap]]
- [[Sparklines]]
- [[Infographics]]
- [[Statistics]]
- [[Dashboard Design]]