dataplau-blog
dataplau-blog
DataPlau
15 posts
Andy's explorations in data & concept visualization. Most dates are approximations of when figures were originally generated.
Don't wanna be here? Send us removal request.
dataplau-blog · 11 years ago
Video
tumblr
Skillspace Explorer. Explore occupations with related skills in an intuitive path-like representation of multi-dimensional "Skillspace." Clicking on an occupation will highlight its ten nearest occupations, based on 218 skill, knowledge, and work activity measurements from the [O*Net](http://onetonline.org) database. Click on one of these "neighbors" to bring up its neighbors in turn. A "breadcrumb trail" shows your past selections; clicking on a breadcrumb will take you back to that occupation. Created in D3.js, using underlying data analysis from R.
4 notes · View notes
dataplau-blog · 11 years ago
Video
tumblr
The heartbeat of unemployment in Wisconsin. A new visualization, built from the ground up using D3.js. At higher speeds, the seasonal patterns of unemployment in Wisconsin are akin to a heartbeat -- until the recession hits in 2008.
4 notes · View notes
dataplau-blog · 11 years ago
Video
tumblr
Skillspace, animated in D3.js. An animated expansion of my earlier, R-generated graphic displaying skills measures of ~800 occupations on five summary axes. Visualization produced entirely in D3.js
1 note · View note
dataplau-blog · 11 years ago
Photo
Tumblr media
Multi-dimensional gauges, recreated in d3.js. Code and live version on bl.ocks.
0 notes
dataplau-blog · 11 years ago
Photo
Tumblr media
Bubble chart, re-created using d3.js. My first d3.js visualization, after one day of learning. Code and full svg version available on Bl.ocks.org and Gist/GitHub.
0 notes
dataplau-blog · 12 years ago
Photo
Tumblr media Tumblr media
Summarizing high-dimensional data. 218 measurements of job skills associated with >800 occupations are summarized by a Cluster Analysis and a Principal Components Analysis. Fifteen occupational clusters are summarized by their positions on the first five Principal Components (Physical/Intellectual work; Interpersonal/Technical interactions; Abstract/Concrete thought processes; Information-/Interaction- centered; Management/Production orientation). Click for full figure. Produced in R; polished in Adobe Illustrator (primarily to introduce the underlying gradient and optimize the text).
1 note · View note
dataplau-blog · 12 years ago
Photo
Tumblr media Tumblr media Tumblr media
Depicting high-dimensional clusters in Principal Component space. >800 occupations (small points) were placed into clusters ("Groups", large points) and subclusters ("Subgroups", medium points) based on similarity across 218 skills dimensions. Principal Components Analysis was used as a data reduction technique to minimize explanatory variables. These figures show the first two Principal Components. Produced in R and Adobe Illustrator.
0 notes
dataplau-blog · 13 years ago
Photo
Tumblr media Tumblr media
"Bubble table" demo. Employment ranges (dummy data) by occupation cluster and industry. Produced in R; cleaned in Adobe Illustrator.
0 notes
dataplau-blog · 15 years ago
Video
tumblr
Teaching slides: speciation mechanisms. Accumulation of genetic & phenotypic differences are depicted by changing colors. Allopatric speciation refers to speciation in regions separated by space or barriers. Parapatric speciation refers to speciation in a population continuously distributed in space, but usually experiencing an environmental gradient. Sympatric speciation refers to speciation in a population sharing the same space at some scale, though often adapting to two different microenvironments. Produced in Mathematica, Adobe Illustrator, and Keynote
0 notes
dataplau-blog · 17 years ago
Video
tumblr
Teaching slide: How genetic associations evolve. Demonstration of how natural selection can cause associations among genes that ultimately slow down further responses to natural selection. Produced in Mathematica, Adobe Illustrator, and Keynote.
0 notes
dataplau-blog · 18 years ago
Video
tumblr
Teaching slide: stable and unstable evolutionary equilibria (heterozygote advantage & disadvantage). In diploid organisms, when the fitness of individuals carrying two different versions of the same gene (heterozygotes) isn't intermediate between those carrying two copies of each of the two versions (homozygotes), there is an equilibrium point at which both versions will be present. This equilibrium point is stable if the heterozygote is the fittest genotype, but unstable if it's the least fit. This animation provides an explanation. Produced in Mathematica, Adobe Illustrator, and Keynote.
0 notes
dataplau-blog · 19 years ago
Photo
Tumblr media Tumblr media
Cubic splines fit to simulated data, with confidence blobs demarking optima. Mean fitness of host populations coevolving with parasites as a function of parasite virulence (V) and host recombination rate. Fitness surfaces were fitted with cubic splines, with 95% confidence blobs for maxima determined by bootstrapping simulated results. Produced in Mathematica; mildly massaged in Adobe Illustrator.
0 notes
dataplau-blog · 23 years ago
Photo
Tumblr media
Linked evolutionary change over time, alternate version. Static version of the "sprinkler plot", showing only the host and using colored shading to more clearly highlight deviations from the plane. Blue disk shows most recent generation; line traces trajectory over several past generations with wedges emphasizing discrete time points. Produced in Mathematica.
0 notes
dataplau-blog · 23 years ago
Video
tumblr
Linked evolutionary change over time ("Sprinkler plots"). This animation shows a host and a parasite species "co-evolving", with the parasite (red) "tracking" the host (blue) in genotype space. Vectors point to the current genotype frequencies; trailing dots show preceding ten generations. Genotypes in the plane are those expected if two genes (A and B) are assorting independently; vertical deviations (shown by distance between a colored vector and its black "shadow" represent non-indpendent associations between genes ("linkage disequilibrium"). Panels represent different degrees of host recombination (rho) and parasite virulence (V) Produced in Mathematica.
0 notes
dataplau-blog · 28 years ago
Photo
Tumblr media
Early (~1997) 3D scatterplot with fitted surface. Demo of "lollipop graph" showing deviations of (simulated) data from fitted surface. Produced in Mathematica.
0 notes