I collaborated with Scientific American to create a data graphic for the September 2014 issue. The graphic compared the genomes of the Denisovan, bonobo, chimp and gorilla, showing how our own genomes are almost identical to the Denisovan and closer to that of the bonobo and chimp than the gorilla.
Here you'll find Hilbert curve art, a introduction to Hilbertonians, the creatures that live on the curve, an explanation of the Scientific American graphic and downloadable SVG/EPS Hilbert curve files.
Hilbertonians are creatures that live in the depths of the Hilbert curve. They live across three adjacent orders of the curve (e.g. 2, 3, 4). The come in many different personalities and many classes exist.
They are social—they always appear in multiples of 4. This is a consequence of how they are defined. A single Hilbertonian has never been seen.
Their genomes are 20 bases long. They only have 2 different types of bases. Out of a possible 220 = 1,048,576 genomes, only 104,976 (almost exactly 10%) produce living and breathing Hilbertonians, defined as those whose bodies form a contiguous shape. The other 943,600 are unfortunately unviable. The genomes of every Hilbertonian can be downloaded.
I've previously taken a more fine-art approach to cover design, such for those of Nature, Genome Research and Trends in Genetics. I've used microscopy images to create a cover for PNAS—the one that made biology look like astrophysics—and thought that this is kind of material I'd start with for the MCS cover.
A map of the nearby superclusters and voids in the Unvierse.
By "nearby" I mean within 6,000 million light-years.
It was now time to design my first ... pair of socks.
In collaboration with Flux Socks, the design features the colors and relative thicknesses of Rogue olympic weightlifting plates. The first four plates in the stack are the 55, 45, 35, and 25 competition plates. The top 4 plates are the 10, 5, 2.5 and 1.25 lb change plates.
The perceived weight of each sock is 178.75 lb and 357.5 lb for the pair.
The actual weight is much less.
Find patterns behind gene expression and disease.
Expression, correlation and network module membership of 11,000+ genes and 5 psychiatric disorders in about 6" x 7" on a single page.
Design tip: Stay calm.
Gandal M.J. et al. Shared Molecular Neuropathology Across Major Psychiatric Disorders Parallels Polygenic Overlap Science 359 693–697 (2018)
We discuss the many ways in which analysis can be confounded when data has a large number of dimensions (variables). Collectively, these are called the "curses of dimensionality".
Some of these are unintuitive, such as the fact that the volume of the hypersphere increases and then shrinks beyond about 7 dimensions, while the volume of the hypercube always increases. This means that high-dimensional space is "mostly corners" and the distance between points increases greatly with dimension. This has consequences on correlation and classification.