Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca - contact me Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca on Twitter Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca - Lumondo Photography Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca - Pi Art Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca - Hilbertonians - Creatures on the Hilbert Curve
Trance opera—Spente le Stellebe dramaticmore quotes

space: exciting



EMBO Practical Course: Bioinformatics and Genome Analysis, 5–17 June 2017.


visualization + design

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Cover image accompanying our article on mouse vasculature development. Biology turns astrophysical. PNAS 1 May 2012; 109 (18) (zoom, PNAS)

Creating the PNAS Cover

One of my goals in life, which I can now say has been accomplished, is to make biology look like astrophysics. Call it my love for the Torino Impact Hazard Scale.

Recently, I was given an opportunity to attend to this (admittedly vague) goal when Linda Chang from Aly Karsan's group approached me with some microscopy photos of mouse veins. I was asked to do "something" with these images for a cover submission to accompany the manuscript.

When people see my covers, sometimes they ask "How did you do that?" Ok, actually they never ask this. But being a scientist, I'm trained me to produce answers in anticipation of such questions. So, below, I show you how the image was constructed.

The image was published on the cover of PNAS (PNAS 1 May 2012; 109 (18))

Tools

Photoshop CS5, Nik Color Efex Pro 4, Alien Skin Bokeh 2 and a cup of coffee from a Rancilio Silvia.

source images

Below are a few of the images I had the option to work with. These are mouse embryonic blood vessels, with a carotid artery shown in the foreground with endothelial cells in green, vascular smooth muscle cells in red and the nuclei in blue.

Of course, as soon as I saw the images, I realized that there was very little that I needed to do to trigger the viewer's imagination. These photos were great!

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Mouse carotid arteries. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Mouse carotid arteries. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Mouse carotid arteries. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Mouse carotid arteries. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Mouse carotid arteries. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Mouse carotid arteries. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Mouse carotid arteries. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Mouse carotid arteries. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Mouse carotid arteries. (zoom)

memories of star trek

Immediately I thought of two episodes of Star Trek (original series): Doomsday Machine and the Immunity Syndrome, as well as of images from the Hubble Telescope.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Enterprise is about to be consumed by a horror tube: a planet killer! (The Doomsday Machine)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Enterprise heads into a giant amoeba. Who eats whom? I'll let you guess. (The Immunity Syndrome)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Orion nebula (M42) as seen by the Hubble telescope. (zoom)

I though it would be pretty easy to make the artery images look all-outer-spacey. They already looked it.

centerpiece image

And then I saw the image below.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
A particularly spectacular image of a mouse carotid artery. I'm thinking 10 on the Torino scale. (zoom)

constructing the cover

background

The background was created from the two images shown here. The second image was sampled three times, at different rotations.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Images used for background. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Images used for background. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Layer composition for background elements. (zoom)

The channel mixer was used to remove the green channel and leave only red and blue.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Background elements for PNAS cover image. (zoom)

middle ground

The next layer was composed of what looked like ribbons of blue gas. This was created by sampling the oval shapes from the source images. Here the red channel was a great source for cloud shapes, and this was the only channel that was kept. The hue was shifted to blue and a curve adjustment was applied to increase the contrast.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
First set of middle ground elements, before adjustments. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
First set of middle ground elements, after channel adjustments. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Second set of middle ground elements. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Layer composition for middle ground elements. (zoom)

When the foreground and middle ground elements were combined, the result was already 40 parsecs away.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Background and foreground elements for PNAS cover image. (zoom)

foreground

The foreground was created from the spectacular comet-like image of a mouse artery. Very little had to be done to make this element look good. It already looked good.

I applied a little blur using Alien Skin's Bokeh 2 to narrow the apparent depth of field, masked out elements at the bottom of the image and removed some of the green channel. The entire blue channel was removed altogether (this gave the tail of the comet a mottled, flame-like appearance).

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Foreground element, before adjustments. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Foreground element, after channel adjustments. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Layer composition for foreground element. (zoom)

post processing

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Initial composition of background, middle ground and foreground elements. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
40% localized application of Nik's Tonal Contrast (Color Efex 4 plugin) to increase structure in red channel. (zoom)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
50% blend with Nik's Pro Contrast (Color Efex 4 plugin). (zoom)

And here we have the final image.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Final PNAS cover. Spacey! (zoom)
VIEW ALL

news + thoughts

Snowflake simulation

Tue 14-11-2017
Symmetric, beautiful and unique.

Just in time for the season, I've simulated a snow-pile of snowflakes based on the Gravner-Griffeath model.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
A few of the beautiful snowflakes generated by the Gravner-Griffeath model. (explore)

Gravner, J. & Griffeath, D. (2007) Modeling Snow Crystal Growth II: A mesoscopic lattice map with plausible dynamics.

Genes that make us sick

Thu 02-11-2017
Where disease hides in the genome.

My illustration of the location of genes in the human genome that are implicated in disease appears in The Objects that Power the Global Economy, a book by Quartz.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
The location of genes implicated in disease in the human genome, shown here as a spiral. (more...)

Ensemble methods: Bagging and random forests

Mon 16-10-2017
Many heads are better than one.

We introduce two common ensemble methods: bagging and random forests. Both of these methods repeat a statistical analysis on a bootstrap sample to improve the accuracy of the predictor. Our column shows these methods as applied to Classification and Regression Trees.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Nature Methods Points of Significance column: Ensemble methods: Bagging and random forests. (read)

For example, we can sample the space of values more finely when using bagging with regression trees because each sample has potentially different boundaries at which the tree splits.

Random forests generate a large number of trees by not only generating bootstrap samples but also randomly choosing which predictor variables are considered at each split in the tree.

Krzywinski, M. & Altman, N. (2017) Points of Significance: Ensemble methods: bagging and random forests. Nature Methods 14:933–934.

Background reading

Krzywinski, M. & Altman, N. (2017) Points of Significance: Classification and regression trees. Nature Methods 14:757–758.

...more about the Points of Significance column

Classification and regression trees

Mon 16-10-2017
Decision trees are a powerful but simple prediction method.

Decision trees classify data by splitting it along the predictor axes into partitions with homogeneous values of the dependent variable. Unlike logistic or linear regression, CART does not develop a prediction equation. Instead, data are predicted by a series of binary decisions based on the boundaries of the splits. Decision trees are very effective and the resulting rules are readily interpreted.

Trees can be built using different metrics that measure how well the splits divide up the data classes: Gini index, entropy or misclassification error.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Nature Methods Points of Significance column: Classification and decision trees. (read)

When the predictor variable is quantitative and not categorical, regression trees are used. Here, the data are still split but now the predictor variable is estimated by the average within the split boundaries. Tree growth can be controlled using the complexity parameter, a measure of the relative improvement of each new split.

Individual trees can be very sensitive to minor changes in the data and even better prediction can be achieved by exploiting this variability. Using ensemble methods, we can grow multiple trees from the same data.

Krzywinski, M. & Altman, N. (2017) Points of Significance: Classification and regression trees. Nature Methods 14:757–758.

Background reading

Lever, J., Krzywinski, M. & Altman, N. (2016) Points of Significance: Logistic regression. Nature Methods 13:541-542.

Altman, N. & Krzywinski, M. (2015) Points of Significance: Multiple Linear Regression Nature Methods 12:1103-1104.

Lever, J., Krzywinski, M. & Altman, N. (2016) Points of Significance: Classifier evaluation. Nature Methods 13:603-604.

Lever, J., Krzywinski, M. & Altman, N. (2016) Points of Significance: Model Selection and Overfitting. Nature Methods 13:703-704.

Lever, J., Krzywinski, M. & Altman, N. (2016) Points of Significance: Regularization. Nature Methods 13:803-804.

...more about the Points of Significance column