Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca - contact me Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca on Twitter Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca - Lumondo Photography Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca - Pi Art Martin Krzywinski / Genome Sciences Center / mkweb.bcgsc.ca - Hilbertonians - Creatures on the Hilbert Curve
Safe, fallen down this way
I want to be just what I am
Cocteau Twinssafe at lastmore quotes

pi: beautiful


Visualizaiton workshop at UBC B.I.G. Research Day. 11 May 2016


visualization + design

Typography geek? If you like the geometry and mathematics of these posters, you may enjoy something more lettered. Visions of type: Type Peep Show: The Private Curves of Letters posters.

`pi` Day 2014 Art Posters

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Support Ellie Balk's Kickstarter community math mural project in which Brooklyn students learn math and art to visualize `pi`.

Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
2013 `pi` day

Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
2014 `pi` day

Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
2015 `pi` day

Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
2014 `pi` approx day

Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Circular `pi` art

On March 14th celebrate Pi Day. Hug `\pi`—find a way to do it. For those who favour `\tau=2\pi` will have to postpone celebrations until July 26th. Some of these folks will argue that `pi` is wrong. If you're not into details, you may opt to party on July 22nd, which is `pi` approximation day (`\pi` ≈ 22/7).

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
All art posters are available for purchase.
I take custom requests.

For the 2014 `pi` day, two styles of posters are available: folded paths and frequency circles.

The folded paths show `pi` on a path that maximizes adjacent prime digits and were created using a protein-folding algorithm. The frequency circles colourfully depict the ratio of digits in groupings of 3 or 6.

Curious how these were made? Read about the method.


Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca buy artwork
Pi Day 2014 poster | Frequency distribution of digits in Pi for each of 128 6-digit groupings in 10 columns up to the Feynman Point. For each grouping the number of times a digit was seen is proportional to the width of the annulus. (zoom, BUY ARTWORK)


Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca buy artwork
Pi Day 2014 poster | Frequency distribution of digits in Pi for each of 128 3-digit groupings in 12 columns up to the Feynman Point. For each grouping the number of times a digit was seen is proportional to the width of the annulus. (zoom, BUY ARTWORK)


Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca buy artwork
Pi Day 2014 poster | Frequency distribution of digits in Pi for each of 128 3-digit groupings in 16 columns up to the Feynman Point. For each grouping the number of times a digit was seen is proportional to the width of the annulus. This is a very satisfying square layout. (zoom, BUY ARTWORK)


Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca buy artwork
Pi Day 2014 poster | Frequency distribution of digits in Pi for each of 128 3-digit groupings in 16 columns up to the Feynman Point, with the first digit (3) offset to the top left. For each grouping the number of times a digit was seen is proportional to the width of the annulus. This is a very satisfying square layout. (zoom, BUY ARTWORK)


Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca buy artwork
Pi Day 2014 poster | Frequency distribution of digits in Pi for the first 4,988 digits of Pi in groupings of 4. This subset contains the triplets for each digit, the last being 888 at digit 4,985. The layout is 29 columns and 43 rows. The first digit (3) offset to the top left. For each grouping the number of times a digit was seen is proportional to the width of the annulus. The Feynman Point 4(999999)8 is found in the middle of row 7. (zoom, BUY ARTWORK)


Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca buy artwork
Pi Day 2014 poster | Frequency distribution of digits in Pi for the first 4,988 digits of Pi in groupings of 4. This subset contains the triplets for each digit, the last being 888 at digit 4,985. The layout is on an Archimedean spiral, with the the first digit (3) in the center. For each grouping the number of times a digit was seen is proportional to the width of the annulus. (zoom, BUY ARTWORK)


Pi Day 2014 Art Poster - Folding the Number Pi
 / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca buy artwork
Pi Day 2014 poster | Frequency distribution of digits in Pi for the first 4,988 digits of Pi in groupings of 4. This subset contains the triplets for each digit, the last being 888 at digit 4,985. The layout is on an Archimedean spiral. For each grouping the number of times a digit was seen is proportional to the width of the annulus. (zoom, BUY ARTWORK)

VIEW ALL

news + thoughts

Pathways

Mon 04-01-2016

Apply visual grouping principles to add clarity to information flow in pathway diagrams.

We draw on the Gestalt principles of connection, grouping and enclosure to construct practical guidelines for drawing pathways with a clear layout that maintains hierarchy.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Nature Methods Points of View column: Pathways. (read)

We include tips about how to use negative space and align nodes to emphasizxe groups and how to effectively draw curved arrows to clearly show paths.

Hunnicutt, B.J. & Krzywinski, M. (2016) Points of Viev: Pathways. Nature Methods 13:5.

background reading

Wong, B. (2010) Points of Viev: Gestalt principles (part 1). Nature Methods 7:863.

Wong, B. (2010) Points of Viev: Gestalt principles (part 2). Nature Methods 7:941.

...more about the Points of View column

Multiple Linear Regression

Mon 04-01-2016

When multiple variables are associated with a response, the interpretation of a prediction equation is seldom simple.

This month we continue with the topic of regression and expand the discussion of simple linear regression to include more than one variable. As it turns out, although the analysis and presentation of results builds naturally on the case with a single variable, the interpretation of the results is confounded by the presence of correlation between the variables.

By extending the example of the relationship of weight and height—we now include jump height as a second variable that influences weight—we show that the regression coefficient estimates can be very inaccurate and even have the wrong sign when the predictors are correlated and only one is considered in the model.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Nature Methods Points of Significance column: Multiple Linear Regression. (read)

Care must be taken! Accurate prediction of the response is not an indication that regression slopes reflect the true relationship between the predictors and the response.

Altman, N. & Krzywinski, M. (2015) Points of Significance: Multiple Linear Regression Nature Methods 12:1103-1104.

Background reading

Altman, N. & Krzywinski, M. (2015) Points of significance: Simple Linear Regression Nature Methods 12:999-1000.

...more about the Points of Significance column

Circos and Hive Workshop Workshop—Poznan, Poland

Sun 13-12-2015

Taught how Circos and hive plots can be used to show sequence relationships at Biotalent Functional Annotation of Genome Sequences Workshop at the Institute for Plant Genetics in Poznan, Poland.

Students generated images published in Fast Diploidization in Close Mesopolyploid Relatives of Arabidopsis.

Workshop materials: slides, handout, Circos and hive plot files.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Drawing synteny between modern and ancient genomes with Circos.

Students also learned how to use hive plots to show synteny.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Hive plots are great at showing 3-way sequence comparisons. Here three modern species of Australian Brassicaceae (S. nutans, S. lineare, B. antipoda) are compared based on their common relationships to the ancestral karotype.

Mandakova, T. et al. Fast Diploidization in Close Mesopolyploid Relatives of Arabidopsis The Plant Cell, Vol. 22: 2277-2290, July 2010

Play the Bacteria Game

Mon 14-12-2015

Choose your own dust adventure!

Nobody likes dusting but everyone should find dust interesting.

Working with Jeannie Hunnicutt and with Jen Christiansen's art direction, I created this month's Scientific American Graphic Science visualization based on a recent paper The Ecology of microscopic life in household dust.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
An analysis of dust reveals how the presence of men, women, dogs and cats affects the variety of bacteria in a household. Appears on Graphic Science page in December 2015 issue of Scientific American.

We have also written about the making of the graphic, for those interested in how these things come together.

This was my third information graphic for the Graphic Science page. Unlike the previous ones, it's visually simple and ... interactive. Or, at least, as interactive as a printed page can be.

More of my American Scientific Graphic Science designs

Barberan A et al. (2015) The ecology of microscopic life in household dust. Proc. R. Soc. B 282: 20151139.

Names for 5,092 colors

Tue 03-11-2015

A very large list of named colors generated from combining some of the many lists that already exist (X11, Crayola, Raveling, Resene, wikipedia, xkcd, etc).

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Confused? So am I. That's why I made a list.

For each color, coordinates in RGB, HSV, XYZ, Lab and LCH space are given along with the 5 nearest, as measured with ΔE, named neighbours.

I also provide a web service. Simply call this URL with an RGB string.

Simple Linear Regression

Sat 07-11-2015

It is possible to predict the values of unsampled data by using linear regression on correlated sample data.

This month, we begin our column with a quote, shown here in its full context from Box's paper Science and Statistics.

In applying mathematics to subjects such as physics or statistics we make tentative assumptions about the real world which we know are false but which we believe may be useful nonetheless. The physicist knows that particles have mass and yet certain results, approximating what really happens, may be derived from the assumption that they do not. Equally, the statistician knows, for example, that in nature there never was a normal distribution, there never was a straight line, yet with normal and linear assumptions, known to be false, he can often derive results which match, to a useful approximation, those found in the real world.
Box, G. J. Am. Stat. Assoc. 71, 791–799 (1976).

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Nature Methods Points of Significance column: Simple Linear Regression. (read)

This column is our first in the series about regression. We show that regression and correlation are related concepts—they both quantify trends—and that the calculations for simple linear regression are essentially the same as for one-way ANOVA.

While correlation provides a measure of a specific kind of association between variables, regression allows us to fit correlated sample data to a model, which can be used to predict the values of unsampled data.

Altman, N. & Krzywinski, M. (2015) Points of Significance: Simple Linear Regression Nature Methods 12:999-1000.

Background reading

Altman, N. & Krzywinski, M. (2015) Points of significance: Association, correlation and causation Nature Methods 12:899-900.

Krzywinski, M. & Altman, N. (2014) Points of significance: Analysis of variance (ANOVA) and blocking. Nature Methods 11:699-700.

...more about the Points of Significance column