Poetry is just the evidence of life. If your life is burning well, poetry is just the ashburn somethingmore quotes

# art is science is art

EMBO Practical Course: Bioinformatics and Genome Analysis, 5–17 June 2017.

# visualization + design

The 2017 Pi Day art imagines the digits of Pi as a star catalogue with constellations of extinct animals and plants. The work is featured in the article Pi in the Sky at the Scientific American SA Visual blog.

# $\pi$ Day 2015 Art Posters

2017 $\pi$ day
2016 $\pi$ approximation day
2016 $\pi$ day
2015 $\pi$ day
2014 $\pi$ approx day
2014 $\pi$ day
2013 $\pi$ day
Circular $\pi$ art

On March 14th celebrate $\pi$ Day. Hug $\pi$—find a way to do it.

For those who favour $\tau=2\pi$ will have to postpone celebrations until July 26th. That's what you get for thinking that $\pi$ is wrong.

If you're not into details, you may opt to party on July 22nd, which is $\pi$ approximation day ($\pi$ ≈ 22/7). It's 20% more accurate that the official $\pi$ day!

Finally, if you believe that $\pi = 3$, you should read why $\pi$ is not equal to 3.

All art posters are available for purchase.
I take custom requests.

Not a circle in sight in the 2015 $\pi$ day art. Try to figure out how up to 612,330 digits are encoded before reading about the method. $\pi$'s transcendental friends $\phi$ and $e$ are there too—golden and natural. Get it?

This year's $\pi$ day is particularly special. The digits of time specify a precise time if the date is encoded in North American day-month-year convention: 3-14-15 9:26:53.

The art has been featured in Ana Swanson's Wonkblog article at the Washington Post—10 Stunning Images Show The Beauty Hidden in $\pi$.

This year's art has a modern Bauhaus style. Sharp edges, lines and solid colors. Potato farms from space. CPUs from up close. If the pieces look like the art of Piet Mondrian, you'd be right.

buy artwork
3,628 digits of $\pi$ in a 6 level treemap. Uniform line thickness. Bauhaus prime colors. (posters, BUY ARTWORK)

The digits of $pi$ are encoded in something that looks like a treemap. I explain how this is done in the methods section, but before reading it, try to see if you can figure out how it's done.

buy artwork
2,258 digits of $\phi$, 3,855 digits of $e$ and 3,628 digits of $\pi$ in 6 level treemaps. Uniform line thickness. Brewer palette sequential greys. (posters, BUY ARTWORK)

I briefly experimented with the 4-color theorem in trying to apply color to the treemap, but it turned out to lack interesting stucture. Well, at least some graphs were made.

I experimented with different treemap resolutions. For treemaps that use an outline around each rectangle, I decided to stop at 8 levels, at which 111,469 digits of $pi$ can be encoded.

buy artwork
3,628 digits of $\pi$ in a 6 level treemap. Uniform line thickness. Bauhaus prime colors. (posters, BUY ARTWORK)
buy artwork
20,244 digits of $\pi$ in a 7 level treemap. Uniform line thickness. Bauhaus prime colors. (posters, BUY ARTWORK)
buy artwork
111,469 digits of $\pi$ in an 8 level treemap. Uniform line thickness, slightly thinner than for the 7-level map. Bauhaus prime colors. (posters, BUY ARTWORK)

I also made a level 9 treemap without the outlines, which encoded 612,330 digits. When rendered at 20,833 × 20,833 pixels (I needed the image in bitmap form to provide the posters for sale), some regions are essentially a pixel in size, as seen in the 1-1 crop below.

buy artwork
612,330 digits of $\pi$ in an 9 level treemap. Bauhaus prime colors. (posters, BUY ARTWORK)
1-1 crop of 612,330 digits of $\pi$ in an 9 level treemap. Bauhaus prime colors. (posters)

VIEW ALL

# $k$ index: a weightlighting and Crossfit performance measure

Wed 07-06-2017

Similar to the $h$ index in publishing, the $k$ index is a measure of fitness performance.

To achieve a $k$ index for a movement you must perform $k$ unbroken reps at $k$% 1RM.

The expected value for the $k$ index is probably somewhere in the range of $k = 26$ to $k=35$, with higher values progressively more difficult to achieve.

In my $k$ index introduction article I provide detailed explanation, rep scheme table and WOD example.

# Dark Matter of the English Language—the unwords

Wed 07-06-2017

I've applied the char-rnn recurrent neural network to generate new words, names of drugs and countries.

The effect is intriguing and facetious—yes, those are real words.

But these are not: necronology, abobionalism, gabdologist, and nonerify.

These places only exist in the mind: Conchar and Pobacia, Hzuuland, New Kain, Rabibus and Megee Islands, Sentip and Sitina, Sinistan and Urzenia.

And these are the imaginary afflictions of the imagination: ictophobia, myconomascophobia, and talmatomania.

And these, of the body: ophalosis, icabulosis, mediatopathy and bellotalgia.

Want to name your baby? Or someone else's baby? Try Ginavietta Xilly Anganelel or Ferandulde Hommanloco Kictortick.

When taking new therapeutics, never mix salivac and labromine. And don't forget that abadarone is best taken on an empty stomach.

And nothing increases the chance of getting that grant funded than proposing the study of a new –ome! We really need someone to looking into the femome and manome.

# Dark Matter of the Genome—the nullomers

Wed 31-05-2017

An exploration of things that are missing in the human genome. The nullomers.

Julia Herold, Stefan Kurtz and Robert Giegerich. Efficient computation of absent words in genomic sequences. BMC Bioinformatics (2008) 9:167

# Clustering

Wed 31-05-2017
Clustering finds patterns in data—whether they are there or not.

We've already seen how data can be grouped into classes in our series on classifiers. In this column, we look at how data can be grouped by similarity in an unsupervised way.

Nature Methods Points of Significance column: Clustering. (read)

We look at two common clustering approaches: $k$-means and hierarchical clustering. All clustering methods share the same approach: they first calculate similarity and then use it to group objects into clusters. The details of the methods, and outputs, vary widely.

Altman, N. & Krzywinski, M. (2017) Points of Significance: Clustering. Nature Methods 14:545–546.

### Background reading

Lever, J., Krzywinski, M. & Altman, N. (2016) Points of Significance: Logistic regression. Nature Methods 13:541-542.

Lever, J., Krzywinski, M. & Altman, N. (2016) Points of Significance: Classifier evaluation. Nature Methods 13:603-604.

# What's wrong with pie charts?

Thu 25-05-2017

In this redesign of a pie chart figure from a Nature Medicine article [1], I look at how to organize and present a large number of categories.

I first discuss some of the benefits of a pie chart—there are few and specific—and its shortcomings—there are few but fundamental.

I then walk through the redesign process by showing how the tumor categories can be shown more clearly if they are first aggregated into a small number groups.

(bottom left) Figure 2b from Zehir et al. Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. (2017) Nature Medicine doi:10.1038/nm.4333

# Tabular Data

Tue 11-04-2017
Tabulating the number of objects in categories of interest dates back to the earliest records of commerce and population censuses.

After 30 columns, this is our first one without a single figure. Sometimes a table is all you need.

In this column, we discuss nominal categorical data, in which data points are assigned to categories in which there is no implied order. We introduce one-way and two-way tables and the $\chi^2$ and Fisher's exact tests.

Altman, N. & Krzywinski, M. (2017) Points of Significance: Tabular data. Nature Methods 14:329–330.