Thoughts rearrange, familiar now strange.break flowers

# 3.14: exciting

More than Pretty Pictures—Aesthetics of Data Representation, Denmark, April 13–16, 2015

# visualization + design

Typography geek? If you like the geometry and mathematics of these posters, you may enjoy something more lettered. Visions of type: Type Peep Show: The Private Curves of Letters posters.

# Pi Day Art Posters — March 14, 2014

All posters are available for purchase.
I also take custom requests.

Two styles of posters are available: folded paths, which show Pi on a path that maximizes adjacent prime digits, and frequency circles, which colourfully depicts the ratio of digits in groupings of 3 or 6.

buy artwork
Pi Day 2014 path posters (view posters, BUY ARTWORK)
buy artwork
Pi Day 2014 frequency circles posters (view posters, BUY ARTWORK)

## posters — folded paths

Curious how these were made? Read about the method.

buy artwork
Pi Day 2014 poster | 132 paths with E=-23 of 64 digits of Pi, sorted by aspect ratio. Start (3) and end (2) digits are highlighted. (zoom, BUY ARTWORK)

buy artwork
Pi Day 2014 poster | 132 paths with E=-23 of 64 digits of Pi, sorted by aspect ratio. Start (3) and end (2) digits are highlighted. (zoom, BUY ARTWORK)

buy artwork
Pi Day 2014 poster | 132 paths with E=-23 of 64 digits of Pi, sorted by aspect ratio. Prime (2 3 5 7) digits are are highlighted. (zoom, BUY ARTWORK)

buy artwork
Pi Day 2014 poster | 132 paths with E=-23 of 64 digits of Pi, sorted by aspect ratio. Digits are colored by prime/composite status. (zoom, BUY ARTWORK)

buy artwork
Pi Day 2014 poster | 9 paths with E=-223 to -220 of 768 digits of Pi, sorted by aspect ratio. Start (3) and end (999999) digits are highlighted. (zoom, BUY ARTWORK)

buy artwork
Pi Day 2014 poster | 9 paths with E=-223 to -220 of 768 digits of Pi, sorted by aspect ratio.. Start (3) and end (999999) digits are highlighted. (zoom, BUY ARTWORK)

buy artwork
Pi Day 2014 poster | 9 paths with E=-223 to -220 of 768 digits of Pi, sorted by aspect ratio.. Prime (2 3 5 7) digits are are highlighted. (zoom, BUY ARTWORK)

buy artwork
Pi Day 2014 poster | 9 paths with E=-223 to -220 of 768 digits of Pi, sorted by aspect ratio. Digits are colored by prime/composite status. (zoom, BUY ARTWORK)

Pi Day 2014 poster | 20 paths with E=-223 to -209 of 768 digits of Pi, sorted by distance between start and end points (2.2–7.0). Start (3) and end (999999) digits are highlighted. (zoom)

Pi Day 2014 poster | The lowest energy path E=-223 of 768 digits of Pi. Start (3) and end (999999) digits are highlighted, as well as all prime digits. This path took about 1 CPU year to find. (37x51, r=0.725, area=1887, cm=1.9/13.4, dend=24.4) (zoom)

# Two Factor Designs

Tue 09-12-2014

We've previously written about how to analyze the impact of one variable in our ANOVA column. Complex biological systems are rarely so obliging—multiple experimental factors interact and producing effects.

ANOVA is a natural way to analyze multiple factors. It can incorporate the possibility that the factors interact—the effect of one factor depends on the level of another factor. For example, the potency of a drug may depend on the subject's diet.

Nature Methods Points of Significance column: Two Factor Designs. (read)

We can increase the power of the analysis by allowing for interaction, as well as by blocking.

Krzywinski, M., Altman, (2014) Points of Significance: Two Factor Designs Nature Methods 11:1187-1188.

### Background reading

Blainey, P., Krzywinski, M. & Altman, N. (2014) Points of Significance: Replication Nature Methods 11:879-880.

Krzywinski, M. & Altman, N. (2014) Points of Significance: Analysis of variance (ANOVA) and blocking Nature Methods 11:699-700.

Krzywinski, M. & Altman, N. (2014) Points of Significance: Designing Comparative Experiments Nature Methods 11:597-598.

# Nested Designs—Assessing Sources of Noise

Mon 29-09-2014

Sources of noise in experiments can be mitigated and assessed by nested designs. This kind of experimental design naturally models replication, which was the topic of last month's column.

Nature Methods Points of Significance column: Nested designs. (read)

Nested designs are appropriate when we want to use the data derived from experimental subjects to make general statements about populations. In this case, the subjects are random factors in the experiment, in contrast to fixed factors, such as we've seen previously.

In ANOVA analysis, random factors provide information about the amount of noise contributed by each factor. This is different from inferences made about fixed factors, which typically deal with a change in mean. Using the F-test, we can determine whether each layer of replication (e.g. animal, tissue, cell) contributes additional variation to the overall measurement.

Krzywinski, M., Altman, N. & Blainey, P. (2014) Points of Significance: Nested designs Nature Methods 11:977-978.

### Background reading

Blainey, P., Krzywinski, M. & Altman, N. (2014) Points of Significance: Replication Nature Methods 11:879-880.

Krzywinski, M. & Altman, N. (2014) Points of Significance: Analysis of variance (ANOVA) and blocking Nature Methods 11:699-700.

Krzywinski, M. & Altman, N. (2014) Points of Significance: Designing Comparative Experiments Nature Methods 11:597-598.

# Replication—Quality over Quantity

Tue 02-09-2014

It's fitting that the column published just before Labor day weekend is all about how to best allocate labor.

Replication is used to decrease the impact of variability from parts of the experiment that contribute noise. For example, we might measure data from more than one mouse to attempt to generalize over all mice.

Nature Methods Points of Significance column: Replication. (read)

It's important to distinguish technical replicates, which attempt to capture the noise in our measuring apparatus, from biological replicates, which capture biological variation. The former give us no information about biological variation and cannot be used to directly make biological inferences. To do so is to commit pseudoreplication. Technical replicates are useful to reduce the noise so that we have a better chance to detect a biologically meaningful signal.

Blainey, P., Krzywinski, M. & Altman, N. (2014) Points of Significance: Replication Nature Methods 11:879-880.

### Background reading

Krzywinski, M. & Altman, N. (2014) Points of Significance: Analysis of variance (ANOVA) and blocking Nature Methods 11:699-700.

Krzywinski, M. & Altman, N. (2014) Points of Significance: Designing Comparative Experiments Nature Methods 11:597-598.

# Monkeys on a Hilbert Curve—Scientific American Graphic

Tue 19-08-2014

I was commissioned by Scientific American to create an information graphic that showed how our genomes are more similar to those of the chimp and bonobo than to the gorilla.

I had about 5 x 5 inches of print space to work with. For 4 genomes? No problem. Bring out the Hilbert curve!

Our genomes are much more similar to the chimp and bonobo than to the gorilla. And, we're practically still Denisovans. (details)

To accompany the piece, I will be posting to the Scientific American blog about the process of creating the figure. And to emphasize that the genome is not a blueprint!

As part of this project, I created some Hilbert curve art pieces. And while exploring, found thousands of Hilbertonians!

# Happy Pi Approximation Day— π, roughly speaking 10,000 times

Wed 13-08-2014

Celebrate Pi Approximation Day (July 22nd) with the art of arm waving. This year I take the first 10,000 most accurate approximations (m/n, m=1..10,000) and look at their accuracy.

Accuracy of the first 10,000 m/n approximations of Pi. (details)

I turned to the spiral again after applying it to stack stacked ring plots of frequency distributions in Pi for the 2014 Pi Day.

Frequency distribution of digits of Pi in groups of 4 up to digit 4,988. (details)

# Analysis of Variance (ANOVA) and Blocking—Accounting for Variability in Multi-factor Experiments

Mon 07-07-2014

Our 10th Points of Significance column! Continuing with our previous discussion about comparative experiments, we introduce ANOVA and blocking. Although this column appears to introduce two new concepts (ANOVA and blocking), you've seen both before, though under a different guise.

Nature Methods Points of Significance column: Analysis of variance (ANOVA) and blocking. (read)

If you know the t-test you've already applied analysis of variance (ANOVA), though you probably didn't realize it. In ANOVA we ask whether the variation within our samples is compatible with the variation between our samples (sample means). If the samples don't all have the same mean then we expect the latter to be larger. The ANOVA test statistic (F) assigns significance to the ratio of these two quantities. When we only have two-samples and apply the t-test, t2 = F.

ANOVA naturally incorporates and partitions sources of variation—the effects of variables on the system are determined based on the amount of variation they contribute to the total variation in the data. If this contribution is large, we say that the variation can be "explained" by the variable and infer an effect.

We discuss how data collection can be organized using a randomized complete block design to account for sources of uncertainty in the experiment. This process is called blocking because we are blocking the variation from a known source of uncertainty from interfering with our measurements. You've already seen blocking in the paired t-test example, in which the subject (or experimental unit) was the block.

We've worked hard to bring you 20 pages of statistics primers (though it feels more like 200!). The column is taking a month off in August, as we shrink our error bars.

Krzywinski, M. & Altman, N. (2014) Points of Significance: Analysis of Variance (ANOVA) and Blocking Nature Methods 11:699-700.

### Background reading

Krzywinski, M. & Altman, N. (2014) Points of Significance: Designing Comparative Experiments Nature Methods 11:597-598.

Krzywinski, M. & Altman, N. (2014) Points of Significance: Comparing Samples — Part I — t-tests Nature Methods 11:215-216.

Krzywinski, M. & Altman, N. (2013) Points of Significance: Significance, P values and t-tests Nature Methods 10:1041-1042.