Art is science in love.
— E.F. Weisslitz
In genomics, insights can hinge on a difference of one. One cellular mutation to go from healthy to diseased. One cell migration from tumor to metastasis. Even subtle differences in gene expression between healthy cells shapes their form and function.
In Data in New Dimensions, we’ve created an immersive data art experience celebrating the individuality and often underestimated influence of the single cell—captured by high-throughput single cell analysis.
Using the rich data from the very tools and instruments in this room, we’ve transformed data points back into cells and, informed by their differences, allowed those cells to once again rejoin the world of the viewer in the third dimension.
How do these canvases make you think about the difference of one in your work?
This piece contrasts two different blood cell states, diseased versus healthy, in such a way that the differences manifest as depth. Cells on the base plane (the closest to the wall) represent healthy control cells, while diseased cells ascend increasingly closer to the viewer based on how different they are from their healthy counterpart.
This piece paints a picture of the diversity of disease, showing how the cells of a tumor and its metastasis vary in expression patterns. These differences are manifested in the piece through each cell’s position in the third dimension. Cells from the primary tumor exist on the base layer (closest to the wall). Cells from the metastatic site project into the room based on the degree of difference from the nearest primary tumor cell in their cluster.
This piece explores the expression differences that help determine a healthy cell’s role within an organism. Each cluster corresponds to a different cell type along the renal tubule, with that cluster’s depth mapping to its position along the tubule. Blood enters the tubule through the cells on the base layer (closest to the wall) and is filtered by the cells in the successively ascending layers. The remaining waste exits past the cells in the layer nearest to the viewer.
Outliers can degrade the fit of linear regression models when the estimation is performed using the ordinary least squares. The impact of outliers can be mitigated with methods that provide robust inference and greater reliability in the presence of anomalous values.
We discuss MM-estimation and show how it can be used to keep your fitting sane and reliable.
Greco, L., Luta, G., Krzywinski, M. & Altman, N. (2019) Points of significance: Analyzing outliers: Robust methods to the rescue. Nature Methods 16:275–276.
Altman, N. & Krzywinski, M. (2016) Points of significance: Analyzing outliers: Influential or nuisance. Nature Methods 13:281–282.
Two-level factorial experiments, in which all combinations of multiple factor levels are used, efficiently estimate factor effects and detect interactions—desirable statistical qualities that can provide deep insight into a system.
They offer two benefits over the widely used one-factor-at-a-time (OFAT) experiments: efficiency and ability to detect interactions.
Since the number of factor combinations can quickly increase, one approach is to model only some of the factorial effects using empirically-validated assumptions of effect sparsity and effect hierarchy. Effect sparsity tells us that in factorial experiments most of the factorial terms are likely to be unimportant. Effect hierarchy tells us that low-order terms (e.g. main effects) tend to be larger than higher-order terms (e.g. two-factor or three-factor interactions).
Smucker, B., Krzywinski, M. & Altman, N. (2019) Points of significance: Two-level factorial experiments Nature Methods 16:211–212.
Krzywinski, M. & Altman, N. (2014) Points of significance: Designing comparative experiments.. Nature Methods 11:597–598.
Celebrate `\pi` Day (March 14th) and set out on an exploration explore accents unknown (to you)!
This year is purely typographical, with something for everyone. Hundreds of digits and hundreds of languages.
A special kids' edition merges math with color and fat fonts.
One moment you're
:) and the next you're
Make sense of it all with my Tree of Emotional life—a hierarchical account of how we feel.
One of my color tools, the
colorsnap application snaps colors in an image to a set of reference colors and reports their proportion.
Below is Times Square rendered using the colors of the MTA subway lines.
Drugs could be more effective if taken when the genetic proteins they target are most active.
Design tip: rediscover CMYK primaries.
Ruben et al. A database of tissue-specific rhythmically expressed human genes has potential applications in circadian medicine Science Translational Medicine 10 Issue 458, eaat8806.