visualization – the evolving ultrasaurus

not all data is interesting

sarah — Wed, 31 Dec 2008 19:43:19 +0000

Mathematician Martin Wattenberg interview by boston.com about his exhibit at the Boston Museum of Modern art. (via ACM TechNews)

“The traditional approach to visualization in science and business is to create something transparent and neutral – a telescope with clear glass,” he said as he roamed through the ICA exhibit. “But for an emotional approach, or an artistic approach, you want to bring a point of view. Not all data is interesting. The art is pointing the telescope at the right set of data.”

You can see his work online at bewitched.com. I had seen Market Map, Name Voyager, and others before, but History Flow was new to me which visualized the editing history of wikipedia articles:

ABCD book

sarah — Fri, 26 Dec 2008 02:26:48 +0000

[Update: so excited to have gotten this for Xmas! Even cooler to experience live.]

Lovely video of this upcoming pop-up book…

via information aesthetics

data visualization with cartograms

sarah — Sun, 30 Nov 2008 19:50:57 +0000

Jon Udell writes about visual numeracy, introducing a new graph to continue the conversation about where the US gets its oil.

Here’s his original map of US oil imports by region:

Here’s his latest graph about where the global oil reserves are:

Comments on Udell’s latest post point out that this kind of graph is called a cartogram, which is a map where areas are scaled to indicate the scalar value of the statistic that is being displayed on the map.

My favorite (via Tony Hirst) is from a series on 2008 election results:

Popular vote results displayed on a map of the US.

Update: further reading in the comments reveals an interesting insight by Tim:

“The thing that irritates me about the first visualization is that by visualizing some data you imply that the data is important. But visualizations, like statistics can lie.

‘In fact, where the US gets its oil is almost completely irrelevant. The oil market is a world market. The transportation costs are startlingly small but-all things being equal-it’s easier to get oil from countries that are near you (and hard to get it from countries you don’t trade with).

“The ‘dangerous disconnect’ between American perception–the US is dependent on Middle Eastern oil–and the reality–we get a lot of our oil from Canada–is actually misunderstanding that is *healthier* than the misunderstanding the chart might lead one to take.

“Reading the chart one might imagine that, if the Middle East were plunged into a giant multi-state war, we’d be basically fine. Of course, unless the United States invaded Canada-and Africa, yay!-and commandeered all the oil for US consumption, we’d be in the grip of massive undersupply and a resulting rise in the price of oil.
It’s also worth saying that reserve numbers have all sorts of spin on them…

Personally, I think the graphs help illustrate the conversation. Your mind doesn’t have to do a double-step between a list of numbers and the narrative that discusses them. Graphing statistics helps tell a story, but as Tim says it is only part of the story. We need to fill in the causal effects and the consequence, and in doing so, draw conclusions and learn.

data mining for oil sources

sarah — Mon, 17 Nov 2008 13:08:43 +0000

Jon Udell writes about where the US gets its oil (the majority comes from Canada, not the middle east). He follows the data with the story of how he discovered the answer and presented the data using DabbleDB.

I agree with Jon when he says “we’re really close to the point where non-specialists will be able to find data online, ask questions of it, produce answers that bear on public policy issues, and share those answers online for review and discussion.” It’ll likely need another generation of tools before we work out the glitches and hiccups in the data flow and it’ll require the generation who grew up with the web to expect such tools at their fingertips and put them to use. I look forward to seeing what comes next.

animated sorting algorithms

sarah — Sun, 16 Nov 2008 18:57:32 +0000

David Martin, an assistant professor at Boston College, has published a fabulous sorting visualization (via HMK).

These should be a must-see for every computer science student. He includes some good notes about what we should be looking for and why we might care (excerpted below). Often people publish marvelous visualizations, but people who are new to the subject matter can miss key aspects of what they might learn from the detailed visual cues. It is great to see the combination here.

These visualizations are intended to:

* Show how each algorithm operates.
* Show that there is no best sorting algorithm.
* Show the advantages and disadvantages of each algorithm.
* Show that worse-case asymptotic behavior is not the deciding factor in choosing an algorithm.
* Show that the initial condition (input order and key distribution) affects performance as much as the algorithm choice.

The ideal sorting algorithm would have the following properties:

* Stable: Equal keys aren’t reordered.
* Operates in place, requiring O(1) extra space.
* Worst-case O(n·lg(n)) key comparisons.
* Worst-case O(n) swaps.
* Adaptive: Speeds up to O(n) when data is nearly sorted or when there are few unique keys.

There is no algorithm that has all of these properties, and so the choice of sorting algorithm depends on the application.

google flu trends

sarah — Wed, 12 Nov 2008 01:53:29 +0000

Can search query trends provide an accurate, reliable model of real-world phenomena? Some folks at Google have been tracking how often people search for flu-related terms and to what extent it relates to CDC data about how many people see their doctor with flu-like symptoms.

They have put together a compelling visualization along with a great article about the process. Here’s an excerpt:

“It turns out that traditional flu surveillance systems take 1-2 weeks to collect and release surveillance data, but Google search queries can be automatically counted very quickly. By making our flu estimates available each day, Google Flu Trends may provide an early-warning system for outbreaks of influenza.

“For epidemiologists, this is an exciting development, because early detection of a disease outbreak can reduce the number of people affected. If a new strain of influenza virus emerges under certain conditions, a pandemic could emerge and cause millions of deaths (as happened, for example, in 1918). Our up-to-date influenza estimates may enable public health officials and health professionals to better respond to seasonal epidemics and — though we hope never to find out — pandemics.”

The Google folk, Jeremy Ginsberg, Matthew Mohebbi, Rajan Patel, and Mark Smolinski, and Larry Brilliant have teamed up with Lynnette Brammer from the CDC and have written an article that has been accepted into the scientific journal Nature. Fascinating, yet somehow spooky.

interactive visualizations of election results

sarah — Wed, 05 Nov 2008 15:56:53 +0000

Like many folks in America, I watched the election results stream in through television and Internet sources. The television cnn experience was fabulous in a cheesy, over-the-top kind of way that we’ve come to expect of our tv news. I thought the hourglass display of exit poll survey data was very nice and I wish they had posted it on the web. On the Web, I liked the dailykos interactive map for pure usability (easy to read graph of electoral votes and the ability to zoom into each state to see results per county). However, the visual design of the NYTimes was, as usual, quite lovely: readable with clear navigation (also with the zoom in county results feature).

Each of the sites I looked at was well done. The production values were high and I can imagine what kind of prep went in making these interactive visualizations happen. I was dismayed to see no credits to the individuals who created these pieces. Every photo in the New York Times has a credit, why not the information graphics and interactive charts? If anyone knows the stories behind these, please let me know. I’m very interested in the people who made them and the stories behind the work, whether it be the design process or the data feeds. With any luck, some of these folk are bloggers.

Daily Kos (click for full size images):

clicking on a state would zoom into election results per county:

NYTimes

CNN

CBS News

kinetic sculpture by theo jansen

sarah — Tue, 07 Oct 2008 12:31:39 +0000

“The boundaries between art and engineering exist only in our minds.” I love the fluid feel of Theo Jansen’s work. (It’s worth looking up some of his other creations on YouTube.) This one feels like some kind of mystical wind creature. Most impressive, though, is the way that this extraordinary act of engineering is made… Continue reading → Continue reading →

tagging brand associations

sarah — Tue, 02 Sep 2008 20:03:27 +0000

Jon Udell writes about brandtags.net, an experiment in identifying brand associations. As much as marketing folks work hard to create a brand, at the end of the day, the brand is defined by your associations. This site attempts to capture those from its audience.

Most often a brand is defined by its flagship product. Campbells is soup. Google and Yahoo are search. Adobe is photoshop. However, Microsoft is evil.

The cloud tag visualization is nice, but the top words are way too large when displayed in “orderly view.” They should lose the word scaling if ordering by popularity, I think… or else figure out how to adjust the scale so the biggest font is not quite so big.

tangible time tracking with legos

sarah — Mon, 01 Sep 2008 16:48:41 +0000

Micheal Hunger writes about using Legos to track time (via infosthetics)

This sounds like a fabulous, tangible visualization. I love the idea of creating a lego template of how I plan to spend my day and then building my actual day and comparing the two.

I just bought some lego bricks of my own to try it out!