Skip to main content

Happy New Year: 2011

My New Year's gift to all y'all is this recommendation:

RUN DON'T WALK to download Google Refine. (No affiliation, just a satisfied customer.) Refine is an app that expedites data cleaning, thereby eliminating hours and hours of tedium and letting me get to the fun parts of the project sooner.

Google Refine

I will certainly be putting it through its paces this year.  If you do check it out, let me know what you think.

Comments

Popular posts from this blog

Recommended: a new review "zoo"

"A Tour Through the Visualization Zoo" is a fantastic introduction to some attractive and sophisticated new visualization formats. The article and illos were put together by Stanford's Jeffrey Heer, Michael Bostock, and Vadim Ogievetsky. Heer is an HCI/visualization genius whose journal articles I've been following with interest; Bostock is the whiz behind the D3 archive of javascript code for visualization.

Run, don't walk. It's great.

Blast from the past: a 1974 data treatise by Edward Tufte

Back in 1974, Yale poli-sci professor Edward Tufte published a slim volume called Data Analysis for Politics and Policy (Prentice-Hall, $3.95). The book in its entirety is available for free download (PDFs) at Tufte's website, accompanied by a contemporary review from the Journal of the American Statistical Association. More than 30 years later, the review amuses me with its restrained praise of the perspective that would eventually make Tufte a Major Figure (and a minor fortune):
Tufte puts residual plots to good use to gain understanding of a data set, and he shows how finding outliers gives the analyst hints about the inadequacy of a statistical model... The discussion of graphical techniques in general is quite good... A brief but compelling discussion of the "value of data as evidence," with regard to the interpretation of nonrandom samples, is presented. If you happen to have a spare 48MB lying about, DAPP's worth a download.

[via Sofa Papa]