Wednesday, 22 December 2010

Is it online, digital or electronic information? mine Google data to find out

I love playing around with new Web tools. This week I had a look at the new Google Books Ngram Viewer which is a tool for comparative analysis of word usage. Google Books Ngram Viewer lets you enter words and phrases and then it does an analysis and displays a graph showing how often those words phrases have occurred. The data set being interrogated is a particular corpus of books on Google Books generated in July 2009.

We often have discussions about the terms: "online information", "electronic information" and "digital information". When you do a comparative analysis of those three phrases on Google Books Ngram Viewer from 1940 onwards, you get the graphs above, showing that "electronic information" (red line), is the most commonly used term, followed by digital information" (green line), while "online information" (blue line) has definitely plateaued.

Here are a few blog posts explaining more about the new tool: The Google Books Ngram Viewer from +Datavisualization.CH and Charting the Epic Battle of “Geek” vs. “Nerd”

Will this sort of analysis will be a boon for linguistics' scholars, or will it just remain a fun tool to explore?

Thursday, 2 December 2010

ANDS, data management and libraries

Slides from my presentation this week to library staff on ANDS (the Australian National Data Service) and data management and libraries.