Because good research needs good data

National Statistics no joke

Chris Rusbridge | 01 April 2008

Today the UK's new independent National Statistics Authority began work, replacing the Office for National Statistics. This is clearly an important agency; in the Today Programme this morning they were claiming that they would make a clear distinction between the statistics and political interpretation. I thought it was worth a look at their web site.Two things initially discouraging: first, a Google search for National Statistics Agency [sic] produces a web page for the old ONS at http://www.statistics.gov.uk/default.asp; this is ALMOST but not quite the same URL as the new authority at http://www.statistics.gov.uk/... but what about the apparently similar but possibly different http://www.statisticsauthority.gov.uk/? Well, teething troubles no doubt. Second there's a prominent link on the (first of the above) home page that "ONS independence comes into effect on 1 April", and the link is broken. More teething...A quick explore led me to the UK snapshot, with lots of interesting web pages summarising data. As an example, there is a page Headed "Acid Rain" under the "Environment" section; you get a graph and a few paragraphs of text, eg "Emissions of chemicals that can cause acid rain fell by 53.8 per cent between 1990 and 2005, from 6.9 million tonnes to 3.2 million tonnes." In fact, the page doesn't tell us whether rain became less acidic during this period, but again that's a quibble.But I was looking for data, not summaries. I found some under "Time Series data", but had to go through a complicated sequence of selections to find an actual dataset. I selected Share Ownership, then "Total market value by sector of beneficial owner: end-2006", then "DEYQ, SRS: Ungrossed: Total Market Value:Individuals" before I got a download button. The 3 download options were:

  • View on Screen
  • Download CSV
  • Download Navidata

Where Navidata is a tool they make available. View on screen gives me a nicely formatted web page displaying a two-column table. Downloading the CSV gave me a cryptically named CSV file, which looks as follows (Crown Copyright, from the national Statistics Authority, Reproduced under the terms of the Click-Use Licence):

,"DEYQ"," 1998",154.8," 1999",163.3," 2000",181.0," 2001",148.4," 2002",104.0," 2003",136.0," 2004",122.3," 2005",..," 2006",155.8,"DEYQ","SRS: Ungrossed: Total Market Value:Individuals","Not seasonally adjusted","Updated on 8/ 6/2007"

All fine, I think... except I could not find any way to do this automatically. Maybe they have an API I haven't found, maybe they have plans not yet come to fruition. Anyway, some good stuff here, but perhaps room for improvement?