Looks like some of us are using less pesticides, purchasing a few more energy efficient and water conservation devices, composting only very slightly more than before, and it seems we dunno what to do with our toxic waste, we still throw out medicines and electronics in the regular curb pick up and we still commute to work one person per car which is too bad since

Passenger transportation accounts for about 12 per cent of Canada’s greenhouse gas emissions and efforts to improve efficiency are a high-profile part of the global warming debate.

Also, sadly we drink way more bottled water than is necessary in a country with an excellent drinking water infrastructure.

It would be great to get a hold of the raw data and play with it. It could be mapped and studied with other variables like income, city versus rural, ethnicity, mother tongue, population density, etc. This type of analysis could help target campaigns in certain under-performing areas and study why others are doing better.

Sources:

Putting Canadian “Piracy” in Perspective, a video from Geist and Albahary is a great way to present an argument. In Geist’s words

over the past year, Canadians have faced a barrage of claims painting Canada as a “piracy haven.” This video – the second in my collaboration with Daniel Albahary – moves beyond the headlines to demonstrate how the claims do not tell the whole story.

The video also uses quite a bit of public and private sector data to support its argument. This to me is what public data are for and this is what democracy looks like – when civil society has access to the data it requires to keep its government accountable, can keep citizens informed and can temper industry desires with public interest!

One of the cultural issues that has become pervasive as of late is the proliferation of policies and decisions being based on assumptions and not on facts, and in the case of the very powerful lobby against Canada on IP in the cultural sector – really biased reports that are not based on facts but on an industry’s desires and self interests. Look for the sources of the data and the methodology in all reports. Even in this great video! Geist and Albahary do a great job in this to show what is being said and repeated (memes) about the cultural industry in Canada and reality.

It is interesting that the video ends with a slide acknowledging the photos used, the music heard, the creators of the video and the license but not all the data sources in the charts! Some of the data references are in some of the bar charts while most statements are referenced with their source at the bottom of the slide. I always look for data references, else how can I go back and verify what was purported!

The data in the charts were:

  • Hollywood Studio Revenue Growth – Data Source unknown
  • Top Hollywood International Markets – Data Source unknown
  • Canadian Music Releases – Statistics Canada
  • Canadian Artist Share of Sales – Canadian Heritage Music Industry Profile
  • Digital Music Download Sales Growth – Data Source unknown
  • Private Copying Revenues 2000-2005 – Data Source unknown
  • RCMP Crime Data – Data Source unknown but assume the RCMP

*************************************
NOTE: See the comments of this post, the references to the data, quotes and reports that were not listed in the credits or with the information in the film are now fully described on Michael Geist’s Blog here.

Datalibre.ca received and excellent comment on the DLI post about access to some of the Statistics Canada data in schools and public libraries. Today I am looking at E-STAT online and am quite impressed – but alas I have not yet gone to a public library to check out what is actually there and what I can do. Nor do I know the limitations of CANSIM data. I did however speak on the phone with a fine librarian at the Main Ottawa Public Library this morning and look forward to digging for data later on today or tomorrow.

E-STAT is:

Statistics Canada’s interactive learning tool designed with the needs and interests of the education community in mind. E-STAT offers an enormous warehouse of reliable and timely statistics about Canada and its ever-changing people.

Using approximately 2,600 tables from CANSIM*, track trends in virtually every aspect of the lives of Canadians. Updated once a year during the summer, CANSIM contains more than 36 million time series.

Hundreds of schools across the country and Depository Service Program Libraries make these data accessible if you go in person to access them. You can get access to these data online only if you are registered with one of these institutions.

The E-STAT license on the data are quite restrictive.

The Government of Canada (Statistics Canada) is the owner or authorized licensee of all intellectual property rights (including copyright) in the data product referred to as E-STAT. Statistics Canada grants the educational institution a non-exclusive, non-assignable and non-transferable licence to use the data product subject to the terms below.

The data product supplied under this agreement shall at all times remain under the control of the institution. It may not be sold, rented, leased, lent, sub-licensed or transferred to any other institution or organization, and may not be traded or exchanged for any other product or service. The data product may not be used for the personal or commercial gain of any authorized user, nor to develop or derive for sale any other data product that incorporates or uses any part of this data product.

The data that are made available are Yearly updated Canadian Socio-economic Information Management System (CANSIM) data, the daily updates are sold for commercial purposes. I am also not sure how fine the geography is for E-STAT data, for instance if the data are available by Dissemination Blocks, Dissemination Area or, Census Tract, or Urban Areas (Note the cost associated with these and other maps). These make a difference, since DB is the finest granularity, DA is a larger neighbourhood level while CT covers a larger areas, while UAs are larger still. Each scale is for a different level of analysis and the boundaries if you aggregate any of these do not necessarily line up. Additionally, DB and DA are only for the 2006 Census while CT and UA are for others. I am guessing E-STAT is CT Scale data and larger.

E-STAT also has some census data, agricultural data, aboriginal survey data, some environmental data and health behaviour data for school aged children. Clearly not all the data are available and certainly not the specialized surveys such as business, waste management, household spending surveys, health, the survey of particular sectors etc. The data come with explanations, and teachers and users guides.

Lets see what we can get once I make a visit!

Another great American project, Fedspending.org is:

a free, searchable database of federal government spending…. With over $14 trillion in federal spending, this more open and accessible tool for citizens to find out where federal money goes and who gets it is long overdue. We believe this website is a good first step toward providing that access.

The project is run by OBM Watch, a “a nonprofit government watchdog organization located in Washington, DC. Our mission is to promote open government, accountability and citizen participation.” Funded by the very busy Sunlight Foundation.

United Nations Common Database (UNCDB) … “provides selected series from numerous specialized international data sources for all available countries and areas.”

Even better:

As of 1 May 2007, use of the Common Database will be FREE OF CHARGE. No subscription will be necessary after that date, and any user can enjoy the full range of data, metadata and various search tools without restriction.

Does anyone know of any exciting applications of these datasets?

Jon Udell has been writing about public data a fair bit of late (and he’s agreed to do an interview with us, coming sometime soon). In his latest post, he puts into practice an interesting theory, that good data presented in the right way is a kind of performance art. He demonstrates with a recent hobby horse of his, crime data from his hometown of Keene, which he runs through in a screencast with narration.

Jon’s inspiration for this style of presenting data is Hans Rosling, whose past two TED Talks made data sexy for many who never thought they might consider sexy and data in the same universe.

What Rosling and Udell are illustrating is the sort of thing that governments don’t seem to have time or interest in doing: presenting data in a way that average people can grasp. By doing that, our communities will necessarily become much better at making sensible decisions, for instance about how and where to spend money. There is no reason why governments can’t be doing this too … but more importantly, there is no reason why taxayers should not get access to this kind of data. With the data, citizen can find new and innovative ways of displaying and using the data (meaning the government doesn’t have to), which, if one has faith in data, people and democracy, should translate to better decision-making in the community.

I tripped over this yesterday while looking for some arguments for and against cost recovery. The arguments are quite good and comprehensive. If any of you can think of more send them to the civicacces.ca list or leave comments here.

This texte I believe was put together by Jo Walsh and colleagues as they were preparing positions for the INSPIRE Directive that became official May 7, 2007. Public Geo Data put together a great campaign, an online petition, a discussion list and superb material to lobby EUROGI for Free and Open Access to Geo Data. At the time the UK was pushing heavily for the Ordnance Survey‘s extreme cost recovery model for the EU while other European nations were working towards more open and free access models. You can read more about it by going through the archive of their mailing list.

Here is the full text for Why Should Government Spatial Data be Free?

We’re planning to do some email interviews about citizen access to government data and related projects with academics, hackers, web project instigators, statisticians, activists, politicians, bureaucrats, writers and the like.

Our first interview is with Rami Tabello, of IllegalSigns.ca – Tracking Toronto’s Outdoor Advertising Industry, a Toronto-based, grassroots project set up to fight illegal billboards. Says the about:

Our Streets are where civic capital is created. Illegal billboards monetize our civic capital, under no colour of right, by treating citizens as consumers first. Illegal billboards commodify what is unique about our neighbourhoods by turning our Streets into pages of a mass-market magazine, without regard to the law. Join us as we fight to legalize and democratize Toronto’s visual environment. Join us as we fight to Reclaim the Streets.

And here is the interview:

1. What do you think of the state of democracy in Canada?
I donÂ’’t much think about it. I think the British Parliamentary system tends towards stability.

2. What do you think of the state of democracy in Toronto?
The problem in Toronto is not lack of democracy, it’Â’s a public service that doesnÂ’’t work and doesn’t hire the right people.

3. How do you think the mechanisms of democracy can be improved?
I have no idea. David Meslin is working on a project to bring instant runoff voting to Toronto: Whorunsthistown.to.

4. Are you optimistic? Why?
IÂ’’m optimistic because city councilors want to do something about illegal billboards.

5. Why did you start illegalsigns.ca?
More of a challenge than anything. An easy way to make a big difference to the visual environment.

6. What tools do you use in illegalsigns.ca?
Our main tools are freedom of information inquiries. We have been barred from that process. Please see: The City Clerk Tries to Shut Down Our Research Team [ed: well worth a read * see below]. We are currently appealing this to the Office of the Information and Privacy Commissioner.

7. What has been the public reaction to your project?
Nothing but positive feedback from the public, the media and city councilors.

8. What has been the reaction from the City?
On the other hand, the bureaucracy has reacted negatively, mainly because we are a source of criticism. See above.

9. What other similar projects would you like to see in Toronto or Canada?
WeÂ’’d like to see a site that tracks illegal parking lots. Illegalparkinglots.ca. half the lots in Toronto are illegal.

*Illegalsigns.ca has filed hundreds of freedom of information claims to get the information on hundreds of signs they claim, and indeed have proved to be illegal. The reaction of Toronto’s City Clerk, is to ban them from the process (see review here, and the Clerk’s letter here-pdf). The Clerk claims the requests are “frivolous and/or vexations,” which means the City might not have to respond. Here’s the key summary about dealings with Toronto’s City Clerk:

The Clerk claims: “the high volume of your requests appears to be for the sole purpose of revisiting enforcement policy matters that City enforcement staff have already addressed with you.” In fact, the high volume of our requests is due to the high volume of properties in Toronto that have illegal billboards on them; this high volume was created because the bureaucracy was operating without scrutiny. The Clerk’s decision, if upheld, would shut down that scrutiny.

Indeed.

Fixmystreet is a neat little project out of the UK, made to:

help people report, view, or discuss local problems they’ve found to their local council by simply locating them on a map. It launched in beta early February 2007.

You enter a postal code, are shown a map, click on the map, and add your comments about problems (graffiti, overflowing drains, broken lights, etc). An email is then sent to the local municipal council. As of today, 171 reports have been made in the past week, 381 problems have been fixed in the past month, and 2462 reports have been updated.

The project comes to you from MySociety, which:

builds websites which give people simple, tangible benefits in the civic and community aspects of their lives. For more info on our aims, click here.

I’m going to feature some Canadian data access projects and people working with data in Canada that I find interesting and important on datalibre.ca . Here is my first go at it. Hope you like it! It is about a great program called the Data Liberation Initiative (DLI) that was formally instituted in 1996. I greatly benefited from the DLI as an undergraduate student studying Geomatics at Carleton University.

Tracey

************
Did you know that until the latter half of the 1990s students and faculty in Canadian Universities had to pay for Canadian Demographic Data that were collected with the use of their own tax dollars? Well it’s true! If students and faculty wanted access to Statistics Canada data to conduct any kind of demographic analysis, to study the labour market, or issues related to income and poverty, explore provincial migration patterns etc. they had to pay exorbitant amounts. What was the effect? Canadian students became US experts since their data were free and worse policy decisions for Canadians were based on US data! The real knowledge and social cost of Data Cost Recovery policies can never be recovered!

Why access to Canadian public data?

I think Professor Paul Bernard, Chair, Advisory Committee on Social Conditions (Statistics Canada) and member of the National Statistics Council said it well back in 1991:

…the genuine exercise of democracy increasingly requires that citizens get access to complex information and have the skills required to understand it.” While he realizes there are pressures on Statistics Canada to reduce costs and increase income, he feels the outcome has been the restriction of “…access to information only to groups that have the solid ability to pay.” Bernard feels that this may “…hamper the participation in public debates of groups whose contribution is not backed up by much money” as well as “those who have no prospect of turning a profit or reaping some tangible and relatively immediate benefit from using it.” This, he states, is “…likely to lead, in the long run, to suboptimal development and less than full-blown democracy.” (see Watkins).

Interestingly, since 1927 the Government of Canada did have a program to share Government information via the Depository Services Program (DSP) which is

an arrangement with some 680 public and academic libraries to house, catalogue and provide reference services for the federal government publications they acquire under the Program. These depositories must make their DSP collections available to all Canadians and for interlibrary loans. DSP also includes depositories such as Parliamentarians, central libraries of the federal government departments and press libraries.

The DSP however does not include the dissemination of public data files or databases collected and managed by the Government of Canada. Data users were and still are considered a special interest group. Odd! Numerate Canadian citizens a special interest group? Imagine literate Canadian citizens being considered a special interest group! Anyway, this meant that independent analysis on a variety of topics important to Canadians was left unquestioned, unstudied, ignored and unknown. Not the best scenario for a democracy or a knowledge based economy let alone for the promotion and growth of a numerate workforce and citizenry.

Fortunately, in 1993 we see the early formation of the Data Liberation Initiative (DLI). An early working group consisting of researchers, data librarians and representatives from Canadian Association of Research Libraries (CARL) and Canadian Association of Public Data Users (CAPDU) , Statistics Canada and the DSP as well as members of the Social Science Federation of Canada (SSFC) got together and held a series of meetings. In 1995 Statistics Canada gave the DLI its formal blessing and the DLI received Treasury Board approval in1996.

What is the Data Liberation Initiative?

The DLI a data purchasing consortium between Canadian Universities and Statistics Canada. Large universities pay $12,000 per year and smaller universities pay $3,000. The Treasury Board of Canada, Industry Canada, Health Canada, Human Resources Development Canada, Social Sciences and Humanities Research Council of Canada, Medical Research Council of Canada and Statistics Canada also financially contribute. These institutions subscribe to the service.

The DLI provides

affordable and equitable access to the standard data products listed in the Statistics Canada Catalogue through an annual subscription fee. The terms of agreement specified in the DLI license place conditions on the use of products disseminated through this program. These restrictions are directed at stopping the redistribution of data received through this channel and protecting against the loss of sales to non-educational markets for Statistics Canada, which is known within Statistics Canada as “leakage”. The license allows the use of DLI data for non-profit, academic research and instruction. Access to statistical information through DLI does require student or staff affiliation with a DLI member institution. While students and staff do not have to pay directly for access, DLI does require mediated services to disseminate statistical and data products on local campuses.

How does it works:

Students and Faculty go to their respective data libraries , consult with the data librarian, sign a use agreement in plain english a DLI Data Use License, access the data via a dedicated computer and download what they need.

The Infrastructure:

An elaborate organizational structure with very dedicated members is in place with a data delivery technical infrastructure that includes a web site, an FTP service, CDRom data delivery service and a special order process. In addition each participating university institutes a ‘data service’ which assumes responsibility for DLI at their site. The project is also glued together with two listserves. The data files are delivered in ASCII formats with associated metadata discoverable using StatCan Software at dedicated workstations in the Library.

Critical Note:

The DLI was and is the best possible reaction and compromise to the very restrictive data cost recovery policies initiated in 80s that remain alive and well with us today. It is important to repeat that these public data have already been paid for by taxation, they are re-paid for with tuition and DLI data access is restricted only to Canadians who are university students and faculty. The DLI solved one very important Canadian knowledge creation and dissemination issue in academic institutions but not the broader issue of access to data by Canadian citizens. They did set a precedent!

Statistics Canada data are still sold to Federal Departments, Provincial Governments and Municipal Governments who are not allowed to share between and among them due to very stringent licensing regimes. Our taxes have paid for many of the same datasets multiple times since these are government purchases and transactions. Just think of all the bureaucracy to manage these license regimes, royalties, the lawyers, purchasing and accounting services, storage, and so on. In addition civil society organizations such as Non Governmental Organizations, Non Profit Organizations, Community Based Researchers etc. who are not wealthy yet fulfill an important democratic function cannot afford these data while it is their role to keep government accountable on a variety of issues (e.g. Environment, Homelessness, Education etc.). Further citizens who want to learn about their communities, develop a community plan or start a new business want access to data but can only do so if they have a significant amount of cash to do so. The result – a lack of informed decision making.

Dream Idea:

It would be fantastic to have the knowledge, training and infrastructure of the DLI extended to all of our public libraries and community access points. Imagine knowledge one stop shopping – picking up a video, a music CD, a novel and some demographic data related to school closures in your neighbourhood – Wow! Of course, the data should be at no cost to the citizen nor the library. Also, imagine having a data librarian in every library that can help citizens find the data they need and helping them learn how to use them? Now that is a knowledge Society.

References:

You can access the documents I referred to here – my del.icio.us – tagged with datalibre civicaccess and DLI.

Continuum of Access, By Chuck Humphrey, University of Alberta.

Charles Humphrey (2005). Collaborative Training in Statistical and Data Library Services: Lessons from the Canadian Data Liberation Initiative. Resource Sharing & Information Networks, Vol. 18 (1/2), pp. 167-181.

« Older entries § Newer entries »