Finding insight from open data with OpenRefine

Track this Workshop

Over the past few years we’ve witnessed a profusion of open data made available for citizen to explore and take action on. However, the tools available for end users haven’t fully caught up. Spreadsheets offer entry level interface to the data but are time consuming and don’t scale, while languages like R or ipython offer flexibility but have a steep learning curve for the non technical person.

Domain experts and citizen need powerful yet easy-to-use interfaces to explore new data sets, normalize them and process them via innovative services often available via an API only. OpenRefine offers the best of both worlds with a self service agile and iterative interface for data discovery and preparation and an easy-to-learn scripting language.

OpenRefine (formerly Freebase Gridworks and Google Refine) is a five year old project that has gained traction with various domain experts including librarians and researchers, data journalist, open data enthusiasts, and semantic web professionals.

During this session we will grab some data from Toronto Open Data portal and start blending and refining them.

Key Take Aways

  • Get starting with OpenRefine interface
  • Explore and gain insight from open data
  • Clean duplicate and typo
  • Reformat and normalize your data
  • Process data against API without a line of code
Intermediate Maker & Hacker Workshop

Q: What kind of volunteer opportunities are there at PodCamp Toronto?

PodCamp Toronto is entirely volunteer-driven! We always need enthusiastic volunteers in a range of areas: manning the registration desk, room hosting and greeting attendees at our traditional Saturday night social. Many previous volunteers have gotten more serious and joined our core organizing team! For more details, please contact our Volunteer Coordinator Samuel at

More FAQ's