Having co-organized csv,conf,v2 this past May, a few of us from Open Knowledge International had the awesome opportunity to travel to Berlin and sit in on a range of fascinating talks on the current state-of-the-art on wrangling messy data. I’m currently writing a series of posts about some of these neat talks which you can read about below:
One such talk was given by Stuart Harrison of the Open Data Institute (ODI) who presented on tool he is developing called Comma Chameleon. Comma Chameleon is a desktop CSV editor with validation magic .
Another such talk was given by Sadayuki Furuhashi of Treasure Data2 who presented on tool he is developing called Embulk. Embulk is an open-source tool for moving messy data.