Place for general discussion.
Hurrah! We have our first new packaged core dataset thanks to sxren CO2 PPM data Ā· Issue #56 Ā· datasets/awesome-data Ā· GitHub
US CPI has been a bit neglected and needs a maintainer: US CPI data Ā· Issue #64 Ā· datasets/awesome-data Ā· GitHub
Anyone in or near London? The ODI are having a meetup Open Data London Meet Up Tickets, Thu 12 Feb 2015 at 18:15 | Eventbrite I donāt know if I can make it but Iām going to try.
Iāve tentatively signed up. It means an overnight stay in London! If I can make it, Iāll come say hello.
Cheers
Edafe
Hi, I likely canāt make the ODIās open data london event but I will be running Open Data Maker London on Thursday 5th February (next Thursday):
http://attending.io/events/open-data-maker-london-feb-2015
This is also probably a better venue for working on the core datasets work as more making than talking oriented (plus Iāve made ācore datasetsā a theme).
I hope to make it, depending on my workshop timings. It would be great to speak to someone about the process. Itāll help tie everything together so I can get started contributing.
Cheers
Edafe
@ekoner great - the event runs 18:30 to 21:00 and its no problem if you do not turn up right at the start.
@EvilPhil how about you?
I donāt know. I will be in London but for training but Iāll see if I can make it. It would be good to meet everybody in person.
A question, if you allow - and i am not even sure this is the right forum.
When I work with datasets I try to note a few things
- where did I get it (=source, url)
- when did I get it and give an estimate when I need to refresh
- note key steps and issues i encounter when cleaning up the data
- gaps if any (= often)
I have not seen these in the introduction, but my experience would indicate this information as very helpful. What do you think of adding this? ( as comment in the package.json?)
@andreas this is the exactly the right place to be asking. I also note you are also free to start a new ātopicā in the forum (just put it in the core datasets category) - which will mean your question will get a dedicated thread.
First, these are great questions and we should adding answers to these to the primary Data Packaging docs e.g. http://data.okfn.org/doc/publish (and subpages) as we go (you can submit improvements to those pages btw!)
Looking for somewhere to make your next contribution? Hereās some places to look:
Hi all, our amazing current managing editor @sxren has less time currently due to other commitments so we are looking for someone to step up and help coordinate activity on the core datasets queue: Issues Ā· datasets/awesome-data Ā· GitHub
Youāll get one-on-one tutoring and support from me and thereās plenty of people helping out
Perhaps of topic a bit, but as Iāve finally started working on the irish House Prices index Iām wondering if anybody knows where I can get a free or very cheap linux VM in the cloud? Iām hoping to run a monthly cron job and push updates to the price index to github.
@EvilPhil this is really interesting as this is a general need on this project - i wonder if we should start a dedicated thread on this (focused on what we would want in terms of scraping).
Letās see if we can find a cheap VM which we could use for this sort of scraping: obviously thereās stuff like DigitalOcean, Linode, Dreamhost etc - but it would be nice to get something pro-bono maybe.
I should also flag Morph in case it fits your needs already: Morph, a scraper platform for hackers and would be hackers - Open Knowledge Labs
Hello @EvilPhil,
you might be interested in these small cheap hosting :
- https://www.kimsufi.com/fr/index.xml dedicated server at 6 ā¬/month
- High-performance Dedicated server Dedibox | Scaleway dedicated server at 6 ā¬/month
- https://www.scaleway.com/pricing/ cloud server at 0,0072 ā¬ /hr, ie 3.5 ā¬ / month. Itās ARM, not a normal intel x86 computeur, but it should be Ok if you are running node or python scriptsā¦