Version control for data [...]

The ability to do revisioning and versioning data – store changes made and share them with others – especially in a distributed way would be a huge benefit to the (open) data community. I’ve discussed why at some length before (see also this earlier post) but to summarize:

It allows effective distributed collaboration – you can take my dataset, make changes, and share those back with me (and different people can do this at once!)
It allows one to track provenance better (i.e. what changes came from where)
It allows for sharing updates and synchronizing datasets in a simple, effective, way – e.g. an automated way to get the last months GDP or employment data without pulling the whole file again [<small>(Source)</small>](

