Data mashups of not a few but a few thousand sources are becoming possible as community efforts, enabled by new tools and Creative Commons licensing, unify the world's exploding store of free, open data. Come find out what's awesome, what's hard, and what's possible when you discover there's really only one dataset.
Questions Answered:
What are the different flavors of data that are out there (e.g. api streams, gov't databases, compendia)?
How do I know the data is accurate?
What architectural and interface challenges come up when you track versioning and provenance a permanent database at such fine granularity?
How do you reconcile different datasets that present contradictory information?
How do you classify and organize all of this data? Do traditional ontology approaches work?
How will interconnecting the world's quantitative information streams change our lives?
What kinds of tools exist to acquire / manage / process / explore these information sources?
Does all this data pose a threat to our privacy? To our Humanity?
That technical challenges exist is obvious; what are the surprisingly abstract philosophical questions?
How do the many information curators and distributors collaborate to grow the global data commons?