Jan
21

Google to Host Terabytes of Open-Source Science Data

Written by Jonathan Dingman
01/21/2008 9:42 ET - Filed under Corporate

Sources at Google have disclosed that the humble domain, http://research.google.com, will soon provide a home for terabytes of open-source scientific datasets.”

Let’s take a look at Google’s mission statement again: “Google’s mission is to organize the world’s information and make it universally accessible and useful.”

So how does this reflect on Google’s mission statement? Head on. Google has hundreds, thousands, maybe even millions of terabytes of storage available. Sure, Google is using it for additional Gmail storage, sure they’re using it for Picasa Web storage, sure they’re using it for Google Apps, but that’s still only a small fraction of what they have available.

Now Google has a place where they can “donate” this extra storage space to.

(Google people) are providing a 3TB drive array (Linux RAID5). The array is provided in “suitcase” and shipped to anyone who wants to send they data to Google. Anyone interested gives Google the file tree, and they SLURP the data off the drive. I believe they can extend this to a larger array (my memory says 20TB).

This is where Trendalyzer comes into play. Google acquired Trendalyzer nearly a year ago and we haven’t heard a single thing about it since. This is where we will see a lot about Trendalyzer because “Google will also be offering algorithms for the examination and probing of the information. The new site will have YouTube-style annotating and commenting features.”

Tags:
  • Subscribe via RSS
  • Bookmark to del.icio.us