The research community in agreement that the raw data should be shared through databases. It will enable a data mining experiments and advance the science. It still not clear how these huge datasets should be presented, managed, formatted and curated.
Now researchers have proposed a common standards for data sharing:
Led by researchers at University of Oxford (UK) and the Harvard Stem Cell Institute (HSCI) at Harvard University, (USA), more than 50 collaborators at over 30 scientific organizations around the globe have agreed on a common standard that will make possible the consistent description of enormous and radically different databases compiled in fields ranging from genetics to stem cell science, to environmental studies.
The proposal for interoperable bioscience data was published in Nature Genetics:
Here we describe a way in which a group of data producers and consumers work within an invisible metadata framework that enables the coordinated use of reporting standards by service providers and circumvents many of the problems caused by data diversity. The same framework enables researchers, bioinformaticians and data managers to operate within an open data commons.
You can also read an editorial.
One of the ways to achieve a common standard is using ISA (Investigation Study Assay) software and framework. Recently we’ve written about the Stem Cell Discovery Engine, which works on ISA-tab format. Now we can give you more examples.
- Library of Integrated Network-based Cellular Signatures (LINCS)
- Harvard Stem Cell Institute’s Blood Genomics Repository.
We hope that this initiative will work nicely and advance the science.