16 December 2002 Online scientific data curation, publication, and archiving
Author Affiliations +
Abstract
Science projects are data publishers. The scale and complexity of current and future science data changes the nature of the publication process. Publication is becoming a major project component. At a minimum, a project must preserve the ephemeral data it gathers. Derived data can be reconstructed from metadata, but metadata is ephemeral. Longer term, a project should expect some archive to preserve the data. We observe that published scientific data needs to be available forever -- this gives rise to the data pyramid of versions and to data inflation where the derived data volumes explode. As an example, this article describes the Sloan Digital Sky Survey (SDSS) strategies for data publication, data access, curation, and preservation.
© (2002) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jim Gray, Alexander S. Szalay, Ani R. Thakar, Christopher Stoughton, Jan vandenBerg, "Online scientific data curation, publication, and archiving", Proc. SPIE 4846, Virtual Observatories, (16 December 2002); doi: 10.1117/12.461524; https://doi.org/10.1117/12.461524
PROCEEDINGS
5 PAGES


SHARE
RELATED CONTENT

ESO Archive data and metadata model
Proceedings of SPIE (September 24 2012)
BIMA data archive the architecture and implementation of a...
Proceedings of SPIE (September 18 1997)
The DIRP framework Flexible HPC based post processing of...
Proceedings of SPIE (September 24 2012)
Moor web access to end to end data flow...
Proceedings of SPIE (September 16 2004)
Chandra data archive operations: lessons learned
Proceedings of SPIE (June 29 2006)

Back to Top