16 December 2002 Online scientific data curation, publication, and archiving
Author Affiliations +
Abstract
Science projects are data publishers. The scale and complexity of current and future science data changes the nature of the publication process. Publication is becoming a major project component. At a minimum, a project must preserve the ephemeral data it gathers. Derived data can be reconstructed from metadata, but metadata is ephemeral. Longer term, a project should expect some archive to preserve the data. We observe that published scientific data needs to be available forever -- this gives rise to the data pyramid of versions and to data inflation where the derived data volumes explode. As an example, this article describes the Sloan Digital Sky Survey (SDSS) strategies for data publication, data access, curation, and preservation.
© (2002) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jim Gray, Jim Gray, Alexander S. Szalay, Alexander S. Szalay, Ani R. Thakar, Ani R. Thakar, Christopher Stoughton, Christopher Stoughton, Jan vandenBerg, Jan vandenBerg, } "Online scientific data curation, publication, and archiving", Proc. SPIE 4846, Virtual Observatories, (16 December 2002); doi: 10.1117/12.461524; https://doi.org/10.1117/12.461524
PROCEEDINGS
5 PAGES


SHARE
RELATED CONTENT

Web-based data providing system for Hyper Suprime-Cam
Proceedings of SPIE (August 06 2014)
ESO Archive data and metadata model
Proceedings of SPIE (September 24 2012)
The DIRP framework Flexible HPC based post processing of...
Proceedings of SPIE (September 24 2012)
Chandra data archive operations: lessons learned
Proceedings of SPIE (June 29 2006)
Observatory Control System of the LAMOST
Proceedings of SPIE (July 25 2000)

Back to Top