Paper
26 July 2016 The HARPS-N archive through a Cassandra, NoSQL database suite?
Emilio Molinari, Jose Guerra, Avet Harutyunyan, Marcello Lodi, Adrian Martin
Author Affiliations +
Abstract
The TNG-INAF is developing the science archive for the WEAVE instrument. The underlying architecture of the archive is based on a non relational database, more precisely, on Apache Cassandra cluster, which uses a NoSQL technology. In order to test and validate the use of this architecture, we created a local archive which we populated with all the HARPSN spectra collected at the TNG since the instrument's start of operations in mid-2012, as well as developed tools for the analysis of this data set. The HARPS-N data set is two orders of magnitude smaller than WEAVE, but we want to demonstrate the ability to walk through a complete data set and produce scientific output, as valuable as that produced by an ordinary pipeline, though without accessing directly the FITS files. The analytics is done by Apache Solr and Spark and on a relational PostgreSQL database. As an example, we produce observables like metallicity indexes for the targets in the archive and compare the results with the ones coming from the HARPS-N regular data reduction software. The aim of this experiment is to explore the viability of a high availability cluster and distributed NoSQL database as a platform for complex scientific analytics on a large data set, which will then be ported to the WEAVE Archive System (WAS) which we are developing for the WEAVE multi object, fiber spectrograph.
© (2016) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Emilio Molinari, Jose Guerra, Avet Harutyunyan, Marcello Lodi, and Adrian Martin "The HARPS-N archive through a Cassandra, NoSQL database suite?", Proc. SPIE 9913, Software and Cyberinfrastructure for Astronomy IV, 99132A (26 July 2016); https://doi.org/10.1117/12.2233137
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Databases

Stars

Data modeling

Data archive systems

Spectrographs

Analytics

Astronomical imaging

RELATED CONTENT

WEAVE core processing system
Proceedings of SPIE (July 18 2014)
CHARA SPICA a six telescope visible instrument for the...
Proceedings of SPIE (August 26 2022)
2dFdr Pipeline As a Web Service (PAWS) on demand...
Proceedings of SPIE (August 29 2022)
WAS: the data archive for the WEAVE spectrograph
Proceedings of SPIE (August 08 2016)
Project overview of OPTIMOS EVE the fibre fed multi...
Proceedings of SPIE (July 15 2010)

Back to Top