REQUIREMENTS FOR SDSS DATA MANAGEMENT SYSTEMS
June 5, 1995
[All requirements are subject to budgetary, manpower, and technical
limitations]
The data management systems shall maintain all processed data from the
SDSS and provide access by SDSS scientists and operators in order to maximize
the ease of the following:
- Operate the SDSS survey so as to maximize the efficiency of operations.
- Perform Quality Analysis operations on the data so as to ensure its
integrity. The operations will verify calibrations, target selection
criteria and classifications, and completeness & accuracy.
- Provide SDSS scientists with access to the data and tools to permit
selection of spectroscopic targets for certain categories (serendipity;
stars).
- Provide SDSS scientists with access to the data so as to enable
scientific analyses.
Requirements for the Science Archive
The science archive shall consist of:
- A science database that shall:
- a. Retain calibrated object catalogs (photometric CCD output)
- b. Retain parameters from spectroscopic pipeline
- c. Enhanced goal: provide ability to recalibrate
object catalogs
- d. Retain references to atlas images and extracted spectra
- e. Provide ability to carry out manual target selection
for certain target categories
- f. Provide ability to carry out offline QA activities
- g. Provide ability for SDSS scientists to extract subsets
of retained data.
- h. Enhanced goal: Provide ability to retain
scientist-derived data sets.
- i. Enhanced goal: Provide smooth transition to
public distribution system.
- A set of files tracked by the science database.
- A set of files not tracked by the science database.
I. Input to Science Archive
- Survey Definition
- a. A description of the North Imaging survey area
- b. Survey progress: A description of sky inserted into
database to date
- Final Astrometric Calibration
- a. List of r' band calibration coefficients on a
frame-by-frame basis.
[TBD: Are position errors stored on an
object-by-object basis?]
- Final Photometric Calibraton
- a. List of photometric calibration coefficients on a
frame-by-frame basis.
- Merged Object Lists
- a. A list of calibrated objects and parameters from
the Frames pipeline of photo
- b. A list of objects masks from the Frames pipeline of photo
[TBD: Do a and b provide all information about masked areas of sky?]
- c. Run and Field information. Might be needed for recalibrations)
- d. Star/Galaxy/QSO classifications
- e. Enhanced goal: Cross-identifications to other catalogs
- Target Selection
- a. A list of all targetable objects with target
selection categories
- b. A list of all objects from a selected as targets
with selection category
- c. Tiling flags for all objects in b.
- d. Reports for all targets selected manually
- Spectroscopic Pipeline
- a. Redshifts and parameters of all targeted objects
- b. Enhanced goal: Tile and plate information.
[TBD: If a target has multiple spectra obtained, is there a need
to assign one as a primary measurement?]
- Enhanced goal: Scientist derived catalogs
- Enhanced goal: Other input catalogs
- Separate files tracked from Science Database
- a. Atlas Images
- b. 1-D spectra
- Separate files not tracked or accessible from Science Archive
- a. Compressed pixel map
- b. Full corrected pixel map
- c. Corrected spectroscopic frames
- d. Unused data
- TBD: Southern Survey
II. Functional Goals
- User will be able to carry out efficient queries to locate objects
over one or more ranges of following attributes:
- a. Longitude or latitude in several spherical coordinates
- i) J2000 Ra and Dec
- ii) B1950 Ra and Dec
- iii) Enhanced goal: Ra and Dec of arbitrary epoch
- iv) Galactic coordinates
- v) Survey Coordinates
- vi) Any linear combination of the two coordinates
- b. Radius within a give point of the sky
- c. u' g' r' i' z' (One set of magnitudes per object)
- d. Any linear combination of c.
- e. Object radius (one per object)
- f. Surface brightess formed by c and d.
- g. Star/Galaxy/QSO Classification flag
- h. Object class (small/medium/big/mask)
- i. Target Selection Category
- j. Spectrum available flag
- User will be able to carry out queries on any retained
object parameter (subject to implementation constraints).
- Enhanced Goal: All calibrated quantities can be recomputed using improved
astrometric and photometric calibrations. Queries can
be performed on the recalibrated quantities.
- For all efficient queries, return an esimated number of objects to be
located.
- For all located objects, users shall be able to specify an arbitrary
subset of stored parameters to be returned (subject to implementation
constraints) plus the following derived quantities:
- a. Number of located objects
- b. TBD: Extra parameters
- Users shall be able to perform the following functions:
- a. Efficient repeated queries [e.g, get all objects
within each of 10,000 QSOs in my favorite catalog).]
- b. Make simple plots, etc. of returned parameters (e.g., SMONGO)
- c. Formulate new queries based on results of previous queries.
- d. TBD: What else?
III. Technical Goals
- User interface
- a. User interface shall be developed in a TCL/Tk/TclDp
environment.
- b. User interface shall communicate with a query support layer
via ASCII interface protocol.
- c. Data shall be returnable to files, sockets, or pipes.
Returned data shall use binary machine independent format (FITS binary,
ASCII if appropriate, TBD: FITS ASCII, other?)
- Enhanced goal: Data shall be stored in a system providing industry-standard
OSQL-like interface to enable use of commercial products to provide
alternative view of the database.
(TBD: I am not sure what this all means).
- Distributability
- a. A master copy of all data shall be maintained
(the Master Science Archive)
- b. Capability shall be present to replicate all or part of the
Master Science Archive as local databases at SDSS institutions.
Replication may consist of:
- i) Science Database in its entirety
- ii) All or part of separate files tracked by
Science Database
- iii) No capability shall be present to
replicate an arbitrarily selected
subset of the science database beyond that
described by section 1.c of USER INTERFACE.
- iv) Replication of databases shall be possible
on all SDSS supported platforms.
- d. No capability is required be present to replicate all
or part of separate files not tracked by
Science Database
- (TBD: Security ...)
- a. Master Science Archive shall be protected against
corruption by SDSS participant users
- b. Master Science Archive shall be protected against
unauthorized access by non-SDSS participants.