NASA Data Center Annual Program Plan

Program Year: FY 2003 
Data Center/Service: Multi-mission Archive at Space Telescope (MAST)
(Optical/UV Science Archive Research Center) 
Supporting Organization: Space Telescope Science Institute
3700 San Martin Drive
Baltimore, MD 21218

Overall Mission:  MAST supports active and legacy mission data sets and related catalogs and surveys, focusing primarily on data in the ultraviolet, optical, and near-IR parts of the spectrum.  Support includes curation of the data, providing expert support to users of the data, providing access to data-specific calibration and analysis software, providing user support for this software, and maintaining public access interfaces to the data.  MAST works with new mission teams in the supported wavelength regions to assist in the development of data management plans, especially in the areas of data formats, descriptive metadata and standardization of keywords, in the development of data access and data delivery plans, and in assuring data quality control.

MAST Data Holdings

Name Size Number of Observations Active Mission Duration
ACTIVE MISSIONS
FUSE 221 GB 2214 1999-
HST 8.6 TB 266,926 1990-
LEGACY MISSIONS
ORFEUS: BEFS 4.1 GB 332 Sept. 1993; Nov. 1996
ORFEUS: IMAPS 0.3 GB 643 Sept. 1993; Nov. 1996
ORFEUS: TUES 0.2 GB 229 Nov. 1996
EUVE 96 GB 1377 1992-Jan. 2001
ASTRO: UIT 56 GB 1,442 Dec. 1990; March 1995
ASTRO: HUT 0.6 GB 516 Dec. 1990; March 1995
ASTRO: WUPPE 0.1 GB 238 Dec. 1990; March 1995
IUE Final Archive 475 GB 103,552 1978-1996
IUE SIPS 125 GB 104,296 1978-1996
Copernicus 0.8 GB 551 1972-1981
CATALOGS & SURVEYS
SDSS: Early Data Release 1 TB n/a 1998-
VLA-FIRST 109 GB 14,940 1993-
Digitized Sky Surveys 5 TB n/a 1950-58, 1975-99
GSC I, II 2 TB n/a 1950-58, 1975-99

Services Provided:  MAST provides support for users seeking to understand the properties and instrumental signatures of all archived data sets and assistance with the interfaces to browse and retrieve these data.  Access to non-HST mission and instrument specific calibration and analysis software and assistance in its use is on a best-effort basis (full support for HST related software is provided by the MAST Helpdesk and staff).

Non-HST Data Analysis Software Provided:  IUE RDAF package (IDL-based), IUE Final Archive processing software (IRAF port), EUVE analysis software package (IRAF-based), Copernicus data analysis software (IDL-based), UIT data reduction and analysis software (FORTRAN, C, and IDL routines),  WUPPE data analysis software (FORTRAN routines requiring the FITSIO library), and HUT data reduction software (IRAF-based) are available through MAST.

Mission Interfaces:  MAST staff members continued to coordinate with the FUSE mission on data ingest, creation of preview data, database queries, and web access. The FUSE Project has undertaken a reprocessing effort to improve the quality of the processed data and to provide files suitable for the "quick look" preview data that are made available through the MAST archive.

Staff members coordinated with the GALEX team in drafting the Interface Control Document (ICD), which specifies data characteristics, file structure, keyword definitions, and delivery mode. A preliminary database has been created using data models and file and keyword information. Test queries based on GALEX team input have are being used to help define the user interface. A detailed testing and implementation plan has been created, in preparation for the launch of GALEX later this year.

MAST has developed working relationships with the teams from three planned NASA missions: Kepler, a newly approved Discovery mission to detect planets; the Cosmic Hot Interstellar Plasma Spectrometer (CHIPS), a University-class Explorer mission; and the Spectroscopy and Photometry of the IGMs Diffuse Radiation (SPIDR), a Small Explorer class mission. All three projects plan to archive their data with MAST. Staff members continued to coordinate with teams from the ORFEUS Project, the Sloan Digital Sky Survey (SDSS), and Voyager UVS.

Interoperability Activities:  MAST is working with the Astrophysics Data Centers Executive Council (ADEC) on a project to build a simple interoperability framework among the NASA data centers. This project will initially act as a referring service in a Web-based search, pointing MAST users to relevant data at other data centers and alerting users at other data centers of MAST data that may be of interest. This project has the immediate goal of improving the interaction between data centers, and will also help lay the interoperability groundwork for the VO.

MAST has provided leadership on a collaborative literature link project between the ADEC and the scientific journals. The goal of the project is to define ways to structure dataset identifiers and object names that can be used by authors to identify datasets and astronomical sources in a published article. This would permit the automatic association between journal articles and their associated datasets and objects, and vice versa; the current MAST literature link activity is labor-intensive. MAST staff members are developing a dataset verifier tool to facilitate the literature link project as well as data access among the various centers. The software tool parses a dataset name, verifies the format, and checks for the existence of the dataset.

Major Activities and Accomplishments of the Past Year:

Data Ingest and Retrievals: The rate of data ingested into the archive continued to grow, with 2.0 TB of data coming from the active missions HST and FUSE. Data retrievals from the same missions was 9.2 TB, which exceeds the size of the entire HST and FUSE archives (8.8 TB). The addition of a RAID array to the distribution system has permitted MAST to provide faster access to non-HST/FUSE data and a larger staging area for HST and FUSE data. The non-HST/FUSE data originally stored on a CD jukebox is now also stored on the RAID array, allowing immediate access to the files.

The large size of ACS files poses logistical problems for transfer of these files to users. Staff members created a "FastAccess" area on the MAST website from which users can retrieve popular and non-proprietary ACS files via ftp, including ACS Early Release Observations (ERO) and the Great Observatories Origins Deep Survey (GOODS) observations. Over 2500 individual ACS files from the ERO and GOODS programs have been retrieved from the site by about 200 users thus far.

MAST Data Ingest & Retrieval Activity

Date Ingest Volume (GB) - Active Missions Retrieval Volume (GB) - Active Missions Retrieval Volume (GB) - Legacy Missions Datasets Retrieved - Active Missions Datasets Retrieved - Legacy Missions
Jul 2001102.0655.5 3.4 78572 2046
Aug 2001129.9717.2 2.0 92075 1260
Sep 2001112.6640.2 1.3 88996 2541
Oct 2001126.4690.3 7.9 93303 5869
Nov 2001110.2745.6 1.3125264 4168
Dec 2001105.2682.1 6.8101499 6038
Jan 2002129.9735.1 2.9113238 7687
Feb 2002109.2690.0 7.0 85553 4310
Mar 2002134.6904.712.2136705 5061
Apr 2002257.0957.2 5.2 89875 7146
May 2002339.0967.434.1 7694226848
Jun 2002382.8832.8 2.0 69713 7493
TOTALS2.0 TB9.2 TB 86.1 GB1,151,73580,467


Research Tools for Data Exploration and Evaluation: MAST members continued to develop new capabilties for researchers to locate and evaluate data available from the archive.

High-Level Science Products: Guidelines have been established for the contribution of High-Level Science Products (HLSP) to MAST. HLSP are defined as fully reduced and processed images and spectra from the MAST missions, as well as closely related ground-based observations, theoretical data products, object catalogs, and original reduction/analysis software. Science-ready products currently available include atlases, sky surveys, and catalogs created using various MAST mission datasets including HST, IUE, FUSE, EUVE, Copernicus, and the Sloan DSS. In addition, major contributions are expected from the HST Treasury, Archival Legacy, and Large programs begun in Cycle 11.

WFPC2 Association Project: A collaboration between MAST, the Canadian Astronomy Data Center (CADC), and the ST-European Coordinating Facility (ST-ECF) was recently undertaken to make combined WFPC2 images available jointly from each of the three archives. These images represent combinations of individual WFPC2 images of a given field taken with a given filter, and are being created for the entire WFPC2 archive using software developed by CADC and ST-ECF. The combined images will enhance the existing archives by providing deep images suitable for scientific analysis and better previews of these fields.

User Interface Enhancements:

Scientific and Technical Publications:

Plans and Schedule for the Coming Year:

MAST plans to continue to enhance the interoperability and scientific utility of our data holdings in the coming year through activities in the following areas.

Additional Ultraviolet and Optical Data Sets

Improved Services for Archival Researchers

Inter-archive Coordination Activities