NASA Data Center Annual Program Plan

Program Year: Reporting period July 2004 through June 2005
Data Center/Service: Multi-mission Archive at Space Telescope (MAST)
(Optical/UV Science Archive Research Center) 
Supporting Organization: Space Telescope Science Institute
3700 San Martin Drive
Baltimore, MD 21218

Overall Mission:  MAST supports active and legacy mission datasets and related catalogs and surveys, focusing primarily on data in the ultraviolet, optical, and near-IR spectral retions. Support includes curation of the data, providing expert support to users of the data, providing access to data-specific calibration and analysis software, providing user support for this software, and maintaining public access interfaces to the data. MAST works with new mission teams in the supported wavelength regions to assist in the development of data management plans, especially in the areas of data formats, descriptive metadata and standardization of keywords, in the development of data access and delivery plans, and assuring data quality control.

This report covers data financially supported under the "MAST" contract. Archive and distribution activities for HST data are supported under the HST contract. Some HST statistics are included in this report, but more complete information on HST activities can be found in the STScI Newsletters and in the STSci Annual Reports. Questions about HST can be directed to archive@stsci.edu.

Total MAST Holdings by volume as of June 30, 2005



MAST holdings without HST/GSC/DSS as of June 30, 2005



MAST Data Holdings

Name Size Number of Observations Active Mission Duration
ACTIVE MISSIONS
FUSE 673.53 GB 3985 1999-
GALEX 429 GB (public)
173 GB (proprietary)
75 GB (catalog)
3252 (public)
187 (proprietary)
19,730,767 (catalog objects)
2003-
HST 23.996 TB 532628 1990-
LEGACY MISSIONS
ORFEUS: BEFS 4.1 GB 332 Sept. 1993; Nov. 1996
ORFEUS: IMAPS 0.3 GB 643 Sept. 1993; Nov. 1996
ORFEUS: TUES 0.2 GB 229 Nov. 1996
EUVE 96 GB 1377 1992-Jan. 2001
ASTRO: UIT 56 GB 1442 Dec. 1990; March 1995
ASTRO: HUT 0.6 GB 516 Dec. 1990; March 1995
ASTRO: WUPPE 0.1 GB 238 Dec. 1990; March 1995
IUE Final Archive 475 GB 103,552 1978-1996
IUE SIPS 125 GB 104,296 1978-1996
Copernicus 0.8 GB 551 1972-1981
CATALOGS & SURVEYS
VLA-FIRST 183.98 GB (compressed) 29,153 1993-
Digitized Sky Surveys 5 TB n/a 1950-58, 1975-99
GSC I, II 2 TB n/a 1950-58, 1975-99

Services Provided: MAST provides support for users seeking to understand the properties and instrumental signatures of all archived datasets and assistance with the interfaces to browse and retrieve these data. Access to non-HST mission and instrument specific calibration and analysis software and assistance in its use continues on a time-available basis. Full support for HST related software is provided by the MAST Help Desk and staff.

Non-HST Data Analysis Software Provided: IUE "RDAF" package (IDL-based), IUE Final Archive processing software (IRAF port), EUVE analysis software package (IRAF-based), Copernicus data analysis software (IDL-based), UIT data reduction/analysis software (FORTRAN, C, and IDL routines), WUPPE data analysis software (FORTRAN routines requiring the FITSIO library), and HUT data reduction software (IRAF-based) are available through MAST.

Mission Interfaces:


Committee Participation within the STScI: 

ACTIVITIES AND MAJOR ACCOMPLISHMENTS OF THE LAST YEAR

MAST Data Ingest & Retrieval Activity

Date Ingest Volume (GB) - Active Missions Retrieval Volume (GB) - Active Missions Retrieval Volume (GB) - Legacy Missions Datasets Retreived - Active Missions Datasets Retreived - Legacy Missions
Jul 1 2004 12:00AM 344.701 1382.197 5.227 54276 12473
Aug 1 2004 12:00AM 343.988 1076.612 142.030 52357 24081
Sep 1 2004 12:00AM 430.302 1104.396 27.781 65986 6315
Oct 1 2004 12:00AM 411.500 1685.704 91.422 59276 28973
Nov 1 2004 12:00AM 439.260 1567.505 101.697 65147 20529
Dec 1 2004 12:00AM 473.170 1557.429 21.437 179084 5185
Jan 1 2005 12:00AM 503.518 1859.858 39.422 56181 7298
Feb 1 2005 12:00AM 521.444 2816.290 23.474 78923 40096
Mar 1 2005 12:00AM 517.580 2581.407 58.090 96124 13943
Apr 1 2005 12:00AM 501.428 2532.458 41.562 86220 8823
May 1 2005 12:00AM 394.318 2132.460 7.361 65582 7503
Jun 1 2005 12:00AM 295.958 2598.555 1.786 61649 6417
Total5177.167 22894.870 561.292 920805 181636

As MAST does not maintain retrieval statistics for DSS, only the number of searches is displayed in order to show the general interest level in these data. Previews are not available for VLA-FIRST data. EUVE data is distributed from HEASARC.



This plot shows the number of datasets downloaded each month per mission during the reporting period










Data Discovery and Search Tools:

MAST has several search tools that complement the individual mission searches.


Scrapbook updates: In March 2005, links to 2MASS images cached at the NASA/IPAC Infrared Science Center (IRSA) were included in the scrapbook. The 2MASS data (available as both jpg images and FITS files) are 20'x20' and are centered on the listed MAST observations. New datasets are added to the scrapbook each month from FUSE and HST observations.

High Level Science Products:

MAST continued to solicit good-quality High Level Science Products this year. MAST staff members consulted with several team regarding the standards and procedures related to archiving HLSP at MAST. The guidelines for submission of HLSP were modified to include new requirements related to making these data available via MAST Virtual Observatory tools.

Several sets of High Level Science Products (HLSP) were delivered and implemented this year.

Since the HLSP are located in an anonymous FTP area, MAST cannot precisely measure the number of distinct users downloading the data. However, we can tabulate the number of distinct domains downloading the data. During the reporting period, 6165 distinct domains downloaded HLSP. The plot shown below shows the number of distinct domains for each set of HLSP . The number of domains for those HLSP sets acquired during the reporting year are shown in pink.



A complete listing of HLSP hosted at MAST is below. Although MAST provides an interface to the WFPC2 Associations, the data are held at CADC. MAST distributes the data via a proxy.

High Level Science Product Holdings
High Level Science Product Set Size
Number of Files
10 Lac Spectral Atlas (HST/GHRS)
5.3 MB
67
AGN and Quasar Spectral Atlas
73.8 MB
451
CoolCAT - Atlas of Cool Stars
1.4 GB
1388
Copernicus Atlas of 6 Selected Stars
3.6 MB
25
EUVE Spectral Atlas of Stars (EUVE)
29.4 MB
490
GOODS: The Great Observatories Origins Deep Survey
96.3 GB
1342
GRAPES - Grism-ACS Program for Extragalactic Science 78 MB
1401
Grayscale of Time Variation of gamma Cas Near SiIV Doublet
5.0 MB
7
Hubble Deep Field
2.1 GB
181
Hubble Deep Field South
7.8 GB
178
Hubble Helix Observations
13.8 GB
32
Magellanic Cloud Planetary Nebulae
721 MB
1620
OB Stars (Galactic): FUSE Spectral Atlas
30 MB
184
OB Stars (Magellanic): FUSE Spectral Atlas
1.2 MB
66
Pre-Main Sequence Stars: IUE Spectral Atlas
10.7 MB
733
Procyon (FV-IV) Spectral Atlas
1.2 MB
14
Quasar Spectrum HST/FOS
.6 MB
4
Quasar Spectrum FUSE
19.3 MB
1
Search Field from a Search for Kuiper Belt Objects
3.7 GB
8
The Medium Deep Survey
11.9 GB
4726
Ultra Deep Field
30.7 GB
2252
Ultraviolet Images of Nearby Galaxies
728.0 MB
334
WFPC2 Archival Parallels
16.6 GB
4087
alpha Ori Spectral Atlas
4.0 MB
60
chi Lupi Spectral Atlas
22.7 MB
156
TOTAL
181.73 GB
19807

New plotting and graphical display tools:


New protocols and IVOA-related services:


New interface pages and search tools:


Enhancements of User Interface Pages and Tools:


Outreach to the user community:


The Astrophysical Data Centers Executive Council (ADEC): 

ADEC representatives White and Kamp attended an ADEC meeting in Pasadena October 29, 2004. The status of implementation of dataset identifiers at the various archive centers was discussed. The use of ADS identifiers will be advertised in a variety of ways including an announcement at AAS and by announcements in participating journals. There was a brief discussion on the efficacy of reducing the number of scripting languages used by the various ADEC members. The conclusion was that this was not practical, but that ADEC members might share libraries that could be of common use.

Kamp participated in an ADEC telecon on March 25, 2005. ADEC discussed the possible joint NASA/NSF funding of an NVO/all datacenters proposal from FY07. ADEC members planned to prepare slides to show hour our major goals fit into the NASA strategy planning. There was additional discussion of the dataset identifiers. IRSA is setting up the tagging for complex Keck datasets. The link will actually start a query that presents the whole data structure, meaning that one tag is no longer attached to an individual file. Other issues such as the dismissal of a service remain to be resolved. MAST proposed that all data centers impose the FITS standard strictly. Some centers do not have the manpower to impose the FITS standard and use more relaxed criteria for flagging of datasets that are not correct. For example, IRSA checks that the file can be read using the standard cfitsio library from Goddard, while NED checks that files submitted to them can be read by Aladin and other common tools (probably an even weaker criteria). MAST prefers not to distribute data that fails when tested with a more thorough tool such as fitsverify. We reached an agreement to work towards a common datacenter policy.

Coordination activities:

The web service for the MAST Scrapbook was created to provide an easy and timely method for IRSA to see what data were marked as "representative". They use it in a tool they developed based on our Scrapbook. MAST also worked with IRSA to incorporate links to the 2MASS previews within the MAST Scrapbook.


MAST Literature Links:  

The publications database and the links between scientific papers and the referenced MAST datasets are regularly updated as new citations become available through the ADS. Below is a plot showing the number of papers published in referred journals during the reporting period (July 2004 through June 2005).

During the past year, MAST began to track the number of citations per paper in the journal database. We obtain the total number of citations per paper from the ADS. The count is updated at least once per month. Below is a chart that shows the average number of citations for papers published that year. The citation record for IUE from 1978 through 1990 is not included in this plot. The "fall-off" in the average number of citations per paper for those articles published more recently is due to the lag between publication of a paper and citation of a paper in a later publication. Meylan, Madrid and Macchetto published a paper in PASP,116:790 entitled "HST Science Metrics". These authors state that the peak of the citation rate occurs about 2 years after publication.

We show below a plot of the average number of citations per paper over the publication lifetime per mission. (If papers from the years 2004-2005 are excluded, the average number of citations per paper increases.)

STAFFING CHANGES AT MAST

During the year Rachel Somerville left MAST and Alberto Conti scaled back participation in MAST activities. Shui-Ay Tseng joined the staff. Conti and Tseng are partially salaried from the MAST contract.


PLANS FOR COMING YEAR

Future datasets:

MAST will receive no datasets from new missions during the coming year. Data will continue to be relased steadily from FUSE and as GALEX Release 2, in late 2005.

Future Services for Ongoing Missions: