Data Availability

Your SPARC data is here to stay

Guaranteed Data Availability

The SPARC Data and Resource Center (SPARC DRC) is committed to ensure data availability and longevity for the SPARC community. To support continuous operation beyond the original SPARC Program funding, we are diversifying funding sources and securing additional mechanisms to support different aspects of the Portal. Together, our distributed data storage plan guarantees that all data published to the SPARC repository will remain available for at least 10 years after its publication.

AWS Open Data Sponsorship Program

SPARC has been accepted into the Amazon Web Services (AWS) Open Data Sponsorhisp Program.

For all public data available on the SPARC Portal, AWS covers the cost of storage and transfer. Here's what these significant benefits mean for you:

  • Free long-term storage for your published data
  • Free data transfer and cloud-based access to your own AWS S3 bucket
  • Free data downloads from the SPARC Portal and programmatically

SPARC datasets are now accessible on the Registry of Open Data at AWS - visit SPARC's listing now!

NIH STRIDES Initiative

SPARC provides access to datasets free of charge through the NIH (STRIDES Initiative | Data Science at NIH).

Archival Data Availability

In addition, the University of Pennsylvania guarantees availability of SPARC data for 10 years post publication in the case other efforts are no longer funded. In this scenario, data will continue to be made available through the University of Pennsylvania, and DOIs will be updated to continue to point to the datasets.