Data Upload FAQs

Uploading data

Can I upload to SPARC from the Cloud? My data are in BOX, Google Drive, or other cloud-based services.

We understand that you can't keep all data on your laptop. Many investigators utilize institutional resources, such as Box, Google Drive, Amazon Web Storage (AWS), and local storage servers. SPARC utilizes the Pennsieve platform (maintained by DAT-Core) for data upload. Pennsieve relies on AWS. We are currently working on a solution for loading data directly from other AWS buckets. For questions about this process, please contact DAT-Core. If you'd like the Curation team to help you manage this today, please contact ([email protected]). Our team is here to help you.


Large dataset??? -- review best practices

How much will it cost for users to download my large dataset?

As you upload your dataset, keep in mind that people pay to download your dataset, which could influence its reuse. As of June 2024, downloading costs are ~$90/TB. To encourage reuse of large datasets, include meaningful descriptions, methods, and metadata, as well as opportunities for people to preview some of your data on the SPARC Portal.


NO MORE! AWS Open Data Sponsorship Program


SODA installation Troubleshooting

slide 13, 14, 15 - https://docs.google.com/presentation/d/1yZUdH3-gxjD4-1j5IIBL8JN9JHKEVZAenadB9CKK6R0/edit#slide=id.g25798e7dc0b_0_104