Organization of a SPARC Dataset
SPARC data is organized into data sets, each containing many files.
A SPARC data set comprises the following data and structure:
- An experimental protocol that has been submitted to Protocols.io and curated.
- Data files organized in folders by the investigators and curated according to the
SPARC Dataset Structure.
The SPARC Dataset Structure was adapted from the
These files and folders include:
dataset_description(xlsx, csv, or json): file contains the study metadata to describe the dataset, including but not limited to, a short description of the study, contributors, associated journal articles and a protocol.io URL.
subjectsfile (xlsx, csv, or json): contains information on every subject involved in the data collection
samplesfile (xlsx, csv, or json): contains information about samples involved in the data collection.
primaryfolder, containing folders named to match the identifiers for subjects and/or samples depending on the study design. See the dataset template for examples.
- Docs folder that contains all the supporting documents for the dataset, including but not limited to, a representative image.
Example of data set organized according to SPARC Dataset structure
Experimental metadata specified by the SPARC Data Standards Committee based on the Minimal Information for a Neuroscience Dataset (MINDS) specification. MINDS metadata fields have been incorporated into the subjects and samples templates available in the zip file. An annotated list of these fields can be found here.