General info for using ABCD data
Important
Only users registered with a named application can use this data. Check with Dr Heather Whalley if you have any question. Watch this video to know more about data protection.
ABCD protocol papers
Data dictionary for release 2.0.1
A summary of ABCD study
Data location
- Eddie: /exports/igmm/eddie/GenScotDepression/data/abcd/release2.0.1
- Eddie datastore: /exports/igmm/datastore/GenScotDepression/data/abcd/release2.0.1
- CMVM datastore:
Important
You should check with Dr. Heather Whalley to see if you have been added to the data usage agreement (i.e. permission to use the data). Once you have permission, contact Dr. Mark Adams for access to the GenScotDepression folder.
Data in these folders are the same. You can load from whichever location you prefer.
Within the data folder, you will be able to see:
- i.files_from_web: raw curated data from the NDA database in plain text format (not recommended to use).
- ii.docs: supporting documentations from the NDA database.
- iii.data: curated data sorted by category. Data is stored in rds format (recommended).
Data download
This section describes procedures for downloading data used by data managers. Those who only wish to use the data can omit this section.
Data
Downloaded
In subfolder 'i.files_from_web'
Procedure:
- Go to: https://nda.nih.gov/abcd
- Login with NDA credentials and go to 'my dashboard'
- Click username at the topright of the page to go to your profile
- Go to 'Data packages' and select the packages to download (add to 'My Data Packages' using tab 'Actions')
- Click 'Download Manager'
- Run the downloaded 'Download Manager'
- Login with NDA credentials
- Download data
Note
It takes a while before the added packages appears in your account. Large data may take up to around 30 mins.
Remember that including bulk data may take up a very large space
The latest curated data packages downloaded are: ABCDstudyNDA
Supporting documentations
Downloaded
In subfolder 'ii.docs'
Summary of the latested release
Check the hyperlinks 'DEAP-NDA variables mapping' and 'Release Notes' for more information about the current dataset.
Segment data by category
-
Summary
- process_files.R is the default R script for all main steps
- baseline.R and baseline_imputation.R are functions for baseline measures
- Other files are used by process_files.R
-
Files essential for process_files.R (all need manual check for each data release)
- Instruments.ABCD_summary_web_dictionary.csv: This file should be created manually. Copy contents from URL
- ls.category: a list of object names and categories, a master list of all object names that ABCD ever released
- ls.category.PreiousVersionOnly: like ls.category, but contains the objects only release in a previous version
- ls.category.CurrentVersion: like ls.category, but contains objects released in the lasted version
- ls.faultyobjects: sometimes ABCD have objects that need to be removed (e.g. some imaging files from release 2.0, fixed in release 2.0.1)