Mini version of Flagship Dataset of Type 2 Diabetes from the AI-READI Project
This dataset is a subset of v3.0.0 of the AI-READI dataset containing data from 100 participants, created by popular demand solely to help develop pipelines before downloading the full dataset.

Overview of the study
The Artificial Intelligence Ready and Equitable Atlas for Diabetes Insights (AI-READI) project seeks to create a flagship ethically-sourced dataset to enable future generations of artificial intelligence/machine learning (AI/ML) research to provide critical insights into type 2 diabetes mellitus (T2DM), including salutogenic pathways to return to health.
Description of the dataset
This dataset is a subset of v3.0.0 of the AI-READI dataset containing data from 100 participants, created on popular demand solely to help develop pipelines before downloading the full dataset. We refer to v3.0.0 of the AI-READI dataset for details about the structure, standards, related resournces, and more.
The dataset contains 16,505 files and is around 200 GB in size.
A detailed description of the dataset is available in the AI-READI documentation for v3.0.0 of the dataset at docs.aireadi.org.
Protocol
The protocol followed for collecting the data can be found in the AI-READI documentation for v3.0.0 of the dataset at docs.aireadi.org.
Dataset access/restrictions
Accessing the dataset requires several steps, including:
- Login in through a verified ID system
- Agreeing to use the data only for type 2 diabetes related research.
- Agreeing to the license terms which set certain restrictions and obligations for data usage (see 'License' section below).
License
This work is licensed under a custom license specifically tailored to enable the reuse of the AI-READI dataset (and other clinical datasets) for commercial or research purposes while putting strong requirements around data usage, security, and secondary sharing to protect study participants, especially when data is reused for artificial intelligence (AI) and machine learning (ML) related applications. More details are available in the License file included in the dataset and also available at https://doi.org/10.5281/zenodo.10642459.
How to cite
This dataset was created for pipeline developement only and should not be used for conducting scientific investigations.
Contact
For any questions, suggestions, or feedback related to this dataset, please go to https://aireadi.org/contact.
Acknowledgement
The AI-READI project is supported by NIH grant 1OT2OD032644 through the NIH Bridge2AI Common Fund program.
Usage statistics
2.01 TB
165,051 Files
License
Health Data LicenseKeywords
Citation
When using this resource, please cite:
Versions
Nov 8, 2024