Skip to content

clinical-meteor/Medical-Imaging-Datasets

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

792 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Awesome Public Medical Imaging Datasets

Table of Contents

Introduction

This repository is a collection of publicly available medical imaging datasets. It aims to provide a comprehensive and valuable resource for researchers, healthcare professionals, and developers working in the field of medical imaging analysis. I wrote about this collection on my Medium blog.

  • Leaderboard The link of leaderboard.
  • paper The link of related papers.
  • licence The licence of the dataset.
  • licence The link is down. Let me know if there's a new one.

NumberOfDataSet


Head and Neck

Brain

  • 3D-2D-GS-CA
    A database of 3D-DSA, 2D-DSA and 2D-MAX images for 10 patients with two cerebrovascular pathologies and corresponding gold standard registrations obtained by aligning fiducial markers.
    licence CC BY-NC-ND

  • 3D-MR-MS
    A database of three-dimensional (3D) magnetic resonance (MR) images of multiple sclerosis (MS) patients with corresponding consensus based ground truth segmentations of white matter lesions.
    paper
    licence CC BY-NC-ND

  • 3D VoTEM (3-D Validation of Tractography with Experimental MRI)
    It has three subset challenges.
    Keyboard: Diffusion MRI, Labeled

  • 7-Tesla resting-state fMRI test-retest
    22 participants were scanned during two sessions spaced one week apart.
    Keyboard: High field fMRI, Labeled
    paper
    licence CC0

  • AANLIB
    Harward Atlas the Whole Brain
    Keyboard: Multi-modality
    licence Commercial reproduction or multiple distribution of any kind is prohibited.

  • ABCD Neurocognitive Prediction
    T1-weighted MRI scans and fluid intelligence scores for children aged 9–10 year
    Keyboard: MRI, Segmentation, Labeled
    paper

  • ABIDE (Autism Brain Imaging Data Exchange)
    Keyboard: Autism spectrum disorders (ASDs), MRI
    paper
    licence CC BY-NC-SA 3.0

  • ACNS0332
    Chemotherapy and Radiation Therapy in Treating Young Patients With Newly Diagnosed, Previously Untreated, High-Risk Medulloblastoma/PNET
    paper

  • ACPI DU LaBar (Addiction Connectome Preprocessed Initiative)
    This dataset includes Scan Parameters, Demographic Information and Demographic Key
    licence Attribution - Non-Commercial

  • ACPI MTA (Addiction Connectome Preprocessed Initiative)
    Multimodal Treatment of Attention Deficit Hyperactivity Disorder (MTA) - Preprocessed
    licence Attribution - Non-Commercial

  • ACPI NYU (Addiction Connectome Preprocessed Initiative)
    These data were collected to study functional and structural connectivity in cocaine addiction. This release contains R-fMRI and behavioral assessments and phenotypic information data from 29 cocaine-dependent individuals and 24 healthy comparison participants.
    licence Attribution - Non-Commercial

  • ADAM (Aneurysm Detection And segMentation)
    Detection of unruptured intracranial aneurysms and segmentation of unruptured intracranial aneurysms from Time of Flight MRAs (TOF-MRAs).
    Leaderboard

  • ADHD-200 (Attention Deficit Hyperactivity Disorder)
    776 resting-state fMRI and anatomical datasets aggregated across 8 independent imaging sites.
    Leaderboard | paper
    licence Consistent with the policies of the 1000 Functional Connectome Project

  • ADNI (Alzheimer's Disease Neuroimaging Initiative)
    Keyboard: Multi-modality
    paper

  • AIMS-TBI (Automated Identification of Mod-Sev Traumatic Brain Injury)
    It is an extension of the dataset ToothFairy
    Keyboard: MRI, Segmentation, Labeled
    Leaderboard

  • Age-ility
    This data set consists of 136 subjects
    Keyboard: MRI, EEG
    paper
    licence Attribution Non-Commercial Share Alike

  • ANT
    It contains 46 healthy aging participants and participants with Parkinson's disease at two sessions each.
    Keyboard: MRI
    licence CC0

  • AOMIC (the Amsterdam Open MRI Collection)
    It is a collection of three datasets with multimodal (3T) MRI data
    Keyboard: MRI
    paper Dataset is described

  • APIS
    A Paired CT-MRI Dataset for Ischemic Stroke Segmentation
    paper
    licence CC BY 4.0

  • ATLAS R1.1 (Anatomical Tracings of Lesions After Stroke)
    An dataset of 229 T1-weighted MRI scans (n=220) with manually segmented lesions and metadata.
    paper

  • ATLAS R2.0 (Anatomical Tracings of Lesions After Stroke)
    A larger dataset of T1w MRIs and manually segmented lesion masks
    Keyboard: MRI, Segmentation, Labeled
    Leaderboard | paper

  • Beijing Enhanced
    These data include 180 healthy controls from a community sample.
    Keyboard: resting state fMRI
    licence Attribution - Non-Commercial

  • Beijing Short TR Sample
    Data is obtained with a short TR (0.4 seconds) and a long TR (2.0 seconds).
    Keyboard: Resting state fMRI
    licence Attribution - Non-Commercial

  • BeijingEOEC (Eyes Open Eyes Closed)
    These data include 48 healthy controls from a community (student) sample.
    Keyboard: Resting state fMRI
    licence Attribution - Non-Commercial

  • BHSD (Brain Hemorrhage Segmentation Dataset)
    A 3D multi-class ICH dataset containing 192 volumes with pixel-level annotations and 2200 volumes with slice-level annotations across five categories of ICH.
    Keyboard: Intracranial hemorrhage (ICH), CT scan, Labeled
    paper

  • BigBrain
    Microscopic resolution 3D model of the human brain.
    Keyboard: X-ray, Labeled
    licence CC BY-NC-SA 4.0

  • BONBID-HIE2023 (BOston Neonatal Brain Injury Dataset for Hypoxic Ischemic Encephalopathy)
    Keyboard: MRI, Segmentation, Labeled
    Leaderboard | paper Data Descriptor
    licence CC BY NC ND

  • BONBID-HIE2024 (BOston Neonatal Brain Injury Dataset for Hypoxic Ischemic Encephalopathy)
    Keyboard: Infant brain, MRI, Segmentation, Labeled
    Leaderboard
    licence CC BY NC ND

  • BR35H
    Brain Tumor Detection
    Keyboard: MRI, Detection, Classification, Labeled

  • Brain Tumor Classification
    Classify MRI images into four classes
    Keyboard: MRI, Labeled

  • Brian Tumor Dataset
    This dataset consists of the scanned images of brain of patient diagnosed of brain tumour.
    Keyboard: X-ray, Cancer, Labeled
    licence GPL 2

  • brain tumor dataset
    Containing 3064 T1-weighted contrast-inhanced images from 233 patients with three kinds of brain tumor: meningioma, glioma, and pituitary tumor.
    Keyboard: Cancer, MRI, Labeled
    paper | paper
    licence CC BY 4.0

  • Brain-Tumor-Progression
    Each patient had two MR exams acquired: within ninety days after completing chemi-radiation therapy and at the progression state which was based on the integration of the clinical performance and/or imaging outcomes.
    Keyboard: Cancer, MRI, Labeled
    paper
    licence TCIA Restricted

  • BrainMetShare
    The dataset includes 156 whole brain MRI studies, including high-resolution, multi-modal pre- and post-contrast sequences in patients with at least 1 brain metastasis.
    Keyboard: Detection, MRI, Segmentation, Labeled
    paper
    licence Their Research Use Agreement, as well as to the Terms of Use of the Stanford University School of Medicine website

  • BrainPTM 2021 (Brain Pre-surgical white matter Tractography Mapping)
    Data consists of 75 cases
    Keyboard: MRI, Cancer, Segmentation, Labeled
    Leaderboard

  • BRAINS (Brain Images of Normal Subjects)
    Keyboard: MRI
    paper

  • BRATS2012 BrokenLink (Brain Tumor Segmentation)
    The tumor and edema regions have been manually delineated.
    Keyboard: Multimodal MRI, Cancer, Labeled
    paper

  • BRATS2013 BrokenLink (Brain Tumor Segmentation)
    A collection of 60 de-identified clinical cases.l2
    Keyboard: Multiparametric, MRI, Cancer, Labeled
    paper

  • BRATS2014 BrokenLink (Brain Tumor Segmentation)
    Keyboard: MRI, Cancer, Labeled
    paper

  • BRATS2015 BrokenLink (Brain Tumor Segmentation)
    Keyboard: MRI, Cancer, Labeled

  • BRATS2017 (Brain Tumor Segmentation)
    Keyboard: MRI, Cancer, Labeled

  • BRATS2018 (Brain Tumor Segmentation)
    The dataset utilizes multi-institutional pre-operative MRI scans and focuses on the segmentation of intrinsically heterogeneous brain tumors. Furthemore, it also focuses on the prediction of patient overall survival, via integrative analyses of radiomic features and machine learning algorithms.
    Keyboard: MRI, Cancer, Labeled

  • BRATS2019 (Brain Tumor Segmentation)
    Keyboard: MRI, Cancer, Labeled
    paper

  • BRATS2020 (Brain Tumor Segmentation)
    Keyboard: MRI, Cancer, Labeled
    paper

  • BRATS2021 (Brain Tumor Segmentation)
    Keyboard: MRI, Cancer, Labeled
    paper

  • BRATS2022 (Brain Tumor Segmentation)
    Keyboard: MRI, Cancer, Labeled
    Leaderboard

  • BRATS2023 (Brain Tumor Segmentation)
    This version addressing additional, populations, tumors (e.g., meningioma), clinical concerns, and technical considerations.
    Keyboard: MRI, Cancer, Labeled
    Leaderboard

  • BRATS2024 (Brain Tumor Segmentation)
    This dataset is substantially expanded to ~4,500 cases towards addressing additional populations, tumors, clinical concerns, and technical considerations.
    Keyboard: MRI, Cancer, Labeled
    Leaderboard

  • CADA (Cerebral Aneurysm Detection and Analysis)
    Data of patients with cerebral aneurysms without vasospasm were collected for diagnostic and treatment decision purposes.
    Keyboard: X-ray rotational angiography (3DRA), Segmentation, Labeled
    Leaderboard | paper
    licence CC BY-NC-ND 4.0

  • CADDementia (Computer-Aided Diagnosis of Dementia)
    Keyboard: Alzheimer's disease (AD), MRI
    Leaderboard | paper | paper
    licence The data of the evaluation framework may only be used for the evaluation of methods for computer-aided diagnosis dementia through this challenge.

  • Calgary-Campinas
    It is comprised of 359 datasets, approximately 60 subjects per vendor and magnetic field strength.
    Keyboard: MRI, Segmentation, Labeled
    paper | paper

  • Cam-CAN (Cambridge Centre for Ageing and Neuroscience)
    Nearly 700 adults were scanned using structural Magnetic Resonance Imaging, functional MRI, magnetoencephalography, and completed multiple cognitive experiments.
    Keyboard: lifespan, MRI, fMRI, MEG
    paper | paper

  • CAUSE07 (Caudate Segmentation Evaluation 2007)
    Keyboard: MRI
    licence CC0

  • CEREBRuM (Convolutional Encoder-decodeR for Fully Volumetric Fast sEgmentation of BRain MRI)
    paper
    licence CC0

  • Changes Associated with Heavy Cannabis Use
    T1-weighted structural MRI study of cannabis users at baseline and 3 years follow up.
    Keyboard: MRI
    paper
    licence CC BY-NC

  • Cleveland CCF
    It includes 31 control adults (11M/20F; ages: 24-60). In addition to the resting state scan this sample includes physiological measurements (heart rate and breathing) obtained during the resting state scan.
    Keyboard: Resting state fMRI (R-fMRI)
    licence Attribution - Non-Commercial

  • CMI-HBN (Child Mind Institute Healthy Brain Network)
    Data from 10,000 children and adolescents (ages 5-21).
    Keyboard: Neuroimaging, MRI, EEG
    paper Data Descriptor

  • COBRE (Center for Biomedical Research Excellence)
    functional MR data from 72 patients with Schizophrenia and 75 healthy controls (ages ranging from 18 to 65 in each group)
    Keyboard: fMRI
    licence Attribution - Non-Commercial

  • Computed Tomography Images for Intracranial Hemorrhage Detection and Segmentation
    A dataset of 82 CT scans was collected, including 36 scans for patients diagnosed with intracranial hemorrhage (ICH).
    Keyboard: CT scan, Labeled
    paper
    licence PhysioNet Restricted Health Data License 1.5.0

  • CoRR (Consortium for Reliability and Reproducibility)
    It has aggregated 1,629 typical individuals resting state fMRI data.
    Keyboard: Resting state fMRI (rfMRI)
    paper

  • CPTAC-GBM (Clinical Proteomic Tumor Analysis Consortium Glioblastoma Multiforme)
    Keyboard: Multi-modality, Cancer
    licence CC BY 3.0 - TCIA Restricted

  • CQ500
    A dataset of 491 scans with 193,317 slices
    Keyboard: CT Scan
    paper

  • Cross-Sectional Multidomain Lexical Processing
    This dataset explores the neural mechanisms and development of lexical processing through task based fMRI of rhyming, spelling, and semantic judgement tasks in both the auditory and visual modalities.
    Keyboard: fMRI
    paper
    licence CC0

  • crossMoDA 2021 (Cross-Modality Domain Adaptation)
    The goal is to segment two key brain structures involved in the follow-up and treatment planning of vestibular schwannoma (VS): the tumour and the cochlea
    Keyboard: MRI, Segmentation
    Leaderboard | paper

  • crossMoDA 2022 (Cross-Modality Domain Adaptation)
    The goal is to segment two key brain structures involved in the follow-up and treatment planning of vestibular schwannoma (VS): the tumour and the cochlea, and to automatically classify hrT2 images with VS according to the Koos grade
    Keyboard: MRI, Segmentation
    Leaderboard

  • crossMoDA 2023 (Cross-Modality Domain Adaptation)
    The 2023 edition extends the segmentation task by including multi-institutional, heterogenous data acquired for routine surveillance purposes and introduces a sub-segmentation for the tumour (intra- and extra-meatal components) thereby leading to a 3 class problem.
    Keyboard: MRI, Segmentation

  • cSeg-2022
    Multi-domain Cross-time-point Infant Cerebellum MRI Segmentation
    Keyboard: Labeled
    Leaderboard

  • CUNMET (Clínica Universidad de Navarra Methylphenidate)
    Examination of the neural correlates of differential treatment response to stimulants (methylphenidate and lisdexamfetamine) in boys and girls with ADHD treated in a naturalistic context.
    Keyboard: MRI, Resting state fMRI, Perfusion/arterial spin labeling (ASL)

  • Curious2022
    Keyboard: MRI, Intra-operative Ultrasound (iUS), Segmentation, Cancer
    Leaderboard

  • dbGaP (Genotypes and Phenotype)
    Study about Neurodevelopmental Genomics: Trajectories of Complex Phenotypes
    Keyboard: MRI, Multimodal Neuroimaging
    paper | paper

  • DFCI-BCH-BWH-PEDs-HGG
    MR imaging of pediatric subjects with high-grade gliomas. It is a subset of the BraTS-PEDs 2023 challenge
    Keyboard: Cancer
    licence CC BY 4.0

  • DLBS (Dallas Lifespan Brain Study)
    350 healthy adults, aged 20-89 who are thoroughly characterized in terms of cognition, brain structure and brain function across the adult lifespan
    Keyboard: MRI, PET, Cognitive Data
    licence Attribution - Non-Commercial

  • EGD (Erasmus Glioma Database)
    It is a collection of 774 patients with glioma.
    Keyboard: MRI, Cancer

  • EPISURG
    A dataset of postoperative MRI images for quantitative analysis of resection neurosurgery for refractory epilepsy.
    paper
    licence CC BY-NC-SA 4.0

  • FeTA (Fetal Tissue Annotation)
    A dataset of manually segmented pathological and non-pathological fetal magnetic resonance brain volume reconstructions across a range of gestational ages into different tissue categories
    Keyboard: MRI, Labeled, Segmentation
    paper

  • FeTS 2022 (Federated Tumor Segmentation)
    Keyboard: multi-parametric MRI (mpMRI), Cancer, Labeled
    Leaderboard

  • FeTS 2024 (Federated Tumor Segmentation)
    FeTS borrows its data from the BraTS Continuous Evaluation, but additionally providing a data partitioning according to the acquisition origin for the training data.
    Keyboard: multi-parametric MRI (mpMRI), Cancer, Labeled
    Leaderboard

  • FIND Lab ( Functional Imaging in Neuropsychiatric Disorders)
    This dataset is comprised of 13 subjects, ages 18-29, 8 female, with both strutural and functional MRI. The functional paradigms collected are as Episodic Memory, Music, Subtraction
    paper
    licence Attribution - Non-Commercial

  • GMSC (Grey matter segmentation challenge)
    Keyboard: MRI, Labeled
    paper
    licence Data is intended for research and educational purposes only

  • Gray matter segmentation at 7T MRI
    The dataset consist of 7 Tesla MRI anatomical images of living human brains and hand labeled cortical gray matter images.
    Keyboard: High field MRI, Labeled, Segmentation
    paper
    licence CC BY 4.0

  • GSP (Genomics Superstruct Project)
    Personality and cognitive measures were obtained on a subset of participants. Each dataset contains a T1-weighted structural MRI scan and either one (n=1,570) or two (n=1,139) resting state functional MRI scans.
    Keyboard: MRI
    paper

  • HARDI 2012 BrokenLink
    Keyboard: Diffusion MRI
    paper

  • HARDI 2013 BrokenLink
    It focuses on the effect of the local reconstruction accuracy on the quality of connectivity reconstruction.
    Keyboard: Diffusion MRI

  • HBN-SSI (Healthy Brain Network Serial Scanning Initiative)
    The primary goal is to assess and compare test-retest reliabilities for full-brain connectivity patterns derived from functional MRI data obtained during different scan conditions.

  • HCP (Human Connectome Project)
    Keyboard: MRI
    paper

  • Head CT - hemorrhage
    This dataset contains 100 normal head CT slices and 100 other with hemorrhage. No distinction between kinds of hemorrhage.
    Keyboard: CT scan, Labeled
    licence CC0: Public Domain

  • Hippocampus Segmentation
    This dataset contains T1-weighted MR images of 50 subjects, 40 of whom are patients with temporal lobe epilepsy and 10 are nonepileptic subjects.
    Keyboard: MRI, Labeled
    paper
    licence The dataset is free to use for research and education.

  • HNU Short TR
    Short-TR Eyes-open/Eyes-closed Resting State fMRI Data
    paper
    licence Attribution - Non-Commercial

  • Human V4 size predicts crowding distance
    Paired measurements of brain and behavior in 50 observers
    Keyboard: fMRI
    paper
    licence CC0

  • Hypothalamus Segmentation
    1343 hypothalamus masks from three different datasets
    paper

  • IBSR (Internet Brain Segmentation Repository)
    Manually-guided expert segmentation results along with magnetic resonance brain image data
    Keyboard: MRI, Labeled
    licence Free For Non-Commercial Use Only

  • INSTANCE2022 (INtracranial hemorrhage SegmenTAtioN ChallengE)
    A training set of 100 cases with ground-truth and a validation set with 30 cases without ground-truth labels.
    Keyboard: Intracranial hemorrhage (ICH), CT Scan, Labeled
    Leaderboard | paper

  • iSeg2017
    6 month old Infant Brain Segmentation
    Keyboard: MRI, Labeled
    Leaderboard | paper

  • iSeg2019
    6 month old Infant Brain Segmentation
    Keyboard: MRI, Labeled
    Leaderboard | paper

  • ISLES (Ischemic Stroke Lesion Segmentation)
    It has multi versions in 2015 to 2018
    Keyboard: MRI
    paper

  • ISLES'22 (Ischemic Stroke Lesion Segmentation)
    Multimodal MRI infarct segmentation in acute and sub-acute stroke
    Keyboard: MRI
    Leaderboard

  • IXI
    This dataset have been collected nearly 600 MR images from normal, healthy subjects.
    Keyboard: MRI
    licence CC BY-SA 3.0

  • LGG-1p19qDeletion
    It performed in 159 subjects with Low Grade Gliomas.
    Keyboard: MRI, Segmentation, Labeled
    paper
    licence TCIA Restricted - CC BY 3.0

  • long-MR-MS
    A database of longitudinal magnetic resonance (MR) images of patients with multiple sclerosis (MS) with corresponding ground truth segmentations of white matter lesion changes.
    licence CC BY-NC-ND

  • Longitudinal Neuroimaging on Arithmetic Processing
    Brain Correlates of Math Development in Children.
    Keyboard: MRI
    paper
    licence CC0

  • Longitudinal Neuroimaging on Multisensory Lexical Processing
    Longitudinal Brain Correlates of Multisensory Lexical Processing in Children.
    Keyboard: MRI
    paper
    licence CC0

  • M4Raw
    A multi-contrast, multi-repetition, multi-channel MRI k-space dataset for low-field MRI research.
    Keyboard: 0.3 Tesla MRI
    paper
    licence CC BY 4.0

  • Maclaren test-retest brain volume
    The dataset comprises three participants, each of whom was scanned 40 times.
    Keyboard: MRI
    paper
    licence CC0

  • MASSIVE (Multiple Acquisitions for Standardization of Structural Imaging Validation and Evaluation)
    The database consist of 8000 diffusion-weighted volumes and ten 3D FLAIR, T1-, and T2-weighted datasets of a single healthy subject.
    Keyboard: diffusion MRI
    paper

  • MEMENTO (Mri whitE Matter rEcoNstrucTiOn)
    The aim is evaluating and advancing the state of the microstructural modeling field.
    Keyboard: Diffusion MRI

  • MICA-MICs (Microstructure-Informed Connectomics)
    The dataset provides raw and fully processed multimodal neuroimaging data acquired in 50 healthy control participants at a filed strength of 3T.
    Keyboard: multimodal MRI
    paper
    licence CC0

  • Mindboggle
    Manually labeled human brain image data.
    Keyboard: MRI, Labeled
    paper
    licence CC BY 4.0

  • MIRIAD (Minimal Interval Resonance Imaging in Alzheimer's Disease)
    Dataset is a series of longitudinal volumetric T1 MRI scans of 46 mild–moderate Alzheimer's subjects and 23 controls.
    Keyboard: Alzheimer's disease (AD), MRI
    paper Overview
    licence BIRN Data License

  • mm-brain-MR
    A database of simulated multimodal (mm) MR images of brains with tumors of varying volumes with anatomical ground truth.
    Keyboard: Cancer, Labeled
    licence CC BY-NC-ND

  • MMRR (Multi-Modal MRI Reproducibility Resource)
    Scan-rescan imaging sessions on 21 healthy volunteers.
    Keyboard: MRI, resting state fMRI
    paper

  • MNI-HiSUB25 (Montreal Neurological Institute)
    A multi-contrast and submillimetric 3-Tesla MRI hippocampal subfield segmentation protocol and dataset
    paper
    licence Attribution - Non-Commercial

  • MPILMBB (Max Planck Institut Leipzig Mind Brain Body)
    A functional connectome phenotyping dataset including cognitive state and personality measures.
    Keyboard: MRI, Cognitive Data
    paper
    licence CC0

  • MPI-LEMON
    It presents a dataset of 228 healthy participants comprising a young and an elderly group acquired cross-sectionally to study mind-body-emotion interactions.
    Keyboard: MRI, EEG
    paper

  • MRBrainS13
    Evaluation Framework for Brain Image Segmentation in 3T MRI Scans
    paper

  • MRBrainS18
    The purpose is to directly compare methods for segmentation of gray matter, white matter, cerebrospinal fluid, and other structures on 3T MRI scans of the brain, and to assess the effect of (large) pathologies on segmentation and volumetry.
    Leaderboard

  • MRI and Alzheimers
    Magnetic Resonance Imaging Comparisons of Demented and Nondemented Adults
    Keyboard: Alzheimer's Disease (AD), Labeled

  • MS (Multiple sclerosis)
    82 data sets had the white matter lesions associated with multiple sclerosis delineated by two human expert raters.
    Keyboard: MRI, Labeled, Segmentation
    paper | paper

  • MSC (Midnight Scan Club)
    This dataset focused on the precise characterization of ten individual subjects via collection of large amounts of per-individual data.
    Keyboard: Resting-state fMRI, MRI, Neuropsychological testing
    paper
    licence CC0

  • MSSeg 2008
    The goal is to compare algorithms to segment the multiple sclerosis (MS) lesions.
    Keyboard: MRI, Segmentation

  • MSSEG 2016
    A total of 100 multiple sclerosis patients
    Keyboard: MRI, Segmentation
    paper

  • MTOP2016 BrokenLink (Mild Traumatic Brain Injury Outcome Prediction)
    Keyboard: MRI, Labeled

  • Multi-shell diffusion MRI
    It was collected from three traveling subjects with identical acquisition setting in ten imaging centers.
    Keyboard: MRI
    paper
    licence CC BY 4.0

  • Multimodal MRI of chess players
    It is a MRI dataset of 29 professional Chinese chess players.
    licence Attribution - Non-Commercial

  • Narratives
    fMRI data for evaluating models of naturalistic language comprehension.
    Keyboard: fMRI, Labeled
    licence CC0

  • Naturalistic Viewing
    The dataset represents simultaneously collected electroencephalography (EEG) and function magnetic resonance imaging (fMRI) recordings obtained from 22 individuals between the ages of 23 and 51 years-old.
    paper
    licence CC BY 4.0

  • NCANDA-A (National Consortium on Alcohol and Neurodevelopment in Adolescence - Adulthood)
    The aim is to determine the effects of alcohol use on the developing adolescent brain, and examine brain characteristics that predict alcohol use problems.
    paper | paper

  • NEO2012
    The dataset consists of male and female adults, all healthy controls with no psychiatric history used in the 2011 PLoS ONE study.
    paper
    licence Attribution - Non-Commercial

  • NeuAtlas Labeled Brain Scans
    Keyboard: MRI, Labeled, Segmentation

  • NeuroImage article by Power et al.
    The dataset consists of children, adolescents, and adults, all of which are controls with no diagnosis.
    Keyboard: MRI
    paper
    licence Attribution - Non-Commercial

  • NKI-RS (Nathan Kline Institute-Rockland Sample)
    NKI-RS is an ongoing, institutionally centered endeavor aimed at creating a large-scale (N > 1000), deeply phenotyped, community-ascertained, lifespan sample (ages 6–85 years old) with advanced neuroimaging and genetics.
    Keyboard: MRI
    paper

  • North Shore - LIJ
    It includes 6 patients with medically intractable epilepsy that underwent implantation of intracranial electrodes for seizure onset localization prior to resective neurosurgery.
    Keyboard: Resting state fMRI (R-fMRI)
    paper
    licence Attribution - Non-Commercial

  • NSD (Natural Scenes Dataset)
    High-resolution fMRI responses to tens of thousands of richly annotated natural scenes
    Keyboard: fMRI, Labeled
    paper Description of the dataset

  • NYUIQ
    It consists of datasets from 49 psychiatrically neurotypical adults, with age, gender and intelligence quotient (IQ) information provided.
    Keyboard: T1 weighted MRI, Resting state fMRI scans (R-fMRI)
    licence Attribution - Non-Commercial

  • OASIS (Open Access Series of Imaging Studies)
    It has multi versions.
    Keyboard: Multi modality, Neuroimaging
    paper | paper | paper | paper

  • Parkinson's Disease Datasets
    The data are comprised of 27 PD patients and 16 age-matched normal controls in the Neurocon dataset, and 20 PD patients and 20 age-matched controls in the Tao Wu dataset. Both sets contain T1 and resting-state scans.
    paper
    licence CC BY-NC-SA

  • PERFORM
    Functional Magnetic Resonance Imaging (fMRI), electroencephalography (EEG), sleep and nutrition assessments were performed on one male control subject.

  • PING (Pediatric Imaging, Neurocognition, and Genetics)
    The study includes 1400 children between the ages of 3 and 20 years so that links between genetic variation and developing patterns of brain connectivity can be examined.
    Keyboard: MRI
    paper

  • PPMI (Parkinson’s Progression Markers Initiative)
    Data from Parkinson’s Disease, Prodromal Cohort, and Healthy Controls.
    paper

  • Prenatal brain
    It was collected from three traveling subjects with identical acquisition setting in ten imaging centers.
    Keyboard: fetal MRI, Segmentation
    licence CC0 1.0

  • PREVENT-AD (Pre-symptomatic Evaluation of Experimental or Novel Treatments for Alzheimer Disease)
    Keyboard: MRI, Labeled
    paper | paper | paper

  • PRIME-DE (Data Exchange)
    A data collections for nonhuman primate imaging

  • QIN GBM Treatment Response
    It collection contains double baseline multi-parametric MRI images collected on patients with newly diagnosed glioblastoma.
    Keyboard: Cancer
    licence TCIA Restricted

  • Quiron-Valencia
    The first release includes data for 45 participants. Each participant has an anatomical as well as a resting state fMRI scan.
    Keyboard: Resting state fMRI (R-fMRI)
    licence Attribution - Non-Commercial

  • RealNoiseMRI
    Evaluating the performance of markerless prospective motion correction and selective reacquisition in a general clinical protocol
    Keyboard: MRI
    licence CC0

  • REMBRANDT
    It contains data generated through the Glioma Molecular Diagnostic Initiative from 874 glioma specimens comprising approximately 566 gene expression arrays, 834 copy number arrays, and 13,472 clinical phenotype data points.
    Keyboard: MRI
    licence TCIA Restricted - CC BY 3.0

  • RESECT BrokenLink (REtroSpective Evaluation of Cerebral Tumors)
    A clinical database of pre-oper, ative MRI and intra-operative ultrasound in low-grade glioma surgeries
    Keyboard: Cancer, Registration, Labeled
    paper
    licence CC BY 4.0

  • RIDER NEURO MRI (Reference Image Database to Evaluate Therapy Response)
    It contains data on 19 patients with recurrent glioblastoma who underwent repeat imaging sets.
    Keyboard: Cancer
    licence TCIA Restricted - CC BY 3.0

  • RSNA Brain Tumor (Radiological Society of North America 2021)
    A dataset for brain tumor segmentation and radiogenomic classification
    Keyboard: MRI, Labeled

  • RSNA Intracranial Hemorrhage Detection (Radiological Society of North America 2019)
    A dataset of more than 25,000 annotated cranial CT exams
    Keyboard: CT scan, Labeled
    paper

  • SALD (Southwest University Adult Lifespan Dataset)
    494 healthy adults (age range: 19-80 years; Males=187) were recruited and completed two multi-modal MRI scan sessions.
    Keyboard: MRI, resting-state functional MRI (rs-fMRI)
    paper Detailed description
    licence Attribution - Non-Commercial

  • SCA2 Diffusion Tensor Imaging
    Nine SCA2 (Spinocerebellar ataxia type II) patients and 16 age-matched healthy controls, were examined twice on the same 1.5T MRI scanner
    paper
    licence CC-BY 4.0

  • Shifts Challenge 2022
    White Matter Multiple Sclerosis (MS) lesion segmentation in 3D Magnetic Resonance Imaging (MRI) of the brain
    Keyboard: MRI
    Leaderboard

  • SHINY-ICARUS (Segmentation over tHree dImensional rotational aNgiographY of Internal Carotid ArteRy with aneUrySm)
    Keyboard: Brain vasculature, Labeled

  • SIMON (Single Individual volunteer for Multiple Observations across Networks)
    A sample of convenience of one healthy male aged between 29 and 46 years old, scanned in 73 sessions at multiple sites and with various scanner models.
    Keyboard: MRI
    licence CC BY-SA

  • SinoCT
    This dataset contains over 9,000 head CT scans, each labeled as normal or abnormal. Each scan contains a reconstructed image and a corresponding sinogram.
    Keyboard: Labeled
    paper
    licence Stanford university dataset research use aggrement

  • SLCN (Surface Learning for Clinical Neuroimaging)
    Part of the dHCP (Developing Human Connectome Project)
    Keyboard: MRI
    Leaderboard

  • SLIM (Southwest University Longitudinal Imaging Multimodal)
    A Long-term Test-Retest Sample of Young Healthy Adults in Southwest China.
    Keyboard: Resting state fMRI (rs-fMRI)
    licence Attribution - Non-Commercial

  • SMILE-UHURA (Small Vessel Segmentation at MesoscopIc ScaLE from Ultra-High ResolUtion 7T Magnetic Resonance Angiograms)
    Keyboard: MRI, Labeled

  • StudyForrest
    A Collection of datasets
    paper List of publications
    licence ODC Public Domain Dedication and Licence (PDDL)

  • Synthetic skull bone defects
    For automatic patient-specific craniofacial implant design
    Keyboard: CT scan
    paper
    licence CC BY 4.0

  • SynthStrip
    Keyboard: Multi-modality, Labeled, Segmentation
    paper

  • T1-weighted with 250 μm resolution
    T1-weighted in vivo human whole brain MRI dataset with an ultrahigh isotropic resolution of 250 μm.
    Keyboard: MRI, High field MRI
    paper

  • TADPOLE (The Alzheimer's Disease Prediction Of Longitudinal Evolution)
    In collaboration with ADNI
    Keyboard: MRI, Labeled
    Leaderboard

  • TCGA-LGG (The Cancer Genome Atlas Low Grade Glioma)
    Data from 199 subjects.
    Keyboard: Multi-Modality
    paper
    licence TCIA Restricted

  • Thalamus Segmentation
    1063 subjects that includes registered T1w and dMRI, automatically generated masks, and a subset with manual annotation. The data is derived from the HCP.
    paper

  • The Neuro Bureau - Berlin: Mind & Brain
    It represents a community sample including individuals ranging in age from 18 to 60 years old. Each participant copmleted at least two 7.5-minute resting state scans.
    Keyboard: Resting state fMRI (R-fMRI)
    licence Attribution - Non-Commercial

  • TopCoW23
    Topology-Aware Anatomical Segmentation of the Circle of Willis
    Keyboard: Magnetic Resonance Angiography (MRA) and Computed Tomography Angiography (CTA)
    Leaderboard | paper
    licence CC BY-NC

  • TopCoW24
    Topology-Aware Anatomical Segmentation of the Circle of Willis
    Keyboard: Magnetic Resonance Angiography (MRA) and Computed Tomography Angiography (CTA)
    Leaderboard
    licence Open use. Must provide the source. Use for commercial purposes requires permission of the data owner.

  • TrackRAD2025
    Real-time Tumor Tracking for MRI-guided Radiotherapy
    Keyboard: Cancer
    licence CC-BY-NC.

  • TRAIN-39
    The overall goal of this project was to better understand how the brain successfully acquires skills relevant to complex tasks.
    Keyboard: fMRI
    licence Attribution - Non-Commercial

  • UNAM Hynosis (Universidad Nacional Autónoma de México)
    Resting state of the static hypnotic state.
    Keyboard: Resting state fMRI scans (rs-fMRI)
    paper
    licence Attribution - Non-Commercial

  • UPenn-GBM (University of Pennsylvania glioblastoma)
    Multi-parametric magnetic resonance imaging scans for de novo Glioblastoma patients.
    Keyboard: Cancer, mpMRI, Segmentation, Labeled
    paper
    licence CC BY 4.0

  • VALDO (VAscular Lesions DetectiOn)
    Keyboard: MRI, cerebral small vessel disease (CSVD), Labeled
    paper

  • Virginia Tech
    The Virginia Tech Carillon Research Institute sample is a collection of past and present scans obtained from psychiatrically screened individuals ranging in age from 18 to 65 years old. The initial release consists of datasets from 25 healthy (community sample) adults, with age, sex, education level, and ethnicity provided.
    Keyboard: T1 weighted MRI, Resting state fMRI scans (R-fMRI)
    licence Attribution - Non-Commercial

  • Wayne 10
    The Wayne State longitudinal data set for the Brain Aging in Detroit Longitudinal Study, comprises 114 extensively-sampled healthy individuals. The overarching aim of the Brain Aging in Detroit Longitudinal Study, is understanding the mechanisms driving human brain changes over the adult lifespan, identifying the risk factors and protective influences that modify the rate of change, and elucidating the relationships between changes in brain properties and cognitive performance.
    ;8 Keyboard: MRI
    licence CC-BY-NC-SA

  • Wayne 11
    The Wayne State longitudinal data set (collected on 4T Bruker scanner, with Siemens user interface) for the Brain Aging in Detroit Longitudinal Study, comprises 200 healthy individuals.
    Keyboard: MRI
    licence CC-BY-NC-SA

  • Wayne EF (Executive Functions)
    The Wayne State executive function data set, comprises 112 extensively-sampled healthy individuals. The overarching aim of the executive function study is to explore the mediating role of differences in brain structure, EF, and processing speed in age-related differences in episodic memory.
    Keyboard: MRI
    paper
    licence CC-BY-NC-SA

  • WMH (White Matter Hyperintensity)
    Keyboard: MRI, Segmentation
    paper
    licence CC-BY-NC-4.0

  • WU-Minn HCP (Washington University, University of Minnesota, Human Connectome Project)
    It includes behavioral and 3T MR imaging data from1206 healthy young adult
    paper

  • Yale Hires
    The Yale High-Resolution Controls (Yale Hires) comprises 120 healthy individuals and was collected with the purpose of assessing the intrinsic organization of the human brain at rest.
    Keyboard: fMRI
    paper
    licence CC-BY-NC-SA

  • Yale Lowres
    The Yale Low-Resolution Controls (Yale Lowres) comprises 100 healthy individuals and was collected with the purpose of assessing the intrinsic organization of the human brain at rest.
    Keyboard: fMRI
    licence CC-BY-NC-SA

  • Yale TRT
    The Yale Test-Retest Dataset (Yale TRT) comprises 12 extensively-sampled healthy individuals and was collected with the purpose of assessing the intrinsic organization of the human brain at rest.
    Keyboard: fMRI
    paper
    licence CC-BY-NC-SA

Ears, Nose, Teeth, and Throat

  • 3DTeethSeg22
    A total of 1800 3D intra-oral scan for 900 patients covering their upper and lower jaws separately.
    Keyboard: Labeeld, Segmentation
    paper | paper
    licence CC BY-NC-ND 4.0

  • Cl-Detection 2023
    Cephalometric Landmark (CL) Detection in Lateral X-ray Images.
    Keyboard: Labeled
    Leaderboard

  • CTooth
    The gathered data set consists of 5803 CBCT slices in total, out of which 4243 contain tooth annotations.
    Keyboard: 3D dental CBCT, Segmentation, Labeled
    paper | paper
    licence CC BY 4.0

  • DDTI
    Thyroid Ultrasound Images to Classify Benign&Malign Cases.
    Keyboard: Labeled

  • DENTEX
    Dental Enumeration and Diagnosis on Panoramic X-rays
    Keyboard: X-rays, Labeled
    Leaderboard | paper | paper
    licence CC BY-SA 4.0

  • OPC-Radiomics
    Radiomic Biomarkers in Oropharyngeal Carcinoma
    Keyboard: CT scan, Cancer
    licence CC BY 3.0 - TCIA Restricted

  • OpenEar
    A library consisting of eight three-dimensional models of the human temporal bone.
    Keyboard: Cone Beam Computed Tomography (CBCT)
    paper
    licence CC BY 4.0

  • Panoramic Dental X-rays
    This dataset consists of anonymized and deidentified panoramic dental X-rays of 116 patients.
    Keyboard: Labeeld, Segmentation
    paper
    licence CC BY NC 3.0

  • Panoramic radiography database
    This database contains 598 panoramic radiographs.
    Keyboard: X-ray
    paper
    licence CC BY 4.0

  • Pulpy3D
    It is an extension of the dataset ToothFairy
    Keyboard: Cone Beam Computed Tomography (CBCT), Segmentation, Labeled
    paper

  • SegThy
    Thyroid and Neck Segmentation.
    Keyboard: MRI, Ultrasound
    licence CC BY

  • STS-2D (Semi-supervised Tooth Segmentation)
    The training dataset consists of 4000 panoramic images of teeth.
    Keyboard: Panoramic X-ray, Labeled
    Leaderboard
    licence Any individual or company is prohibited from using it for commercial purposes.

  • STS-3D (Semi-supervised Tooth Segmentation)
    Training dataset consists of 312 CT scans, containing about 62400 slices.
    Keyboard: Cone Beam Computed Tomography (CT scan), Labeled
    Leaderboard
    licence Any individual or company is prohibited from using it for commercial purposes.

  • TCGA-THCA (The Cancer Genome Atlas Thyroid Cancer)
    Data from 6 subjects and 2780 images
    Keyboard: CT scan
    licence CC BY 3.0

  • Teeth Segmentation Dataset
    The dataset consists of 598 images from other dataset with a total of 15,318 polygons, where each tooth is segmented manually with a different class.
    Keyboard: Panoramic X-ray, Segmentation, Labeled
    licence CC0 1.0

  • Thyroid Ultrasound Cine-clip
    Data is collected from 167 patients with biopsy-confirmed thyroid nodules (n=192).
    Keyboard: Ultrasound cine-clip images, Labeeld, Segmentation
    licence Stanford university dataset research use aggrement

  • TN-SCUI2020 (Thyroid Nodule Segmentation and Classification in Ultrasound Images)
    A dataset of thyroid nodule with over 4,500 patient
    Keyboard: Ultrasound Image, Thyroid
    Leaderboard
    licence The publish right of this dataset is limited to the purpose of this challenge only

  • ToothFairy
    A dataset of dental scans obtained by 3D CBCT
    Keyboard: Cone Beam Computed Tomography (CBCT), Segmentation
    Leaderboard | paper

  • ToothFairy2
    Multi-Structure Segmentation in CBCT Volumes
    Keyboard: Cone Beam Computed Tomography (CBCT)
    Leaderboard
    licence CC BY-SA

  • ToothFairy3
    Multi-Structure Segmentation in CBCT Volumes
    Keyboard: Cone Beam Computed Tomography (CBCT), Segmentation
    Leaderboard
    licence CC-BY-NC-SA

  • Vestibular Schwannoma SEG
    242 consecutive patients with vestibular schwannoma (VS) undergoing Gamma Knife stereotactic radiosurgery (GK SRS).
    Keyboard: MRI, Segmentation, Labeled
    paper | paper
    licence CC BY 4.0

Eyes

  • ADAM
    Diagnosis of Age-related Macular degeneration (AMD) and segmentation of lesions in fundus photos from AMD patients
    Keyboard: Labeled
    Leaderboard | paper

  • AGE (Angle closure Glaucoma Evaluation)
    A dataset of 4800 annotated AS-OCT images
    Keyboard: OCT
    Leaderboard | paper Clinical Background | paper

  • AIROGS (Artificial Intelligence for RObust Glaucoma Screening)
    This dataset includes around 113,000 images from about 60,000 patients
    Keyboard: Fundus Images
    Leaderboard | paper Summary Paper
    licence CC BY-NC-ND 4.0

  • APTOS 2019 (Asia Pacific Tele-Ophthalmology Society)
    Keyboard: Fundus photography, Diabetic retinopathy
    Leaderboard

  • CATARACTS
    Surgical tool detection in 50 videos of cataract surgeries
    Keyboard: Video, Labeled
    Leaderboard | paper
    licence CC BY 4.0

  • CHASE-DB1 BrokenLink
    Keyboard: Retinal, Labeled

  • DDR
    13,673 fundus images from 9598 patients.
    Keyboard: Diabetic retinopathy (DR), Segmentation, Detection
    paper

  • DRAC 2022 (Diabetic Retinopathy Analysis Challenge)
    A ultra-wide optical coherence tomography angiography (UW-OCTA) dataset addressing three primary clinical tasks: DR lesion segmentation, image quality assessment, and DR grading.
    Keyboard: Diabetic retinopathy, Segmentation, Classification
    Leaderboard | paper | paper

  • DRiDB (Diabetic Retinopathy Image Dataset)
    Keyboard: Fundus Images, Diabetic retinopathy
    paper
    licence The data included in the dataset can be used, free of charge, for research and educational purposes. Copy, redistribution, and any unauthorized commercial use is prohibited.

  • DRIVE (Digital Retinal Images for Vessel Extraction)
    Keyboard: Retinal, Segmentation
    Leaderboard

  • Duke Dataset for Fluorescein Angiography in DME eyes
    Fluorescein angiography images obtained from 24 eyes of 24 subjects.
    Keyboard: Video, Segmentation, Labeled
    paper

  • E-ophtha
    Keyboard: Diabetic retinopathy (DR), Color fundus images, Labeled

  • EyePACS Diabetic Retinopathy Detection
    Keyboard: Retina Images, Labeled
    Leaderboard

  • FIRE (Fundus Image Registration Dataset)
    Keyboard: Retinal, Labeled

  • GAMMA
    The dataset consists of 2D fundus images and 3D optical coherence tomography (OCT) images of 300 patients. The dataset was annotated with glaucoma grade in every sample, and macular fovea coordinates as well as optic disc/cup segmentation mask in the fundus image.
    Keyboard: OCT images
    Leaderboard | paper

  • HRF (High-Resolution Fundus)
    The database contains 15 images of healthy patients, 15 images of patients with diabetic retinopathy and 15 images of glaucomatous patients.
    Keyboard: Fundus Images, Segmentation, Labaled
    paper

  • IDRiD (Indian Diabetic Retinopathy Image Dataset)
    paper First Results and Analysis | paper Data Descriptor
    licence CC BY 4.0

  • JustRAIGS (Justified Referral in AI Glaucoma Screening)
    The dataset is divided into a training subset with 101,442 gradable fundus images, spanning both referable and no referable glaucomatous cases, and a test subset comprising 9,741 fundus images.
    Keyboard: Fundus Images, Labeled
    Leaderboard | paper
    licence CC BY-NC-SA

  • MeDAL Retina Dataset
    Keyboard: Retinal, Labeled
    paper Comprehensive details
    licence CC BY 4.0

  • Messidor MA Groundturth BrokenLink
    Microaneurysm (MA) detection in 20 retinal images
    Keyboard: Retinal, Labeled
    paper | paper

  • OCTA-500
    It contains OCTA imaging under two fields of view (FOVs) from 500 subjects.
    Keyboard: Optical coherence tomography angiography (OCTA), Segmentation, Labeled
    paper
    licence CC BY 4.0

  • ODIR 2019 (Ocular Disease Intelligent Recognition)
    A database of 5000 patients with age, color fundus photographs from left and right eyes
    Keyboard: Labeled
    Leaderboard

  • PALM
    Investigation and development of algorithms associated with the diagnosis of Pathological Myopia (PM) and segmentation of lesions in fundus photos from PM patients.
    Keyboard: Labeled
    Leaderboard

  • PRIME-FP20
    It provides 15 high-resolution ultra-widefield (UWF) fundus photography (FP) images
    Keyboard: Retinal vessel, Segmentation, Labeled
    paper
    licence CC BY 4.0

  • RAVIR
    A Dataset and Methodology for the Semantic Segmentation and Quantitative Analysis of Retinal Arteries and Veins in Infrared Reflectance Imaging
    Leaderboard | paper | paper
    licence CC BY-NC-SA 4.0

  • REFUGE (Retinal Fundus Glaucoma)
    A data set of 1200 fundus images with ground truth segmentations and clinical glaucoma labels
    Keyboard: Segmentation, Classification, Labeled
    Leaderboard | paper | paper

  • RETOUCH (Retinal OCT Fluid Challenge)
    Detect and segment various types of fluids on a common dataset of optical coherence tomography (OCT) volumes representing different retinal diseases, acquired with devices from different manufacturers.
    Keyboard: OCT images
    Leaderboard | paper
    licence The data shared in this challenge is strictly limited to research purpose only, any commercial use is prohibited.

  • RFMiD (Retinal Fundus Multi-Disease Image Dataset)
    It consists of 3200 fundus images
    Keyboard: Fundus Images, Classification
    Leaderboard | paper Data Descriptor

  • RIGA (Retinal fundus images for glaucoma analysis)
    It is derived from three sources for a total of 750 images. The optic cup and disc boundaries for each image was marked and annotated.
    Keyboard: Fundus images, Labeled
    paper
    licence CC BY-NC 4.0

  • RITE (Retinal Images vessel Tree Extraction)
    Segmentation or classification of arteries and veins on retinal fundus images, which is established based on the DRIVE database
    paper

  • ROC (Retinopathy Online Challenge)
    50 training images and 50 test images
    Keyboard: Diabetic retinopathy, Fundus Images, Labeled
    paper Overview

  • ROCC (Retinal OCT Classification Challenge)
    A dataset of OCT volumes, acquired with Topcon SD-OCT devices
    Keyboard: OCT images, Diabetic retinopathy

  • ROSE (Retinal OCT-Angiography Vessel SEgmentation)
    It includes two subsets: ROSE-1 and ROSE-2.
    paper
    licence CC BY 4.0

  • Segmentation of OCT images
    Images for segmentation of optical coherence tomography images with diabetic macular edema (DME).
    Keyboard: OCT images
    paper

  • STAGE (Structural-Functional Transition in Glaucoma Assessment)
    400 OCT data and corresponding Visual Field test reports with Mean Deviation (MD) values, sensitivity maps and pattern deviation probability map labels.
    Keyboard: OCT images
    Leaderboard

  • STARE (STructured Analysis of the Retina)
    Keyboard: Labeled

  • UK Biobank BrokenLink
    2 sets of manual segmentations for 20 UK Biobank retinal images
    Keyboard: Retinal, Labeled
    paper

  • UoA-DR (University of Auckland Diabetic Retinopathy)
    This database consists of 200 retinal images mostly affected with diabetic retinopathy.
    Keyboard: Segmentation, Labeled
    paper
    licence CC0 1.0


Chest and Abdomen

Bowel

  • ASU-Mayo
    Containing 19,400 frames and a total of 5,200 polyp instances from 10 unique polyps.
    Keyboard: Colonoscopy videos, Segmentation, Labeled
    paper
    licence Contact provider

  • CMB-CRC (Cancer Moonshot Biobank - Colorectal Cancer)
    Keyboard: Multi-modality, Cancer
    licence CC BY 4.0 - TCIA Restricted

  • Collection of textures in colorectal cancer histology
    Keyboard: Labeled, Mulyi tissue
    paper
    licence CC BY 4.0

  • CoNIC (Colon Nuclei Identification and Counting)
    Keyboard: whole-slide images (WSI), Nuclear segmentation and classification
    Leaderboard | paper

  • CPTAC-COAD (Clinical Proteomic Tumor Analysis Consortium Colon Adenocarcinoma)
    Keyboard: Histopathology, Cancer
    licence CC BY 3.0

  • CVC colon DB
    It contains 15 short colonoscopy sequences.
    Keyboard: Colonoscopy video, Segmentation, Classification
    paper

  • Digestpath2019 (Digestive-System Pathological 2019)
    Colonoscopy tissue segmentation and classification and Signet ring cell detection dataset
    Keyboard: Whole slide image (WSI), Cancer, Labeled
    paper

  • EDD2020 (Endoscopy Disease Detection)
    Annotated data consists of 5 different disease classes.
    Keyboard: Video, Segmentation, Detection, Labeled
    paper
    licence CC BY 4.0

  • El Salvador atlas of Gastrointestinal
    It displays 5154 video clips.
    Keyboard: Video Endoscopy
    paper

  • EndoCV2021 (Endoscopy Computer Vision 2021)
    Addressing generalisability in polyp detection and segmentation
    Keyboard: Colonoscopy, Labeled

  • HyperKvasir
    The dataset contains 110,079 images and 373 videos where it captures anatomical landmarks and pathological and normal findings.
    Keyboard: Gastrointestinal tract, Labeled, Segmentation
    paper
    licence The data is released fully open for research and educational purposes.

  • Kvasir-Capsule
    The dataset consists of 117 videos which can be used to extract a total of 4,741,504 image frames.
    Keyboard: Video capsule endoscopy (VCE) , Labeled
    paper
    licence The data is released fully open for research and educational purposes.

  • Kvasir-SEG
    It contains 1000 images.
    Keyboard: Gastrointestinal polyp, Labeled, Segmentation, Colonoscopy
    paper
    licence The use of the dataset is restricted for research and educational purposes.

  • PAIP2020
    Classification of molecular subtypes in colorectal cancer for whole-slide image analyses
    Leaderboard
    licence CC BY-NC 4.0

  • PAIP2023
    Tumor cellularity prediction in pancreatic cancer (supervised learning) and colon cancer (transfer learning)
    Leaderboard
    licence CC BY-NC 4.0

  • PolypGen
    This dataset is composed of a total of 8037 frames including both single and sequence frames.
    Keyboard: Detection, Segmentation, Video
    paper
    licence CC BY 4.0

  • TCGA-COAD (The Cancer Genome Atlas Colon Adenocarcinoma)
    Data from 25 subjects.
    Keyboard: CT scan
    licence CC BY 3.0

  • TCGA-READ (The Cancer Genome Atlas Rectum Adenocarcinoma)
    Data from 3 subjects and 1,796 images.
    Keyboard: MRI, CT scan
    licence CC BY 3.0

  • The National CT Colonography Trial (ACRIN 6664)
    A collection contains 825 cases of CT colonography imaging with accompanying spreadsheets that provide polyp descriptions and their location within the colon segments.
    paper
    licence CC BY 3.0

Breast

  • ACRIN-FLT-Breast (ACRIN 6688)
    Examination both pre-therapy and post-therapy
    Keyboard: 18F-FLT PET imaging, CT Scan, Cancer
    licence CC BY 3.0

  • ACROBAT (AutomatiC Registration Of Breast cAncer Tissue)
    Consisting of 4212 WSIs from 1153 patients
    Keyboard: whole-slide images (WSI), Cancer
    Leaderboard | paper

  • Advanced-MRI-Breast-Lesions
    Standard and Delayed Contrast-Enhanced MRI of Malignant and Benign Breast Lesions with Histological and Clinical Supporting Data
    Keyboard: Segmentation, Cancer
    licence CC BY 4.0

  • BACH (BreAst Cancer Histology)
    Keyboard: Biopsy, Cancer
    Leaderboard | paper
    licence CC BY-NC-ND

  • BCI (Breast Cancer Immunohistochemical)
    Keyboard: hematoxylin and eosin (HE) stained images, Image Generation, Labeled
    Leaderboard | paper

  • BCNB (Breast Cancer Core-Needle Biopsy)
    A dataset of Early Breast Cancer Core-Needle Biopsy WSI, which includes core-needle biopsy whole slide images of early breast cancer patients and the corresponding clinical data.
    Keyboard: Whole-Slide Images (WSIs), Labeled
    paper

  • BCSS (Breast Cancer Semantic Segmentation)
    The dataset contains over 20,000 segmentation annotations of tissue region from breast cancer images from TCGA.
    Keyboard: Cancer, Labeled
    paper
    licence CC0 1.0 Universal (CC0 1.0)

  • BreakHis (Breast Cancer Histopathological)
    A dataset of 7909 breast cancer histopathology images acquired on 82 patients
    Keyboard: Cancer, Labeled
    paper
    licence The Database may be used for non-commercial research

  • Breast Cancer CT
    Keyboard: CT Scan, Labeled, Cancer
    licence CC BY-NC-SA 4.0

  • Breast-Cancer-Screening-DBT (Digital Breast Tomosynthesis)
    It contains 22,032 reconstructed DBT volumes belonging to 5,610 studies from 5,060 patients.
    Keyboard: Mammography, Cancer
    paper
    licence CC BY-NC 4.0

  • BREAST-DIAGNOSIS
    It contains cases that are high-risk normals, DCIS, fibroids and lobular carcinomas.
    Keyboard: Cancer
    licence CC BY 3.0

  • Breast-MRI-NACT-Pilot
    Single site breast DCE-MRI data and segmentations from patients undergoing neoadjuvant chemotherapy
    Keyboard: Labeled, Cancer
    licence CC BY 3.0

  • BreastPathQ
    Development of quantitative biomarkers for the determination of cancer cellularity from whole slide images (WSI) of breast cancer hematoxylin and eosin (H&E) stained pathological slides
    Keyboard: Cancer, Haematoxylin and eosin (H&E) stained slides
    Leaderboard

  • BUS-UCLM
    It is comprised of breast ultrasound images from 38 patients. It consists of 683 images, of which 174 are benign, 90 are malignant, and 419 are normal.
    Keyboard: Cancer
    licence CC BY-NC 4.0

  • BUSI (Breast Ultrasound Images)
    The data collected at baseline include breast ultrasound images among women in ages between 25 and 75 years old.
    Keyboard: Cancer, Labeled
    paper

  • BUSIS (Breast Ultrasound Image Segmentation)
    Keyboard: Ultrasound image, Labeled, Cancer
    paper

  • CBIS-DDSM (Curated Breast Imaging Subset of Digital Database for Screening Mammography)
    The DDSM is a database of 2,620 scanned film mammography studies. It contains normal, benign, and malignant cases with verified pathology information.
    Keyboard: Cancer, Labeled
    paper
    licence CC BY 3.0

  • DMR-IR
    Keyboard: Cancer, Thermography image

  • Duke Breast Cancer MRI
    Dynamic contrast-enhanced magnetic resonance images of breast cancer patients with tumor locations.
    Keyboard: Cancer, Labeled
    paper
    licence CC BY-NC 4.0

  • HEROHE (HER2 on hematoxylin and eosin)
    The dataset consists of annotated, whole-slide images dataset (509), specifically collected for predicting human epidermal growth factor receptor 2 (HER2) status
    Keyboard: whole-slide images (WSI), Cancer
    paper
    licence CC BY-NC-ND 3.0

  • INbreast
    The database has a total of 115 cases (410 images) from which 90 cases are from women with both breasts affected and 25 cases are from mastectomy patients.
    Keyboard: Mammography, Cancer
    paper

  • ISPY1 - Trial (Investigation of Serial Studies to Predict Your Therapeutic Response with Imaging and moLecular Analysis)
    Keyboard: MRI, Cancer, Segmentation
    paper
    licence CC BY 3.0

  • ISPY2 - Trial (Investigation of Serial Studies to Predict Your Therapeutic Response with Imaging and moLecular Analysis)
    Keyboard: MRI, Cancer, Segmentation
    paper
    licence CC BY 4.0

  • MIAS
    Keyboard: Mammography, Cancer, Labeled
    licence For research purposes ONLY

  • MIDOG 2021 (Mitosis Domain Generalization 2021)
    Detect mitotic figures (cells undergoing cell division) from histopathology images (object detection)
    Keyboard: Whole-Slide Images (WSI), Cancer, Labeled
    Leaderboard | paper
    licence CC BY-NC-ND

  • MIDOG 2022 (Mitosis Domain Generalization 2022)
    Detect mitotic figures (cells undergoing cell division) from histopathology images (object detection)
    Keyboard: Whole-Slide Images (WSI), Cancer, Labeled
    Leaderboard | paper
    licence CC BY-NC-ND

  • Mini-DDSM (Digital Database for Screening Mammography)
    This is the light-weight version of the popular DDSM.
    Keyboard: Mammography, Cancer
    paper
    licence CC BY-ND 4.0

  • MITOS-ATYPIA-14
    It is made up of two parts: Detection of mitosis on the one hand, and evaluation of nuclear atypia score on the other hand.
    Keyboard: Cancer, Haematoxylin and eosin (H&E) stained slides
    Leaderboard

  • NuCLS
    The datasets contain over 220000 labeled nuclei from breast cancer images from TCGA
    Keyboard: Cancer, Labeled
    paper
    licence CC0 1.0 license

  • Post-NAT-BRCA
    Assessment of Residual Breast Cancer Cellularity after Neoadjuvant Chemotherapy using Digital Pathology
    Keyboard: Histopathology
    paper
    licence CC BY 3.0

  • QIN-Breast
    This collection contains longitudinal PET/CT and quantitative MR images collected for the purpose of studying treatment assessment in breast cancer in the neoadjuvant setting.
    Keyboard: PET/CT, MRI, Cancer
    licence CC BY 3.0

  • QIN-Breast-02
    This data is from a multi-site, multi-parametric quantitative MRI study of adult (18+ years old) females diagnosed with invasive breast cancer.
    Keyboard: Cancer
    licence TCIA Limited CC0 1.0 license - CC BY 3.0

  • RIDER Breast MRI (Reference Image Database to Evaluate Therapy Response)
    RIDER is a targeted data collection used to generate an initial consensus on how to harmonize data collection and analysis for quantitative imaging methods applied to measure the response to drug or radiation therapy.
    Keyboard: Cancer
    licence CC BY 3.0

  • RSNA Screening Mammography Breast Cancer Detection (Radiological Society of North America 2023)
    Keyboard: Radiographic breast images, Labeled
    Leaderboard

  • SLN-Breast
    Breast Metastases to Axillary Lymph Nodes
    Keyboard: Histopathology, Cancer
    paper
    licence CC BY 3.0

  • TCGA-BRCA (The Cancer Genome Atlas Breast Invasive Carcinoma)
    Data from 139 subjects.
    Keyboard: MRI, Cancer
    licence CC BY 3.0

  • TDSC-ABUS2023 (Tumor Detection, Segmentation and Classification Challenge on Automated 3D Breast Ultrasound 2023)
    Keyboard: Ultrasound, Cancer, Labeled
    Leaderboard

  • TIGER (Tumor InfiltratinG lymphocytes in breast cancER)
    Keyboard: H&E Whole-Slide Images (WSI), Cancer, Detecion, Segmentation
    Leaderboard
    licence CC BY-NC 4.0

  • TUPAC (Tumor Proliferation Assessment Challenge)
    The dataset consisted of 500 training and 321 testing breast cancer histopathology WSIs.
    Keyboard: Whole-Slide Images, Cancer
    paper

  • UTA4
    Keyboard: Multi-modality, Cancer
    paper
    licence CC-BY-SA-4.0

Heart and Blood Vessels

  • ACDC (Automated Cardiac Diagnosis Challenge)
    The dataset contains data from 150 multi-equipments CMRI recordings with reference measurements and classification from two medical experts.
    Keyboard: Cardiac MRI (CMR), Segmentation
    Leaderboard | paper

  • AMRG Cardiac Atlas
    There are 4 protocols used to create the cardiac atlas: T1-Weighted Images, True FISP Cines, MR tagging and contrast MRI.

  • Angiographic dataset for stenosis detection
    All patients had angiographically and/or functionally confirmed one-vessel coronary artery disease.
    paper
    licence CC BY 4.0

  • AortaSeg24
    Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography (CTA)
    Leaderboard

  • ARCADE
    Automatic Region-based Coronary Artery Disease Diagnostics Using X-Ray Angiography Images
    Keyboard: X-ray coronary angiography, Labeled
    Leaderboard | paper

  • ASOCA (Automated Segmentation of Coronary Arteries)
    A set of Cardiac Computed Tomography Angiography (CCTA) with contrast agent showing the coronary arteries
    Keyboard: CCTA, Labeled
    Leaderboard

  • Atria Segmentaion 2018
    A total of 154 3D MRIs from patients with Atrial fibrillation (AF) are used.
    Keyboard: Labeled
    paper

  • Blood Cell Images
    This dataset contains 12,500 augmented images of blood cells with accompanying cell type labels. The cell types are Eosinophil, Lymphocyte, Monocyte, and Neutrophil.
    licence MIT

  • CAMUS (Cardiac Acquisitions for Multi-structure Ultrasound Segmentation)
    The dataset consists of clinical exams from 500 patients
    Keyboard: 2D echocardiographic images
    Leaderboard | paper

  • CardiacUDA (Unsupervised Domain Adaption)
    Keyboard: Echocardiogram Videos
    paper
    licence Apache 2.0

  • CAVAREV (CArdiac VAsculature Reconstruction EValuation)
    The goal is to enable an easy and objective comparison of different dynamic reconstruction algorithms.

  • cDEMRIS (Cardiac Delayed Enhancement Segmentation Challenge)
    The dataset includes Late Gadolinium enhancement (LGE) cardiovascular magnetic resonance imaging used to visualise regions of fibrosis and scarring in the left atrium (LA) myocardium.
    Keyboard: Cardiac MRI (CMR), Segmentation
    paper
    licence CC BY 4.0

  • CETUS (Challenge on Endocardial Three-dimensional Ultrasound Segmentation)
    The dataset is composed of 45 sequences of 3D ultrasound volumes of one cardiac cycle from 45 patients to compare left ventricle segmentation methods for both End Diastolic and End Systolic phase instances.
    Keyboard: Ultrasound imaging, Segmentation
    Leaderboard

  • CHD (Congenital Heart Disease)
    Physiologic clinical data, and computational models from adults and children with various congenital heart defects.
    Keyboard: MRI
    paper | paper

  • COCA
    Coronary Calcium and chest CT’s.
    Keyboard: Segmentation
    licence Stanford university dataset research use aggrement

  • COSMOS (CarOtid vessel wall SegMentation and atherosclerOsis diagnosiS)
    Keyboard: 3D-VISTA (volume isotropic turbo spin echo acquisition) images
    Leaderboard

  • CMRxMotion
    Extreme Cardiac MRI Analysis under Respiratory Motion
    paper | paper

  • CMRxRecon
    It aims to establish a platform for fast CMR image reconstruction
    Keyboard: 3T MRI, Segmentation, Labeled

  • CMRxUniversalRecon
    Also known as CMRxRecon2024
    Keyboard: Cardiac MRI Reconstruction

  • CT Pulmonary Angiography
    A collection of CT pulmonary angiography (CTPA) for patients susceptible to Pulmonary Embolism (PE). In addition to slice-level PE labels, labels for PE location, RV/LV ratio, and PE type are provided.
    Keyboard: Labeled
    licence Stanford university dataset research use aggrement

  • DETERMINE (Defibrillators to Reduce Risk by Magnetic Resonance Imaging Evaluation)
    It consists of MR images and some 3D left ventricular models derived semi-automatically.

  • EchoNet-Dynamic
    A Cardiac Motion Video Data Resource for Medical Machine Learning includes 10,030 labeled echocardiogram videos
    Keyboard: Echocardiography, Labeled
    paper

  • EchoNet-LVH
    A Parasternal Long Axis Echocardiography Video Data Resource
    Keyboard: Echocardiography, Labeled
    paper

  • EchoNet-Pediatric
    A Pediatric data resource includes 7,643 labeled echocardiogram videos
    Keyboard: Echocardiography, Labeled
    paper

  • EMIDEC (automatic Evaluation of Myocardial Infarction from Delayed-Enhancement Cardiac MRI)
    The database consists of 150 exams divided into 50 cases with normal MRI after injection of a contrast agent and 100 cases with myocardial infarction.
    Keyboard: Segmentation, Classification
    Leaderboard | paper | paper
    licence CC BY-NC-SA 4.0

  • HMC-QU
    The dataset includes a collection of apical 4-chamber (A4C) and apical 2-chamber (A2C) view 2D echocardiography.
    Keyboard: Myocardial Infarction, Detection, Segmentation, Labeled
    paper
    licence Attribution-NonCommercial-ShareAlike 3.0 IGO (CC BY-NC-SA 3.0 IGO)

  • ImageALCAPA (Anomalous left coronary artery from pulmonary artery)
    30 3D CTA images.
    Keyboard: CTA (Computed tomography angiography), Labeled, Segmentation
    paper
    licence Apache 2.0

  • ImageCAS (Coronary Artery Segmentation)
    A Dataset and for Coronary Artery Segmentation based on CT.
    Keyboard: CTA (Computed tomography angiography), Segmentation
    paper
    licence Apache 2.0

  • ImageCHD (Congenital Heart Disease)
    A 3D CT Image Dataset for classification of Congenital Heart Disease.
    Keyboard: CT scan, Labeled
    paper
    licence Apache 2.0

  • ImageTBAD
    A 3D CT Image Dataset for Automatic Segmentation of of Type-B Aortic Dissection.
    Keyboard: CTA (Computed tomography angiography), Labeled, Aorta
    paper
    licence Apache 2.0

  • LASC'13 (Left Atrial Segmentation Challenge 2013)
    The benchmark consists of 30 CT and 30 MRI datasets.
    Keyboard: Labeled
    paper
    licence CC BY 4.0

  • LAScarQS (Left Atrial and Scar Quantification & Segmentation)
    It provides 194 LGE MRIs from patients suffering atrial fibrillation (AF).
    Keyboard: Labeled
    paper
    licence CC BY NC ND

  • LivScar
    The image database consists of 30 Late Gadolinium enhancement cardiovascular magnetic resonance images of both humans and pigs that were acquired from two separate imaging centres.
    Keyboard: Cardiac MRI (CMR), Segmentation
    paper
    licence CC BY 4.0

  • LV Landmark Detection Challenge
    This challenge uses the same data set as in the LV Segmentation Challenge with manually annotated landmark positions were placed in the training data set as annotation data.
    Keyboard: MRI, Labeled

  • LV Segmentation Challenge
    This challenge was aimed to establish a set of ground truth or consensus segmentation derived from participants.
    Keyboard: MRI

  • LV Statistical Shape Challenge
    The training dataset will comprise one hundred (100) cases with myocardial infarction and an additional one hundred (100) asymptomatic cases from the DETERMINE and MESA datasets respectively.
    Keyboard: MRI

  • LVQuan19 (Left Ventricle Full Quantification)
    A dataset with processed SAX MR sequences of 86 subjects.
    paper

  • M&Ms
    Multi-Centre, Multi-Vendor and Multi-Disease Cardiac Segmentation
    Keyboard: Cardiac MRI (CMR)
    paper

  • M&Ms-2
    Multi-Disease, Multi-View & Multi-Center Right Ventricular Segmentation in Cardiac MRI
    Keyboard: Cardiac MRI (CMR)
    Leaderboard | paper

  • MBAS24 (Multi-class Bi-Atrial 2024)
    Concluding 70 3D LGE-MRI scans for training, 30 for validation, and an additional 100 designated for the final test phase.
    licence CC BY NC ND

  • MESA (Multi-Ethnic Study of Atherosclerosis)
    It aims to investigate the manifestation of subclinical to clinical cardiovascular disease before signs and symptoms develop.
    Keyboard: MRI

  • MITEA (MR-Informed Three-dimensional Echocardiography Analysis)
    The dataset consists of annotated 3D echocardiography (3DE) data using labels derived from paired CMR scans acquired in a mixed cohort of 134 human subjects (82 healthy controls and 52 patients with acquired cardiac disease).
    paper

  • MM-WHS (Multi-Modality Whole Heart Segmentation)
    It provides multi-modality cardiac images acquired in real clinical environment.
    Keyboard: Anonymized clinical MRI and CT scan, Labeled
    paper | paper

  • Motion Correction Challenge
    The dataset consists of 10 cases.
    Keyboard: MRI, Labeled
    paper

  • MS-CMRSeg (Multi-sequence Cardiac MR Segmentation)
    Data from 45 patients.
    Keyboard: Cardiac MRI (CMR), Segmentation, Labeled
    paper

  • MyoPS 2020
    Myocardial pathology segmentation combining multi-sequence cardiac magnetic resonance (CMR)
    paper

  • MYOSAIQ (MYOcardial Segmentation with Automated Infarct Quantification)
    The full dataset is composed of 467 Late gadolinium enhanced magnetic resonance images from two different cohorts to quantify myocardial infarction (MI) lesions at different phases of the longitudinal evolution of the disease
    Keyboard: MRI, Segmentation

  • OCMR
    Open-Access Multi-Coil k-Space Dataset for Cardiovascular Magnetic Resonance Imaging
    Keyboard: MRI
    paper

  • orCaScore
    Cardiac CT exams of 72 patients
    Keyboard: CT scan
    Leaderboard | paper

  • Parse2022 (Pulmonary Artery Segmentation 2022)
    Our dataset contains 200 3D volumes with refined pulmonary artery label
    Keyboard: CT Pulmonary Angiography (CTPA)
    Leaderboard | paper

  • PASCAL
    Carotid Artery TOF MRA Data Set

  • RadFusion
    The dataset collected data from 1794 patients susceptible to pulmonary embolism. It consists of Chest CT, patient demographics and medical history.
    Keyboard: CT scan, Labeled
    paper
    licence Stanford university dataset research use aggrement

  • RVSC (Right Ventricle Segmentation Challenge)
    Keyboard: Cardiac cine MRI
    paper

  • SCD (Sunnybrook Cardiac Data)
    The dataset consist of 45 cine-MRI images from a mixed of patients and pathologies: healthy, hypertrophy, heart failure with infarction and heart failure without infarction.
    Keyboard: Segmentation, Labeled
    paper
    licence Public Domain (CC0 1.0 Universal)

  • SCMR Consensus Contours (Society for Cardiovascular Magnetic Resonance)
    This dataset is designed to have the most reliable ground truth myocardial contours from short-axis MRI with multiple pathologies (1 healthy and 3 cardiac disease).
    Keyboard: Segmentation, Labeled
    paper

  • Second Annual Data Science Bowl
    Keyboard: cardiac MRI, Registration
    Leaderboard

  • SEG.A. 2023 (Segmentation of the Aorta)
    Keyboard: CTA (Computed tomography angiography), Labeled, Aorta
    Leaderboard

  • SLAWT (Segmentation of Left Atrial Wall for Thickness)
    The image database consisted of cardiac CT (n=10) and MRI (n=10) of healthy and diseased subjects.
    Keyboard: CT scan, MRI, Segmentation, Labeled
    paper

  • Vessel Segmentation
    Leaderboard

Kidneys and Urinary Tract

  • CPTAC-CCRCC (The Clinical Proteomic Tumor Analysis Consortium Clear Cell Renal Cell Carcinoma)
    Data from 262 subjects.
    Keyboard: Multi-modality
    licence CC BY 3.0

  • KiPA22 (Kidney PArsing 2022)
    Multi-Structure Segmentation for Renal Cancer Treatment
    Keyboard: Computed Tomography Angiography (CTA), Labeled
    Leaderboard

  • KiTS19 (Kidney Tumor Segmentation 2019)
    Keyboard: CT scan, Cancer, Labeled
    Leaderboard | paper
    licence CC BY-NC-SA 4.0

  • KiTS21 (Kidney Tumor Segmentation 2021)
    Keyboard: CT scan, Cancer, Labeled
    Leaderboard | paper | paper

  • KiTS23 (Kidney Tumor Segmentation 2023)
    Keyboard: CT scan, Cancer, Labeled

  • HuBMAP - Hacking the Kidney (Human BioMolecular Atlas Program)
    Identify glomeruli in human kidney tissue images. It includes 11 fresh frozen and 9 Formalin Fixed Paraffin Embedded (FFPE) PAS kidney images.
    Keyboard: Segmentation
    Leaderboard

  • MONKEY (Machine-learning for Optimal detection of iNflammatory cells in the KidnEY)
    Detection of mononuclear, inflammatory cells and detect and distinguish inflammatory cells
    Keyboard: WSI, Labeled
    Leaderboard

  • TCGA-BLCA (The Cancer Genome Atlas Urothelial Bladder Carcinoma)
    Data from 120 subjects and 111,781 images
    Keyboard: Multi-modality
    licence CC BY 3.0

  • TCGA-KICH (The Cancer Genome Atlas Kidney Chromophobe)
    Data from 15 subjects.
    Keyboard: MRI, CT scan
    licence CC BY 3.0

  • TCGA-KIRC (The Cancer Genome Atlas Kidney Renal Clear Cell Carcinoma)
    Data from 267 subjects.
    Keyboard: Multi-modality
    licence CC BY 3.0

  • TCGA-KIRP (The Cancer Genome Atlas Cervical KIdney Renal Papillary cell carcinoma)
    Data from 33 subjects.
    Keyboard: Multi-modality
    licence CC BY 3.0

  • Urinary tract infections
    A dataset containing 300 images and 3,562 manually annotated urinary cells labelled into seven classes of clinically significant urinary content.
    Keyboard: Segmentation , Labeled
    paper
    licence CC BY 4.0

Liver

  • 3D-IRCADb-01 (3D Image Reconstruction for Comparison of Algorithm Database)
    10 women and 10 men with hepatic tumours in 75% of cases.
    Keyboard: 3D CT scan, Cancer, Labeled, Segmentation
    licence CC BY-NC-ND 4.0

  • AHEP0731
    Risk-Based Therapy in Treating Younger Patients With Newly Diagnosed Liver Cancer
    paper
    licence NCTN/NCORP Data Archive License (With Collaborative Agreement)

  • ATLAS (A Tumour and Liver Automatic Segmentation)
    60 Public images
    Keyboard: MRI, Cancer, Labeled
    Leaderboard | paper | paper

  • CLUST (Challenge on Liver Ultrasound Tracking)
    It has two versions
    Leaderboard | paper

  • Colorectal Liver Metastases
    This collection consists of images for 197 patients with Colorectal Liver Metastases (CRLM).
    Keyboard: CT scan, Cancer, Segmentation, Labeled
    paper
    licence CC BY 4.0

  • Duke Liver DataSet
    It provides over 2000 anonymized MRI image series acquired in routine liver MRI protocols across 105 subjects.
    Keyboard: MRI, Cancer, Labeled
    paper
    licence CC-BY-NC-ND-4.0

  • HCC-TACE-Seg
    Multimodality annotated Hepatocellular carcinoma (HCC) cases with and without advanced imaging segmentation.
    Keyboard: CT scan, Cancer, Segmentation, Labeled
    paper
    licence CC BY 4.0

  • LiTS (Liver Tumor Segmentation)
    Keyboard: CT scan, Cancer, Labeled
    Leaderboard | paper
    licence Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

  • P2ILF (Preoperative to Intraoperative Laparoscopy Fusion)
    Keyboard: Laparoscopic video images, Segmentation, Registration
    Leaderboard

  • PAIP2019
    Keyboard: Whole-slide images (WSIs), Cancer, Segmentation, Labeled, Hepatocellular Carcinoma (HCC)
    Leaderboard

  • SHAPE 2014 BrokenLink
    The dataset is part of the training data from the "VISCERAL Organ Segmentation and Landmark Detection Challenge"
    Keyboard: MRI, CT scan, Labeled, Segmentation

  • SLIVER07 (Segmentation of the Liver Competition 2007)
    Keyboard: 3D CT scan
    Leaderboard | paper

  • TCGA-LIHC (The Cancer Genome Atlas Liver Hepatocellular Carcinoma)
    It has used in LiverHccSeg
    Keyboard: MRI, CT scan, Cancer
    licence CC BY 3.0

Lungs

  • 4D Lung
    The images include four-dimensional (4D) fan beam (4D-FBCT) and 4D cone beam CT (4D-CBCT)
    Keyboard: Cancer
    paper The dataset is described | paper
    licence CC BY 3.0

  • 4DCT
    Image sets and reference data of thoracic 4DCT images acquired as part of the radiotherapy planning process for the treatment of thoracic malignancies.
    paper | paper

  • ACDC-LungHP (Automatic Cancer Detection and Classification in Lung Histopathology)
    Keyboard: Cancer, H&E staining, Pathology
    Leaderboard | paper | paper

  • ACRIN-NSCLC-FDG-PET
    Positron Emission Tomography Pre- and Post-treatment Assessment for Locally Advanced Non-small Cell Lung Carcinoma
    Keyboard: Multi modality, Cancer
    licence CC BY 3.0

  • Airway
    Airway Segmentation and Centerline Extraction from Thoracic CT scan
    paper
    licence CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

  • ANODE09 (Automatic Nodule Detection 2009)
    Automatic detection of pulmonary nodules in chest
    Keyboard: CT-scan
    Leaderboard | paper
    licence Data downloaded from their site may only be used for the purpose of preparing an entry to be submitted on the site

  • ATM22 (Airway Tree Modeling)
    Dataset provides CT scans with detailed pulmonary airway annotation.
    Keyboard: CT-scan, Labeled
    Leaderboard | paper

  • BIMCV COVID-19
    These iterations of the database include 21342 CR, 34829 DX and 7918 CT studies.
    Keyboard: Chest radiography (CXR), CT scan, Labeled
    paper

  • BrixIA
    COVID19 severity score assessment project and database.
    Keyboard: Chest radiography (CXR), Labeled
    paper

  • BSTI COVID19 (British Society of Thoracic Imaging)
    Keyboard: CT scan
    licence It is not intended for download or research applications.

  • CANDID-PTX
    19,237 anonymized adult chest x-ray datasets in 1024 x 1024 pixel
    Keyboard: X-ray, Segmentation, Labeled
    licence CC BY-NC-SA 4.0

  • Chest CT-Scan images
    CT-Scan images with different types of chest cancer
    Keyboard: CT-scan, Labeled

  • Chest X-Ray Images (Pneumonia)
    There are 5,863 X-Ray images and 2 categories (Pneumonia/Normal).
    paper
    licence CC BY 4.0

  • Chest XR COVID-19 detection
    The dataset contains 20,000+ images and 3 classes: COVID-19, Pneumonia and Normal (healthy).
    Keyboard: X-ray, Labeled
    licence The dataset can only be used for this challenge

  • ChestX-Det
    It consists of 3578 images from NIH ChestX-14
    Keyboard: X-ray, Segmentation, Labeled

  • ChestX-ray8 (ChestXray-NIHCC)
    100,000 anonymized chest x-ray images
    Keyboard: X-ray, Labeled
    paper

  • ChestX-ray14 (ChestXray-NIHCC)
    It is a dataset which comprises 112,120 frontal-view X-ray images of 30,805 unique patients with the text-mined fourteen common disease labels, mined from the text radiological reports via NLP techniques. It expands on ChestX-ray8 by adding six additional thorax diseases.
    Keyboard: X-ray, Labeled

  • CheXlocalize
    234 images with 643 expert segmentations.
    Keyboard: X-ray, Labeled
    paper
    licence Stanford university dataset research use aggrement

  • CheXmask
    The database aggregates 657,566 anatomical segmentation masks from five public databases.
    Keyboard: X-ray, Labeled
    paper
    licence Creative Commons Attribution 4.0 International Public License

  • CheXpert
    A Large Chest X-Ray Dataset.
    Keyboard: X-ray, Labeled
    paper

  • CheXphoto
    It comprises a training set of natural photos and synthetic transformations of 10,507 x-rays from 3,000 unique patients that were sampled at random from the CheXpert training set, and a validation and test set of natural and synthetic transformations applied to all 234 x-rays from 200 patients and 668 x-rays from 500 patients in the CheXpert validation and test sets, respectively.
    licence Stanford university dataset research use aggrement

  • CMB-LCA (Cancer Moonshot Biobank - Lung Cancer)
    Keyboard: Multi-modality, Cancer
    licence CC BY 4.0 - TCIA Restricted

  • COPDgene
    Image sets and reference data of inspiratory and expiratory breath-hold CT image pairs acquired from the National Heart Lung Blood Institute COPDgene study archive.
    paper

  • COVID-19-20
    COVID-19 Lung CT Lesion Segmentation
    Keyboard: CT scan, Labeled
    Leaderboard | paper
    licence Annotation data are available under CC0 license

  • COVID-19-AR
    Chest Imaging with Clinical and Genomic Correlates Representing a Rural COVID-19 Positive Population
    paper
    licence CC BY 4.0

  • COVID-19 chest xray
    It contains COVID-19 cases as well as MERS, SARS, and ARDS.
    Keyboard: X-ray, CT scan
    licence CC0: Public Domain

  • COVID-19 CT Images Segmentation
    The data was provided by medicalsegmentation.
    Keyboard: CT scan, Labeled
    Leaderboard

  • COVID-19 CT Lung and Infection
    The dataset contains 20 labeled COVID-19 CT scans
    Keyboard: Segmentation
    licence CC BY 4.0

  • COVID-19 CT scans
    20 CT scans and expert segmentations of patients with COVID-19
    Keyboard: Labeled
    licence Coronacases (CC BY NC 3.0) - Radiopedia (CC BY NC SA 3.0) - Annotations (CC BY 4.0)

  • Covid-19 Image
    3 Way Classification - COVID-19, Viral Pneumonia, Normal
    Keyboard: X ray, Labeled
    licence CC BY-SA 4.0

  • COVID-19 Image Repository
    An anonymized data set of COVID-19 cases with a focus on radiological imaging
    licence CC BY 3.0

  • Covid-19 Infection Percentage Estimation
    Keyboard: CT scan, Labeled
    paper

  • COVID-19-NY-SBU
    This collection of cases was acquired at Stony Brook University from patients who tested positive for COVID-19.
    licence CC BY 4.0

  • COVID-19 Radiography Database
    3616 COVID-19 Chest X-ray images and lung masks
    paper | paper

  • COVID-19 Ultrasound
    200 LUS videos labelled with a diagnostic outcome
    paper Abstract | paper Full paper | paper

  • COVID-19 xray
    This dataset contains 6500 images of AP/PA chest x-rays with pixel-level polygonal lung segmentations

  • COVID-BLUES (Bluepoint Lung Ultrasound)
    It contains bluepoint-specific lung ultrasound videos recorded included 63 patients (33 COVID-positive and 30 COVID-negative), with the inclusion criteria being symptoms of a lung infection.

  • COVID-ChestXRay
    An database of COVID-19 cases with chest X-ray or CT images
    Keyboard: CT scan, X-ray
    paper | paper

  • COVID-CT
    It contains 349 COVID-19 CT images from 216 patients and 463 non-COVID-19 CTs
    Keyboard: CT scan, Classification
    Leaderboard | paper

  • COVID-CT-MD
    The dataset contains volumetric chest CT scans of 169 patients positive for COVID-19 infection, 60 patients with Community Acquired Pneumonia, and 76 normal patients.
    Keyboard: CT scan, Labeled
    paper Desription of the Dataset

  • COVID-CTset
    The dataset contains 63849 images from 377 patients
    Keyboard: CT scan, Labeled
    paper Desription of the Dataset

  • COVID19-CT
    An Chest CT Image Repository of 1000+ Patients with Confirmed COVID-19 Diagnosis
    licence CC0 1.0

  • COVIDx
    It is a collection of 8 datasets
    Keyboard: CT scan, X-ray
    paper

  • CPTAC-LSCC (Clinical Proteomic Tumor Analysis Consortium Lung Squamous Cell Carcinoma)
    Keyboard: CT scan, PT, Histopathology, Cancer
    licence CC BY 3.0 - TCIA Restricted

  • CRASS12 (Chest Radiograph Anatomical Structure Segmentation)
    Automatic segmentation of anatomical structures in chest radiographs
    paper

  • CT Images in COVID-19
    A dataset from 632 patients with COVID-19 infections at initial point of care, and a dataset of 121 CTs from 29 patients with COVID-19 infections with serial / sequential CTs.
    paper A classification model derived
    licence CC BY 4.0

  • CT-RATE
    The 3D medical imaging dataset that pairs images with textual reports.
    Keyboard: CT scan, Labeled
    paper
    licence CC BY-NC-SA

  • CTVIE19
    Data consist of PFTs, multi-inflation non-contrast CT (4D or breath-hold) and contrast-based ventilation images (nuclear imaging or hyperpolarised gas MRI) for patients with lung cancer and several non-oncological obstructive respiratory diseases including cystic fibrosis, asthma and COPD.
    Keyboard: CT scan, Labeled, Segmentation

  • CXLSeg
    Segmented Chest X-ray radiographs based on the MIMIC-CXR dataset.
    Keyboard: Segmentation
    licence PhysioNet Credentialed Health Data License 1.5.0

  • DIR-Lab (Deformable Image Registration Laboratory)
    Thoracic 4DCT images. Inspiratory and expiratory breath-hold CT image pairs.
    Keyboard: CT scan, Labeled
    paper

  • ELCAP (Early Lung Cancer Action Program)
    The database currently consists of an image set of 50 low-dose documented whole-lung CT scans for detection.
    Keyboard: CT scan, Nodules, Labeled

  • EMPIRE10 (Evaluation of Methods for Pulmonary Image Registration 2010)
    Keyboard: CT, Registration of thoracic
    paper
    licence Data downloaded from this site may only be used for the purpose of preparing an entry to be submitted on this site.

  • EXACT09 (Extraction of Airways from CT 2009)
    The images are volumetric chest CT scans acquired at different sites using several different scanners, scanning protocols, and reconstruction parameters.
    paper

  • Indiana U. Chest X-rays
    Each image classified manually into frontal and lateral chest X-ray categories.
    paper

  • INSPECT
    It contains data from 19,402 patients, including CT images, radiology report impression sections, and structured electronic health record (EHR) data.
    Keyboard: CT scan, Cancer, Labeled
    paper
    licence Stanford university dataset research use aggrement

  • IQ-OTH/NCCD (Iraq-Oncology Teaching Hospital/National Center for Cancer Diseases)
    The dataset contains a total of 1190 images representing CT scan slices of 110 cases.
    Keyboard: CT scan, Cancer, Labeled
    licence CC0: Public Domain

  • JSRT (Japanese Society of Radiological Technology)
    The database includes 154 conventional chest radiographs with a lung nodule and 93 radiographs without a nodule
    Keyboard: X-ray
    paper Cited DB | paper Cited DB

  • LCTSC (Lung CT Segmentation Challenge)
    Keyboard: CT scan, Cancer
    paper
    licence CC BY 3.0

  • LIDC-IDRI (Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI))
    Keyboard: CT scan, Cancer, Labeled
    paper
    licence CC BY 3.0

  • LNDb (Lung Nodule Database)
    Lung nodule detection, segmentation and characterization as well as prediction of patient follow-up
    Keyboard: Cancer, CT-scan
    Leaderboard | paper
    licence CC BY-NC-ND 4.0

  • LOLA11 (LObe and Lung Analysis 2011)
    Compare methods for (semi-)automatic segmentation of the lungs and lobes from chest
    Keyboard: segmentation, CT-scan
    Leaderboard
    licence Data downloaded from this site may only be used for the purpose of preparing an entry to be submitted on this site. ...

  • LUMIC
    Keyboard: CT-scan, registration, Labeled
    paper

  • LUNA16 (LUng Nodule Analysis 2016)
    Nodule location detection
    Keyboard: Cancer, CT-scan
    Leaderboard | paper Overview paper
    licence Creative Commons Attribution 4.0 International License

  • Lung-Fused-CT-Pathology
    Mapping the extent of Invasive Adenocarcinoma onto in vivo lung CT
    Keyboard: Cancer, CT scan, Labeled
    paper
    licence CC BY 3.0

  • Lung-PET-CT-Dx
    A Large-Scale CT and PET/CT Dataset for Lung Cancer Diagnosis
    Keyboard: Cancer, Labeled
    licence CC BY 4.0

  • MedSeg Covid Dataset
    This is a dataset of 100 axial CT images from >40 patients with COVID-19
    licence CC0

  • MELA (Mediastinal Lesion Analysis)
    Detectition mediastinal lesions from 1100 CT scans, consisting of 770 CTs for training, 110 CTs for validation, and 220 CTs for testing.
    Keyboard: CT Scan
    Leaderboard

  • MIDRC-RICORD-1A (Medical Imaging Data Resource Center - RSNA International COVID-19 Open Radiology Database Release 1a)
    120 Chest CT Covid+
    Keyboard: CT Scan, Labeled
    paper
    licence CC BY-NC 4.0

  • MIDRC-RICORD-1B (Medical Imaging Data Resource Center - RSNA International COVID-19 Open Radiology Database Release 1b)
    120 Chest CT Covid+
    Keyboard: CT Scan, Labeled
    paper
    licence CC BY-NC 4.0

  • MIDRC-RICORD-1C (Medical Imaging Data Resource Center - RSNA International COVID-19 Open Radiology Database Release 1c)
    998 Chest X-rays Covid+
    Keyboard: X-rays, Labeled
    paper
    licence CC BY-NC 4.0

  • MIMIC-CXR
    The dataset contains 377,110 images corresponding to 227,835 radiographic.
    Keyboard: X-ray
    paper
    licence PhysioNet Credentialed Health Data License 1.5.0

  • MIMIC-CXR-JPG
    The MIMIC-CXR-JPG dataset is wholly derived from MIMIC-CXR, providing JPG format files derived from the DICOM images and structured labels derived from the free-text reports.
    Keyboard: X-ray
    paper
    licence PhysioNet Credentialed Health Data License 1.5.0

  • NLST (National Lung Screening Trial)
    26,254 low-dose CT scans
    Keyboard: CT Scan, Labeled
    paper
    licence CC BY 4.0

  • NODE21
    Detection and generation of lung nodules.
    Keyboard: X-ray
    Leaderboard | paper
    licence CC BY-NC-ND 4.0

  • NSCLC Radiogenomics (Non-Small Cell Lung Cancer)
    The dataset comprises Computed Tomography (CT), Positron Emission Tomography (PET)/CT images, semantic annotations of the tumors as observed on the medical images using a controlled vocabulary, segmentation maps of tumors in the CT scans, and quantitative values obtained from the PET/CT scans.
    paper
    licence CC BY 3.0

  • NSCLC Radiogenomics-Stanford (Non-Small Cell Lung Cancer)
    This collection contains images from patients with NSCLC imaged prior to surgical excision with both thin-section computed tomography (CT) and whole body positron emissions tomography (PET)/CT scans
    paper
    licence CC BY 3.0

  • NSCLC-Radiomics (Non-Small Cell Lung Cancer)
    This collection contains images from 422 patients.
    Keyboard: CT Scan, Labeled
    licence CC BY-NC 3.0

  • NSCLC-Radiomics-Interobserver1 (Non-Small Cell Lung Cancer)
    This collection contains clinical data and computed tomography from 22 non-small cell lung cancer radiotherapy patients.
    Keyboard: CT Scan, Labeled
    licence CC BY-NC 3.0

  • PadChest
    This dataset includes more than 160,000 images of 67,000 patients.
    Keyboard: X-ray, Labeled
    paper

  • Phantom FDA
    As part of a more general effort to probe the interrelated factors impacting the accuracy and precision of lung nodule size estimation, it has been presented with an anthropomorphic thoracic phantom containing a vasculature insert on which synthetic nodules were inserted or attached.
    Keyboard: CT scan
    paper
    licence CC BY 3.0

  • Pulmonary Chest X-Ray Abnormalities
    Diagnose tuberculosis and other diseases from x-rays.
    Keyboard: Labeled

  • QIN Lung CT
    Keyboard: Nodule, CT Scan, Segmentation
    licence CC BY 3.0

  • RIDER Lung CT
    Coffee-break lung CT collection with scan images reconstructed at multiple imaging parameters
    licence CC BY 4.0

  • RSNA Pneumonia Detection (Radiological Society of North America 2018)
    30,000 frontal view chest radiographs
    Keyboard: X-ray, Labeled

  • RSNA Pulmonary Embolism (Radiological Society of North America 2020)
    Detect and characterize instances of pulmonary embolism (PE) on chest CT studies
    Keyboard: CT scan, Labeled
    Leaderboard | paper

  • SARS-CoV-2 CT-scan
    Containing 1252 CT scans that are positive for SARS-CoV-2 infection (COVID-19) and 1230 CT scans for patients non-infected by SARS-CoV-2, 2482 CT scans in total
    paper
    licence CC BY-NC-SA 4.0

  • SHCXR Lung Mask
    Manually Segmented Lungs Masks for Shenzhen Hospital Chest X-ray Set
    paper | paper
    licence CC BY-NC-SA 4.0

  • SIIM-ACR Pneumothorax Segmentation
    Identify Pneumothorax disease in chest x-rays
    Leaderboard

  • SIMBA
    Chest Health Analysis System Public Lung Image Database.
    Keyboard: CT scan, Labeled

  • SPIE-AAPM Lung CT Challenge
    SPIE-AAPM-NCI Lung Nodule Classification Challenge Dataset
    paper | paper
    licence CC BY 3.0

  • STOIC2021
    Study of Thoracic CT in COVID-19
    Keyboard: CT scan
    Leaderboard | paper
    licence CC BY-NC 4.0

  • TCGA-LUAD (The Cancer Genome Atlas Lung Adenocarcinoma)
    Data from 69 Participants
    Keyboard: Multi-modality
    licence CC BY 3.0

  • VAMPIRE (Ventilation And Medical Pulmonary Image Registration Evaluation)
    It includes 50 pairs of 4DCT scans and corresponding clinical or experimental ventilation scans, referred to as reference ventilation images (RefVIs). The dataset includes 25 humans imaged with Galligas 4DPET/CT, 21 humans imaged with DTPA-SPECT, and 4 sheep imaged with Xenon-CT.
    Keyboard: CT-scan, Animal
    paper

  • VESSEL12 (VESsel SEgmentation in the Lung 2012)
    Automatic (and semi-automatic) segmentation of blood vessels in the lungs from CT images
    Keyboard: CT-scan
    Leaderboard | paper Overview paper

  • VIA/I-ELCAP
    The database contains a number of annotated CT image scans that highlight many of the key issues in measuring large lesions in the lung.
    Keyboard: CT-scan
    paper The dataset is described

  • VinDr-PCXR
    The dataset consists of 9,125 posteroanterior (PA) view CXR scans in patients younger than ten years and comes with both the localization of critical findings and the classification of common thoracic diseases.
    Keyboard: Labeled
    licence PhysioNet Restricted Health Data License 1.5.0

  • WSSS4LUAD (Weakly-supervised Tissue Semantic Segmentation for Lung Adenocarcinoma)
    Segment tumor epithelial, tumor-associated stroma and normal tissue with only patch-level labels.
    Keyboard: H&E stained Whole Slide Image (WSI), Cancer
    Leaderboard | paper
    licence CC BY 4.0


Musculoskeletal System

Bones

  • 2D-3D-GS
    A database of CT, MR and X-ray images of cadaver lumbar spine (L1 - L5) and the gold standard registrations between these images, along with the software for registration evaluation and user manual.
    licence CC BY-NC-ND

  • IVDM3Seg
    Intervertebral Disc Localization and Segmentation from 3D Multi-modality MR (M3) Images
    Keyboard: MRI
    Leaderboard

  • Fractured Limbs
    5684 CT images (upper limbs: 2057 and lower limbs: 3627) to understand bone injurie.
    Keyboard: CT scan, Labeled
    paper

  • lumbar-CT-vertebrae
    A database of computed tomography (CT) images with reference segmentations of lumbar vertebrae.
    licence CC BY-NC-ND

  • Leg-3D-US
    The dataset assembles pairs of Ultrasound volumes and 3-labels muscles of the low-limb leg from 44 healthy volunteers, aged between 18 and 45 years.
    paper
    licence GNU General Public License

  • MURA (MUsculoskeletal RAdiographs)
    Large Dataset for Abnormality Detection in Musculoskeletal Radiographs
    Keyboard: X-ray, Labeled
    paper
    licence Stanford university dataset research use aggrement

  • Osteosarcoma-Tumor-Assessment
    Osteosarcoma data from UT Southwestern/UT Dallas for Viable and Necrotic Tumor Assessment
    Keyboard: Cancer, Histopathology, Labeled
    paper | paper | paper
    licence CC BY 3.0

  • PENGWIN
    Pelvic fracture segmentation techniques in both 3D CT scans and 2D X-ray images
    Keyboard: MRI, Cancer, Labeled
    Leaderboard | paper
    licence CC BY 4.0

  • RibFrac
    A dataset for detect and classify around 5,000 rib fractures from 660 computed tomography (CT) scans
    Keyboard: CT scan, Labeled
    Leaderboard
    licence CC BY-NC 4.0

  • RibSeg
    It includes two subsets.
    Keyboard: CT scan, Segmentation
    paper | paper
    licence CC BY-NC 4.0

  • RSNA Bone Age (Radiological Society of North America 2017)
    Identify the age of a child from an X-ray of their hand
    Keyboard: X-ray, Labeled
    paper | paper | paper

  • RSNA Cervical Spine Fracture (Radiological Society of North America 2022)
    Including approximately 3,000 CT
    Keyboard: CT scan, Labeled
    Leaderboard

  • SPIDER (SPIne Segmentation: Discs, vERtebrae, and spinal canal)
    A lumbar spine MR dataset with reference segmentations of vertebrae, intervertebral discs (IVDs), and spinal canal
    Keyboard: MRI
    Leaderboard | paper
    licence CC-BY 4.0

  • Spinal Disease Dataset
    The dataset includes MRI images of T1 and T2 sagittal plane and T2 axial plane (FSE/TSE).
    Keyboard: MRI, Segmentation, Labeled
    licence CC-BY-SA-NC 4.0

  • SpineWeb BrokenLink
    16 spinal imaging datasets

  • VerSe
    Large Scale Vertebrae Segmentation
    Keyboard: CT scan, Segmentation
    Leaderboard | paper | paper
    licence CC BY-SA 4.0

  • xVertSeg
    Classify and segment vertebrae from the spine images that include fractured and non-fractured cases
    Keyboard: CT scan, Segmentation, Classification

Joints

  • DDH x-ray images
    The hip X-ray images (in anteroposterior view) including 354 subjects (120 DDH, 234 normal)
    Keyboard: Developmental dysplasia of the hip (DDH), Labeled
    paper
    licence CC BY 4.0

  • K2S
    A dataset of high-resolution 3D knee MRI including raw k-space data and post-processing annotations with masks for tissue segmentation.
    Keyboard: MRI, Labeled
    licence CC BY NC ND

  • Knee Osteoarthritis Dataset with Severity Grading
    Keyboard: X-ray, Labeled
    licence CC BY 4.0

  • kneeMRI
    The dataset consists of 917 12-bit grayscale volumes of either left or right knees.
    Keyboard: MRI scans, Labeled
    paper
    licence CC BY-NC-ND 4.0

  • KNOAP2020 (KNee OsteoArthritis Prediction)
    Keyboard: MRI scans, X-ray, Labeled
    paper
    licence The provided data may only be used for preparing an entry to be submitted to this challenge.

  • LERA (Lower Extremity Radiographs)
    Data is collected from 182 patients who underwent a radiographic examination. The dataset consists of images of the foot, knee, ankle, or hip associated with each patient.
    Keyboard: X-ray
    licence Stanford University School of Medicine LERA- Lower Extremity RAdiographs Dataset Research Use Agreement

  • MRNet
    Diagnosis of abnormalities from Knee MRs
    Keyboard: MRI, Labeled
    paper
    licence Stanford University School of Medicine MRNet Dataset Research Use Agreement

  • OAI (Osteoarthritis Initiative)
    The dataset contains cases of moderate and severe OA.
    Keyboard: MRI
    paper

  • SKM-TEA (Stanford Knee MRI Multi-Task Evaluation)
    The dataset consists of 86 scans for training, 33 scans for validation, and 36 scans for testing.
    Keyboard: MRI, Segmentation, Labeled
    paper
    licence Stanford University School of Medicine SKM-TEA Dataset Research Use Agreement

  • X-ray images of the hip joints
    A dataset consisting of x-ray examinations of the lower legs performed as part of routine medical service.
    licence CC BY 4.0


Pelvis and Reproductive Organs

Female Reproductive Organs

  • A-AFMA (Automatic amniotic fluid measurement and analysis)
    The goal is measurement of the maximum vertical pocket (MVP)
    Keyboard: Ultrasound Video Clip
    licence No publication rights are given on this data. The data may only be used for the purpose of this challenge.

  • ACOUSLIC-AI (Abdominal Circumference Operator-agnostic UltraSound measurement)
    Diagnosing fetal growth restriction is challenging in low-resource settings
    Leaderboard
    licence CC BY-NC-SA

  • ATEC23
    Automated prediction of treatment effectiveness in ovarian cancer using histopathological images
    Keyboard: Whole Slide Images (WSIs), Cancer
    paper
    licence CC BY-NC 4.0

  • Cervix93
    A Cervical Cytology Dataset for Nucleus Detection and Image Classification and Methods for Cervical Nucleus Detection
    Keyboard: Pap smear, Cancer, Labeled
    paper

  • CMB-OV (Cancer Moonshot Biobank - Ovarian Carcinoma Cancer)
    Keyboard: Histopathology
    licence CC BY 4.0

  • CPTAC-UCEC (Clinical Proteomic Tumor Analysis Consortium Uterine Corpus Endometrial Carcinoma)
    Keyboard: Multi-modality, Cancer
    licence CC BY 3.0

  • DTU/HERLEV (PAP-SMEAR)
    Keyboard: Pap smear, Labeled
    paper Benchmark

  • ENDO-AID
    The dataset consists of 91 digital pathology whole-slide images (WSI) of endometrium carcinoma Pipelle biopsies, stained with hematoxylin and eosin (H&E).
    Keyboard: Whole-slide images (WSI), Cancer
    licence Creative Commons Attribution Non Commercial 4.0 International

  • Fetoscopy Placenta
    The dataset contains 483 frames with ground-truth vessel segmentation annotations taken from six different in vivo fetoscopic procedure videos.
    Keyboard: Segmentation, Labeled
    paper
    licence CC BY-NC-SA 4.0

  • HC18
    Measurement of fetal head circumference (HC)
    Keyboard: Ultrasound imaging, Labeled
    Leaderboard

  • Intel & MobileODT Cervical Cancer Screening
    Keyboard: Colposcopy, Classification, Cancer

  • IUGC (Intrapartum Ultrasound Grand Challenge 2024)
    Intrapartum ultrasound videos, aiming for participants to develop an automated fetal biometry measurement method
    Keyboard: Classification, Segmentation

  • JNU-IFM
    An intrapartum transperineal ultrasound dataset of the Intelligent Fetal Monitoring
    Keyboard: Ultrasound videos, Labeled
    paper
    licence CC BY 4.0

  • Liquid based-cytology Pap smear
    The repository consists of a total of 963 LBC images
    Keyboard: Pap Smear, Labeled
    paper
    licence CC BY 4.0

  • MMOTU (Multi-Modality Ovarian Tumor Ultrasound Image)
    It consists of two sub-sets with two modalities, which are OTU_2d and OTU_CEUS respectively including 1469 2d ultrasound images and 170 CEUS images.
    Keyboard: Labeled, Segmentation, Classification, Cancer
    paper

  • Overlapping Cervical Cytology Image Segmentation
    The targets are to extract the boundaries of individual cytoplasm and nucleus from overlapping cervical cytology images.
    Keyboard: Segmentation, Cancer
    Leaderboard

  • Overlapping Cervical Cytology Image Segmentation 2
    The targets are to extract the boundaries of individual cytoplasm and nucleus from overlapping cervical cytology images.
    Keyboard: Segmentation, Cancer
    Leaderboard

  • Ps-Fh-Aop-2023 (Pubic Symphysis-Fetal Head Segmentation and Angle of Progression)
    Keyboard: Ultrasound imaging, Labeled
    Leaderboard

  • TCGA-CESC (The Cancer Genome Atlas Cervical Squamous Cell Carcinoma and Endocervical Adenocarcinoma Collection)
    Data from 54 subjects.
    Keyboard: MRI
    licence CC BY 3.0

  • TCGA-OV (The Cancer Genome Atlas Ovarian Cancer)
    Data from 143 subjects and 53,662 images.
    Keyboard: Ovary, MRI, CT scan
    licence CC BY 3.0

  • TCGA-UCEC (The Cancer Genome Atlas Uterine Corpus Endometrial Carcinoma)
    Data from 65 subjects and 77214 images
    Keyboard: Multi-modality
    licence CC BY 3.0

  • SIPaKMeD
    The database consists of 4049 images of isolated cells that have been manually cropped from 966 cluster cell images of Pap smear slides.
    Keyboard: Pap smear, Labeled
    paper
    licence It can be used for experimental purposes with the request to cite the paper.

Male Reproductive Organs

  • AGGC22 (Automated Gleason Grading Challenge 2022)
    Dataset of prostatectomy and biopsy specimens with annotations
    Keyboard: H&E-stained whole slide image
    Leaderboard | paper
    licence CC BY-NC-SA 4.0

  • AUTO-RTP (Fully Automated Radiotherapy Treatment Planning)
    Keyboard: Cancer
    Leaderboard
    licence Use of the data is restricted to this challenge and related publications

  • BIMCV-Prostate
    It includes a total of 9,341 prostate MRI sessions, distributed among 8,441 subjects.
    Keyboard: Cancer
    licence CC BY 4.0

  • CMB-PCA (Cancer Moonshot Biobank - Prostate Cancer)
    Keyboard: Multi-modality
    licence CC BY 4.0 - TCIA Restricted

  • Gleason 2019
    Gleason grading of prostate cancer in digital histopathology images
    Keyboard: H&E-stained histopathology image, Cancer, Labeled
    Leaderboard

  • I2CVB (Initiative for Collaborative Computer Vision Benchmarking)
    It provides a multi-parametric MRI dataset to help at the development of computer-aided detection and diagnosis system.
    Keyboard: MRI, Segmentation, Labaled
    paper

  • Multi-site Dataset for Prostate MRI Segmentation
    It contains prostate T2-weighted MRI data (with segmentation mask) collected from six different data sources out of three public datasets.

  • NCI-ISBI 2013
    Automated Segmentation of Prostate Structures. Image data were selected from PROSTATE-DIAGNOSIS and Prostate-3T collections
    Keyboard: MRI, Labeled
    licence CC BY 3.0

  • PANDA (Prostate cANcer graDe Assessment)
    Classifying the severity of prostate cancer from microscopy scans of prostate biopsy samples
    Keyboard: whole-slide images (WSI), Cancer
    paper
    licence Subject to Competition Rules

  • PI-CAI (Prostate Imaging: Cancer AI)
    Keyboard: Prostate, MRI, Cancer, Labeled
    Leaderboard

  • PROMISE12 (Prostate MR Image Segmentation 2012)
    Compare interactive and (semi)-automatic segmentation algorithms for MRI of the prostate
    Keyboard: T2-weighted MRI, Labeled
    Leaderboard | paper Overview paper

  • Prostate-3T
    Prostate transversal T2-weighted magnetic resonance images acquired on a 3.0T Siemens TrioTim using only a pelvic phased-array coil were acquired for prostate cancer detection.
    Keyboard: MRI
    licence CC BY 3.0

  • PROSTATE-DIAGNOSIS
    Prostate cancer T1- and T2-weighted magnetic resonance images were acquired at 1.5 T
    Keyboard: MRI
    licence CC BY 3.0

  • Prostate segmentation 2009
    Keyboard: MRI

  • PROSTATEx
    This dataset have been included in the PI-CAI Public Training and Development dataset.
    Keyboard: MRI, Cancer
    licence CC BY 3.0

  • QIN-PROSTATE-Repeatability
    This is a dataset with multiparametric prostate MRI applied in a test-retest setting, allowing to evaluate repeatability of the MRI-based measurements in the prostate.
    Keyboard: MRI, Labeled
    paper
    licence CC BY 4.0

  • TCGA-PRAD (The Cancer Genome Atlas Prostate Adenocarcinoma)
    Data from 14 subjects and 16,790 images.
    Keyboard: Multi-modality
    licence CC BY 3.0


Other Organs and Systems

Lymph Nodes

  • CALGB50303
    Rituximab and Combination Chemotherapy in Treating Patients With Diffuse Large B-Cell Non-Hodgkin's Lymphoma
    Keyboard: Cancer, Multi-modality
    paper | paper

  • CAMELYON16
    Detection of metastases in hematoxylin and eosin (H&E) stained whole-slide images of lymph node sections
    Keyboard: Cancer, Digital pathology, Lymph node detection
    Leaderboard | paper | paper

  • CAMELYON17
    Evaluate new and existing algorithms for automated detection and classification of breast cancer metastases in whole-slide images of histological lymph node sections
    Keyboard: Cancer, Digital pathology, Lymph node detection
    Leaderboard | paper | paper

  • CT Lymph nodes
    90 CTs dataset of lymph nodes
    Keyboard: CT scan, Lymph node detection
    licence CC BY 3.0

  • DLBCL-Morphology
    H&E and immunohistochemical stain images of 209 cases of diffuse large B-cell lymphoma linked with cytogenetic features and clinical outcomes.
    Keyboard: Histopathology, Cancer
    licence CC BY-NC 4.0

  • LNQ2023 (Mediastinal Lymph Node Quantification)
    Segmentation of Heterogeneous CT Data
    Keyboard: CT scan, Cancer
    Leaderboard

  • LyNoS
    15 CTs with corresponding lymph nodes, azygos, esophagus, and subclavian carotid arteries
    Keyboard: CT scan, Segmentation
    paper

  • Mediastinal-Lymph-Node-SEG
    Mediastinal Lymph Node Quantification (LNQ): Segmentation of Heterogeneous CT Data
    Keyboard: CT scan, Labeled
    licence CC BY 4.0

  • PatchCamelyon
    Image classification dataset consists of 327.680 color images extracted from histopathologic scans of lymph node sections.
    Keyboard: Labeled
    Leaderboard | paper

Skin

  • 7-point criteria evaluation
    For evaluating computerized image-based prediction of the 7-point skin lesion malignancy checklist.
    Keyboard: Labeled
    paper
    licence CC BY-NC-ND

  • Anti-PD-1_MELANOMA
    This collection includes 47 melanoma cases treated with anti-PD1 immunotherapy, each with pre-treatment and 1 or more imaging follow-up timepoints.
    Keyboard: Multi-modality, Cancer
    licence TCIA Restricted

  • Asan and Hallym Datasets
    paper | paper

  • Atlas Dermatologico
    It contains more than 12500 pictures of dermatology diseases.

  • CPTAC-CM (Clinical Proteomic Tumor Analysis Consortium Cutaneous Melanoma)
    Keyboard: Multi-modality, Cancer
    licence CC BY 3.0

  • DDI (Diverse Dermatology Images)
    A biopsy-proven skin disease dataset with diverse skin tone representation.
    Keyboard: Labeled
    paper

  • Dermofit Image Library
    1300 High quality skin lesion images.
    Keyboard: Segmentation, Labeled

  • Dermoscopy and Dermatoscopy Atlas
    It contains 433 diagnosis.

  • Fitzpatrick 17k
    16,577 clinical images sourced from two dermatology atlases — DermaAmin and Atlas Dermatologico
    paper
    licence CC BY-NC-SA 3.0

  • FUSC (Foot Ulcer Segmentation Challenge)
    In the dataset provided, over 1200 images are collected over 2 years from hundreds of patients.
    Keyboard: Labeled
    Leaderboard | paper

  • HAM10000
    A collection of multi-source dermatoscopic images of common pigmented skin lesions
    Keyboard: Dermatoscopic images

  • ISIC (International Skin Imaging Collaboration)
    It has multi versions.
    Keyboard: Dermoscopic images
    paper | paper | paper

  • MED-NODE
    It consists of 170 clinical images (70 melanoma and 100 nevi cases).
    paper

  • Melanoma Dataset
    Keyboard: Classification, Dermatoscopic images
    paper

  • PAD-UFES-20
    There are 1,373 patients, 1,641 skin lesions, and 2,298 images present in the dataset.
    Keyboard: Cancer, Detection
    paper
    licence CC BY 4.0

  • PH2
    The database contains a total of 200 dermoscopic images of melanocytic lesions
    Keyboard: Classification, Segmentation, Dermoscopic images, Labeled
    paper

  • SD-198 (Skin Disease)
    It contains 198 different diseases from different types of eczema, acne and various cancerous conditions.
    Keyboard: Labeled
    paper

  • Skin Cancer Detection
    This includes images extracted from the public databases DermIS and DermQuest, along with manual segmentations of the lesions.
    Keyboard: Segmentation, Cancer


Multi Organs Datasets

  • AAPM-RT-MAC
    The data contains a total of 55 MRI cases, each from a single examination from a distinct patient.
    Keyboard: Head and Neck
    licence TCIA Restricted

  • AbdomenAtlas
    8,448 CT volumes, totaling 3.2 million CT slices.
    Keyboard: Spleen, liver, kidneys, stomach, gallbladder, pancreas, aorta, and IVC, CT Scan, Segmentation, Labeled
    paper
    licence CC BY-NC 4.0

  • AbdomenCT-1K
    The dataset with more than 1000 (1K) abdominal organ scans.
    Keyboard: liver, kidney, spleen, and pancreas, CT Scan, Segmentation, Labeled
    paper

  • Abdominal ultrasound simulation scans
    The scans were acquired from 11 subjects without any abdominal pathology or known disease.
    Keyboard: Liver, kidney, pancreas, vessels, adrenals, gallbladder, bones, spleen, Ultrasound (US) imaging, Segmentation, Labeled

  • AIDA-E (Analysis of Images to Detect Abnormalities in Endoscopy)
    Keyboard: Multi tissues, Endoscopy, Cancer

  • AMOS
    A large-scale abdominal multi-organ benchmark for versatile medical image segmentation
    Keyboard: Multi-tissue (15 abdominal organs), MRI, CT scan
    Leaderboard

  • ANHIR (Automatic Non-rigid Histological Image Registration)
    They have assembled 8 datasets, containing 355 images with 18 different stains, resulting in 481 image pairs to be registered.
    Keyboard: Multi-tissue, Whole-slide images
    Leaderboard | paper

  • APOLLO-5 BrokenLink (Applied Proteogenomics OrganizationaL Learning and Outcomes)
    A collection of 31 datasets for different organs.
    Keyboard: Cancer, Multi-modality
    licence TCIA Limited (contact Support)

  • AutoPET
    A whole-body FDG-PET/CT dataset with manually annotated tumor lesions (FDG-PET-CT-Lesions)
    Keyboard: PET - CT scan, Labeled
    licence TCIA Restricted

  • Cell Segmentation in Multi-modality Microscopy Images
    Keyboard: Multi tissues, High-Resolution Microscopy Images, Segmentation, Labeled
    Leaderboard | paper Summary Paper
    licence CC BY-NC-ND

  • CHAOS (Combined (CT-MR) Healthy Abdominal Organ Segmentation)
    There are 20 training and 20 testing cases in the CT dataset. MRI dataset contains 20 training and 20 testing cases with T1-Dual and T2 SPIR sequences.
    Keyboard: Liver, Kidneys, Spleen, CT Scan, MRI, Labeled
    Leaderboard | paper

  • COSAS (Cross-Organ and Cross-Scanner Adenocarcinoma Segmentation)
    Segmenting normal gland and adenocarcinoma regions
    Keyboard: Whole slide images
    Leaderboard
    licence CC BY-NC-ND

  • CPTAC-SAR (The Clinical Proteomic Tumor Analysis Consortium SARcomas)
    Keyboard: Abdomen, Arm, Bladder, Chest, Head-Neck, Kidney, Leg, Retroperitoneum, Stomach, and Uterus, Multi-modality
    licence CC BY 3.0

  • CT-ORG
    This dataset consists of 140 computed tomography scans come from a wide variety of sources.
    Keyboard: Bone, Liver, Lung, Kidney and Bladder, CT scan
    licence CC BY 3.0

  • CURVAS (Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation)
    Keyboard: Pancreas, Kidney and Liver, CT scan
    Leaderboard
    licence CC BY-NC

  • Decathlon
    Medical Segmentation Decathlon (MSD) Generalisable 3D Semantic Segmentation.
    Keyboard: Liver, Brain, Hippocampus, Lung, Prostate, Cardiac, Pancreas, Colon, Hepatic Vessels and Spleen, Multi-modality, Labeled
    Leaderboard | paper | paper
    licence CC-BY-SA 4.0

  • DeepLesion
    A dataset with 32,735 lesions in 32,120 CT slices from 10,594 studies of 4,427 unique patients.
    Keyboard: Bone, Abdomen, Mediastinum, Liver, Lung, Kidney, Soft tissue, and Pelvis, CT scan, Cancer
    paper

  • EAD 2019 (Endoscopy artifact detection)
    Facilitating diagnosis and treatment of diseases in hollow organs.
    Keyboard: Multi-tissue, Multi-modality, Video Endoscopy, Labeled
    Leaderboard | paper

  • EAD 2020 (Endoscopy artifact detection)
    The 8 classes in this challenge include specularity, bubbles, saturation, contrast, blood, instrument, blur and imaging artefacts.
    Keyboard: Multi-tissue, Multi-modality, Video Endoscopy, Labeled
    paper

  • EndoVis
    A large collection of publicly accessible datasets comprising various computer vision tasks (classification, segmentation, detection, localization,…) and subdisciplines ranging from laparoscopy to coloscopy and surgical training.

  • fastMRI
    Keyboard: Knee, Brain, and Prostate, MRI
    paper | paper | paper

  • fastPET-LD (Fast PET-CT lesion detection)
    Keyboard: Hot spots, PET - CT scan, Labeled, Detection
    Leaderboard
    licence Participants cannot share the data, cannot use it for any commercial purpose.

  • FLARE 2022 (Fast and Low-resource semi-supervised Abdominal oRgan sEgmentation)
    A small number of labeled cases (50) and a large number of unlabeled cases (2000) in the training set.
    Keyboard: Multi-tissue (13 organs), CT scan, Labeled
    Leaderboard | paper

  • FLARE21 (Fast and Low GPU memory Abdominal oRgan sEgmentation)
    A abdominal CT organ dataset with 500 CT scans from 11 countries, including multi-center, multi-phase, multi-vendor, and multi-disease cases.
    Keyboard: Liver, Kidney, Spleen, and Pancreas, CT scan, Cancer, Segmentation
    Leaderboard | paper Summary Paper

  • HaN-Seg (Head and Neck Segmentation)
    Images of 60 patients aged 34–79 years that were appointed for image-guided Radiotherapy in the HaN region
    Keyboard: 30 organs-at-risk, CT Scan, MRI, Labeled
    Leaderboard | paper

  • Head and Neck Auto Segmentation
    With manual segmentation of left and right parotid glands, brainstem, optic chiasm, optic nerves (both left and right), mandible, submandibular glands (both left and right) and manual identification of bony landmarks..
    Keyboard: CT scan, Labeled
    paper

  • HEAD-NECK-RADIOMICS-HN1
    This collection contains clinical data and computed tomography (CT) from 137 head and neck squamous cell carcinoma (HNSCC) patients treated by radiotherapy.
    Keyboard: CT Scan, Segmentation, Labeled
    paper
    licence TCIA No Commercial Limited - CC BY-NC 3.0

  • Healthy-Total-Body-CTs
    This data set includes low-dose whole body CT images and tissue segmentations of thirty healthy adult research participants who underwent PET/CT imaging.
    paper
    licence TCIA Restricted - CC BY 4.0

  • HECKTOR 2020 (HEad and neCK TumOR)
    Automatic bi-modal approaches for the segmentation of H&N tumors in PET-CT scans, focusing on oropharyngeal cancers.
    Keyboard: Head, Neck, FDG-PET/CT scan, Cancer, Labeled
    Leaderboard | paper Overview paper

  • HECKTOR 2021 (HEad and neCK TumOR)
    The automatic segmentation of Head and Neck (H&N) primary tumors in FDG-PET and CT images and the prediction of patient outcomes, namely Progression Free Survival (PFS)
    Keyboard: Head, Neck, FDG-PET/CT images, Cancer, Labeled
    Leaderboard | paper Overview paper

  • HECKTOR 2022 (HEad and neCK TumOR)
    The data were collected for a total of 883 cases consisting of FDG-PET/CT images and clinical information.
    Keyboard: Head, Neck, Lymph nodes, FDG-PET/CT images, Cancer, Segmentation, Labeled
    Leaderboard | paper Overview paper

  • LC25000
    The dataset contains color 25,000 Lung and colon histopathological images
    Keyboard: Lung and Colon, Cancer, Labeled
    paper

  • LDCT-and-Projection-data (Low Dose CT)
    Reconstructed images, patient age and gender, and pathology annotation are also provided for these de-identified data sets.
    Keyboard: Head, Chest, and Abdomen, CT scan
    paper | paper
    licence TCIA Restricted - CC BY 3.0

  • Learn2Reg 2024
    The dataset has over 46,000 nuclei, 71 patients, four organs, and four nucleus types.
    Keyboard: Multi-modality, Registration
    Leaderboard

  • LYON19
    The test set contains Region of Interests (ROIs) selected from whole-slide images (WSI) of immunohistochemistry (IHC) stained specimens
    Keyboard: breast, colon, prostate, whole-slide images (WSI)
    Leaderboard | paper

  • MedFMC
    Foundation Model Prompting for Medical Image Classification
    Keyboard: Thoracic and Colon, Multli Modalities
    Leaderboard

  • MedIMeta (Medical Imaging Meta-Dataset)
    A multi-domain multi-task medical imaging meta-dataset containing 19 medical imaging datasets spanning 10 different domains.
    paper

  • MedMNIST
    18x Standardized Datasets for 2D and 3D Biomedical Image Classification with Multiple Size Options: 28 (MNIST-Like), 64, 128, and 224
    paper | paper
    licence CC BY 4.0 - CC BY-NC 4.0

  • MedPix
    Medical images, teaching cases, and clinical topics, integrating images and textual metadata including over 12,000 patient case scenarios, 9,000 topics, and nearly 59,000 images.

  • MedSeg
    An AI tool for segmentation CT scan and MRI images. There are segmented images of other public dataset in their website

  • MedShapeNet
    This dataset contains over 100,000 3D medical shapes, including bones, organs, vessels, muscles, etc., as well as surgical instruments. It has used in AutoImplant
    paper
    licence CC BY NC 4.0

  • MoNuSAC2020
    The dataset has over 46,000 nuclei, 71 patients, four organs, and four nucleus types.
    Keyboard: Lung, Prostate, Kidney, and Breast, H&E staining, Classification, Segmentation
    Leaderboard | paper Details
    licence CC BY-NC-SA 4.0

  • MoNuSeg
    Keyboard: Multi tissues, H&E stained tissue images, Segmentation, Labeled
    Leaderboard | paper
    licence CC BY-NC-SA 4.0

  • MRIdata
    It is a list of magnetic resonance imaging raw k-space datasets.
    Keyboard: MRI
    licence CC BY-NC 4.0

  • Multi-Atlas Labeling Beyond the Cranial Vault
    It has two subdatasets: Abdominal and Cervix
    Keyboard: Multi tissues, CT scan, Segmentation, Labeled
    Leaderboard

  • Multi-organ Abdominal CT
    The data comprises reference segmentations for 90 abdominal CT images delineating multiple organs.
    Keyboard: Spleen, Left kidney, Gallbladder, Esophagus, Liver, Stomach, Pancreas and Duodenum, CT scan
    paper

  • NuInsSeg (Nuclei Instance Segmentation)
    This dataset contains 665 image patches with more than 30,000 manually segmented nuclei from 31 human and mouse organs.
    Keyboard: Multi organs, H&E-Stained Images, Labeled
    paper

  • OCELOT
    A dataset purposely dedicated to the study of cell-tissue relationships for cell detection in histopathology
    Keyboard: Kidney, Head-neck, Prostate, Stomach, Endometrium, and Bladder, Whole-slide images (WSIs)
    Leaderboard | paper
    licence CC-BY-NC 4.0

  • OCT and Chest X-Ray images
    Keyboard: Eye and Chest, OCT, X-ray, Classification, Labeled
    paper
    licence CC BY 4.0

  • Olympus EndoAtlas
    It is solely for use by qualified medical professionals.
    Keyboard: Esophagus, stomach, Pancreatobiliary, Small-intestine, and Colorectum, Gastrointestinal video endoscopy, EndoCapsule

  • PAIP2021
    Detection of Perineural Invasion in Multiple Organ Cancer
    Keyboard: Colon, Prostate and Pancreatobiliary tract, Whole-slide images (WSIs), Cancer, Labeled
    Leaderboard
    licence CC BY-NC 4.0

  • PanNuke
    Nuclei labels across 19 different tissue types.
    Keyboard: Multi-tissue, whole-slide images (WSIs), Classification, Segmentation
    paper Details | paper Details
    licence CC BY-NC-SA 4.0

  • PathVQA (Pathology Visual Question Answering)
    This version of the dataset contains a total of 5,004 images and 32,795 question-answer pairs.
    Keyboard: Labeled, Text
    paper
    licence MIT License

  • PMC-VQA
    It contains 227k VQA (Visual Question Answerin) pairs of 149k images, covering various modalities or diseases
    Keyboard: Multi-modality, Labeled
    paper

  • Radimagenet
    Annotated medical images from multiple modalities and of multiple pathologies.
    paper

  • RSNA Abdominal Trauma Detection (Radiological Society of North America 2023)
    Including more than 4,000 CT exams with various abdominal injuries and a roughly equal number of cases without injury.
    Keyboard: Liver, Spleen, Kidneys, and Bowel, CT scan, Labeled
    Leaderboard

  • SCR
    Segmentations Chest X-Rays images from JSRT.
    Keyboard: Lungs, Heart, Clavicle, Labeled
    paper
    licence CC BY 4.0

  • SegRap2023
    A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma
    Keyboard: 45 organs-at-risk, CT Scan, Cancer
    Leaderboard | paper

  • SegTHOR (Segmentation of THoracic Organs at Risk)
    The dataset includes 60 3D CT scans, divided into a training set of 40 and a test set of 20 patients.
    Keyboard: Heart, Aorta, Trachea, and Esophagus, CT Scan, Cancer, Labeled
    Leaderboard | paper

  • SMIR BrokenLink
    This collection contains post mortem CT scans of the whole body.
    licence CC_BY_NC_SA_3.0

  • Soft-tissue-Sarcoma
    A radiomics model from joint FDG-PET and MRI texture features for the prediction of lung metastases in soft-tissue sarcomas of the extremities.
    paper
    licence CC BY 3.0

  • StructSeg2019
    Segmentation of organs-at-risk (OAR) and gross target volume (GTV) of tumors of two types of cancers, nasopharynx cancer and lung cancer, for radiation therapy planning.
    Keyboard: Head & neck, Lung, CT scans, Cancer, Labeled

  • SynthRAD2025
    MRI-to-sCT generation to facilitate MRI-only and MRI-based adaptive radiotherapy and CBCT-to-sCT generation to facilitate CBCT-based adaptive radiotherapy.
    Keyboard: Head and Neck, Thorax and Abdomen

  • TCGA-SARC (The Cancer Genome Atlas Sarcoma)
    Data from 5 subjects and 5653 images
    Keyboard: Chest-Abdomen-Pelvis, Leg, and TSpine, Multi-modality, Segmentation
    licence CC BY 3.0

  • tma
    Stanford Tissue Microarray Database
    Keyboard: Multi tissue
    paper

  • TotalSegmentator
    1228 images with segmented 117 anatomical structures Keyboard: Multi-tissue, CT scan, Segmentation, Labeled
    paper
    licence CC BY 4.0

  • Transverse musculoskeletal
    The dataset included 3917 images of biceps brachii, tibialis anterior and gastrocnemius medialis acquired on 1283 subjects.
    Keyboard: Ultrasound images, Segmentation, Neuromuscular
    paper
    licence CC BY 4.0

  • Ultra-low Dose PET Imaging
    The dataset contains 1447 subjects of whole-body 18F-FDG PET imaging
    Keyboard: Positron emission tomography (PET)

  • USenhance 2023 (Ultrasound Image Enhancement)
    Keyboard: Thyroid, Carotid artery, Breast, Liver, and Kidney, Ultrasound imaging
    Leaderboard

  • WORD
    This dataset contains 150 abdominal CT volumes (30495 slices).
    Keyboard: 16 organs, CT scan, Segmentation
    paper
    licence GNU General Public License v3.0


Animals

  • Brain Catalogue
    A collection of the diversity of the vertebrate brain.

  • Canine cutaneous mast cell tumor
    It consists of 32 whole slide images.
    Keyboard: Canine, Cancer
    paper

  • CATCH (CAnine CuTaneous Cancer Histology Dataset)
    Data from 282 subjects
    Keyboard: Canine, Skin Cancer, Histopathology
    licence CC BY 4.0

  • CRT-EPIGGY19 (Cardiac Resynchronization Therapy Electrophysiological)
    Keyboard: Pig

  • ICDC-Glioma (Integrated Canine Data Commons)
    Data from 78 subjects
    Keyboard: Canine, Head, Cancer, MRI, Histopathology
    paper
    licence CC BY 4.0

  • Magnetic Resonance Histology
    High-Resolution Magnetic Resonance Histology of the Embryonic and Neonatal Mouse
    Keyboard: Mouse

  • MCP (Mouse Connectome Project)
    It is an NIH-funded venture that aims to create a complete mesoscale connectivity atlas of the mouse brain and to subsequently generate its global neural networks. Keyboard: Mouse

  • MITOS_WSI_CMC
    Keyboard: Canine, Whole slide image, Cancer, Breast
    paper

  • Mouse-Astrocytoma
    Data from 48 subjects
    Keyboard: Mouse, Head, Glioblastoma Multiforme, Cancer, MRI
    licence CC BY 3.0

  • Mouse Brain atlas
    Keyboard: Mouse, Segmentation, MRI

  • Mouse-Mammary
    Data from 32 subjects
    Keyboard: Mouse, Breast, Cancer, MRI
    licence CC BY 3.0

  • RabbitCT
    Keyboard: Rabbit, CT scan
    paper | paper | paper

  • Rat brain endothelium RNAseq
    Sprague Dawley rats were subjected to transient middle cerebral artery occlusion or sham surgery and 3 days later brains were harvested and single cell suspensions generated. Cells were sorted to enrich for brain endothelium and processed using bulk RNAseq.
    licence CC0 1.0

  • RNR-EXM (Robust Non-rigid Registration Challenge for Expansion Microscopy)
    It released 24 pairs of 3D image volumes from three different species.
    Keyboard: Zebrafish, Mouse, C. elegans, Labeled
    Leaderboard


Notes and Contributions

The papers mentioned only use or explain the datasets. They are here to make it easier to find them. You don't have to cite them.
To know how to refer to each dataset and be sure of their latest usage license, check its description.
If you find any issues with the datasets (like broken links, order, description, etc.), please let me know.
If you know of any other datasets that aren't on the list, please contribute and add them to make the list more complete.


About

Publicly available medical imaging datasets for research and analysis.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 100.0%