Quality Control for Medical Image Segmentation Under Domain Shift With Heteroscedastic Regression

Quality control (QC) experiments for segmentation reliability. This repo trains UNet models, trains score predictors for QC, and evaluates multiple QC baselines (score-agreement, Mahalanobis, and predictive-entropy) with a shared analysis notebook.

Project layout

models
- src/model/unet - U-Net models.
- src/model/score_predictor - score prediction implementation.
- src/model/mahalanobis - Mahalanobis distance model.
- src/model/calibration - calibration utilities.
src/apps - training and evaluation entrypoints.
src/notebooks - evaluation notebooks.
results/ - saved outputs (per-dataset/split/method runs).
pre-trained/ - pretrained checkpoints.

Installation

TODO

Training

UNet training and eval:

src/apps/train_unet_per_split.sh

Score predictor training and eval (Beta$_{\mu,\kappa}$ QC head on top of UNet):

src/apps/train_score_predictor.sh

Evaluation baselines

These scripts compute QC signals and save them into results files for later aggregation:

Score agreement (SA): src/apps/eval_score_agreement.sh
Mahalanobis distance (Maha): src/apps/eval_mahalanobis.sh
Predictive entropy (PE): src/apps/eval_comp_entropy.sh

Analysis notebooks

QC analysis workflow is in src/notebooks/QC_eval.ipynb. It:

Loads results for multiple datasets/splits and all runs.
Fits calibrators (thresholding for correlation-based methods; beta adapters for beta-based predictors).
Computes ranking metrics (Pearson’s $\rho$, MAE, eAURC) and risk-control metrics (Rec⁺ / Rec^- at t=0.8, α=0.95).

UNet evaluation is in src/notebooks/unet_eval.ipynb.

Notes

Results are organized by dataset, split, method, and run ID under results/.
Dataset shifts used in the paper:
- M&Ms scanner drift → scanner-symphonytim
- M&Ms pathology drift → pathology-norm-vs-fall-scanners-all
- PMRI dataset shift → promise12
- PMRI 3T→1.5T shift → threet-to-onepointfivet

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quality Control for Medical Image Segmentation Under Domain Shift With Heteroscedastic Regression

Project layout

Installation

Training

Evaluation baselines

Analysis notebooks

Notes

About

Uh oh!

Releases

Packages

Languages

License

MedVisBonn/isbi26-implementation

Folders and files

Latest commit

History

Repository files navigation

Quality Control for Medical Image Segmentation Under Domain Shift With Heteroscedastic Regression

Project layout

Installation

Training

Evaluation baselines

Analysis notebooks

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages