Generating a synthetic dataset for surgical instrument segmentation with VQDatasetGAN

This project is designed to improve YOLO's performance in segmenting surgical instruments in real-time surgical video. This is an implementation of the VQGAN-based version of BigDatasetGAN. I rearranged some of the code from Taiming Transformers and implemented the segmentation head for VQGAN from BigDatasetGAN based on the segmentation head from BigDatasetGAN.

Status

I am currently working on improving the image and segmentation mask quality by enhancing the data quality and using transfer learning. I am training VQGAN on a subset of the SurgVu dataset (900k Images), fine-tuning on the SARAS-MEAD (23k Images) dataset, and then further fine-tuning on a smaller private dataset specific to Transorbital Robotic Surgery (2k Images). The idea is to train on a large dataset of porcine tissue (SurgVu), then fine-tune on a medium-sized dataset of human tissue (SARAS-MEAD), and then further specialize the model by training on a small dataset of domain-specific human tissue (TORS)

Example Synthetic Images with segmentation mask overlay

SurgVu Images

The VQDatasetGAN model generated these images at 256 x 256 resolution, then upsampled to 512 x 512

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
SyntheticImages		SyntheticImages
bigdatasetGAN		bigdatasetGAN
datasets		datasets
hpc_scripts		hpc_scripts
taming		taming
tools		tools
transformer		transformer
vqgan		vqgan
yolo		yolo
.gitignore		.gitignore
LICENSE		LICENSE
Notes.txt		Notes.txt
README.md		README.md
loggers.py		loggers.py
train.py		train.py
train_transformer.sh		train_transformer.sh
train_vqgan_hpc.sh		train_vqgan_hpc.sh
training_utils.py		training_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Generating a synthetic dataset for surgical instrument segmentation with VQDatasetGAN

Status

Example Synthetic Images with segmentation mask overlay

SurgVu Images

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

ataffe/VQDatasetGAN

Folders and files

Latest commit

History

Repository files navigation

Generating a synthetic dataset for surgical instrument segmentation with VQDatasetGAN

Status

Example Synthetic Images with segmentation mask overlay

SurgVu Images

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages