OrganSegRSTN_PyTorch: an end-to-end coarse-to-fine organ segmentation framework

This is a re-implementation of OrganSegRSTN in PyTorch 0.4, Python 3.6

v1.2 - Nov 9 2018 - by Tianwei Ni and Lingxi Xie

Credit: Original version of OrganSegRSTN is implemented in CAFFE by Qihang Yu, Yuyin Zhou and Lingxi Xie.

Before you start, please note that there is a LAZY MODE, which allows you to run the entire framework with ONE click. Check the contents before Section 4.3 for details.

1. Introduction

OrganSegRSTN is a code package for our paper:

Qihang Yu, Lingxi Xie, Yan Wang, Yuyin Zhou, Elliot K. Fishman, Alan L. Yuille, "Recurrent Saliency Transformation Network: Incorporating Multi-Stage Visual Cues for Small Organ Segmentation", in IEEE Conference on CVPR, Salt Lake City, Utah, USA, 2018.

OrganSegRSTN is a segmentation framework designed for 3D volumes. It was originally designed for segmenting abdominal organs in CT scans, but we believe that it can also be used for other purposes, such as brain tissue segmentation in fMRI-scanned images.

OrganSegRSTN is based on the state-of-the-art deep learning techniques. This code package is to be used with PyTorch, a deep learning library.

It is highly recommended to use one or more modern GPUs for computation. Using CPUs will take at least 50x more time in computation.

We provide an easy implementation in which the training stages has only 1 fine-scaled iteration. If you hope to add more, please modify the model.py accordingly. As we said in the paper, our strategy of using 1 stage in training and multiple iterations in testing works very well.

2. File List

Folder/File	Description
`README.md`	the README file

DATA2NPY/	codes to transfer the NIH dataset into NPY format
`dicom2npy.py`	transferring image data (DICOM) into NPY format
`nii2npy.py`	transferring label data (NII) into NPY format

OrganSegRSTN/	primary codes of OrganSegRSTN
`coarse2fine_testing.py`	the coarse-to-fine testing process
`coarse_fusion.py`	the coarse-scaled fusion process
`coarse_testing.py`	the coarse-scaled testing process
`Data.py`	the data layer
`model.py`	the models of RSTN
`init.py`	the initialization functions
`oracle_fusion.py`	the fusion process with oracle information
`oracle_testing.py`	the testing process with oracle information
`training.py`	training the coarse and fine stages jointly
`training_parallel.py`	training the coarse and fine stages jointly with multi-GPU
`run.sh`	the main program to be called in bash shell
`utils.py`	the common functions

SWIG_fast_functions/	C codes for acceleration in testing process
logs/	training log files on the NIH dataset

3. Installation

Python 3.6+
PyTorch 0.4

4. Usage

Please follow these steps to reproduce our results on the NIH pancreas segmentation dataset.

NOTE: Here we only provide basic steps to run our codes on the NIH dataset. For more detailed analysis and empirical guidelines for parameter setting (this is very important especially when you are using our codes on other datasets), please refer to our technical report (check our webpage for updates).

4.1 Data preparation

4.1.1 Download NIH data from https://wiki.cancerimagingarchive.net/display/Public/Pancreas-CT .

You should be able to download image and label data individually.
Suppose your data directory is $RAW_PATH:
    The image data are organized as $RAW_PATH/DOI/PANCREAS_00XX/A_LONG_CODE/A_LONG_CODE/ .
    The label data are organized as $RAW_PATH/TCIA_pancreas_labels-TIMESTAMP/label00XX.nii.gz .

4.1.2 Use our codes to transfer these data into NPY format.

Put dicom2npy.py under $RAW_PATH, and run: python dicom2npy.py .
    The transferred data should be put under $RAW_PATH/images/
Put nii2npy.py under $RAW_PATH, and run: python nii2npy.py .
    The transferred data should be put under $RAW_PATH/labels/

4.1.3 Suppose your directory to store experimental data is `$DATA_PATH`:

Put images/ under $DATA_PATH/
Put labels/ under $DATA_PATH/
Download the FCN8s pretrained model below and put it under $DATA_PATH/models/pretrained/

20210414: The FCN8s pretrained model in PyTorch the link went error, so you have to pre-train the model using this repo by yourself.

NOTE: If you use other path(s), please modify the variable(s) in run.sh accordingly.

4.2 Initialization (requires: 4.1)

4.2.1 Check `run.sh` and set $DATA_PATH accordingly.

4.2.2 Set `$ENABLE_INITIALIZATION=1` and run this script.

Several folders will be created under $DATA_PATH:
    $DATA_PATH/images_X|Y|Z/: the sliced image data (data are sliced for faster I/O).
    $DATA_PATH/labels_X|Y|Z/: the sliced label data (data are sliced for faster I/O).
    $DATA_PATH/lists/: used for storing training, testing and slice lists.
    $DATA_PATH/logs/: used for storing log files during the training process.
    $DATA_PATH/models/: used for storing models (snapshots) during the training process.
    $DATA_PATH/results/: used for storing testing results (volumes and text results).
According to the I/O speed of your hard drive, the time cost may vary.
    For a typical HDD, around 20 seconds are required for a 512x512x300 volume.
This process needs to be executed only once.

NOTE: if you are using another dataset which contains multiple targets,
    you can modify the variables "ORGAN_NUMBER" and "ORGAN_ID" in run.sh,
    as well as the "is_organ" function in utils.py to define your mapping function flexibly.

LAZY MODE!

You can run all the following modules with one execution!

a) Enable everything (except initialization) in the beginning part.
b) Set all the "PLANE" variables as "A" (4 in total) in the following part.
c) Run this manuscript!

4.3 Training (requires: 4.2)

4.3.1 Check `run.sh` and set `$TRAINING_PLANE` , `$TRAINING_GPU` , `$CURRENT_FOLD`.

You need to run X|Y|Z planes individually, so you can use 3 GPUs in parallel.
You can also set TRAINING_PLANE=A, so that three planes are trained orderly in one GPU.
You need to set CURRENT_FOLD in {0, 1, ..., FOLDS-1}, which means the testing fold is $CURRENT_FOLD, and training folds are the rest.

4.3.2 Set `$ENABLE_TRAINING=1` and run this script.

The following folders/files will be created:
    Under $DATA_PATH/models/snapshots/, a folder named by training information.
        Snapshots will be stored in this folder.
On the axial view (training image size is 512x512, small input images make training faster),
    each 20 iterations cost ~10s on a Titan-X Pascal GPU, or ~8s on a Titan-Xp GPU.
    As described in the code, we need ~80K iterations, which take less than 5 GPU-hours.

4.3.3 Important notes on initialization, model mode and model convergence.

It is very important to provide a reasonable initialization for our model. In the previous step of data preparation, we provide a scratch model for the NIH dataset, in which both the coarse and fine stages are initialized using the weights of an FCN-8s model (please refer to the FCN project). This model was pre-trained on PASCALVOC.

What does `mode` in RSTN stand for?

We train RSTN model in three sequential modes S,I,J in order to make model converge well.

S stands for Separate. The input pair (image, label) of fine FCN is irrelevant to the outputs of coarse FCN.
I stands for Individual. The input (image) of fine FCN is relevant to the outputs of coarse FCN, but label is irrelevant.
J stands for Joint. The input pair (image, label) of fine FCN is relevant to the outputs of coarse FCN.

How to determine if a model converges and works well?

The coarse loss in the beginning of training is almost 1.0. If a model converges, you should observe the loss function values to decrease gradually. In order to make it work well, in the end of each training stage, you need to confirm the average loss to be sufficiently low (e.g. 0.3 in S, 0.2 in I, 0.15 in J).

Training RSTN on other CT datasets?

If you are experimenting on other CT datasets, we strongly recommend you to use a pre-trained model, such as those pre-trained model attached in the last part of this file. We also provide a mixed model (to be provided soon), which was tuned using all X|Y|Z images of 82 training samples for pancreas segmentation on NIH. Of course, do not use it to evaluate any NIH data, as all cases have been used for training.

4.3.4 Multi-GPU training

For your convenience, we provide training_parallel.py to support multi-GPU training. Thus, you just run it instead of training.py in the training stage. But you should pay attention that:

image size must be uniform (but normally, the shape of each medical image case is not identical; in NIH dataset, only Z plane could be trained in parallel without padding into same size)
batch_size should be no less than the number of GPUs (set by os.environ["CUDA_VISIBLE_DEVICES"])
parallel in get_parameters() must be set True.
prefix module. should be added to the keys of pretrained_dict
last incomplete batch is dropped in trainloader
in coarse_testing.py and coarse2fine_testing.py: you should wrap the model into nn.DataParallel

4.4 Coarse-scaled testing (requires: 4.3)

4.4.1 Check `run.sh` and set `$COARSE_TESTING_PLANE` and `$COARSE_TESTING_GPU`.

You need to run X|Y|Z planes individually, so you can use 3 GPUs in parallel.
You can also set COARSE_TESTING_PLANE=A, so that three planes are tested orderly in one GPU.

4.4.2 Set `$ENABLE_COARSE_TESTING=1` and run this script.

The following folder will be created:
    Under $DATA_PATH/results/, a folder named by training information.
Testing each volume costs ~30 seconds on a Titan-X Pascal GPU, or ~25s on a Titan-Xp GPU.

4.5 Coarse-scaled fusion (optional) (requires: 4.4)

4.5.1 Fusion is performed on CPU and all X|Y|Z planes are combined and executed once.

4.5.2 Set `$ENABLE_COARSE_FUSION=1` and run this script.

The following folder will be created:
    Under $DATA_PATH/results/coarse_testing_*, a folder named by fusion information.
The main cost in fusion includes I/O and post-processing (removing non-maximum components).
We have implemented post-processing in C for acceleration (see 4.8.3).

4.6 Oracle testing (optional) (requires: 4.3)

NOTE: Without this step, you can also run the coarse-to-fine testing process. This stage is still recommended, so that you can check the quality of the fine-scaled models.

4.6.1 Check `run.sh` and set `$ORACLE_TESTING_PLANE` and `$ORACLE_TESTING_GPU`.

You need to run X|Y|Z planes individually, so you can use 3 GPUs in parallel.
You can also set ORACLE_TESTING_PLANE=A, so that three planes are tested orderly in one GPU.

4.6.2 Set `$ENABLE_ORACLE_TESTING=1` and run this script.

The following folder will be created:
    Under $DATA_PATH/results/, a folder named by training information.
Testing each volume costs ~10 seconds on a Titan-X Pascal GPU, or ~8s on a Titan-Xp GPU.

4.7 Oracle fusion (optional) (requires: 4.6)

NOTE: Without this step, you can also run the coarse-to-fine testing process. This stage is still recommended, so that you can check the quality of the fine-scaled models.

4.7.1 Fusion is perfomed on CPU and all X|Y|Z planes are combined and executed once.

4.7.2 Set `$ENABLE_ORACLE_FUSION=1` and run this script.

The following folder will be created:
    Under $DATA_PATH/results/, a folder named by fusion information.
The main cost in fusion includes I/O and post-processing (removing non-maximum components).

4.8 Coarse-to-fine testing (requires: 4.4)

4.8.1 Check run.sh and set `$COARSE2FINE_TESTING_GPU`.

Fusion is performed on CPU and all X|Y|Z planes are combined.
Currently X|Y|Z testing processes are executed with one GPU, but it is not time-comsuming.

4.8.2 Set `$ENABLE_COARSE2FINE_TESTING=1` and run this script.

The following folder will be created:
    Under $DATA_PATH/results/, a folder named by coarse-to-fine information (very long).
This function calls both fine-scaled testing and fusion codes, so both GPU and CPU are used.
    In our future release, we will implement post-processing in C for acceleration.

4.8.3 how to compile `fast_functions` for other python version?

We provide _fast_functions.so for python3.6 for acceleration in coarse2fine_testing.py, which can be only run in python 3.6 environment. But we also support other python version 3+, here is the instructions:

First, check your default python3 version by ls -l /usr/bin/python*，ensure that /usr/bin/python3 is linked to the python3.* version you want. Otherwise, please download python3.* you want and ln -s /usr/bin/python3.* /usr/bin/python3.

Then go to SWIG_fast_functions directory, run

$ swig -python -py3 fast_functions.i
$ python3 setup.py build_ext --inplace
$ mv _fast_functions.cpython-3*m-x86_64-linux-gnu.so _fast_functions.so # * depends on py3.x
$ python test.py # test
$ cp {_fast_functions.so,fast_functions.py} ../OrganSegRSTN/ # no space in {}

Finally, you can run coarse2fine_testing.py successfully.
- If fails, you can comment post_processing and DSC_computation in ff, and uncomment those in vanilla python. These functions in vanilla python can be run successfully, but slower than C re-implementation.

NOTE: currently we set the maximal rounds of iteration to be 10 in order to observe the convergence. Most often, it reaches an inter-DSC of >99% after 3-5 iterations. If you hope to save time, you can slight modify the codes in coarse2fine_testing.py. Testing each volume costs ~40 seconds on a Titan-X Pascal GPU, or ~32s on a Titan-Xp GPU. If you set the threshold to be 99%, this stage will be done within 2 minutes (in average).

Congratulations! You have finished the entire process. Check your results now!

5. Pre-trained Models on the NIH Dataset

NOTE: all these models were trained following our default settings.

The 82 cases in the NIH dataset are split into 4 folds:

Fold #0: testing on Cases 01, 02, ..., 20;
Fold #1: testing on Cases 21, 22, ..., 40;
Fold #2: testing on Cases 41, 42, ..., 61;
Fold #3: testing on Cases 62, 63, ..., 82.

We provide the trained models on each plane of Fold #0, in total 3 files in the google drive. They have 84.62% accuracy of in the coarse-to-fine testing. Each of these models is around 1.07GB, approximately the size of two (coarse+fine) FCN models.

We also attach the log files and testing results for your reference here. Please refer to the logs/ folder.

6. Contact Information

If you encounter any problems in using these codes, please open an issue in this repository. You may also contact Tianwei Ni (twni2016@gmail.com) or Lingxi Xie (198808xc@gmail.com).

Thanks for your interest! Have fun!

7. Citation

If you use our codes, please cite our paper accordingly:

@inproceedings{yu2018recurrent,
  title={Recurrent saliency transformation network: Incorporating multi-stage visual cues for small organ segmentation},
  author={Yu, Qihang and Xie, Lingxi and Wang, Yan and Zhou, Yuyin and Fishman, Elliot K and Yuille, Alan L},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={8280--8289},
  year={2018}
}

and possibly, our previous work (the basis of this work):

@inproceedings{zhou2017fixed,
  title={A fixed-point model for pancreas segmentation in abdominal CT scans},
  author={Zhou, Yuyin and Xie, Lingxi and Shen, Wei and Wang, Yan and Fishman, Elliot K and Yuille, Alan L},
  booktitle={International conference on medical image computing and computer-assisted intervention},
  pages={693--701},
  year={2017},
  organization={Springer}
}

All the materials released in this library can ONLY be used for RESEARCH purposes.

The authors and their institution (JHU/JHMI) preserve the copyright and all legal rights of these codes.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
DATA2NPY		DATA2NPY
OrganSegRSTN		OrganSegRSTN
SWIG_fast_functions		SWIG_fast_functions
logs		logs
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
icon.png		icon.png

License

twni2016/OrganSegRSTN_PyTorch

Folders and files

Latest commit

History

Repository files navigation

OrganSegRSTN_PyTorch: an end-to-end coarse-to-fine organ segmentation framework

1. Introduction

2. File List

3. Installation

4. Usage

4.1 Data preparation

4.1.1 Download NIH data from https://wiki.cancerimagingarchive.net/display/Public/Pancreas-CT .

4.1.2 Use our codes to transfer these data into NPY format.

4.1.3 Suppose your directory to store experimental data is $DATA_PATH:

4.2 Initialization (requires: 4.1)

4.2.1 Check run.sh and set $DATA_PATH accordingly.

4.2.2 Set $ENABLE_INITIALIZATION=1 and run this script.

4.3 Training (requires: 4.2)

4.3.1 Check run.sh and set $TRAINING_PLANE , $TRAINING_GPU , $CURRENT_FOLD.

4.3.2 Set $ENABLE_TRAINING=1 and run this script.

4.3.3 Important notes on initialization, model mode and model convergence.

What does mode in RSTN stand for?

How to determine if a model converges and works well?

Training RSTN on other CT datasets?

4.3.4 Multi-GPU training

4.4 Coarse-scaled testing (requires: 4.3)

4.4.1 Check run.sh and set $COARSE_TESTING_PLANE and $COARSE_TESTING_GPU.

4.4.2 Set $ENABLE_COARSE_TESTING=1 and run this script.

4.5 Coarse-scaled fusion (optional) (requires: 4.4)

4.5.1 Fusion is performed on CPU and all X|Y|Z planes are combined and executed once.

4.5.2 Set $ENABLE_COARSE_FUSION=1 and run this script.

4.6 Oracle testing (optional) (requires: 4.3)

4.6.1 Check run.sh and set $ORACLE_TESTING_PLANE and $ORACLE_TESTING_GPU.

4.6.2 Set $ENABLE_ORACLE_TESTING=1 and run this script.

4.7 Oracle fusion (optional) (requires: 4.6)

4.7.1 Fusion is perfomed on CPU and all X|Y|Z planes are combined and executed once.

4.7.2 Set $ENABLE_ORACLE_FUSION=1 and run this script.

4.8 Coarse-to-fine testing (requires: 4.4)

4.8.1 Check run.sh and set $COARSE2FINE_TESTING_GPU.

4.8.2 Set $ENABLE_COARSE2FINE_TESTING=1 and run this script.

4.8.3 how to compile fast_functions for other python version?

5. Pre-trained Models on the NIH Dataset

6. Contact Information

7. Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages