Skip to content

Latest commit

 

History

History
306 lines (235 loc) · 20 KB

README.md

File metadata and controls

306 lines (235 loc) · 20 KB

Diffeomorphic shape modelling

This software implements a statistical shape model based on diffeomorphic transforms.

Learning shape parameters – i.e., a mean shape (or template) and a principal space of deformation – from a series of 2D or 3D images is cast as a maximum a posterori inference problem in a graphical model.

This repository contains core functions and executable scripts written in Matlab.

Standalone release

TODO - there's no such release yet

We provided compiled command line routines that can be used without Matlab license. To use them, you must:

  • Install Matlab runtime
  • Download your platform release
  • Run the train or fit functions in a terminal:
    ./run_shape_train /path/to/matlab/runtime input.json option.json
    ./run_shape_fit /path/to/matlab/runtime input.json option.json

Basic use cases

The illustrative JSON files are commented. However, comments are not allowed in JSON, so they should be removed from any real file.

Learn a shape model from grey and white matter segmentations obtained with SPM's New Segment

input.json

{"f": [["rc1_sub001.nii","rc2_sub001.nii"],     // Grey and white classes for the first subject
       ["rc1_sub002.nii","rc2_sub002.nii"],
       ["rc1_sub003.nii","rc2_sub003.nii"],
       ...
       ["rc1_sub100.nii","rc2_sub100.nii"]]}    // Grey and white classes for the last subject

option.json

{"dir":   {"model": "~/my_shape_model/",        // Write all (model and subject) data in the same folder
           "dat":   "~/my_shape_model/"},
 "model": {"name": "categorical",               // Observed images are segmentations
           "nc":   3},                          // There are 3 classes, even though only 2 were provided.
                                                //   This allows the third class to be automatically created
                                                //   by "filling probabilities up to one"
 "pg":    {"K": 32},                            // Use 32 principal components
 "par":   {"subjects": "for"}                   // Do not use parallelisation
 }

Command line

./run_shape_train /path/to/matlab/runtime input.json option.json

Apply a shape model to grey and white matter segmentations obtained with SPM's New Segment

input.json

{"f": [["rc1_sub101.nii","rc2_sub101.nii"],     // Grey and white classes for the first subject
       ["rc1_sub102.nii","rc2_sub102.nii"],
       ["rc1_sub103.nii","rc2_sub103.nii"],
       ...
       ["rc1_sub200.nii","rc2_sub200.nii"]],    // Grey and white classes for the last subject
 "w": "~/my_shape_model/subspace.nii",          // Learnt principal subspace
 "a": "~/my_shape_model/log_template.nii"}      // Learnt log-template

option.json

{"dir":   {"model": "~/normalised_images/",     // Write all (model and subject) data in the same folder
           "dat":   "~/normalised_images/"},
 "model": {"name": "categorical",               // Observed images are segmentations
           "nc":   3},                          // There are 3 classes, even though only 2 were provided.
                                                //   This allows the third class to be automatically created
                                                //   by "filling probabilities up to one"
 "pg":    {"K": 32},                            // Use 32 principal components
 "z":     {"A0": [0.0016, 0.0030, ...]},        // Diagonal elements of the latent precision matrix
 "r":     {"l0": 18.8378},                      // Residual precision
 "par":   {"subjects": "for"}                   // Do not use parallelisation
 }

Command line

./run_shape_fit /path/to/matlab/runtime input.json option.json

The model

This work relies on a generative model of shape in which individual images (of brains, in particular) are assumed to be generated from a mean shape – commonly named template – deformed according to a transformation of the coordinate space. Here, these transformations are diffeomorphisms, i.e., one-to-one invertible mappings that allow for very large deformations. By using the geodesic shooting framework, we parameterise these transformations by their initial velocity, which can be seen as an infinitesimal (very small) deformation. The a posteriori covariance structure of these velocity fields is inferred by making use of a technique related to the well-known principal component analysis, adapted to the particular structure of the space on which lie velocity fields, called a Riemannian manifold. Our model also includes a rigid-body transform, whose role is to factor out all deformations induces by brains misalignment.

Inputs can be binary or categorical images (i.e., segmentations), in which case a Bernoulli or Categorical data term is used, or multimodal intensity images, in which case a uniform Gaussian or Laplace noise data term is used.

All considered, the following variables are infered:

  • W = [w1 .. wK]: the principal subspace of deformation, made of K principal geodesics;
  • z: transformation coordinates in the principal subspace, which is a low-dimensional representation of each subject in terms of deformation of the template;
  • A: the precision matrix (i.e., inverse covariance) of the latent coordinates. At the optimum, it should be a diagonal matrix that contains the variance along each principal component, or in other words, their scale;
  • v: the velocity field of each subject;
  • lam: the precision of the residual field, also named anatomical noise;
  • q: parameters of the rigid-body transform. Note that there are options to use different kind of affine transforms instead, however it is not advised, as differences in size should be captured by the shape model.

The following parameters are manually set and impact the model's behaviour:

  • K: the number of principal components ;
  • L: a Riemannian metric, in the form of a mixture of membrane, bending and linear-elastic energies. A small penalty on absolute displacements should also be set in order to ensure the diffeomorphism to be invertible and that it can be shot with a low number of integration steps ;
  • A0 and n0: the prior expected value of the latent precision matrix and its degrees of freedom, which should be seen as the virtual number of subjects that weight this prior belief ;
  • l0 and n0: the prior expected value of the residual precision matrix and its degrees of freedom, which should be seen as the virtual number of subjects that weight this prior belief ;

Algorithm

This algorithm was described in a MICCAI 2018 proceeding:

The current implementation, however, differs significantly:

  • The model described in the paper assumes that the posterior distribution factorises (mean field) over W, z and r, where r is the residual field. This means that the posterior over each of these variables is obtained by making a Laplace approximation, where the mean and precision are assumed to be the maximum and Hessian matrix about that maximum of the negative log-posterior. This maximum is obtained numerically using Gauss-Newton optimisation.

    Pros:

    • Principal directions are directly picked up by optimising the data term, which might be faster and more robust.

    Cons:

    • Laplace approximations are used for both the residual field and latent coordinates.
    • A line search is necessary when updating the principal subspace, which is quite costly as we need to reshoot all diffeomorphisms for each factor of the line search.
  • The current implementation assumes a different mean field approximation, where the posterior factorises over W, z and v, where v is the complete velocity. This allows us to write the model in terms of the probability distribution p(v | W, z), which yields a closed-form posterior distribution for W and z. Hence, only the posterior distribution over v relies on a Laplace approximation.

    Pros:

    • The principal subspace and latent coordinates have closed-form update equations, meaning that only initial velocities are fit by Gauss Newton (and Laplace approximated).

    Cons:

    • Since latent coordinates are obtained from velocity fields, it might take a few iterations before the shape model appropriately captures the main modes of variation.

Noise models

Three noise models are implemented:

  • Categorical: This model is designed for categorical inputs, typically segmentations into C classes. In this case, the template is called a tissue probability map as it embeds the prior probability of observing each class in a given location. Each input image must be a 4D volume (or a list of 3D volumes), where the fourth dimension corresponds to those classes. In each voxel, the sum over classes should be one, meaning that input volumes can be hard segmentations (where, in each voxel, exactly one class is one and all the other classes are zero) or soft probabilities. This model should be preferred when possible as it is non-parameteric and thus more robust. To learn from MRIs, we would typically advise to segment the images using SPM (or another neuroimaging software) first, and to use the grey and white classes as input for the shape model. Note that it is possible to provide uncomplete classes (which do not sum to one): in this case, the model.nc option should be used to specify the complete number of classes, allowing the missing (background) class to be automatically inferred by filling probabilities up to one.

  • Bernoulli: This is a special case of the Categorical model when there are only two classes. It would typically be used when working with MNIST (intensities should then be normalised to the range [0,1]).

  • Normal/Gaussian: This model is classically used in shape modelling applications. Note that a good estimate of the noise variance σ2 (model.sigma2) must be provided. We leave this estimation to the user. Also, if the noise is spatially correlated, regularisation parameters should be made stronger by incorporating a fudge factor.

User documentation

Input format

If you use compiled routines or their Matlab equivalents (shape_train, shape_fit), inputs should be JSON files containing dictionaries. These routines wrap around the main script (shape_model).

If you directly use the main scripts in Matlab, inputs are structures.

Regarding of the above choice, there are one mandatory input and one optional one:

  • input: [mandatory] a dictionary of filenames with fields:
    • 'f': observed images (list or matrix, the first dimension is for subjects, the second for classes or modalities) ;
    • 'v': observed velocity fields (list)
    • 'w': principal subspace to use or initialise with
    • 'a': [Bernoulli/Categorical only] log-template to use or initialise with
    • 'mu': [Gaussian only] template to use or initialise with
  • opt: [optional] a dictionary of options
    • All possible fields are provided below.

Options

Options will be provided in their Matlab structure form. Their JSON equivalent are dictionaries whose keys are the structure field names.

Model

model.name     - Generative data model ('normal'/'categorical'/'bernoulli') - ['normal']
model.sigma2   - If normal model: initial noise variance estimate           - [1]
model.nc       - [categorical only] Number of classes                       - [from input]
pg.K           - Number of principal geodesics                              - [32]
pg.prm         - Parameters of the geodesic operator                        - [1e-4 1e-3 0.2 0.05 0.2]
pg.bnd         - Boundary conditions for the geodesic operator              - [0 = circulant]
pg.geod        - Additional geodesic prior on velocity fields               - [true]
tpl.vs         - Lattice voxel size                                         - [auto]
tpl.lat        - Lattice dimensions                                         - [auto]
tpl.prm        - Parameters of the field operator                           - [1e-3  1e-1 0] (cat/ber)
                                                                              [0     0    0] (normal)
tpl.bnd        - Boundary conditions                                        - [1 = mirror]
tpl.itrp       - Interpolation order                                        - [1]
v.l0           - Prior expected anatomical noise precision                  - [17]
v.n0           - Prior DF of the anatomical noise precision                 - [10]
z.init         - Latent initialisation mode ('auto'/'zero'/'rand')          - ['auto']
z.A0           - Prior expected latent precision matrix                     - [eye(K)]
z.n0           - Prior DF of the latent precision matrix                    - [K]
q.A0           - Prior expected affine precision matrix                     - [eye(M)]
q.n0           - Prior DF of the affine precision matrix                    - [M]
q.B            - Affine_basis                                               - ['rigid']
q.hapx         - Approximate affine hessian                                 - [true]
f.M            - Force same voxel-to-world to all images                    - [read from file]

Optimise parameters

optimise.pg.w       - Optimise subspace                                     - [true]
optimise.z.z        - Optimise latent coordinates                           - [true]
optimise.z.A        - Optimise latent precision                             - [true]
optimise.q.q        - Optimise affine coordinates                           - [true]
optimise.q.A        - Optimise affine precision                             - [true]
optimise.v.v        - Optimise velocity fields                              - [true]
optimise.v.l        - Optimise residual precision                           - [true]
optimise.tpl.a      - Optimise template                                     - [true]

Processing

iter.em        - Maximum number of EM iterations                            - [1000]
iter.gn        - Maximum number of Gauss-Newton iterations                  - [1]
iter.ls        - Maximum number of line search iterations                   - [6]
iter.itg       - Number of integration steps for geodesic shooting          - [auto]
iter.pena      - Penalise Gauss-Newton failures                             - [true]
lb.threshold   - Convergence criterion (lower bound gain)                   - [1e-5]
lb.moving      - Moving average over LB gain                                - [3]
par.subjects   - How to parallelise subjects (see distribute_default)       - [no parallelisation]
ui.verbose     - Talk during processing                                     - [true]
ui.debug       - Further debugging talk                                     - [false]
ui.fig_pop     - Plot lower bound                                           - [true]
ui.fig_sub     - Plot a few subjects                                        - [true]

I/O

dir.model      - Directory where to store model arrays and workspace       - ['.']
dir.dat        - Directory where to store data array                       - [next to input]
fnames.result  - Filename for the result environment saved after each EM   - ['shape_model.mat']
                 iteration
fnames.model   - Structure of filenames for all file arrays
fnames.dat     - Structure of filenames for all file arrays
ondisk.model   - Structure of logical for temporary array                  - [default_ondisk]
ondisk.dat     - "      "       "       "       "       "

Content of the repository

  • backward-compatibility: A few basic Matlab functions that only appeared in recent versions of Matlab and that were reimplemented for backward compatibility.
  • core: Core functions. Their signature is as straightforward as possible, so they can be used in other contexts as some sort of library. They should in general work on both numerical arrays and SPM's file_array (i.e., memory mapped arrays).
    • core/register contains functions related to the registration of single subjects.
    • core/shape contains functions related to population parameters learning (template, principal subspace...).
  • scripts: Executable functions that implement the complete model.
    • shape_model.m implements a purely variational model based on a Riemannian PCA applied to fitted velocity fields. It can also work with known (observed) velocity fields.
    • General subfunctions:
      • shape_input.m deals with input files (individual images, eventually some parameters of the model...),
      • shape_default.m sets all default parameters,
      • shape_data.m sets all working data structures,
      • shape_init_*.m initialises the model,
      • shape_plot_*.m plotting utilities,
      • shape_ui.m verbose utilities,
      • ...
  • update: Specialised subfunctions that initialise, update or aggregate specific variables.
  • utility-functions: Various very basic utility functions. Some of them might get moved to the auxiliary-functions toolbox in the future.

Dependencies

This project has strong dependencies to SPM12 and its Shoot toolbox. Both of them should be added to Matlab's path. SPM can be downloaded at www.fil.ion.ucl.ac.uk/spm.

Core functions also depend on our auxiliary-functions toolbox, which gathers lots of low-level functions.

Furthermore, executable scripts depend on our distributed-computing toolbox, which helps parallelising parts of the code either on the local workstation (using Matlab's parallel processing toolbox) or on a computing cluster (see the toolbox help file for use cases and limitations).

Note that if these toolboxes are all located in the same folder, i.e.:

  • ./shape-toolbox
  • ./auxiliary-functions
  • ./distributed-computing

a call to setpath.m adds all necessary folders to Matlab's path.

How to compile

If no executable is provided for a given platform and/or runtime version, it is possible to compile one yourself using the script shape_compile.m. You will need:

  • Matlab (I used R2017b, I am not sure how far back the code is compatible),
  • A compatible compiler,
  • This code source and all dependencies on your Matlab path.

Then, all you need is to run:

shape_compile('/output/directory')

which will generate the files:

  • shape_fit
  • shape_train
  • run_shape_fit.sh
  • run_shape_train.sh
  • ... plus a few others.

Contributors

This software was developed under the Human Brain Project (SP2) flagship by John Ashburner's Computational Anatomy Group at the Wellcome Centre for Human Neuroimaging in UCL.

  • The shape toolbox was mainly developed by Yaël Balbastre with invaluable help from John Ashburner.
  • The auxiliary-functions and distributed-computing toolboxes were developed by Mikael Brudfors and Yaël Balbastre.

If you encounter any difficulty, please send an email to y.balbastre or j.ashburner at ucl.ac.uk

License

This software is released under the GNU General Public License version 3 (GPL v3). As a result, you may copy, distribute and modify the software as long as you track changes/dates in source files. Any modifications to or software including (via compiler) GPL-licensed code must also be made available under the GPL along with build & install instructions.

TL;DR: GPL v3