Building an Atlas for T2-MRI
Overview
The image to the right illustrates the overall organization of the ASHS software. Previous sections in this tutorial executed the ASHS segmentation pipeline using one of the atlases provided by the ASHS team. This atlas package uses a specific segmentation protocol, and is optimized for a specific MRI protocol. While this may be just fine for many users, some might be interested in using ASHS to label MRI scans following another segmentation protocol. The ASHS training pipeline makes that possible.
Required Data
In this tutorial, we will use the ASHS atlas pipeline to build a new atlas package. As the diagram above illustrates, you will need a set of 20 or so manually segmented MRI scans with
- T1-weighted whole-brain MRI for each subject in NIFTI format
- T2-weighted oblique coronal MRI for each subject in NIFTI format
- Manual segmentation of the structures of interest in the T2-MRI in NIFTI format
- Each voxel should be assigned a numerical label, with label 0 representing background. For example, CA1=1, DG=2, and so on.
To build an ASHS package, a Linux cluster with Sun Grid Engine (SGE) is required. Building a package is a time consuming process that may take a day or so on a cluster, and would take weeks or months on a personal workstation.
How many subjects?
There is no right answer to this question, at least not until we perform some additional validation studies. However, our experience suggests that to achieve good results you should include on the order of 20-30 subjects when constructing an atlas. If the goal is to study a heterogeneous population (e.g., some controls and some patients), then it's best to include representative examples from each subset of subjects in the atlas set.
For optimal performance, you should be able to run twice as many jobs in parallel on your cluster as there are subjects. For example, if there are 20 subjects, and you can utilize 40 cores in parallel, you will have good performance. If however, you have 21 subjects and 40 cores, time will not be optimally used, as there will be times when only 2 cores are utilized.
Preparing the Data
Please see "Examining the Image Data" section under Getting Started. All the input images should satisfy the requirements in that section.
Some additional preparation is required to build an atlas package. You will need to create the following files to run ASHS. Each file is described below.
- Data manifest file (required)
- Label description file (required)
- Configuration file (optional)
- Slice heuristics file (optional)
- Cross-validation file (optional)
Data Manifest File
The data manifest file is just a text file that describes the data used to build the atlas. Create it using your favorite text editor (Vim, Emacs, or gEdit). Each row in the file corresponds to one subject. Each row contains 5 entries, separated by spaces. The entries are:
- Subject ID (no spaces)
- Path to T1-weighted MRI in NIFTI format
- Path to T2-weighted MRI in NIFTI format
- Path to the Left MTL segmentation in NIFTI format
- Path to the Right MTL segmentation in NIFTI format
All entries must not contain spaces. Example manifest file follows:
XZY001 data/xzy001_t1.nii data/xzy001_t2.nii seg/xzy001_l.nii seg/xzy001_r.nii
XZY002 data/xzy002_t1.nii data/xzy002_t2.nii seg/xzy002_l.nii seg/xzy002_r.nii
To avoid errors, we recommend using full paths in the manifest file (in the example above we used relative paths for brevity)
Label Description File
The label description file is used to specify the names of the anatomical labels used in your protocol, as well as the color that should be used when generating figures. The best way to create this file is to use ITK-SNAP.
- Load your T2-weighted MRI in ITK-SNAP (
File->Open Image
) - Load your left or right segmentation (
Segmentation->Open Segmentation
) - Open the label editor (
Tools->Label Editor
) - Specify a name and a color for each of the labels used in your segmentation.
- Save your labels to a file (
Segmentation->Export Label Descriptions
)
The saved label description file should look like this:
################################################
# ITK-SnAP Label Description File
# File format:
# IDX -R- -G- -B- -A-- VIS MSH LABEL
# Fields:
# IDX: Zero-based index
# -R-: Red color component (0..255)
# -G-: Green color component (0..255)
# -B-: Blue color component (0..255)
# -A-: Label transparency (0.00 .. 1.00)
# VIS: Label visibility (0 or 1)
# IDX: Label mesh visibility (0 or 1)
# LABEL: Label description
################################################
0 0 0 0 0 0 0 "Clear Label"
1 255 0 0 1 1 1 "CA1"
2 0 255 0 1 1 1 "CA23"
3 0 0 255 1 1 1 "DG"
4 255 255 0 1 1 1 "Sub"
5 0 255 255 1 1 1 "EC"
6 255 0 255 1 1 1 "PRC"
7 255 239 213 1 1 1 "Cyst"
As an alternative to using ITK-SNAP, create a file in the same format using a text editor instead.
Handling Left vs. Right: In our lab, we use the same labels for the left and right structures. So left CA1 and right CA1 would both have label 1. If you prefer to assign different labels to the left CA1 and right CA1, you may. ASHS should still work.
Configuration File (Optional)
The configuration file can be used to tune ASHS performance to your own data. You should not need to provide your own configuration file when working with image resolution and dimensions comparable to the ones used in the ASHS-provided atlas packages, i.e.:
- T2-MRI has ~0.4 x 0.4 x 2.0 mm3 resolution and 20-40 slices
- T1-MRI has ~1.0 x 1.0 x 1.0 mm3 resolution and covers whole brain
- Structures are < 5 cm3 in volume
However, if your images are much larger than that, you may need to override some settings in the configuration file. To do so, copy the configuration file in the ASHS bin directory and edit the entries in this file to suit your situation. The entries are documented in the file itself.
> cp $ASHS_ROOT/bin/ashs_config.sh work/my_ashs_config.sh
> gedit work/my_ashs_config.sh
Here is an example of a configuration file used for a dataset with very large T2-weighted images that a nearly isotropic, with ~ 0.4 x 0.4 x 0.4 mm3 resolution.
Slice Heuristics File (Optional)
Because the T2 images used by ASHS tend to be highly anisotropic, some segmentation protocols include additional heuristic rules. For example, we may choose to label the subfields only in the body of the hippocampus, while all voxels in the head and tail of the hippocampus are assigned the summary head/tail label. This is the case for the VACIND atlas package provided with ASHS. A heuristics file can be used to specify certain rules that labels on each T2-MRI slice should satisfy. Example rules are:
- Label HEAD and label CA1 should not appear on the same MRI slice
- Label ERC should not appear more than one slice posterior of label HEAD
- If the number of voxels assigned label ERC on a slice is less than 20% of the median number of such voxels across all slices, then ERC voxels on the given slice should be assigned the background label instead.
Rules are specified using a specific syntax. Please see a separate page with documentation of the rule syntax language and examples.
Cross-validation File (Optional)
During atlas generation, you may choose to perform some cross-validation to measure segmentation accuracy. This cross-validation is invoked using the -x
option to ashs_train
. When using this option, you must supply a file that specifies which subjects are treated as test subjects in each cross-validation experiment.
Notes:
- Cross-validation involves leave-K-out segmentation of the input datasets. However, some of the registrations between input datasets performed during atlas building are reused during cross-validation.
- Cross-validation can take as much or more time than atlas building.
- Currently, cross-validation statistics (overlaps, etc.) are not generated by ashs_train and must be computed by the user.
The cross-validation file is a text file. Each row corresponds to one cross-validation experiment, and contains the Subject IDs of the subjects who should be used as test subjects for that experiment. For simple leave-one-out cross-validation, simply provide a file with the ID of each subject on a separate row:
xzy001
xzy002
...
For leave-five-out cross-validation, the file would look like this:
xzy001 xzy023 xzy025 xzy039 xzy042
xzy007 xzy011 xzy015 xzy026 xzy041
...
Building the Atlas
To build the atlas, execute the command ashs_train.sh
. We recommend writing a small bash script file that will run the command (let's call it runme.sh
). This way you can re-run ashs_train.sh later without having to retype all the parameters. Here is an example script:
#!/bin/bash
# Script for running ASHS training pipeline
$ASHS_ROOT/bin/ashs_train.sh \
-D manifest.txt \
-L snaplabels.txt \
-w /home/pauly/work/ashs/testatlas \
-Q \
| tee ashs_train.log
This script runs the program with the minimal set of required options:
- -D specifies the manifest file
- -L specifies the label description file
- -w gives the working directory where the atlas will be created
- -Q specifies that we are running atlas building in parallel on a cluster that has the Grid Engine (qsub) scheduling system.
To provide your own configuration, heuristic rules, and cross-validation files, use the options -C, -r and -x, respectively. For example:
#!/bin/bash
# Script for running ASHS training pipeline
$ASHS_ROOT/bin/ashs_train.sh \
-D manifest.txt \
-L snaplabels.txt \
-w /home/pauly/work/ashs/testatlas \
-Q \
-C my_config.sh \
-r my_rules.txt \
-x my_xval.txt \
| tee ashs_train.log
To see a full listing of options, call
> $ASHS_ROOT/bin/ashs_train.sh -h
Now, use nohup
to launch your script:
> nohup bash ./runme.sh &
Examining the Output and Using the Atlas
When atlas building completes, the working directory should contain the folder final
. Inside that folder you should see the following files and directories:
> ls -l final
drwxr-xr-x 4 pauly jboss 29 Jul 23 2013 adaboost
-rw-r--r-- 1 pauly jboss 36 Jul 23 2013 ashs_atlas_vars.sh
-rw-r--r-- 1 pauly jboss 7403 Jul 17 2013 ashs_system_config.sh
-rwxr-xr-x 1 pauly jboss 132 Jul 17 2013 ashs_user_config.sh
drwxr-xr-x 2 pauly jboss 64 Aug 2 2013 ref_hm
drwxr-xr-x 2 pauly jboss 38 Sep 12 2013 snap
drwxr-xr-x 2 pauly jboss 4096 Apr 16 12:24 template
drwxr-xr-x 31 pauly jboss 4096 Jul 23 2013 train
The contents of the final directory should be on the order of several gigabytes (depending on the number and size of the input images, of course). Run the following command to examine the size of the output to check that everything is there.
> du -h --max-depth=1
58M ./template
12M ./ref_hm
2.0G ./train
4.0K ./snap
1.4M ./adaboost
2.1G .
If the sizes of your subdirectories don't look similar, check the output for errors.
The final directory and its contents constitute the atlas package. You should be able to provide the path the final directory to ashs_main script in order to segment new datasets using your atlas package. This is the best way to test whether package building worked.