Skip to content

Latest commit

 

History

History
201 lines (160 loc) · 8.93 KB

README.md

File metadata and controls

201 lines (160 loc) · 8.93 KB

Image-to-image translation by CNNs trained on paired data (AUTOMAP + Pix2pix)

Written by Shizuo KAJI

This is an implementation of image-to-image translation using a paired image dataset. It can be used for various tasks including

  • denoising, super-resolution, modality-conversion, and reconstruction

The details can be found in our paper: Overview of image-to-image translation using deep neural networks: denoising, super-resolution, modality-conversion, and reconstruction in medical imaging by Shizuo Kaji and Satoshi Kida, Radiological Physics and Technology, Volume 12, Issue 3 (2019), pp 235--248, arXiv:1905.08603

If you use this software, please cite the above paper.

Background

This code is based on

An encoder-decoder network with a discriminator (conditional GAN) is used. Fully-convolutional encoder-decoder network is precomposed with fully-connected layers to make the receptive field the whole domain so that it is capable of learning image translations involving global information such as the radon transform (CT reconstruction).

In other words, it is a combination of AUTOMAP

  • Image reconstruction by domain transform manifold learning, Zhu B, Liu JZ, Cauley SF, Rosen BR, Rosen MS, Nature. 555(7697):487-492, 2018

and Pix2pix

  • Image-to-Image Translation with Conditional Adversarial Networks, Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros, CVPR, 2017

with some improvements.

Requirements

  • a modern GPU
  • python 3: Anaconda is recommended
  • chainer >= 7.2.0, cupy, chainerui, chainercv, opecv, pydicom: install them by
pip install cupy chainer chainerui chainercv opencv-contrib-python pydicom

Note that with GeForce 30 RTX series, the installation of chainer and cupy can be a little tricky for now. You need CUDA >= 11.1 for these GPUs, and it is supported by CuPy >= v8. The latest version of Chainer v7.7.0 available on pip is not compatible with the latest version of CuPy. See here. You can install the latest Chainer directly from the github repository, which is compatible with the latest version of CuPy. For example, follow the following procedure:

You will see some warning messages, but you can ignore them.

Licence

MIT Licence

Data preparation

We need input-output paired images (x_1,y_1), (x_2,y_2), (x_3,y_3),... so that the input image x_i is spatially aligned with the output image y_i. In particular, the image sizes should be same.

Method 1: pairs specified by a text file

  • Make a text file "ct_reconst_train.txt" consisting of:
filename of x_1     filename of y_1
filename of x_2     filename of y_2
...

The columns are separated by a tab.

  • Make another text file "ct_reconst_val.txt" for validation.

Method 2: pairs specified by file names

  • Alternatively, create four directories named "trainA", "trainB", "testA", "testB". Put x_i in "trainA" and y_i in "trainB" with the same filename. They are paired and used for training. Files under "testA" and "testB" with the same filenames are paired and used for validation.

Training

Pixelwise regression (super-resolution, denoising, modality conversion, etc.)

Typical usage cases should be covered by pixelwise regression.

Let us use an example created by

python utility/create_shapes.py

We obtain sample images under "shapes" directory and two text files (Method 1) containing the file names of the images.

We can train a model to convert from rectangle images to ellipse images by

python train_cgan.py -R shapes -t shapes/shapes_train.txt --val shapes/shapes_val.txt -o result -g 0 -e 100 -l1 0 -l2 10.0 -ldis 1.0 -ltv 1e-3

which learns translation of images in shapes_train.txt placed under "shapes/" and outputs the result under "result/".

An example command for Method 2 is

python train_cgan.py -R images -o result -it jpg -g 0 -e 100 -l1 0 -l2 1.0 -ldis 0.1 -ltv 1e-3

which learns translation of jpg images from "images/trainA" to "images/trainB" and outputs the result under "result/".

By specifying, say -it dcm, the code searches for DICOM files instead of jpeg images. For example,

python train_cgan.py -R images -o result -it dcm -l1 0 -l2 1.0 -ldis 0.1 --btoa

learns translation of images from "images/trainB" to "images/trainA" (note that --btoa means translation in the opposite way from B to A).

During training, it occasionally produces image files under "results/vis" containing original, ground truth, and converted images in each row.

Non-local pixelwise regression (reconstruction)

To account for non-local mapping, we use fully-connected (FC) layer before convolution (-gfc 1). Note that FC layers are very expensive in terms of GPU memory.

Let us use an example created by

python utility/create_shapes_sinogram.py

It produces sample images of Radon transform and text files ct_reconst_train.txt and ct_reconst_val.txt.

An example command-line arguments for training is

python train_cgan.py -t radon/ct_reconst_train.txt --val radon/ct_reconst_val.txt -R radon -o result  -cw 128 -ch 128 -rt 0 --grey -g 0 -e 200 -gfc 1 -u none -l1 0 -l2 1.0 -ldis 0.1 -ltv 1e-3 --btoa

which learns translation of images in ct_reconst_train.txt placed under "radon/" and outputs the result under "result/".

The images are cropped to 128 x 128 (-cw 128 -ch 128) and converted to greyscale (--grey). Train the model with GPU (-g 0) and 200 epochs (-e 200). Use the network with 1 FC layer (-gfc 1) at the beginning (as in AUTOMAP) without UNet-like skip connections (-u none). The loss consists of L2 reconstruction error, discriminator, and total variation with weights 1.0 0.1 0.01 respectively.

Crop size may have to be a power of two, if you encounter any error regarding the "shape of array".

Pixelwise classification (segmentation)

If the target images encodes classes for each pixel,

python train_cgan.py -R segmentation -o result -it dcm -e 200 -l1 0 -l2 1.0  -ldis 0 -cn 4

where (-cn 4) tells that there are four classes which are labeled 0,1,2, and 3.

List of command-line arguments

For a list of command-line arguments,

python train_cgan.py -h

To achieve high performance, we have to tune the parameters of the model via command-line arguments.

Conversion with a trained model

python convert.py -b 10 -a results/args --val ct_reconst_val.txt -R radon -o converted -m results/gen_200.npz

converts image files in ct_reconstruct_val.txt using a learnt model "results/gen_200.npz" with parameters recorded in "results/args" (automatically created during training) and outputs the result to "converted/". A larger batch size (-b 10) increases the conversion speed but may consume too much memory.

Denoising demo with DICOM files

We use the CPTAC-SAR dataset hosted by the Cancer Imaging Archive (TCIA). Note that this demo is just to show how to use the code, and we do not aim at training a practical model. (for example, training images should be collected from multiple patients for a practical use).

  • Download the dcm files from the above URL.
  • Create a training dataset containing pairs of clean images and images with artificial noise.
python utility/noising.py -n 700 -it dcm -R "CPTAC-SAR/C3N-00843/02-25-2009-92610/629-ARTERIA-66128" -o images > CPTAC-SAR.txt

You will get "images/trainA" containing images with noise and "images/trainB" containing clean images. Also, you will get a text file "CPTAC-SAR.txt" containing image file names.

  • Split the dataset into training and validation. Just split the text file into two files. Name them "CPTAC-SAR_train.txt" and "CPTAC-SAR_val.txt" You can do this by:
python utility/random_split.py dataset/CPTAC-SAR.txt --ratio 0.8 
  • Start training by
python train_cgan.py -t CPTAC-SAR_train.txt --val CPTAC-SAR_val.txt -it dcm -ch 480 -cw 480 -g 0 -e 50 -gc 32 64 128 -l1 1.0 -l2 0 -ldis 0.1 -ltv 1e-4 -R images -o result -vf 2000

It takes a while. You will see some intermediate outputs under "result".

  • Denoise validation images (which are not used for training) using the trained model "gen_50.npz".
python convert.py -cw 512 -ch 512 -b 10 --val CPTAC-SAR_val.txt -it dcm -R images -o converted -m result/??/gen_50.npz -a result/??/args 

where ?? denotes the date_time automatically assigned for the output of the training. The converted images will be found under "converted".

You can also use this trained model to denoise any other images placed under "testImg" with a similar noise characteristics

python convert.py -cw 512 -ch 512 -b 10 -it dcm -R testImg -o converted -m result/??/gen_50.npz -a result/??/args