CISC 472 - Similarity Measures

An image is a 2D or 3D array of pixels, each containing a grey value (typically in 0..255).

Give two images, image registration finds the transformation of one image so that it "best" aligns with the other image.

Image registration can be intra-modal, where both images are acquired with the same modality (e.g. x-ray, CT, MR, PET, US), or can be inter-modal, where the images are acquired with different modalities.

Image registration can be

2D/2D: These are typically slices of a patient from a CT or MR scan, or projections of a patient from an x-ray. In lung cancer, for example, multiple x-rays of the same patient may be taken over a long period and the radiologist would like to align the images to see if any changes occur between one image and the next.
2D/3D: This is done when a slice of the patient, or a projection of the patient, must be aligned with a 3D volume. For example, an ultrasound (US) slice could be aligned with a CT volume to determine where the US is in the CT. Or an intra-op 2D fluoro image (i.e. projection) of a patient could be aligned with pre-op CT scan so that the patient can be registered to the CT, in which surgical planning might have been made.
3D/3D: Not as common, but whole 3D volumes are aligned like 2D images are aligned.

Any image registration method needs:

A similarity measure to measure how well two images align. One of the images will have been transformed to lie on top of the other image, and the similarity measure gives an indication of how well that transform causes the images to align.
An optimization algorithm to determine how to find the transformation of one image so that it optimally aligns with the other image. An optimal alignment maximizes (or minimizes) the similarity measure.
An interpolation method to allow pixel values to be evaluated at positions that are not pixel centres. When a transformation of scaling, rotation, and translation is applied to one image, its pixel centres will almost never align perfectly with the pixel centres of the other image. Since the similarity measure usually iterates over pixels of one image and compares them to the pixels of the other (transformed) image, we need to get pixel values at any position in the transformed image ... not only at the pixel centres.

We will consider only 2D/2D image registration for the moment.

Given two images, $I_1$ and $I_2$, where one of the images has already been transformed to overlap the other image, a similarity measure,

$S( I_1, I_2 )$

determines how "well" the images align.

Some similarity measures are

Root-mean-square error (RMSE)
Cross correlation
Normalized cross-correlation (NCC)
Mutual information (MI)

RMS Error

RMS Error just measures the squared difference between corresponding pixels:

$S(I_1,I_2) = \sqrt{ {1 \over N} \sum_{ij} (I_1(i,j) - I_2(i,j) )^2 }$

This is a poor measure, as it requires that the two images be of the same modality and captured with the same imaging parameters.

Cross-Correlation

The cross-correlation is the product of corresponding pixels:

$S(I_1,I_2) = \sum_{ij} I_1(i,j)\ I_2(i,j) $

This is really a dot product, where the pixel values of each image are laid out in a one-dimensional "image vector", and the dot product of the two image vectors is computed.

The dot product is maximized when the vectors are parallel. Similary, the correlation is maximized when each pixel of one image is a scalar multiple of the corresponding pixel of the other image ... that is, when the images are perfectly aligned.

This measure is also known as the Pearson correlation coefficient.

Normalized Cross-Correlation

NCC determines the correlation of the two images, with each image normalized to so that its mean pixel value is zero and its standard deviation is one:

$S(I_1,I_2) = \large \sum_{ij} ({ I_1(i,j) - \mu_1 \over \sigma_1 }) ({ I_2(i,j) - \mu_2 \over \sigma_2 } )$

where the $\mu$ and $\sigma$ are the means and standard deviations of their respective images.

This is a better measure, as it doesn't matter if the images have different brightnesses (which is corrected by the subtraction of the mean) or different contrasts (which is corrected by the division by the standard deviation). However, there's an expectation that the pixels of the two images have the same normalized values at optimal alignment. This will not be true if the images come from different modalities.

Mutual Information

Mutual Information (MI) is a very commonly-used similarity measure. But we need to discuss joint histograms and entropy before considering MI.

Those topics will be discussed in the next two lectures.

Image Registration

Similarity Measures

RMS Error

Cross-Correlation

Normalized Cross-Correlation

Mutual Information