An Introduction to Smoothing

Smoothing is a process by which data points are averaged with their neighbours in a series, such as a time series, or image. This (usually) has the effect of blurring the sharp edges in the smoothed data. Smoothing is sometimes referred to as filtering, because smoothing has the effect of suppressing high frequency signal and enhancing low frequency signal. There are many different methods of smoothing, but here we discuss smoothing with a Gaussian kernel. We hope we will succeed in explaining this phrase in the explanation below.

This page is designed to be read in conjunction with the matlab commands that were used to make the figures. The code makes some of the ideas a little clearer, to those with some familiarity with matlab. The matlab commands are contained in the file smoothtalk.m

Some example data for smoothing

Here is a set of data, made out of random numbers, that we will use as a pretend time series, or a single line of data from one plane of an image.

The numbers were generated with matlab, by creating 40 successive random numbers from a normal distribution. See the smoothtalk.m file for how to do this.

The Gaussian kernel

The 'kernel' for smoothing, defines the shape of the function that is used to take the average of the neighbouring points. A Gaussian kernel is a kernel with the shape of a Gaussian (normal distribution) curve. Here is a standard Gaussian, with a mean of 0 and a sigma (=population standard deviation) of 1.

In the standard statistical way, we have defined the width of the Gaussian shape in terms of sigma. However, when the Gaussian is used for smoothing, it is usual to describe the width of the Gaussian with another related measure, the Full Width at Half Maximum (FWHM).

The FWHM is the width of the kernel, at half of the maximum of the height of the Gaussian. Thus, for the standard Gaussian above, the maximum height is ~0.4. The width of the kernel at 0.2 (on the Y axis) is the FWHM. As x = -1.175 and 1.175 when y = 0.2, the FWHM is in fact 2.35. The FWHM is related to sigma by the formula (matlab format):

FWHM = sigma * sqrt(8*log(2))

Smoothing with the kernel

The basic process of smoothing is very simple. We proceed through the data point by point. For each data point we generate a new value that is some function of the original value at that point and the surrounding data points.

With Gaussian smoothing, the function that is used is our Gaussian curve..

So, let us say that we are generating the new, smoothed value for the 14th value in our example data set. We are using a Gaussian with FWHM of 4 units on the x axis. To generate the Gaussian kernel average for this 14th data point, we first move the Gaussian shape to have its centre on 14 on the x axis. In order to make sure that we don't do an overall scaling of the values aftern smoothing, we then divide the values in the Gaussian curve by the total area under the curve, so that the values add up to one (see the smoothtalk.m file for how to do this in matlab):

We take the values of our resulting function (from the y axis), at each of the points in the data (the x axis). Thus we generate Gaussian function values for 13 12 11 etc, and 15 16 17 etc. This gives us a discrete Gaussian.

In fact the Gaussian values for 12 13 14 15 and 16 are:

    0.1174    0.1975    0.2349    0.1975    0.1174

and the data values for the same points are:

    1.0645    0.3893    0.3490   -0.6566   -0.1946

We then multiply the Gaussian values by the values of our data, and sum the results to get the new smoothed value for point 14. Thus, the new value for point 14 is ... + 0.1174*1.0645 + 0.1975*0.3893 + 0.2349*0.3490 + 0.1975*-0.6566 + 0.1174*-0.1946 + ...

We then store this new smoothed value for future use, and move on, to x = 15, and repeat the process, with the Gaussian kernel now centred over 15. If we do this for each point, we eventually get the smoothed version of our original data (see the smoothtalk.m file again for a simple but inefficient way of doing this):

Other kernels

Of course, we could have used any shape for the kernel - such as a square wave:

This would have the effect of replacing each data point with a straight average of itself and the nieghbouring points.

Smoothing in 2D

Smoothing in two dimensions follows simply from smoothing in one dimension. This time the Gaussian kernel is not a curve, but a cone. Here is what such a cone looks like when placed over the central point of a plane:

and the same thing with discrete values for each pixel in the image.

We then proceed as before, multiplying the values of the kernel (as shown in the figure above) by the data in the image, to get the smoothed value for that point, and doing the same for every point on the image.

The procedure is the same for 3D data, except the kernel is rather more difficult to visualize, being something like a sphere with edges that fade out, as the cone fades out at the edges in the 2D case.

In fact, it turns out that we don't have to generate these 2D and 3D versions of the kernel for the computations, because we get the same result as we do by applying the full 2 or 3D kernel, if we simply apply a one dimensional smooth sequentially in the 2 or 3 dimenensions. Thus, for 2 dimensions, we could first smooth in the x direction, and then smooth the x-smoothed data, in the y direction.

Why smooth?

The primary reason for smoothing is to increase signal to noise. Smoothing increases signal to noise by the matched filter theorem. This theorem states that the filter that will give optimum resolution of signal from noise is a filter that is matched to the signal. In the case of smoothing, the filter is the Gaussian kernel. Therefore, if we are expecting signal in our images that is of Gaussian shape, and of FWHM of say 10mm, then this signal will best be detected after we have smoothed our images with a 10mm FWHM Gaussian filter.

The next few images show the matched filter theorem in action. First we can generate a simulated signal in a one dimensional set of data, by creating a Gaussian with FWHM 8 pixels, centred over the 14th data point:

Next, we add some random noise to this signal:

We then smooth with a matching 8 pixel FWHM filter:

and recover our signal well from the noisy data.

Thus, we smooth with a filter that is of matched size to the activation we wish to detect. This is of particular relevance when comparing activation across subjects. Here, the anatomical variability between subjects will mean that the signal across subjects may be expected to be rather widely distributed over the cortical surface. In such a case it may be wiser to use a wide smoothing to detect this signal. In contrast, for a single subject experiment, where you want to detect (for example) a thalamic signal, which may be in the order of a few mm across, it would be wiser to use a very narrow smoothing, or even no smoothing.

Finding the signal for any smoothing level

Sometimes you do not know the size or the shape of the signal change that you are expecting. In these cases, it is difficult to choose a smoothing level, because the smoothing may reduce signal that is not of the same size and shape as the smoothing kernel. There are ways of detecting signal at different smoothing level, that allow appropriate corrections for multiple corrections, and levels of smoothing. This Worsley 1996 paper describes such an approach:

Worsley KJ, Marret S, Neelin P, Evans AC (1996) Searching scale space for activation in PET images. Human Brain Mapping 4:74-90

Another promising method is to use wavelet transforms; see: Federico Turkheimer's wavelet introduction

Matthew Brett (FB) 19/8/99

CBU Imaging Website.
© Copyright MRC Cognition and Brain Sciences Unit 2005.
Page maintained by: daniel.bor@mrc-cbu.cam.ac.uk

Guidelines/Help

Theory

Computer Support/Software

Admin/Mailing Lists (internal)

EEG/MEG Lab

CBU Homepage

fMRI Guidelines

PET Guidelines

Getting Help

Links

Planning an fMRI Study (internal)

fMRI Design

MRI Scanning Protocols

fMRI Preprocessing

fMRI Analysis

fMRI Graphical

Pre-SPM Preprocessing

Image Diagnostics

SPM Preprocessing

Getting The Data (internal)

Data Reconstruction

Converting to Analyse

From 4D to 3D (ana4dto3d)

Using pvconv

Using ImageJ (internal)

Problems With Voxel Sizes

Slice Timing Issues

Removing Bad Scans

EPI Undistortion

Mpr (coreg tool)

Masked Normalisation

Skull Stripped Normalisation

Background

Improvement in Power

Acquiring fieldmaps

What You Need

Guide and Reference

Step by Step Example

Batch Mode

Download

SPM 2 Analysis Defaults

General Analysis How-to (External)

Analysing without Bad Scans

Small Volume Correction

Getting % signal change

Converting between MNI and Talairach

SPM Related

Plots and Adjusted Data

Display Slices

Graphical Output

How to plan/run a PET Study (internal)

PET Scanning

PET Preprocessing

PET Analysis

PET Graphical

Mpr (coreg tool)

SPM 99 Analysis protocol

PET Contrasts

Small Volume Correction

Automating PET Analyses

Converting between MNI and Talairach

Old (SPM 96) Analysis Protocol

Removing Time and Movement Effects

Graphical Output

Comprehensive Imaging Guide (internal)

People to ask for help (internal)

Points of Contact (internal)

Getting SPM Help

Brain Anatomy

fMRI Related

Imaging Journals

SPM Theory

fMRI Design

Genes and fMRI (internal)

The Commissures

Brodmann Areas/Talairach Co-ords

Bruker Data Format

Analyze Data format

Intro To PET Headcurves

Rik Henson's SPM Mini-Course

SPM Data format

Slice Timing Issues

Intro to Smoothing

SPM Stats

Random Field Theory

Small Volume Correction

Examples of headcurve timings

SPM Software Related

Programming Help

Unix Help

Software to Download

Misc

VNC Advice (internal)

Versions of SPM Available

SPM Batch Mode

SPM Tips

Getting SPM Help

Configuring P4 Machines for SPM

SPM Benchmarks

MarsBar Tutorial

Batch to Display thresholded SPMs

SPM 2 Batch Scripts

Visual Basic Help

Learning Matlab

DMDX Help

Scannersync

Zil (DMDX) Utility

Cygwin

MEx and GCC

Dearchiving tar.gz Files

Byte Conversion Utilities

MakefMRIBatch Utility

DBedit

Flirt

pvconv.pl

Ana4Dto3D

Image Viewer

SPM Utilities

Zil (DMDX) Utility

Display Slices

Marsbar (ROI tool)

Mpr (coreg tool)

Vol_corr

Fieldmap Undistortion

Using Analyze

Reading pdf and ps Files

Archiving and Restoring Data (internal)

MakefMRIBatch Utility

Byte Conversion Utilities

DBedit

Flirt

Image Viewer

IV_Volumetool

Intro to EEG/MEG

Tools For Basic Processing

Tools For Source Analysis

Comprehensive Imaging Guide (internal)

Imagers Interest Group Meetings

Proposing a study (internal)

LREC Form (internal)

Arsac Form (internal)

Insurance Form (internal)

Requesting fMRI Slots Form (doc) (internal)

fMRI Booking Sheet (internal)

CBU Imaging Management Committee (internal)

Points of Contact (internal)

Recruiting Subjects (internal)

Mailing Lists (internal)