thesis ...
"Phase-Only Optical Information Processing"
University of Edinburgh, Duncansapien, 1992.
Index Chapter 1 2 3 4 5 6 7 8 9 (Edinburgh Research Archive version)
Chapter 4
Optical Correlation
Introduction
Chapters one, two and three of this thesis are concerned with elementary phase filtering operations on the spatial frequency spectra of a class of objects exhibiting spatial variations in phase only. Within each type of phase filtering operation (Schlieren, Dark Ground, Phase Contrast etc.) the filtering operation is independent of the particular phase object characteristics. Commonly, the purpose of the filter is to render the phase structure of the object visible as an intensity variation as described in the last chapter.The second half of this thesis concentrates on a class of phase filters which allow much more intricate operations to be performed on the object spectra, and are subsequently more complicated in form. Specifically, attention is turned to the subject of spatial filters which allow certain pattern recognition operations to be performed. The object is usually an amplitude-only object so that whereas previously the complexity of the problem lay in the object structure, now the complexity lies in the structure of the filter.
Pattern Recognition
The purpose of this chapter is to introduce the general terminology of two dimensional spatial filtering, and in particular, pattern recognition filters. The construction and principles of operation of a two dimensional reconfigureable Fourier plane filter (a Spatial Light Modulator or SLM) is detailed in chapters five and six. Specifically, optical pattern recognition experiments have been performed with this device, and form the subject of chapters seven and eight. As such, this chapter lays an essential framework within which the subsequent work of this project should be set against.In the first section, the concept of two dimensional phase filters with a frequency dependent complex transmittance is introduced. Spatial filters which allow pattern recognition tasks to be performed optically, such as the classical Vander Lugt matched spatial filter, are defined. Examples of practical modern day filters (SLMs) are given in section two together with a mathematical background to such effects as filter pixellation. In section three a major constraint upon many spatial light modulators, that of single parameter modulation, is discussed with relevance to its effect on phase-only correlation filters. Finally, section four discusses the merit of alternative correlation techniques and their relevance to this project assessed.
1 Two Dimensional Spatial Filtering
The alteration of the spatial frequency spectrum of an object is known as `spatial filtering'. Appendix one provides a brief review of the standard optical processing bench on which such operations are usually performed. A single spatial frequency present in the object has both an associated amplitude and phase, which is to say a number describing the spatial offset of that component from the origin of the object plane. In the frequency plane, the amplitude information of a single frequency is represented by the Point Spread Function of the transform lens, which is usually a sharply peaked function, and the phase as a temporal delay of the light field at that point.
1.1 Mathematics of Correlation Filters
[Update 2006:An excellent external link: Convolution of Two Functions (PDF) from Will Hossack's home page, my PhD supervisor. ]
One particular optical processing operation has been studied
extensively, that of optical correlation. Consider the spatial
frequency spectrum of an object g(x,y) which is conventionally denoted
by G(nx,ny). The spectrum can be separated into a product of
two functions, one entirely REAL and describing the amplitude of
G(nx,ny) and the other complex, describing the phase
distribution so that
| (1) |
It is well known [28] that a filter H(nx,ny) of form
| (2) |
These
results are at the very heart of most pattern recognition filters, for if
it is desired that a particular object is identified in an input scene
to a processor, one way of detecting the presence of that object is to
use a spatial filter having identical amplitude and conjugate phase to
the object in the frequency plane. The function g(x,y) is called the
`target' object. Suppose an input scene to the correlator contains several
different, spatially distinct objects fi(x,y).
The image plane will contain the
cross-correlations of the target g(x,y) with the input functions
fi(x,y). Should one of the fi(x,y) prove identical to the target
function then the image plane will contain the auto-correlation of
g(x,y) instead. Mathematically, the cross-correlation of
two functions g(x,y) and f(x,y) is
| (3) |
| (4) |
If the power P contained in two functions g(x,y) and f(x,y) is
identical, power defined in the usual way as
| (5) |

Figure 4.1: Optical Correlation
1.2 Vander Lugt Matched Filter
The first optical implementation of matched spatial filtering was carried out holographically by Vander Lugt [29]. The spectrum of a target object g(x,y) is caused to interfere with an off axis reference beam in the Fourier Plane and the resulting interference pattern recorder on photographic emulsion. Once the emulsion is developed, it may be reinserted into the Fourier plane to act as a matched spatial filter. If the input object is replaced by another of transmittance f(x,y), illuminated by a plane wave and the reference beam is removed, it can be shown [30] that three beams emerge from the far side of the holographic filter. One beam emerges at zero angle to the optical system and is focussed to the origin of the image plane, but contains no extractable information. Two further beams emerge at equal but opposite angles from the optical axis of the system and are again focussed to the image plane. One contains the convolution of the target function f(x,y) with the object function g(x,y) and the other contains the cross-correlation of the two functions. Until the advent of fast, reconfigurable spatial filters (to be described in section 4.2) much work on optical correlation utilised this experimental arrangement. See, for instance, the work of Casasent & Furman [31]. Although the matched spatial filter produces the highest signal-to-noise ratio (SNR, to be discussed shortly) in the image plane [32], there exist several disadvantages to its use in practice. Section 4.2 lists some of the drawbacks of this technique and describes alternative methods of implementing correlation optically. All of these, however, are modifications of the basic Vander Lugt matched spatial filter described here.
Figure 4.2: Vander Lugt Holographic Filter
Improved Discrimination
For many objects however the difference in brightness between the autocorrelation and cross-correlation arising from use of a matched spatial filter is quite small. It is thus difficult to discriminate between a recognised target by the brightness of the correlation peak compared with the surrounding cross-correlation peaks. Also, the peak need not necessarily be very sharp, so that even the auto-correlation may be a broad, dim function.It has been shown however that discrimination is greatly improved if the input scene has undergone an edge enhancing operation before being used as input to the correlator (as commonly performed by the subtraction of the mean value of the object signal throughout the function, for example). The low spatial frequencies present in the object spectrum, which `fill in' large, fairly uniform areas of the object, are suppressed in such an operation in deference to the high spatial frequencies which define the edges of the object. Research performed by Horner and Bartelt [33] also indicates that binarisation of the input scene combined with a filter matched to the binary object, rather than a continuous tone object, results in a brighter correlation peak value and an improved signal to noise ratio (SNR). The precise specification of what the SNR measures, together with a parameter known as the optical efficiency hH, are defined as follows:
- The SNR used was defined as the ratio of the peak correlation value to the rms noise outside of a 50% of peak threshold level.
- HORNER efficiency hH is defined as the ratio of the energy within the 50% of peak threshold level to the total energy falling on the output plane of the correlator.
Table 4.1 compares the correlation peak intensity (as a fraction of that obtained by a classical matched filter), SNR and HORNER efficiency for several combinations of input and filter type. Subscripts CTS and BIN refer to the object type from which the filter was made. These results are from Horner and Bartelt, where the input scene was a doll's face, all results being computer simulations.
| Input Object | FilterCTS | FilterBIN | |
| Continuous | SNR | 4.1 | 4.1 |
| R02 | 1.0 | 1.3 | |
| hH | 6.3% | 3.1% | |
| Binary | SNR | 4.1 | 7.1 |
| R02 | 1.1 | 3.4 | |
| hH | 1.5% | 0.7% |
Thus the highest SNR ratio and intensity of autocorrelation peak occur when a filter is made from a binarised input scene and the same binary object is presented at the input. In this case however optical efficiency decreases because although the peak is made sharper and higher, less energy goes into the peak region as a whole than it does with a classical matched filter using a continuous object.
In section 4.3.3, the same three parameters used here are again compared but for a filter which is matched only to the phase of the target object. It will be argued that a phase-only filter should give much improved results and a similar table to that shown above is presented, again quoted from Horner and Bartelt, to quantify this improvement.
2 Practical Filters
In practice, the matched spatial filters of Vander Lugt suffer from the severe drawback of lengthy preparation time and the need for a physically new filter for each target object. A significant increase in the speed of the whole process may be achieved by encoding several matched filters on a single piece of film. Such a filter is known as a `frequency-multiplexed' or `composite' filter [34], [35]. To avoid overlapping of the autocorrelations in the image plane, the holographically stored matched spectra of a number of different input objects are recorded using a plane wave reference beam with an object dependent angle to the optical axis. An important strength of this technique is the ability to store, on a single filter, a number of filters matched to both scaled and rotated versions of a single target object. Using computer generated holograms, this field has been investigated by Leger and Lee [36].A practical use of an optical correlator has been suggested by Johnson [37] which serves to highlight this problem. If a small video camera views the entrance to a doorway it can send signals which, if suitably displayed, can be used as the input scene to a compact optical processor (The means whereby this might be effected are discussed shortly). The frequency spectrum of the input scene can then be filtered so as to produce a correlation peak whenever a particular face appears at the input to the system. This has obvious security implications. Due to the large number of variations in subject distance from camera, aspect ratios of the face and facial expressions, it would be required to scan through a number of filters in rapid succession for each time selected input scene, each filter pre-stored and calculated to recognise the face at various distances, etc. Holographic filters could be used if mechanically replaced rapidly enough but owing to the large number envisaged this scheme would be severely limited in the number of different people it could `recognise', even if frequency-multiplexing were used.
Spatial Light Modulators
It would clearly be advantageous to update the filter pattern without physical removal of the filter. To date, a large number of devices exist which can act as reprogrammable spatial filters, and are known as `Spatial Light Modulators' or `SLM's. In general, a spatial light modulator is a device which may be used to impress information onto a wavefront, so that the information may represent an input image scene as an amplitude variation or a spatial filter as phase variations, for example.Actual spatial light modulators are constrained by many factors to perform modulation of either amplitude or phase, but not both together. However, it is a frequent occurence for a phase modulator to also, sometimes unavoidably, introduce a small degree of amplitude modulation as well and vice versa for an amplitude modulator. These are aberrations of the filter and can usually be tolerated if small enough, but have led to some authors referring to specific SLMs as either `phase-mostly' or `amplitude-mostly' modulators [38], [39].
A further consideration in physical devices is that the modulation parameter frequently takes one or other of only two values, and such devices are said to have a binary mode of operation. For amplitude-only modulators this is usually adequate if it is desired either to pass or block a group of spatial frequencies, for example. Of far more concern is the effect this has on phase modulating filters. This forms the subject of section 4.3.3.
SLMs are subclassified as either pixellated, a necessity arising from the need to assign a stored memory location value to a particular position on the surface of the modulator, memory locations being discrete, or non-pixellated. The process of loading the information onto the SLM is known as `addressing'. Pixellated devices are usually electrically addressed whereas non-pixellated devices are commonly addressed optically, though this is a generalisation.
4.1 Optical Addressing
As an example of what is meant by optical addressing, it will prove instructive to examine the operation of an SLM known as the `Hughes Liquid Crystal Light Valve' [40], [41], the operation of which is depicted in figure 4.3.
Figure 4.3: Operation of the Hughes LC Light Valve SLM
In this device, an image is focussed onto the front face of the SLM using an incoherent beam of light. A photoconductive layer of cadmium sulphide behind the glass substrate experiences a decrease in resistance as light falls on the surface, causing the voltage dropped across a layer of nematic liquid crystal to decrease. The light modulating effect is actually quite involved and uses a phenomena known as the hybrid field effect, but the specific effect is not of concern here. A coherent beam of light is reflected off the back face of the device and as it travels through the liquid crystal layer experiences a spatially varying optical effect which is primarily one of polarisation guidance in this case. By this means, the information of an incoherent signal is imprinted onto the wavefront of a coherent signal which can be used, for example, as the input stage to an optical processor.
4.2 Pixellated Devices
Whilst optically addressed devices are suitable for incoherent to coherent conversion in the input plane to an optical processor, they are of limited practical use as frequency filters with one notable exception - the Joint Transform correlator (See section 4.4). Most filters are calculated computationally and pixellation of the devices allows interfacing of the SLM to a computer for filter update. Consequently, most SLMs are pixellated devices requiring a discrete number of values to describe the modulation characteristics of each pixel. Figure 4.4 illustrates the general characteristics of a pixellated spatial light modulator in the frequency plane.
Figure 4.4: Characteristics of a Pixellated Spatial Light Modulator
The pixels may either be transparent (a transmissive SLM) or reflecting
(reflection mode SLM), in which case the reflected beam is isolated by
using a beamsplitter placed in front of the device.
The finite extent of the filter results in a maximum frequency passed by
the device, each pixel covering a range of spatial frequencies which it
ideally modulates in an identical fashion. If `FL' denotes the
focal length of the lenses used in the processor, `xT' the physical
distance along the x-axis of the frequency plane then the
frequency-distance relationship has been given as
| (6) |
| (7) |
If the pixels are separated by a physical distance `dp', the
corresponding separation in frequency can be found. According to the
`sampling theorem' [] a sampling in frequency space at interval
dn results in a replicated image of the object, each
replication separated by a distance [ 1/(2dn)]. To avoid
overlapping images the object width OW must not exceed this value,
so that
| (8) |
| (9) |
This result shall be of central importance in chapter 7 and shall be discussed with reference to a specific optical processing system, and is noted here as a figure of merit of an SLM. Notably, the greater the number of pixels the better becomes the SBP becomes allowing either improved image resolution or larger object dimensions for the same degree of resolution.
4.3 Electrical Addressing
One commercially available electrically addressed SLM is known as the Litton magneto-optic SLM or LIGHT-MOD [42], [43] as it is more commonly known, utilises the effects of Faraday rotation as a light modulation mechanism. The LIGHT-MOD is a 48×48 pixellated transmissive array2, the pixels defined by the areas of intersect of a network of `drive lines' in which currents are caused to flow. By suitably addressing the device, the resulting currents in the drive line network cause the magnetisation vector in the region of a pixel to lie in one of two possible directions. Consequently, a linearly polarised light field normally incident on the array has its polarisation vector rotated in either a positive or negative sense according to the magnetic field on each pixel by the Faraday effect. By a suitable arrangement of polarisers before and after the SLM either binary amplitude or binary phase modulation may be achieved.This device has been used extensively in experimental studies, most notably by Flannery et al [44] in 1986 where the results of an initial investigation into optical binary phase-only filtering were published. `Excellent agreement' between computer simulation and the experimental correlations was found and a photograph was presented showing two bright, distinct spots of light in the output plane of the optical correlation bench. Further experimental correlation results from a very thorough investigation were published in 1988 where an impressive agreement with predictions from simulations was obtained [45].
The space-bandwidth product (as determined by the number of pixels along one axis of the filter) of this device is a factor of three times higher than that of the 16×16 array used in this project. Thus the results of Flannery et al, particularly those of reference [44], may be used as a benchmark for comparison with those of this project. More shall be said of this in chapter eight.
4.4 Information Function
In the mathematical description of a pixellated filter it is often useful to form an expression describing the modulation parameter of each pixel of the SLM. Individual pixel modulation parameter settings are described by an information function v(x,y), the local value of which is identical to the modulation parameter at any given pixel location. Consider the description of an SLM used in the object plane: the array of pixels is commonly described by a Dirac comb function [], convolved with a pixel function describing the shape of each pixel (assumed not to vary over the SLM). Reducing to one dimension for simplicity, and assuming the filter is so large that the summation may be extended to ±¥, the light modulation performed by the SLM can be written as
| (10) |
|
| (12) |
| (13) |
Optimum Sampling
Note that if the bandlimit of the information function is too wide ( > [ 1/(D)]) then the spectral replicas overlap and aliasing occurs, so that high frequencies from the outer replications wrongly appear in lower frequency locations of the zero order replication. (Figure 4.5).
Figure 4.5: Aliasing arising from too low a sampling interval
This occurs if the SLM sampling
interval D is too large as may be seen from equation 4.13. `Optimum
sampling' refers to the situation where the spectral replications just
touch at their peripheries, so that if nL is the highest
frequency required in the information function then
| (14) |
3 Single Parameter Correlation Filters
The general characteristics of practical filters (pixellation, bandwidth, etc.) have been introduced. The primary limitation of many pixellated SLMs is the binary mode of operation. As stated in chapter one, it is a primary objective of this project to demonstrate the capabilities of a low space-bandwidth-product, pixellated, binary-mode SLM as an optical correlator. Therefore it is required now to ascertain the likely performance of correlation filters which have only two phase values allowed - binary phase-only filters or BPOFs, as they are commonly known. Specifically there are three points to be addressed.- Given that the vast majority of SLMs available can function either as binary amplitude-only or binary phase-only filters, which modulation parameter should be chosen to represent a pixellated correlation filter ?
- The classical `matched' correlation filter requires both amplitude and phase filtering operations to be performed. Whichever modulation parameter is chosen, how will the filter perform with only one modulation parameter ?
- What effect on filter performance does the quantisation of the modulation parameter to only two values have ?
3.1 Choice of Modulation Parameter
Consider an object centered in the object plane of an optical processor. At the image plane it is desired to obtain the cross-correlation of this object with some target object, which is a sharply peaked function centered about the origin of the image plane. The phase of the filter serves to cancel out the phase of the object spectrum so that no phase variations exist over the frequency plane. As such, the complex light field immediately behind the filter behaves like a plane wave but with a spatially varying amplitude over the wavefront. This wave is focussed down to the center of the image plane and the resulting sharply peaked image plane light distribution is the cross-correlation of the object with the target object from which the filter was calculated. It is likely that sharper still focussing would occur if the amplitude over the surface of the plane-type wave was uniform, so that the light field immediately after the filter perfectly resembles a plane wave in both amplitude and phase.
Looking at the process in another way, setting the phase of the
spectrum to be zero for each and every Fourier component means that the
spatial offset of each spatial frequency in the image is zero. Thus if
the image amplitude g(xi,yi) is described as a Fourier integral
| (15) |
As such, it would seem that the phase information of the filter is primarily responsible for the correlation process. Indeed, the phase spectrum of an object is unique, whereas the amplitude spectrum need not be [24]. Correlation performed with an amplitude-only filter is compared to that from phase-only filters in reference [46] where it is also concluded that an amplitude-only filter is virtually useless.
Having established which parameter is the more important, the effect of dropping the other modulation parameter - amplitude - is now considered. This will be aided by studying some particular types of phase filter.
3.2 Phase Only Filters
Actual correlation filters commonly differ from the matched filter thus far introduced. The matched filter, as described by equation 4.2, has an amplitude equal to that of the Fourier Transform of the target object. The phase-only filter however (POF) has an amplitude everywhere equal to unity, and is of form
| (16) |
As mentioned earlier, practical filters (SLMs) most commonly modulate only a single parameter. Further, this parameter frequently is allowed to take on just one of two possible values. From the discussion above the choice of modulation parameter in such a device should be phase rather than amplitude in a practical optical correlation system. Knowing the relative importance of the phase information of a spectrum, it is important that a theoretical basis be laid which shows that binarisation of phase is an allowable procedure in the first place. Such an analysis has already been published and is summarised here.
Phase Quantisation
As mentioned earlier, the vast majority of phase modulating SLMs cannot perform continuous phase modulation. Their modulation capabilities are quantised, frequently to only two levels, and a proper understanding of the effects of phase quantisation is thus required.
The effects of phase quantisation of the Fourier Transform on the image
has been analysed in detail by Goodman and Silvestri [].
They found that the
image was described by a series of terms relating to the number of phase
quantisation levels. Specifically, if N denotes the number of
quantisation levels and g¢(x) the actual image amplitude obtained then
| (17) |
|
This work forms the basis of a theoretical explanation as to why binary phase-only filters still work. The ideal Fourier Transform of a continuous phase-only filter should resemble the object function from which the filter was calculated. In reducing the allowable phase values to only two the above analysis reveals that the primary image still exists, though attenuated, together with a number of unwanted (modified) images superimposed. In the next subsection, simulations performed by Horner et al indirectly provide a quantitative comparison of the effects of these `false' images on the correlation arising from a binarised phase filter with the correlation arising from a continuous phase-only filter.
3.3 Binary Phase Only filters (BPOFs)
Binarisation of the filter phase must be done in many spatial light modulators due to their binary mode of operation, more of which shall be discussed later. A very large number of binarisation algorithms have been proposed, the aim of which is to produce a single number with which to characterise the phase of a single SLM pixel. This number most commonly takes the value 0 or p radians, arising partly because some SLMs can only retard one pixel by p radians relative to another. In order that a binary phase value be calculated, the target object is modelled on a computer and the Fourier Transform taken. The phases F(nx,ny) of all points lying in the physical extent of any pixel are binarised according to a relation such as
|
| MF | POF | BPOF | |
| FilterCTS, Object Continuous | 1 | 125 | 36 |
| FilterBIN, Object Binary | 3.4 | 1191 | 471 |







