Free PhD eTheses 


1.  A Modified Probabilistic Network for Signal Processing and Pattern Recognition, by Anthony Zaknich, 1995. Download zipped pdf (1.7 MB)

    

2.  A Swept-Carrier Technique for Underwater Acoustic Communications, by Gareth J. Cook, 1999. Download zipped pdf (5.5 MB)

 

3.  Approximate Dynamic Programming with Adaptive Critics and the Algebraic Perceptron as a Fast Neural Network related to Support Vector Machines, by Thomas Hanselmann, 2003. Download zipped pdf (4.8 MB)

 

4.  Equalization of Nonlinear Communications Channels with Practical Large Margin Classifiers, by James Ping Young, 2003. Download zipped pdf (1.7 MB)


A Modified Probabilistic Network for Signal Processing and Pattern Recognition

Anthony Zaknich

The University of Western Australia, May 1995

 

This thesis identifies some of the features and major limitations of current artificial neural network architectures and learning laws and presents an alternative class of practical non-parametric and semi-parametric  networks which are quick and easy to train and can be realised with present or foreseeable technology. An approach is adopted which guarantees engineering solutions to a wide range of nonlinear signal processing problems. This has been achieved by utilising as a basis an artificial neural network structure which is theoretically well founded and understood and easily realisable. The modified probabilistic neural network structure has been invented and extended from the probabilistic neural network classifier. The probabilistic neural network is derived from Bayes' theory and Parzen non-parametric windowing techniques. The main problems with utilising the probabilistic neural network are that it is only a classifier and that its implementation requires a large memory storage and access for complex problems.  An adequate memory size can not be predetermined for general network design. The modified probabilistic neural network, however, is based on a semi-parametric approach and can be implemented with a fixed memory size which is related to the resolution of the data quantisation devices and the required accuracy. It is most similar to Specht's general regression neural network and the network developed by Moody and Darken and has significant similarities with Albus' cerebellar model articulation controller and Kohonen's self-organising map. The generic aspect of the work is shown by giving solutions to a number of applications to show the utility and effectiveness of the modified probabilistic neural network. Performance comparisons are made with other artificial neural networks and more established methods including the multi-layer perceptron with backpropagation-of-error, adaptive least mean squares linear finite impulse response filter and nonlinear second and third order Volterra filters. These comparisons demonstrate the applicability and implementation advantages of the new artificial neural network architecture. Some optoelectronic and VLSI implementation ideas are developed to show how easily these technologies can be used to build modified probabilistic neural network hardware realisations. A significant contribution of the thesis is the development of an effective and practical general artificial neural network methodology and design for application to a broad class of nonlinear time series signal processing problems.

 

Download zipped pdf (1.7 MB)


A Swept-Carrier Technique for Underwater Acoustic Communications

Gareth J. Cook

The University of Western Australia, December 1999

 

Underwater acoustic channels present a number of challenges to the designers of communications systems. One of the their most important characteristics is multipath propagation, which severely limits data rate and system stability.

 

This thesis addresses this problem of multipath propagation in underwater acoustic channels through the proposal of a novel communications signal. The signal was inspired by a spectral measurement known as time-delay spectrometry (TDS). It has a form which identifies it with a class of signals previously defined as swept-carrier signals. The original material presented in this thesis can be considered either as a special application of TDS or an extension to earlier work in swept-carrier communications.

 

TDS was originally devised as a means of measuring the frequency response of loudspeakers in reverberant environments. As such, it has two characteristics which the signals proposed in this thesis seek to exploit. These are:

1.    multipath suppression;

2.    system identification

 

Under certain somewhat idealised conditions the problem of multipath interference can be avoided through careful design of the swept-carrier signal. Most of the theoretical results derived in this thesis assume idealised suppression of multipath.

 

The spectral characteristics of the swept-carrier signal were examined through use of convenient intuitive approximations and also through rigorous mathematical analysis, resulting in expressions for the power density spectrum. The error performance of the swept-carrier signal was determined for a particular subclass of discrete multipath channels (which includes the additive white Gaussian noise channel).

 

The proposed communication technique was observed to be similar in principle to the classical frequency stepping approach commonly supported in many commercial acoustic modems. The channel conditions under which the proposed technique would be considered appropriate coincide with the conditions normally associated with the frequency stepping approach. It was shown that in certain discrete multipath channels, the swept-carrier technique is superior to the classical approach, both in terms of error performance and bandwidth efficiency.

 

An experimental communications system was developed which made us of largely non-specialised hardware. The successful trial of this system revealed the suitability of the technique for practical implementation. The field work also incorporated unusual sounding channel techniques, using chirp signals, and a TDS-based transducer calibration.

 

In cases where some level of multipath interference cannot be avoided the ability to generate and exploit channel estimates from the swept-carrier signal becomes important. These issues were explored resulting in a novel receiver design heavily influenced by TDS theory. A computer simulation of this receiver showed a performance improvement over simpler receivers which do not incorporate TDS-type channel estimates in the detection process.

 

Download zipped pdf (5.5 MB)


Approximate Dynamic Programming with Adaptive Critics and the Algebraic Perceptron as a Fast Neural Network related to Support Vector Machines

Thomas Hanselmann

The University of Western Australia, 2003

 

This thesis treats two aspects of intelligent control: The first part is about long-term optimization by approximating dynamic programming and in the second part a specific class of a fast neural network, related to support vector machines (SVMs), is considered.

 

The first part relates to approximate dynamic programming, especially in the framework of adaptive critic designs (ADCs). Dynamic programming can be used to find an optimal decision or control policy over a long period. However, in practice it is difficult, and often impossible, to calculate a dynamic programming solution, due to the ‘curse of dimensionality’. The adaptive critic design framework addresses this issue and tries to find a good solution by approximating the dynamic programming process for a stationary environment.

 

In adaptive critic design there are three modules, the plant or environment to be controlled, a critic to estimate the long-term cost and action or controller module to produce the decision or control strategy. Even though there have been many publications on the subject over the past two decades, there are some points that have had less attention. While most of the publications address the training of the critic, one of the points that has not received systematic attention is training of the action module. Normally, training starts with an arbitrary, hopefully stable, decision policy and its long-term cost is then estimated by the critic. Often the critic is a neural network that has to be trained, using a temporal difference and Bellman’s principle of optimality. Once the critic network has conveyed, a policy improvement step is carried out by gradient descent to adjust the parameters of the controller network. Then the critic is retrained again to give the new long-term cost estimate. However, it would be preferable to focus more on extremal policies earlier in the training. Therefore, the Calculus of Variations is investigated to discard the idea of using the Euler equations to train the actor. However, an adaptive critic formulations for a continuous plant with a short-term cost as an integral cost density is made and the chain rule is applied to calculate the total derivative of the short-term cost with respect to the actor weights. This is different from the discrete systems, usually used in adaptive crtitics, which are used in conjunction with total order derivatives. This idea is then extended to second order derivatives such that Newton’s method can be applied to speed up convergence. Based on this, an almost concurrent actor and critic training was proposed. The equations are developed for any non-linear system and short-term cost density function and these were tested on a linear quadratic (LQR) setup. With this approach the solution to the actor and critic weights can be achieved in only a few actor-critic cycles.

 

Some other, more minor issues, in the adaptive critic framework are investigated, such as the influence of discounting factor in the Bellman equation on total ordered derivatives, the target interpretation in backpropagation through time as moving and fixed targets, the relation between simultaneous recurrent networks and dynamic programming is stated and reinterpreted of the recurrent generalized multilayer perceptron (GMLP) as a recurrent generalized finite impulse (GFIR-MLP) is made.

 

Another subject in this area that is investigated, is that of a hybrid dynamic system, characterized as a continuous plant and a set of basic feedback controllers, which are used to control the plant by finding a switching sequence to select one basic controller at a time. The special but important case is considered when the plant is linear but with some uncertainty in the state space and in the observation vector, and a quadratic cost function. This is a form of robust control, where a dynamic programming solution has to be calculated. Due to the special form of, the recursive dynamic programming solution can be approximated by a certain form of an adaptive critic design, sometimes called Q-learning. However, extra care has to be taken to avoid instability due to the approximation errors and the recursive procedure of dynamic programming, which tends to considerably amplify errors.

 

The problem of fast learning with limited data is addressed in the second part. This area has been investigated for many decades but only recently started to blossom in full with Vapnik’s statistical learning theory and a class of algorithms, the so-called support vector machines, which make use of very high dimensional feature spaces and linear separations therein. In this thesis, the special case of binary pattern recognition is investigated. An algorithm, called the algebraic perceptron algorithm is introduced. It extends the well-known perceptron algorithm to achieve a linear dichotomy in high dimensional spaces of dimensionality such 350 and which represents polynomial curves in the input space. This is achieved through inner product kernels, similar to support vector machines. However, in contrast to SVMs, the algebraic perceptron will not find an optimal solution to separate the classes. Nevertheless, it can be optimized towards an optimal solution, if necessary. Sometimes, especially with many densely placed data points, it can even achieve a better solution than the theoretically superior optimal support vector machines whose solutions is often tricky to calculate for large data sets.

 

There are many interesting geometrical interpretations and possibilities that can be used to extend the algebraic perceptron. One such application suggested is to use it as a tool to decompose more complicated objects into simpler ones. This potential is demonstrated on an artificially created binary object of overlapping ellipses.

 

Download zipped pdf (4.8 MB)


Equalization of Nonlinear Communications Channels with Practical Large Margin Classifiers

James Ping Young

The University of Western Australia, September 2003

 

The subject of this thesis is about particular methods that make solving the problem of communications channel equalization more manageable. The thesis follows the popular choice of using neural networks for dealing with nonlinear input-output problems. The two main neural networks investigated were the Modified Probabilistic Neural Network (MPNN) and the Algebraic Perceptron (AP).

 

The MPNN is modelled directly from the Bayesian estimation technique. When employed to equalise communications channels it was found that with appropriate settings noise effects can be reduced and that the network size can be quite lean for some problems. In this thesis, further improvements to the MPNN method are proposed to make it more suitable for use in communications environments. First of all, the MPNN is developed to process complex signals. Secondly, a stochastic gradient method is proposed so that the MPNN is able to self adapt to changing channel characteristics. Thirdly, a method which further reduces the size of the network is proposed, because the size of the network is directly proportional to implementation cost. Lastly, the MPNN is shown to be able to be tuned to possess the same quality as an optimal equalizer. The MPNN can serve as a performance benchmark in determining the usefulness of the other equalizers.

 

The second focus of the thesis is on the AP. The AP is a binary kernel classifier. The advantages of the method are its ability to obtain a solution fast during training and its slimline structure which allows fast run-time implementation. The downside is that performance is unstable due to arbitrariness in the construction of its decision boundaries. An improvement to the method is proposed to reduce the level of arbitrariness. The goal of the proposed improvement is two fold. It is to increase the margin between the patterns and the separating hyperplane and at the same time to keep the size of the network small. The resulting development is given the name Algebraic Perceptron with Large Margin (APlm). The APlm is compared with the Support Vector Machine (SVM) and the Sequential Support Vector machine (SVMseq) in the problem of communications channel equalization. The SVMseq is an approximate implementation of the SVM. It is, in principle, identical to the more popular technique named the Kernel Adatron (KA). It is found that the APlm possess many desirable qualities and is superior in terms of implementation ease, computational load, robustness and even performance.

 

In general, it is found that both the APlm and SVMseq can be well suited to the problem. However, there is room to make the technique even more versatile. One way is to incorporate the ability to adapt to changing channels. In the thesis, Decision-directed self adaptation is combined with the batch training technique to allow the property of large margin to coexist with self adaptation. In this way the advantage of having a large margin is retained even when the equalizer adapts. The other development is to structure the nonlinear classifiers to accommodate decision feedback. By adopting the decision feedback structure, difficult channels do not have to be solved by progressively increasing the input dimension of the equalizers. Therefore, the performance of the equalizer can be improved without requiring a drastic amount of additional computational load.

 

Download zipped pdf (1.7 MB)


Return to AZTRONIX Homepage