ANN optimization and improvement

Regional_Neural_Network
Page Navigation
9

Foo's (2002) study shows that training speed increases by almost three times when

the conjugated optimization technique is used.

Overfitting is another problem that may occur during neural network training. The

error on the training set is driven to a very small value, but when new data is

presented to the network, the error is large. In this case, the network has memorized

the training examples, but has not learned to generalize to new situations. One useful

approach for improving network generalization is to use an adequately sized network

that is just large enough to provide an adequate fit. The larger a network is, the more

complex the functions that the network can create, which may lead to overfitting. If

a small enough network is used, it will not have enough power to overfit the data.

Mathworks (1999) provides examples that show how reducing the size of a network

can prevent overfitting. However, it is difficult to know beforehand just how large

a network should be for a specific application. In general, the optimal network size

to prevent overfitting can be determined through model sensitivity experiments.

4. RNN--WL model design

In this study, the standard three-layer feed-forward backgropagation network

(Haykin, 1999) with a nonlinear differentiable log-sigmoid transfer function in the

hidden layer (Fig. 5) was employed. The network programming was done using the

Matlab computer software (MathWorks, 1999). Huang and Fu's (2002) study indi-

cates that using an optimized conjugated training method results in improvement of

both training speed and accuracy. In general, the network training speed using conju-

gated training method is about three times faster than when the standard gradient