The term "neural net" is more or less a misnomer, implying that some sort of cognitive process is being developed or used by a computing system. In reality, the backpropagation (BP) neural net is a well defined numerical calculation for classifying an input parameter set into two or more categories. The BP neural net can be given a simple geometric interpretation. Before it can be used to classify data, a neural net must be trained by presenting the training routine a series of points with values for each input parameter and the dependent variable. The training algorithm for the backpropagation neural net is given in the primary reference
Consider a BP neural net used to classify its input into two categories using two input units (parameters), the coordinates X and Y. Each hidden unit has the form:
Fk(X,Y) = AkX + BkY + Ck
Zk = S(Fk(X,Y))
W(X,Y) = DZ0 + EZ1 + FZ2 + G
output = S(W(X,Y))
The figures at the bottom of this page provide examples of using the two input unit back propagation neural net with 2, 3 and 4 hidden units to create a decision "surface" which in the plane is a curve. The two categories of points, asterisks, and squares, are generated using a random number generator and have no significance beyond a simple demonstration of the neural net technique. The asterisks (red) are uniform in a statistical sense over the entire area. The squares (green) are generated from three overlapping Gaussian distributions concentrated near the center of the plots. The solid curve (blue) enclosing most of the squares clearly shows a number of lines corresponding to the number of hidden units and the blending of them into a single boundary by the output unit. The data for each figure is the same. There are 500 data points in each category and in the following table the number of squares and asterisks correctly classified is given for the three hidden unit values. By design, it is impossible to correctly classify all of the points.
Hidden Correct . . . . . Total
Units Squares Asterisks %
2 479 362 84.1
3 488 412 90.0
4 493 488 91.1
Moving to three dimensions, the hidden units describe planes and give the distance of three dimensional points to the plane. The simplest solid object composed from planes is the four sided tetrahedron. Four hidden units are appropriate when there are three input units. While I can't visualize geometry in higher dimensions, the equations have the same form. This discussion provides the reason for using eleven hidden units in the IBB situation classification neural net. In practice the classification error changes, but only slightly, as the number of hidden units is increased or decreased.
The wikipedia contains a general discussion on neural networks that include extensive references and links. Many different kinds of neural net have been invented. There is no assurance that the back propagation net is the optimum method for the demonstration on this page or the Intentional Base on Balls classification task. The theory of this particular form of neural net used is covered in “Neural Networks”, Laurene Fausett, Prentice Hall, 1994, and other textbooks.


