Neural Networks in Soil Science: A Tool or Just Cool?
Yakov Pachepsky
USDA-ARS, Remote Sensing and Modeling Laboratory, BARC-WEST
Beltsville, MD, 20705
and
Duke University Phytotron
Duke University
Durham, NC, 27708
e-mail: ypachepsky@asrr.arsusda.gov
The use of artificial neural networks (ANN) in geosciences is on the rise. An appealing claim made for ANNs is
that they are based on biological learning principles. Publications on ANN applications are mostly success stories.
An ANN is a mathematical model (or physical device) in which many simple, small, nonlinear submodels (or
sub-devices) are connected with unidirectional communication channels. The terms "neuron", "node", "element"
or "neurode" are often used to describe the submodels. Each submodel processes numeric inputs coming from
other submodels via the connections. Weights are given to each connection of each submodel. A graduate adjustment
is made to the weights in all submodels as the observed "input-output" patterns are sequentially presented to
the ANN. This process is commonly called "learning" or "training". ANNs are manifold nowadays. Multilayered
feed-forward neural networks (MFNN) became very popular because of their relative simplicity, stable
performance, and multiple applications. Quite often, MFNNs are equated to ANNs. This misconception may lead to
selecting inappropriate ANN. The task-dependent selection of the ANN is a necessary precondition of its successful
application. It is not easy to do, though, because the software for many types of networks is either proprietary and
costly or just not developed. ANNs have been developed primarily as pattern recognition tools, and ANN are
used classification or discrimination tools. It has been demonstrated and in some cases proven that ANNs can be
good approximation tools and successfully compete with regression techniques. ANNs are useful in making short
term predictions in time series that are registered in observations or generated by simulation models. Some ANNs
are effective at identifying relevant input variables. There are essential differences between ANNs and
conventional classification, discrimination, or regression algorithms. ANNs are not as predictable as conventional
algorithms. They must be trained several times, and there is no guarantee that the best net will be found. Computer time
and computer memory requirements can be prohibitively large. ANN learning depends on selection of the
learning sample. This seems to be the main encumbrance in ANN applications. No general recipe exists to build a
learning sample or to select the network architecture and parameters of the network learning process. Examples of
ANN applications in soil science are found mostly in soil hydrology. With ANNs, estimating water retention and
hydraulic conductivity from readily available data and determining drainage patterns from digital elevation models
was reasonable successful. Other applications will undoubtedly appear soon. The existing applications in
geosciences show that ANNs are a complement rather than a replacement for conventional techniques. Building a good
ANN requires understanding how the ANN works, involves a heuristic trial-and-error process, and may demand
an ability to change and/or amend the algorithm. A compensation for this effort is an efficient classifier or predictor.
Back to the Abstracts Page
|