TY - JOUR
T1 - Noninvertibility in neural networks
AU - Rico-Martínez, Ramiro
AU - Adomaitis, Raymond A.
AU - Kevrekidis, Ioannis G.
N1 - Funding Information:
The authors would like to acknowledge the support of the National Science Foundation and UTRC. One of the authors (R.R.M.) was partially supported by CONACyT; the hospitality of the Center for Nonlinear Studies at the Los Alamos National Laboratory is gratefully acknowledged.
PY - 2000/11/1
Y1 - 2000/11/1
N2 - We present and discuss an inherent shortcoming of neural networks used as discrete-time models in system identification, time series processing, and prediction. Trajectories of nonlinear ordinary differential equations (ODEs) can, under reasonable assumptions, be integrated uniquely backward in time. Discrete-time neural network mappings derived from time series, on the other hand, can give rise to multiple trajectories when followed backward in time: they are in principle noninvertible. This fundamental difference can lead to model predictions that are not only slightly quantitatively different, but qualitatively inconsistent with continuous time series. We discuss how noninvertibility arises, present key analytical concepts and some of its phenomenology. Using two illustrative examples (one experimental and one computational), we demonstrate when noninvertibility becomes an important factor in the validity of artificial neural network (ANN) predictions, and show some of the overall complexity of the predicted pathological dynamical behavior. These concepts can be used to probe the validity of ANN time series models, as well as provide guidelines for the acquisition of additional training data. (C) 2000 Elsevier Science Ltd.
AB - We present and discuss an inherent shortcoming of neural networks used as discrete-time models in system identification, time series processing, and prediction. Trajectories of nonlinear ordinary differential equations (ODEs) can, under reasonable assumptions, be integrated uniquely backward in time. Discrete-time neural network mappings derived from time series, on the other hand, can give rise to multiple trajectories when followed backward in time: they are in principle noninvertible. This fundamental difference can lead to model predictions that are not only slightly quantitatively different, but qualitatively inconsistent with continuous time series. We discuss how noninvertibility arises, present key analytical concepts and some of its phenomenology. Using two illustrative examples (one experimental and one computational), we demonstrate when noninvertibility becomes an important factor in the validity of artificial neural network (ANN) predictions, and show some of the overall complexity of the predicted pathological dynamical behavior. These concepts can be used to probe the validity of ANN time series models, as well as provide guidelines for the acquisition of additional training data. (C) 2000 Elsevier Science Ltd.
KW - Artificial neural networks
KW - Noninvertibility
KW - System identification
KW - Time-series processing
UR - http://www.scopus.com/inward/record.url?scp=0034333253&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0034333253&partnerID=8YFLogxK
U2 - 10.1016/S0098-1354(00)00599-8
DO - 10.1016/S0098-1354(00)00599-8
M3 - Article
AN - SCOPUS:0034333253
SN - 0098-1354
VL - 24
SP - 2417
EP - 2433
JO - Computers and Chemical Engineering
JF - Computers and Chemical Engineering
IS - 11
ER -