TY - JOUR
T1 - Exploring unsupervised pre-training for echo state networks
AU - Steiner, Peter
AU - Jalalvand, Azarakhsh
AU - Birkholz, Peter
N1 - Publisher Copyright:
© 2023, The Author(s).
PY - 2023/12
Y1 - 2023/12
N2 - Echo State Networks (ESNs) are a special type of Recurrent Neural Networks (RNNs), in which the input and recurrent connections are traditionally generated randomly, and only the output weights are trained. However, recent publications have addressed the problem that a purely random initialization may not be ideal. Instead, a completely deterministic or data-driven initialized ESN structure was proposed. In this work, an unsupervised training methodology for the hidden components of an ESN is proposed. Motivated by traditional Hidden Markov Models (HMMs), which have been widely used for speech recognition for decades, we present an unsupervised pre-training method for the recurrent weights and bias weights of ESNs. This approach allows for using unlabeled data during the training procedure and shows superior results for continuous spoken phoneme recognition, as well as for a large variety of time-series classification datasets.
AB - Echo State Networks (ESNs) are a special type of Recurrent Neural Networks (RNNs), in which the input and recurrent connections are traditionally generated randomly, and only the output weights are trained. However, recent publications have addressed the problem that a purely random initialization may not be ideal. Instead, a completely deterministic or data-driven initialized ESN structure was proposed. In this work, an unsupervised training methodology for the hidden components of an ESN is proposed. Motivated by traditional Hidden Markov Models (HMMs), which have been widely used for speech recognition for decades, we present an unsupervised pre-training method for the recurrent weights and bias weights of ESNs. This approach allows for using unlabeled data during the training procedure and shows superior results for continuous spoken phoneme recognition, as well as for a large variety of time-series classification datasets.
KW - Clustering
KW - ESN
KW - RCN
KW - State machine
UR - http://www.scopus.com/inward/record.url?scp=85169611449&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85169611449&partnerID=8YFLogxK
U2 - 10.1007/s00521-023-08988-x
DO - 10.1007/s00521-023-08988-x
M3 - Article
AN - SCOPUS:85169611449
SN - 0941-0643
VL - 35
SP - 24225
EP - 24242
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 34
ER -