TY - GEN
T1 - Robust speaker verification over the telephone by feature recuperation
AU - Li, X.
AU - Mak, M. W.
AU - Kung, S. Y.
PY - 2001
Y1 - 2001
N2 - The performance of speaker verification systems is often compromised under real-world environments. For example, variations in handset characteristics could cause severe performance degradation. This paper presents a novel method to overcome this problem by using a non-linear handset mapper. Under this method, a mapper is constructed by training an elliptical basis function network using distorted speech features as inputs and the corresponding clean features as the desired outputs. During feature recuperation, clean features are recovered by feeding the distorted features to the feature mapper. The recovered features are then presented to a speaker model as if they were derived from clean speech. Experimental evaluations based on 258 speakers of the TIMIT and NTIMIT corpuses suggest that the feature mappers improve the verification performance remarkably.
AB - The performance of speaker verification systems is often compromised under real-world environments. For example, variations in handset characteristics could cause severe performance degradation. This paper presents a novel method to overcome this problem by using a non-linear handset mapper. Under this method, a mapper is constructed by training an elliptical basis function network using distorted speech features as inputs and the corresponding clean features as the desired outputs. During feature recuperation, clean features are recovered by feeding the distorted features to the feature mapper. The recovered features are then presented to a speaker model as if they were derived from clean speech. Experimental evaluations based on 258 speakers of the TIMIT and NTIMIT corpuses suggest that the feature mappers improve the verification performance remarkably.
UR - http://www.scopus.com/inward/record.url?scp=0009652961&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0009652961&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:0009652961
SN - 9628576623
SN - 9789628576623
T3 - Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing, ISIMP 2001
SP - 433
EP - 436
BT - Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing, ISIMP 2001
T2 - 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing, ISIMP 2001
Y2 - 2 May 2001 through 4 May 2001
ER -