Kernel-based probabilistic neural networks with integrated scoring normalization for speaker verification

Kwok Kwong Yiu, Man Wai Mak, Sun Yuan Kung

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper investigates kernel-based probabilistic neural networks for speaker verification in clean and noisy environments. In particular, it compares the performance and characteristics of speaker verification systems that use probabilistic decision-based neural networks (PDBNNs), Gaussian mixture models (GMMs) and elliptical basis function networks (EBFNs) as speaker models. Experimental evaluations based on 138 speakers of the YOHO corpus and its noisy variants were conducted. The original PDBNN training algorithm was also modified to make PDBNNs appropriate for speaker verification. Experimental evaluations, based on 138 speakers and the visualization of decision boundaries, indicate that GMM- and PDBNN-based speaker models are superior to the EBFN ones in terms of performance and generalization capability. This work also finds that PDBNNs and GMMs are more robust than EBFNs in verifying speakers in noise environments.

Original languageEnglish (US)
Title of host publicationAdvances in Multimedia Information Processing - PCM 2002 - 3rd IEEE Pacific Rim Conference on Multimedia, Proceedings
EditorsYung-Chang Chen, Long-Wen Chang, Chiou-Ting Hsu
PublisherSpringer Verlag
Pages623-630
Number of pages8
ISBN (Print)3540002626, 9783540002628
DOIs
StatePublished - Jan 1 2002
Event3rd IEEE Pacific Rim Conference on Multimedia, PCM 2002 - Hsinchu, Taiwan, Province of China
Duration: Dec 16 2002Dec 18 2002

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2532
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other3rd IEEE Pacific Rim Conference on Multimedia, PCM 2002
CountryTaiwan, Province of China
CityHsinchu
Period12/16/0212/18/02

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Kernel-based probabilistic neural networks with integrated scoring normalization for speaker verification'. Together they form a unique fingerprint.

Cite this