Speaker verification from coded telephone speech using stochastic feature transformation and handset identification

Eric W.M. Yu, Man Wai Mak, Sun Yuan Kung

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Scopus citations

Abstract

A handset compensation technique for speaker verification from coded telephone speech is proposed. The proposed technique combines handset selectors with stochastic feature transformation to reduce the acoustic mismatch between different handsets and different speech coders. Coder-dependent GMM-based handset selectors are trained to identify the most likely handset used by the claimants. Stochastic feature transformations are then applied to remove the acoustic distortion introduced by the coder and the handset. Experimental results show that the proposed technique outperforms the CMS approach and significantly reduces the error rates under six different coders with bit rates ranging from 2.4 kb/s to 64 kb/s. Strong correlation between speech quality and verification performance is also observed.

Original languageEnglish (US)
Title of host publicationAdvances in Multimedia Information Processing - PCM 2002 - 3rd IEEE Pacific Rim Conference on Multimedia, Proceedings
EditorsYung-Chang Chen, Long-Wen Chang, Chiou-Ting Hsu
PublisherSpringer Verlag
Pages598-606
Number of pages9
ISBN (Print)3540002626, 9783540002628
DOIs
StatePublished - Jan 1 2002
Externally publishedYes
Event3rd IEEE Pacific Rim Conference on Multimedia, PCM 2002 - Hsinchu, Taiwan, Province of China
Duration: Dec 16 2002Dec 18 2002

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2532
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other3rd IEEE Pacific Rim Conference on Multimedia, PCM 2002
CountryTaiwan, Province of China
CityHsinchu
Period12/16/0212/18/02

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Speaker verification from coded telephone speech using stochastic feature transformation and handset identification'. Together they form a unique fingerprint.

  • Cite this

    Yu, E. W. M., Mak, M. W., & Kung, S. Y. (2002). Speaker verification from coded telephone speech using stochastic feature transformation and handset identification. In Y-C. Chen, L-W. Chang, & C-T. Hsu (Eds.), Advances in Multimedia Information Processing - PCM 2002 - 3rd IEEE Pacific Rim Conference on Multimedia, Proceedings (pp. 598-606). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2532). Springer Verlag. https://doi.org/10.1007/3-540-36228-2_74