Divergence-based out-of-class rejection for telephone handset identification

Chi Leung Tsang, Man Wai Mak, Sun Yuan Kung

Research output: Contribution to conferencePaperpeer-review

12 Scopus citations

Abstract

Research has shown that handset selectors can be used to assist telephone-based speech/speaker recognition. Most handset selectors, however, simply select the most likely handset from a set of known handsets even for speech coming from an 'unseen' handset. This paper proposes a divergence-based handset selector with out-of-handset (OOH) rejection capability to identify the 'unseen' handsets. This is achieved by measuring the Jensen difference between the selector's output and a constant vector with identical elements. The resulting handset selector is combined with a feature-based channel compensation algorithm for telephonebased speaker verification. Utterances whose handsets were identified as 'unseen' are either transformed by a global bias vector or normalized by cepstral mean subtraction (CMS). On the other hand, if the handset can be identified (considered as 'seen'), its corresponding transformation parameters will be used to transform the utterances. Experiments based on ten handsets of the HTIMIT corpus show that using the transformation parameters of the 'seen' handsets to transform the utterances with correctly identified handsets and processing those utterances with 'unseen' handsets by CMS achieve the best result.

Original languageEnglish (US)
Pages2329-2332
Number of pages4
StatePublished - 2002
Event7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States
Duration: Sep 16 2002Sep 20 2002

Other

Other7th International Conference on Spoken Language Processing, ICSLP 2002
Country/TerritoryUnited States
CityDenver
Period9/16/029/20/02

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Divergence-based out-of-class rejection for telephone handset identification'. Together they form a unique fingerprint.

Cite this