Abstract
Research has shown that handset selectors can be used to assist telephone-based speech/speaker recognition. Most handset selectors, however, simply select the most likely handset from a set of known handsets even for speech coming from an 'unseen' handset. This paper proposes a divergence-based handset selector with out-of-handset (OOH) rejection capability to identify the 'unseen' handsets. This is achieved by measuring the Jensen difference between the selector's output and a constant vector with identical elements. The resulting handset selector is combined with a feature-based channel compensation algorithm for telephonebased speaker verification. Utterances whose handsets were identified as 'unseen' are either transformed by a global bias vector or normalized by cepstral mean subtraction (CMS). On the other hand, if the handset can be identified (considered as 'seen'), its corresponding transformation parameters will be used to transform the utterances. Experiments based on ten handsets of the HTIMIT corpus show that using the transformation parameters of the 'seen' handsets to transform the utterances with correctly identified handsets and processing those utterances with 'unseen' handsets by CMS achieve the best result.
Original language | English (US) |
---|---|
Pages | 2329-2332 |
Number of pages | 4 |
State | Published - 2002 |
Event | 7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States Duration: Sep 16 2002 → Sep 20 2002 |
Other
Other | 7th International Conference on Spoken Language Processing, ICSLP 2002 |
---|---|
Country/Territory | United States |
City | Denver |
Period | 9/16/02 → 9/20/02 |
All Science Journal Classification (ASJC) codes
- Language and Linguistics
- Linguistics and Language