Abstract
Because of the differences in education background, accents, etc., different persons have their unique way of pronunciation. This paper exploits the pronunciation characteristics of speakers and proposes a new conditional pronunciation modeling (CPM) technique for speaker verification. The proposed technique aims to establish a link between articulatory properties (e.g., manners and places of articulation) and phoneme sequences produced by a speaker. This is achieved by aligning two articulatory feature (AF) streams with a phoneme sequence determined by a phoneme recognizer, and formulating the probabilities of articulatory classes conditioned on the phonemes as speaker-dependent probabilistic models. The scores obtained from the AF-based pronunciation models are then fused with those obtained from a spectral-based speaker verification system, with the frame-by-frame fused scores weighted by the confidence of the pronunciation models. Evaluations based on the SPIDRE corpus demonstrate that AF-based CPM systems can recognize speakers even with short utterances and are readily combined with spectral-based systems to further enhance the reliability of speaker verification.
| Original language | English (US) |
|---|---|
| Pages | 2597-2600 |
| Number of pages | 4 |
| State | Published - 2004 |
| Event | 8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of Duration: Oct 4 2004 → Oct 8 2004 |
Other
| Other | 8th International Conference on Spoken Language Processing, ICSLP 2004 |
|---|---|
| Country/Territory | Korea, Republic of |
| City | Jeju, Jeju Island |
| Period | 10/4/04 → 10/8/04 |
All Science Journal Classification (ASJC) codes
- Language and Linguistics
- Linguistics and Language
Fingerprint
Dive into the research topics of 'Articulatory feature-based conditional pronunciation modeling for speaker verification'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver