Speaker verification via articulatory feature-based conditional pronunciation modeling with vowel and consonant mixture models

Ka Yee Leung, Man Wai Mak, Manhung Siu, Sun Yuan Kung

Research output: Contribution to conferencePaperpeer-review

Abstract

Articulatory feature-based conditional pronunciation modeling (AFCPM) aims to capture the pronunciation characteristics of speakers by modeling the linkage between the states of articulation during speech production and the actual phones produced by a speaker. Previous AFCPM systems use one discrete density function for each phoneme to model the pronunciation characteristics of speakers. This paper proposes using a mixture of discrete density functions for AFCPM. In particular, the pronunciation characteristics of each phoneme is modeled by two density functions: one responsible for describing the articulatory features that are more relevant to vowels and the other for consonants. Verification scores are the weighted sum of the outputs of the two models. To enhance the resolution of the pronunciation models, four articulatory properties (front-back, liprounding, place of articulation, and manner of articulation) are used for pronunciation modeling. The proposed AFCPM is applied to a speaker verification task. Results show that using four articulatory features achieves a lower error rate as compared to using two features (manner and place of articulation) only. It was also found that dividing the articulatory properties into two groups is an effective means of solving the data-sparseness problem encountered in the training phase of AFCPM systems.

Original languageEnglish (US)
Pages3089-3092
Number of pages4
StatePublished - 2005
Event9th European Conference on Speech Communication and Technology - Lisbon, Portugal
Duration: Sep 4 2005Sep 8 2005

Other

Other9th European Conference on Speech Communication and Technology
Country/TerritoryPortugal
CityLisbon
Period9/4/059/8/05

All Science Journal Classification (ASJC) codes

  • General Engineering

Fingerprint

Dive into the research topics of 'Speaker verification via articulatory feature-based conditional pronunciation modeling with vowel and consonant mixture models'. Together they form a unique fingerprint.

Cite this