• Journal of Internet Computing and Services
    ISSN 2287 - 1136(Online) / ISSN 1598 - 0170 (Print)
    http://jics.or.kr/

An Amplitude Warping Approach to Intra-Speaker Normalization for Speech Recognition


Kim Dong-Hyun, Hong Kwang-Seok, Journal of Internet Computing and Services, Vol. 4, No. 3, pp. 9-14, Jun. 2003
Full Text:
Keywords: speaker normalization, speaker adaptation, vocal tract normalization, speech recognition

Abstract

The method of vocal tract normalization is a successful method for improving the accuracy of inter-speaker normalization. In this paper, we present an intra-speaker warping factor estimation based on pitch alteration utterance. The feature space distributions of untransformed speech from the pitch alteration utterance of intra-speaker would vary due to the acoustic differences of speech produced by glottis and vocal tract. The variation of utterance is two types: frequency and amplitude variation. The vocal tract normalization is frequency normalization among inter-speaker normalization methods. Therefore, we have to consider amplitude variation, and it may be possible to determine the amplitude warping factor by calculating the inverse ratio of input to reference pitch. k, the recognition results, the error rate is reduced from 0.4% to 2.3% for digit and word decoding.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[APA Style]
Kim Dong-Hyun and Hong Kwang-Seok (2003). An Amplitude Warping Approach to Intra-Speaker Normalization for Speech Recognition. Journal of Internet Computing and Services, 4(3), 9-14.

[IEEE Style]
K. Dong-Hyun and H. Kwang-Seok, "An Amplitude Warping Approach to Intra-Speaker Normalization for Speech Recognition," Journal of Internet Computing and Services, vol. 4, no. 3, pp. 9-14, 2003.

[ACM Style]
Kim Dong-Hyun and Hong Kwang-Seok. 2003. An Amplitude Warping Approach to Intra-Speaker Normalization for Speech Recognition. Journal of Internet Computing and Services, 4, 3, (2003), 9-14.