Using the modulation complex wavelet transform for feature extraction in automatic speech recognition

Proc. of the European Conf. on Speech Communication and Technology (Eurospeech), Vol. 4, pp. 2639-2642, Aalborg, 2001

Using the modulation complex wavelet transform for feature extraction in automatic speech recognition

Y. Momomura, K. Okada, T. Arai, N. Kanedera and Y. Murahara

Abstract: In this paper we examine robust feature extraction methods for automatic speech recognition (ASR) in noise-distorted environments. Previous research showed that combining the coefficients of multi-resolutional approach can be achieved using a wavelet transform instead of the Fourier transform. Taking the FFT phase into consideration, we applied the Gabor function, which is a complex function, as mother wavelet. This approach yielded a 1.7% increase in recognition accuracy compared to the FFT-based multi-resolutional approach.

[PDF (322 kB)]