On the importance of various modulation frequencies for speech recognition

Proc. of the European Conf. on Speech Communication and Technology (Eurospeech), Vol. 3, pp. 1079-1082, Rhodes, 1997

On the importance of various modulation frequencies for speech recognition

N. Kanedera, T. Arai, H. Hermansky and M. Pavel

Abstract: Temporal processing of the time trajectories in the logarithmic spectrum domain, performed in cepstral mean subtraction, in computation of dynamic features in speech, or in RASTA processing, is becoming a common procedure in current ASR. Such temporal processing effectively enhances some components of the modulation spectrum of speech while suppressing others. It is therefore important to know the relative importance of various components of the modulation spectrum of speech. In this study we report on the effect of band-pass filtering of the time trajectories of spectral envelopes on speech recognition. Results indicate the relative importance of different components of the modulation spectrum of speech for ASR.

[PDF (130 kB)]