Yuichi Ishimoto, Masashi Unoki, and Masato Akagi,
"A fundamental frequency estimation method for noisy speech based on
periodicity and harmonicity,"
Proc. of ICASSP2001, SPEECH-SF3.1, USA, May 2001.
Last modified:
2 June 2001
Abstract
This paper proposes a robust and accurate F0 estimation method for noisy
speech. This method uses two different principles: (1) an F0 estimation
based on periodicity and harmonicity of instantaneous amplitude for a
robust estimation in noisy environments, and (2) TEMPO2 proposed by
Kawahara et al. as an accurate estimation method. The proposed method
also uses a comb filter with controllable pass-bands to combine the two
estimation methods. Simulations were carried out to estimate F0s from
real speech in noisy environments and to compare the proposed method
with other methods. The results showed that this method can not only
estimate F0s for clean speech with similar accuracy as TEMPO2 but also
robustly estimate F0s from noisy speech in comparison with the other
method such as TEMPO2 and cepstrum method.