next up previous
Next: Separation block Up: IMPROVEMENTS TO PREVIOUS MODEL Previous: IMPROVEMENTS TO PREVIOUS MODEL

F0(t) estimation block

In the proposed model, the fundamental frequency F0(t) is estimated using Comb filtering on the auditory-motivated filterbank. This Comb filter is defined by

\begin{displaymath}{\rm{Comb}}(k,\ell)=\left\{\begin{array}{ll}
2, & \omega_k=n\...
...ll, 4\leq n \leq N\\
0, & \mbox{otherwise}
\end{array}\right.
\end{displaymath} (15)

where k and $\ell$ are indices, $\omega_k$ and $\omega_\ell$ are the center frequencies in channels, and N is the number of harmonics of the highest order. Then, $\hat{\ell}$, which corresponds to the channel containing the fundamental wave, is determined by

\begin{displaymath}\hat{\ell} = \mathop{\arg \max}_{\ell\leq L}\sum_{k=1}^K {\rm{Comb}}(k,\ell)S_k(t),
\end{displaymath} (16)

where L is the upper-limited search region of $\ell$. The estimated F0(t) is determined by $\omega_{\hat{\ell}}/2\pi$.

Since the number of channels in the auditory-motivated filterbank is finite, the estimated fundamental frequency F0(t) takes a discrete value. In addition, the fluctuation of the estimated F0(t) behaves like a stair shape and the temporal differentiation of F0(t) is zero at any segment. Therefore, this paper assumes that E0,R(t)=0 in constraint (ii) for a segment. Here, the above segment is determined using the following equation, as the duration for which the temporal variation of F0(t) has variance of zero as F0(t).

 \begin{displaymath}\frac{1}{T_{h}-T_{h-1}}\int_{T_{h-1}}^{T_h} \left\vert F_0(t)-\overline{F_0(t)}\right\vert^2 dt = 0,
\end{displaymath} (17)

where the length of the segment is Th-Th-1. In this paper, let the parameters be N=10 and L=K/4.


next up previous
Next: Separation block Up: IMPROVEMENTS TO PREVIOUS MODEL Previous: IMPROVEMENTS TO PREVIOUS MODEL
Masashi Unoki
2000-10-26