next up previous
Next: Separation block Up: Auditory sound segregation model Previous: F estimation block

Grouping block

The grouping block determines the concurrent time-frequency region of the desired signal using constraints (i) and (iii) in Table 1, and then reconstructs the segregated instantaneous amplitude and phase using the inverse wavelet transform [Unoki and Akagi1999]. $\hat{f}_1(t)$ and $\hat{f}_2(t)$ are the reconstructed f1(t) and f2(t).

Constraint (i) is implemented by comparing the onset/offset ( $T_{k,\rm{on}},T_{k,\rm{off}}$) of Xk(t) with the onset/offset ( $T_{\rm{S}},T_{\rm{E}}$) of $X_{\hat{\ell}}(t)$ corresponding to F0(t), where $\Delta T_{\rm{S}}=25$ ms and $\Delta T_{\rm{E}}=50$ ms [Unoki and Akagi1999]. In this paper, onset $T_{k,\rm{on}}$ and offset $T_{k,\rm{off}}$ in Xk(t) are determined as follows.

1.
Onset $T_{k,\rm{on}}$ is determined by the nearest maximum point of $\vert{d\phi_k(t)}/{dt}\vert$ (within 25 ms) to the maximum point of dSk(t)/dt.
2.
Offset $T_{k,\rm{off}}$ is determined by the nearest maximum point of $\vert{d\phi_k(t)}/{dt}\vert$ (within 25 ms) to the minimum point of dSk(t)/dt.

Constraint (iii) is implemented by determining the channel number corresponding to the integer multiples of F0(t). The channel number $\ell$ of $X_\ell(t)$, in which the harmonic components exist in the output of the $\ell$-th channel, is determined by

 \begin{displaymath}\ell=\frac{K}{2}-\left\lceil \frac{\log(n\cdot F_0(t)/f_0)}{\log\alpha} \right\rceil,\quad n=1,2,\cdots, N_{F_0},
\end{displaymath} (13)

where $\alpha$ is the scale parameter and $\lceil\cdot\rceil$ is the ceil symbol, meaning the approximation of the closest integer value toward positive infinity. In addition, K is an even number and f0 is the center frequency of the analyzing wavelet in the constant Q gammatone filterbank (f0=600) [Unoki and Akagi1999].


next up previous
Next: Separation block Up: Auditory sound segregation model Previous: F estimation block
Masashi Unoki
2000-10-26