next up previous
Next: Calculation of the four Up: Model A: Auditory segregation Previous: Auditory filterbank

Separation and Grouping

First, we can observe only the signal f(t), where f(t)=f1(t)+f2(t), f1(t) is the desired signal and f2(t) is a noise masker. The observed signal f(t) is decomposed into its frequency components by an auditory filterbank. Second, outputs of the k-th channel, which correspond to f1(t) and f2(t), are assumed to be

 \begin{displaymath}f_1(t): A_k(t)\sin(\omega_k t + \theta_{1k}(t))
\end{displaymath} (2)

and

 \begin{displaymath}f_2(t): B_k(t)\sin(\omega_k t + \theta_{2k}(t)).
\end{displaymath} (3)

Here, $\omega_k$ is the center frequency of the auditory filter and $\theta_{1k}(t)$ and $\theta_{2k}(t)$ are the input phases of f1(t) and f2(t), respectively. Since the output of the k-th channel Xk(t) is the sum of Eqs. ([*]) and ([*]),

 \begin{displaymath}X_k(t)=S_k(t)\sin(\omega_k t + \phi_k(t)).
\end{displaymath} (4)

Therefore, the amplitude envelopes of the two signals Ak(t) and Bk(t) can be determined by

 \begin{displaymath}A_k(t)={S_k(t)\sin(\theta_{2k}(t)-\phi_k(t))}/{\sin\theta_k(t)}
\end{displaymath} (5)

and

 \begin{displaymath}B_k(t)={S_k(t)\sin(\phi_k(t)-\theta_{1k}(t))}/{\sin\theta_k(t)},
\end{displaymath} (6)

where $\theta_{k}(t)=\theta_{2k}(t)-\theta_{1k}(t)$ and $\theta_k(t)\not=n\pi, n\in {\bf{Z}}$. Since the amplitude envelope Sk(t) and the output phase $\phi_k(t)$ are observable, then if $\theta_{1k}(t)$ and $\theta_{2k}(t)$ are determined, Ak(t) and Bk(t) can be determined by the above equations. Finally, all the components are synthesized from Eqs. ([*]) and ([*]) in the grouping block. Then f1(t) and f2(t) can be reconstructed by the grouping block using the inverse wavelet transform. Here, $\hat{f}_{1,{\rm {A}}}(t)$ and $\hat{f}_{2,{\rm{B}}}(t)$ are the reconstructed f1(t) and f2(t), respectively.

In this paper, we assume that the center frequency of the auditory filter corresponds to the signal frequency. Therefore, we consider the problem of segregating f1(t) from f(t) when $\theta_{1k}(t)=0$ and $\theta_k(t)=\theta_{2k}(t)$.


next up previous
Next: Calculation of the four Up: Model A: Auditory segregation Previous: Auditory filterbank
Masashi Unoki
2000-10-26