next up previous
Next: Auditory filterbank Up: Auditory segregation model Previous: Auditory segregation model

Formulation of the problem of segregating two acoustic sources

In this paper, we define the problem of segregating two acoustic sources as ``the segregation of the mixed signal into original signal components, where mixed signal is composed of two signals generated by any two acoustic sources''. The problem of segregating two acoustic sources is formulated as follows.

Firstly, we can observe only the signal f(t):

 
f(t)=f1(t)+f2(t), (1)

where f1(t) is the desired signal and f2(t) is a noise. The observed signal f(t) is decomposed into its frequency components by an auditory filterbank. Secondly, outputs of the k-th channel, which correspond to f1(t) and f2(t), are assumed to be

 \begin{displaymath}f_1(t) : A_k(t)\sin(\omega_k t + \theta_{1k}(t))
\end{displaymath} (2)

and

 \begin{displaymath}f_2(t) : B_k(t)\sin(\omega_k t + \theta_{2k}(t)),
\end{displaymath} (3)

respectively. Since the output of the k-th channel Xk(t) is represented by

 \begin{displaymath}X_k(t)=S_k(t)\sin(\omega_k t + \phi_k(t)),
\end{displaymath} (4)

where

 \begin{displaymath}S_k(t)=\sqrt{A_k^2(t)+2A_k(t)B_k(t)\cos\theta_k(t)+B_k^2(t)}
\end{displaymath} (5)

and

 \begin{displaymath}\phi_k(t)=\tan^{-1}\left(\frac{A_k(t)\sin\theta_{1k}(t)+B_k(t...
...t)}{A_k(t)\cos\theta_{1k}(t)+B_k(t)\cos\theta_{2k}(t)}\right),
\end{displaymath} (6)

then the amplitude envelopes of the two signals Ak(t) and Bk(t) can be determined by

 \begin{displaymath}A_k(t)=\frac{S_k(t)\sin(\theta_{2k}(t)-\phi_k(t))}{\sin\theta_k(t)}
\end{displaymath} (7)

and

 \begin{displaymath}B_k(t)=\frac{S_k(t)\sin(\phi_k(t)-\theta_{1k}(t))}{\sin\theta_k(t)},
\end{displaymath} (8)

respectively, where $\theta_k(t)=\theta_{2k}(t)-\theta_{1k}(t)$ and $\theta_k(t)\not= n\pi, n\in{\bf {Z}}$. Thus, if the four parameters, Sk(t), $\phi_k(t)$, $\theta_{1k}(t)$, and $\theta_{2k}(t)$ are calculated, Ak(t) and Bk(t) can be determined by the above equations. Finally, f1(t) and f2(t) can be reconstructed by grouping constraints. $\hat{f}_1(t)$ and $\hat{f}_2(t)$ are the reconstructed f1(t) and f2(t), respectively.

In this paper, we assume $\theta_{1k}(t)=0$ and $\theta_{k}(t)=\theta_{2k}(t)$. Moreover, we consider the problem of segregating two acoustic sources in which the localized f1(t) is added to f2(t).


next up previous
Next: Auditory filterbank Up: Auditory segregation model Previous: Auditory segregation model
Masashi Unoki
2000-10-26