next up previous
Next: Calculation of the four Up: Auditory segregation model Previous: Auditory segregation model

Formulation of the problem of segregating two acoustic sources

In this paper, we define the problem of segregating two acoustic sources as ``the segregation of the mixed signal into original signal components, where the mixed signal is composed of two signals generated by any two acoustic sources.'' We formulate it as follows:

Firstly, we can observe only the signal f(t):

f(t)=f1(t)+f2(t), (1)

where f1(t) is the desired signal and f2(t) is a noise. The observed signal f(t) is decomposed into its frequency components by an auditory filterbank. Secondly, outputs of the k-th channel, which correspond to f1(t) and f2(t), are assumed to be

\begin{displaymath}A_k(t)\sin(\omega_k t +\theta_{1k}(t))
\end{displaymath} (2)

and

\begin{displaymath}B_k(t)\sin(\omega_k t + \theta_{2k}(t)),
\end{displaymath} (3)

respectively. Since the output of the k-th channel Xk(t) is represented by

\begin{displaymath}X_k(t)=S_k(t)\sin(\omega_k t + \phi_k(t)),
\end{displaymath} (4)

where

 \begin{displaymath}S_k(t)=\sqrt{A_k^2(t)+2A_k(t)B_k(t)\cos\theta_k(t)+B_k^2(t)}
\end{displaymath} (5)

and

 \begin{displaymath}\phi_k(t)=\tan^{-1}\left(\frac{A_k(t)\sin\theta_{1k}(t)+B_k(t...
...t)}{A_k(t)\cos\theta_{1k}(t)+B_k(t)\cos\theta_{2k}(t)}\right),
\end{displaymath} (6)

then the amplitude envelopes of the two signals Ak(t) and Bk(t) can be determined by

 \begin{displaymath}A_k(t)=\frac{S_k(t)\sin(\theta_{2k}(t)-\phi_k(t))}{\sin\theta_k(t)}
\end{displaymath} (7)

and

\begin{displaymath}B_k(t)=\frac{S_k(t)\sin(\phi_k(t)-\theta_{1k}(t))}{\sin\theta_k(t)},
\end{displaymath} (8)

respectively, where $\theta_k(t)=\theta_{2k}(t)-\theta_{1k}(t)$ and $\theta_k(t)\not= n\pi, n\in {\bf {Z}}$. Thus, if the four parameters, Sk(t), $\phi_k(t)$, $\theta_{1k}(t)$, and $\theta_{2k}(t)$ are calculated, Ak(t) and Bk(t) can be calculated by the above equations. Finally, f1(t) and f2(t) can be reconstructed by grouping constraints. $\hat{f}_1(t)$ and $\hat{f}_2(t)$ are reconstructed f1(t) and f2(t), respectively.

In this paper, we assume $\theta_{1k}(t)=0$ and $\theta_{k}(t)=\theta_{2k}(t)$. Additionally, we consider the problem of segregating two acoustic sources in which the localized f1(t) is added to f2(t).


next up previous
Next: Calculation of the four Up: Auditory segregation model Previous: Auditory segregation model
Masashi Unoki
2000-10-26