Last modified:
19 August 2019
|
Audio/Speech information hiding |
Research area:
- Audio information hiding scheme based on cochlear delay
characteristics
- Speech information hiding scheme based on formant tuning
- Audio information hiding scheme based on singular spectrum analysis
|
Speech perception using synthesizing noise-vocoded speech |
Research area:
- Speech intelligibility
- Vocal emotion perception
- Speaker individuality (speech identification)
- Speech enhancement for Cochlear implant users
|
Blind estimation/extraction of speech features |
Research area:
- Blind estimation of speech transmission index (STI)
- Acurate and robust estimation method of foundamental frequency (F0)
- Acurate and robust detection of voice activities (VAD)
|
Computational theory of auditory sound
segregation |
Research area:
- Global: Acoustical Information Science
- (1) Blind dereverberation method based on the Modulation Transfer
Function (MTF)
- (2) Computational theory of auditory sound segregation
- (3) Audio watermarking method
- (4) Bone-conducted speech restoration
Research programme in detail:
The study of computational theory of the auditory system tries to answer
the following questions: "what is the purpose of auditory processing?"
and "why must the auditory system compute it?" based on research in
psychology, physiology, and information science. This has high
correlation to the auditory edition of the computational theory of
vision proposed by Marr.
If this computational theory behind our auditory system can be
constructed, it may, not only clarify human auditory functions but also
contribute to some applications such as preprocessors for robust speech
recognition systems and modelling of the psychoacoustical phenomena.
However, thus far it has not been constructed completely using Marr's
interpretation. This is due to our lack of understanding of the
psychoacoustical and physiological knowledge of auditory system in order
to construct such a model, as compared to that of vision.
The aim of my work is the development of computational theory behind the
auditory system.
I have conducted a study on the computational theory of audition focused
on auditory sound segregation. My approach is based on two main ideas:
(1) constraints on sound source and environment conditions are necessary
to uniquely solve the sound segregation problem (ill-posed inverse
problem) and (2) psychoacoustical heuristic regularities proposed by
Bregman can be used to uniquely solve the real sound segregation problem
(auditory scene analysis).
As a result of my research, I have proposed a strategy for the auditory
sound segregation problem such as vowel segregation. Other researchers
often failed in representing real sound segregation such as consonants,
burst, and complex sounds.
However, based on the above two ideas I obtained an indication for
constructing a computational theory of auditory sound segregation.
Since I have taken to develop a highly accurate sound segregation system
based on this indication, it must be extended to represent real sound
segregation if it also uses reasonable constraints except for Bregman's
regularities.
This required a higher-level study of the algorithm of auditory sound
segregation, i. e., whether a strategy using reasonable constraints can
simulate auditory sound segregation.
Hence, I need to have clear boundaries on what are psychoacoustical,
physiological, and mathematical constraints. Are these boundaries
reasonable for simulating auditory sound segregation so that we can
construct a computational theory of auditory sound segregation.
This required a study on the adequacy and necessity of these constraints
based on the following steps:
Step. 1, a study of the required constraints for auditory sound
segregation using computer simulation and hearing tests.
Step 2, a study of the constraints for auditory sound segregation using
psychoacoustical and physiological tests.
These I believe will lead me towards constructing a computational theory
of auditory sound segregation based on results from the above steps and
my current status of research and knowledge.
|
DSAM (Development System for Auditory
Modelling) Project |
|
HomePage
|
![]() |
Created by M. Unoki, 13 Nov. 2000
|