プロフィール

Masato AKAGI's Home Page

RESEARCH TOPICS　　　　　　　　　　　　　　　　　　　　(01/Jul/2017)

Interests

Interests of Akagi Lab. are speech signal processing and modeling the speech perception mechanism of human beings. Speech is the most natural means of communication between human beings. Therefore, it is believed that modeling these mechanisms is able to realize high-performance speech processing systems.

Research Areas

Fig. 1 shows the principle procedures of speech communication (production and perception). Akagi laboratory .is mainly focusing on the topics indicated by the red blocks, such as speech production, speech communication in real environments, and speech perception. In our research, not only engineering (digital signal processing) but also knowledge of physiology and/or psychology is required as shown in Fig. 2.

Fig. 1 Principle of speech communication

Fig. 2 Related research fields

Research Issues

In Figure 3;
Production: Through modeling the speech production mechanisms, the research is aimed to synthesize the natural speech. The research includes: to investigate the relationship between speech spectrum and shape of vocal tract, to synthesize natural speech with non-linguistic information (e.g., individuality and emotion), to synthesize singing voice.
Perception: Through modeling the perception mechanisms of human beings, the research is aimed to recognize speech in real-world conditions. Specifically, the research includes: to realize cock-tail-party effect, to enhance speech.

Fig. 3 View of research topics

1. Non-linguistic Information

1-1 Singing Voice
1-2 Speaker Indivisuality
1-3 Emotional Speech
1-4 Voice Conversion
1-5 Speech Coding

2. Speech recovery

2-1 Noise Reduction
2-2 F0 Extraction
2-3 De-reverberation
2-4 Bone-conducted Speech
2-5 Speech Recognition
2-6 DOA

3. Cocktail-party Effect Modeling

3-1 Sound Segregation
3-2 Privacy Protection
3-3 Noisy Sound Perception

4. Psychoacoustics

4-1 Auditory Model
4-2 Contextual Effect
4-3 Auditory Filter
4-4 Phase Perception
4-5 Speech Perception
4-6 Noise Evaluation
4-7 Pitch Perception
4-8 Direction Perception

5. Physiological Auditory Modeling

6. Abnormal Speech

6-1 Abnormal Speech Perception
6-2 3D Vocal Tract Modeling

7. Interaction between Perception and Production

8. Signal Analysis