Fundamental frequency (F0) control models, which can cope with F0
dynamic characteristics related to singing-voice perception, are
required to construct natural singing-voice synthesis systems. This
paper discusses the importance of F0 dynamic characteristics in singing
voices and demonstrates how much it influence on singing voice
perception through psychoacoustic experiments. This paper, then,
proposes an F0 control model that can generate F0 fluctuations in
singing voices, and a singing-voice synthesis method. The results show
that F0 contour including fluctuations: Overshoot, Vibrato, Preparation,
and Fine-fluctuation, affects singing voice perception, and the proposed
synthesis method can generate natural singing voices by controlling
these F0 fluctuations.
|