By handling fundamental frequency contours in the framework of the generation process model, flexible prosody control becomes possible for speech synthesis. The model can be used to solve problems resulting from hidden Markov model (HMM)-based speech synthesis,...
synthesisHMMMDCTOverlap-and-addMel-cepstralanalysisHidden Markov model (HMM) based text-to-speech (TTS) has become one of the most promising approaches, as it has proven to be a particularly flexible and robust framework to generate synthetic speech. However, several factors such as mel-cepstral...
In this paper, first experiments on statistical parametric HMM-based speech synthesis for the Czech language are described. In this synthesis method, trajectories of speech parameters are generated from the trained hidden Markov models. A final speech waveform is synthesized from those speech parameters...
[2] MANIEZZO V. Genetic evolution of the topology and weight distribution of neural networks[J]. IEEE Transactions on Neural Networks, 1994, 5(6):900~909. [3] TERASHIMA R, YOSHIMURA T, WAKITA T. Prediction method of speech recognition performance based on HMM-based speech synthesis technique...
a two-level based model is introduced for duration modeling and prediction,and the duration prediction RMSE was improved from 29.56ms to 27.01ms.From the evaluation results of the final system,the synthetic speech is stable,fluent and rhythmed.As the speech synthesis system only requires very ...
Sinusoidal+All-Pole Modification Based Spectral Smoothing for Concatenative Speech Synthesis 热度: SIMULTANEOUSMODELINGOFSPECTRUM,PITCHANDDURATION INHMM-BASEDSPEECHSYNTHESIS TakayoshiYoshimura,KeiichiTokuda,TakashiMasuko,TakaoKobayashiandTadashiKitamura NagoyaInstituteofTechnology,Gokiso,Shouwa-ku,Nagoya,466-8555Japan ...
On the state for an excitation model in HMM-based speech synthesis This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based speech synthesizers. During the waveform generation part, mixed excitation is constructed by state-dependent filtering of pulse trains and...
This paper presents an evaluation of the contextual factors of HMM-based speech synthesis and coding systems. Two experimental setups are proposed that are based on successive context addition from phonetic to full-context. The aim was to investigate the impact of the individual contextual factors ...
speech units modeled by MSD-HMMs, which we call “average voice” models, then we adapt the average voice models to the tar- get speaker using the extended MLLR algorithm. 2. HMM-BASED SPEECH SYNTHESIS SYSTEM 2.1. System overview Ablockdiagramof the HMM-based TTS systemis shown in Fig.1...
以及范围更广的介绍,可参考Keiichi Tokuda, et al. Speech Synthesis Based on Hidden Markov Models....