Abstract:In this paper, as the mean vector of GMM parameters can represent the basic shapes of converted feature vectors, a novel mixed model comprised of GMM and ANN spectral conversion method is proposed to alleviate the over-smoothing problem by using ANN to transform the mean vector of GMM parameters. Not only static but also dynamic spectral features are used for approaching converted spectrum sequence in order to gain the continuous converted spectral. Moreover, as pitch is very important to voice conversion, F0 is also analyzed and transformed on the basis of spectral conversion. The performance of the proposed method is evaluated using subjective and objective tests, and the results show that the proposed method can obtain a better speech quality than the earlier voice conversion system based on conventional GMM method.