메뉴 건너뛰기




Volumn , Issue , 2011, Pages 2765-2768

Intonation conversion from neutral to expressive speech

Author keywords

DCT; Expressivity; F0 modeling; GMM; Multi level dynamic features; Prosody conversion

Indexed keywords

CONVERSION METHODS; DCT; DISCRETE COSINE TRANSFORM COEFFICIENTS; DYNAMIC FEATURES; EMOTIONAL SPEECH; EXPERIMENTAL EVALUATION; EXPRESSIVE SPEECH; EXPRESSIVITY; F0 CONTOURS; F0 MODEL; GAUSSIAN MIXTURE MODEL; GMM; PROSODIC FEATURES; TEMPORAL CORRELATIONS;

EID: 84865747520     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (50)

References (12)
  • 1
    • 84889917205 scopus 로고    scopus 로고
    • Intonation modelling and adaptation for emotional prosody generation
    • Inanoglu, Z., Young, S., "Intonation Modelling and Adaptation for Emotional Prosody Generation", ACII, 2005
    • (2005) ACII
    • Inanoglu, Z.1    Young, S.2
  • 2
    • 85009177437 scopus 로고    scopus 로고
    • Modeling of various speaking styles and emotions for HMM Based Speech Synthesis
    • Yamagishi, J., Onishi, K., Masuko, T., Kobayashi, T., "Modeling of various speaking styles and emotions for HMM Based Speech Synthesis", Eurospeech, 2003
    • (2003) Eurospeech
    • Yamagishi, J.1    Onishi, K.2    Masuko, T.3    Kobayashi, T.4
  • 3
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to- speech synthesis
    • Kain, A., Macon, M., "Spectral Voice Conversion for Text-to- Speech Synthesis", ICASSP, 1998.
    • (1998) ICASSP
    • Kain, A.1    MacOn, M.2
  • 4
    • 34547520011 scopus 로고    scopus 로고
    • A novel method for pitch prediction in voice conversion
    • Helander, E., Nurminen, J.,"A Novel Method for Pitch Prediction in Voice Conversion", ICASSP, 2007.
    • (2007) ICASSP
    • Helander, E.1    Nurminen, J.2
  • 5
    • 34047263010 scopus 로고    scopus 로고
    • Prosody conversion from neutral speech to emotional speech
    • Tao, J., Yongguo, K., and Li, A. "Prosody Conversion from Neutral Speech to Emotional Speech", IEEE Trans. Audio, Speech and Lang Proc., vol.14:1145-1153, 2006
    • (2006) IEEE Trans. Audio, Speech and Lang Proc. , vol.14 , pp. 1145-1153
    • Tao, J.1    Yongguo, K.2    Li, A.3
  • 6
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Toda, T., Black, A.W., Tokuda, K., "Voice Conversion Based on Maximum Likelihood Estimation of Spectral Parameter Trajectory", IEEE Trans. Audio, Speech and Lang Proc., 2007
    • (2007) IEEE Trans. Audio, Speech and Lang Proc.
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 7
    • 84867194192 scopus 로고    scopus 로고
    • Multilevel parametric-base F0 model for speech synthesis
    • Latorre, J., Akamine, M., "Multilevel parametric-base F0 model for speech synthesis", Interspeech 2008.
    • (2008) Interspeech
    • Latorre, J.1    Akamine, M.2
  • 8
    • 77955722263 scopus 로고    scopus 로고
    • Hierarchical prosody conversion using regression-based clustering for emotional synthesis
    • Wu, C.H., Hsia, C.C, Lee, C.H.,"Hierarchical Prosody Conversion Using Regression-Based Clustering for Emotional Synthesis", IEEE Trans. Audio, Speech and Lang Proc., 2010.
    • (2010) IEEE Trans. Audio, Speech and Lang Proc.
    • Wu, C.H.1    Hsia, C.C.2    Lee, C.H.3
  • 9
    • 85089106384 scopus 로고    scopus 로고
    • Estimating phrase curves in the general superpositional intonation model
    • J. van Santen and T. Mishra, "Estimating phrase curves in the general superpositional intonation model," SSW5, 2004.
    • (2004) SSW5
    • Van Santen, J.1    Mishra, T.2
  • 10
    • 84924023612 scopus 로고    scopus 로고
    • Automatic phoneme segmentation with relaxed textual constraints
    • P. Lanchantin, X. Rodet, and C. Veaux, "Automatic Phoneme Segmentation with Relaxed Textual Constraints," LREC 2007.
    • (2007) LREC
    • Lanchantin, P.1    Rodet, X.2    Veaux, C.3
  • 11
    • 52449117078 scopus 로고    scopus 로고
    • A sawtooth waveform inspired pitch estimator for speech and music
    • Camachao, A., Harris, J.G., "A sawtooth waveform inspired pitch estimator for speech and music," JASA, 124, pp. 1638-1652, 2008.
    • (2008) JASA , vol.124 , pp. 1638-1652
    • Camachao, A.1    Harris, J.G.2
  • 12
    • 84872712056 scopus 로고    scopus 로고
    • Shape-invariant speech transformation with the phase vocoder
    • Roebel, A., "Shape-invariant speech transformation with the phase vocoder," Interspeech, 2010.
    • (2010) Interspeech
    • Roebel, A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.