메뉴 건너뛰기




Volumn 2, Issue , 2012, Pages 1630-1633

Analysis on the importance of short-term speech parameterizations for emotional statistical parametric speech synthesis

Author keywords

Expressive speech synthesis; Speech synthesis; Statistical parametric speech synthesis

Indexed keywords

CLASSIFICATION PERFORMANCE; EMOTION CLASSIFICATION; EMOTION IDENTIFICATIONS; EXCITATION PARAMETERS; EXPRESSIVE SPEECH SYNTHESIS; GAUSSIAN MIXTURE MODEL (GMMS); HIDDEN MARKOV MODEL(HMM); STATISTICAL PARAMETRIC SPEECH SYNTHESIS;

EID: 84878387086     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (2)

References (16)
  • 1
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • Nov
    • H. Zen, K. Tokuda, and A. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, pp. 1039-1064, Nov. 2009.
    • (2009) Speech Communication , vol.51 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.3
  • 2
    • 0032595183 scopus 로고    scopus 로고
    • Modeling of the glottal flow derivative waveform with application to speaker identification
    • Sept
    • M. D. Plumpe, T. F. Quatieri, and D. A. Reynolds, "Modeling of the glottal flow derivative waveform with application to speaker identification, " IEEE Trans. on Speech and Audio Processing, vol. 7, pp. 569-586, Sept. 1999.
    • (1999) IEEE Trans. on Speech and Audio Processing , vol.7 , pp. 569-586
    • Plumpe, M.D.1    Quatieri, T.F.2    Reynolds, D.A.3
  • 3
    • 70450163450 scopus 로고    scopus 로고
    • Comparison of multiple voice source parameters in different phonation types
    • M. Airas and P. Alku, "Comparison of multiple voice source parameters in different phonation types, " in Proc. of Interspeech, pp. 1410-1413, 2007.
    • (2007) Proc. of Interspeech , pp. 1410-1413
    • Airas, M.1    Alku, P.2
  • 4
    • 33644694381 scopus 로고    scopus 로고
    • Emotions in vowel segments of continuous speech: Analysis of the glottal flow using the normalized amplitude quotient
    • M. Airas and P. Alku, "Emotions in vowel segments of continuous speech: Analysis of the glottal flow using the normalized amplitude quotient, " Phonetica, vol. 63, no. 1, pp. 26-46, 2006.
    • (2006) Phonetica , vol.63 , Issue.1 , pp. 26-46
    • Airas, M.1    Alku, P.2
  • 5
    • 84865709194 scopus 로고    scopus 로고
    • Clustering expressive speech styles in audiobooks using glottal source parameteres
    • E. Szekely, J. P. Cabral, P. Cahill, and J. Carson-Berndsen, "Clustering expressive speech styles in audiobooks using glottal source parameteres, " in Proc. of Interspeech, pp. 2409-2412, 2011.
    • (2011) Proc. of Interspeech , pp. 2409-2412
    • Szekely, E.1    Cabral, J.P.2    Cahill, P.3    Carson-Berndsen, J.4
  • 6
    • 33947684811 scopus 로고
    • A four-parameter model of the glottal flow
    • G. Fant, J. Liljencrants, and Q. Lin, "A four-parameter model of the glottal flow, " STL-QPSR, vol. 26, no. 4, pp. 001-013, 1985.
    • (1985) STL-QPSR , vol.26 , Issue.4 , pp. 001-013
    • Fant, G.1    Liljencrants, J.2    Lin, Q.3
  • 7
    • 0036339929 scopus 로고    scopus 로고
    • Normalized amplitude and quotient for parameterization of the glottal flow
    • Aug
    • P. Alku and T. Backstrom, "Normalized amplitude and quotient for parameterization of the glottal flow, " Journal of Acoust. Society of America, vol. 112, pp. 701-710, Aug. 2002.
    • (2002) Journal of Acoust. Society of America , vol.112 , pp. 701-710
    • Alku, P.1    Backstrom, T.2
  • 8
    • 79959855615 scopus 로고    scopus 로고
    • Cluster analysis of differential spectral envelopes on emotional speech
    • G. Salvi, F. Tesser, E. Zovato, and P. Cosi, "Cluster analysis of differential spectral envelopes on emotional speech, " in Proc. of Interspeech, pp. 322-325, 2010.
    • (2010) Proc. of Interspeech , pp. 322-325
    • Salvi, G.1    Tesser, F.2    Zovato, E.3    Cosi, P.4
  • 9
    • 67650486451 scopus 로고    scopus 로고
    • Multimodal signals: Cognitive and algorithmic issues
    • Berlin, Heidelberg: Springer-Verlag
    • A. P?ribilova and J. P?ribil, "Multimodal signals: Cognitive and algorithmic issues, " ch. Spectrum Modification for Emotional Speech Synthesis, pp. 232-241, Berlin, Heidelberg: Springer-Verlag, 2009.
    • (2009) Ch. Spectrum Modification for Emotional Speech Synthesis , pp. 232-241
    • Pribilova, A.1    Pribil, J.2
  • 11
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Jan
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models, " IEEE Trans. on Speech and Audio Processing, vol. 3, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. on Speech and Audio Processing , vol.3 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 12
    • 85131821539 scopus 로고
    • Melgeneralized cepstral analysis-A unified approach to speech spectral estimation
    • K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Melgeneralized cepstral analysis-a unified approach to speech spectral estimation, " in Proc. of ICSLP, pp. 1043-1046, 1994.
    • (1994) Proc. of ICSLP , pp. 1043-1046
    • Tokuda, K.1    Kobayashi, T.2    Masuko, T.3    Imai, S.4
  • 13
    • 84867616957 scopus 로고    scopus 로고
    • Complex cepstrum as phase information for statistical parametric speech synthesis
    • R. Maia, M. Akamine, and M. F. J. Gales, "Complex cepstrum as phase information for statistical parametric speech synthesis, " in Proc. of ICASSP, pp. 4581-4584, 2012.
    • (2012) Proc. of ICASSP , pp. 4581-4584
    • Maia, R.1    Akamine, M.2    Gales, M.F.J.3
  • 14
    • 0002450185 scopus 로고    scopus 로고
    • Efficient representation of short-time phase based on group delay
    • H. Banno, J. Lu, S. Nakamura, K. Shikano, and H. Kawahara, "Efficient representation of short-time phase based on group delay, " in Proc. of ICASSP, pp. 861-864, 1998.
    • (1998) Proc. of ICASSP , pp. 861-864
    • Banno, H.1    Lu, J.2    Nakamura, S.3    Shikano, K.4    Kawahara, H.5
  • 15
    • 84874199000 scopus 로고    scopus 로고
    • Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system straight
    • H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT, " in Proc. of MAVEBA, pp. 13-18, 2001.
    • (2001) Proc. of MAVEBA , pp. 13-18
    • Kawahara, H.1    Estill, J.2    Fujimura, O.3
  • 16
    • 84865777002 scopus 로고    scopus 로고
    • The cstr/emime hts system for the blizzard challenge 2010
    • Blizzard Challenge Workshop
    • J. Yamagishi and O. Watts, "The CSTR/EMIME HTS system for the Blizzard Challenge 2010, " in Proc. Blizzard Challenge Workshop, 2010
    • (2010) Proc
    • Yamagishi, J.1    Watts, O.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.