SCOPUS 정보 검색 플랫폼

13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012

Volumn 2, Issue , 2012, Pages 1630-1633

Analysis on the importance of short-term speech parameterizations for emotional statistical parametric speech synthesis

Author keywords

Expressive speech synthesis; Speech synthesis; Statistical parametric speech synthesis

Indexed keywords

CLASSIFICATION PERFORMANCE; EMOTION CLASSIFICATION; EMOTION IDENTIFICATIONS; EXCITATION PARAMETERS; EXPRESSIVE SPEECH SYNTHESIS; GAUSSIAN MIXTURE MODEL (GMMS); HIDDEN MARKOV MODEL(HMM); STATISTICAL PARAMETRIC SPEECH SYNTHESIS;

GROUP DELAY; HIDDEN MARKOV MODELS; PARAMETERIZATION; SPEECH PROCESSING;

SPEECH SYNTHESIS;

EID: 84878387086 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (2)

References (16)

1
- 67651002140
- Statistical parametric speech synthesis
- Nov
- H. Zen, K. Tokuda, and A. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, pp. 1039-1064, Nov. 2009.
- (2009) Speech Communication , vol.51 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.³

2
- 0032595183
- Modeling of the glottal flow derivative waveform with application to speaker identification
- Sept
- M. D. Plumpe, T. F. Quatieri, and D. A. Reynolds, "Modeling of the glottal flow derivative waveform with application to speaker identification, " IEEE Trans. on Speech and Audio Processing, vol. 7, pp. 569-586, Sept. 1999.
- (1999) IEEE Trans. on Speech and Audio Processing , vol.7 , pp. 569-586
- Plumpe, M.D.¹ Quatieri, T.F.² Reynolds, D.A.³

3
- 70450163450
- Comparison of multiple voice source parameters in different phonation types
- M. Airas and P. Alku, "Comparison of multiple voice source parameters in different phonation types, " in Proc. of Interspeech, pp. 1410-1413, 2007.
- (2007) Proc. of Interspeech , pp. 1410-1413
- Airas, M.¹ Alku, P.²

4
- 33644694381
- Emotions in vowel segments of continuous speech: Analysis of the glottal flow using the normalized amplitude quotient
- M. Airas and P. Alku, "Emotions in vowel segments of continuous speech: Analysis of the glottal flow using the normalized amplitude quotient, " Phonetica, vol. 63, no. 1, pp. 26-46, 2006.
- (2006) Phonetica , vol.63 , Issue.1 , pp. 26-46
- Airas, M.¹ Alku, P.²

5
- 84865709194
- Clustering expressive speech styles in audiobooks using glottal source parameteres
- E. Szekely, J. P. Cabral, P. Cahill, and J. Carson-Berndsen, "Clustering expressive speech styles in audiobooks using glottal source parameteres, " in Proc. of Interspeech, pp. 2409-2412, 2011.
- (2011) Proc. of Interspeech , pp. 2409-2412
- Szekely, E.¹ Cabral, J.P.² Cahill, P.³ Carson-Berndsen, J.⁴

6
- 33947684811
- A four-parameter model of the glottal flow
- G. Fant, J. Liljencrants, and Q. Lin, "A four-parameter model of the glottal flow, " STL-QPSR, vol. 26, no. 4, pp. 001-013, 1985.
- (1985) STL-QPSR , vol.26 , Issue.4 , pp. 001-013
- Fant, G.¹ Liljencrants, J.² Lin, Q.³

7
- 0036339929
- Normalized amplitude and quotient for parameterization of the glottal flow
- Aug
- P. Alku and T. Backstrom, "Normalized amplitude and quotient for parameterization of the glottal flow, " Journal of Acoust. Society of America, vol. 112, pp. 701-710, Aug. 2002.
- (2002) Journal of Acoust. Society of America , vol.112 , pp. 701-710
- Alku, P.¹ Backstrom, T.²

8
- 79959855615
- Cluster analysis of differential spectral envelopes on emotional speech
- G. Salvi, F. Tesser, E. Zovato, and P. Cosi, "Cluster analysis of differential spectral envelopes on emotional speech, " in Proc. of Interspeech, pp. 322-325, 2010.
- (2010) Proc. of Interspeech , pp. 322-325
- Salvi, G.¹ Tesser, F.² Zovato, E.³ Cosi, P.⁴

9
- 67650486451
- Multimodal signals: Cognitive and algorithmic issues
- Berlin, Heidelberg: Springer-Verlag
- A. P?ribilova and J. P?ribil, "Multimodal signals: Cognitive and algorithmic issues, " ch. Spectrum Modification for Emotional Speech Synthesis, pp. 232-241, Berlin, Heidelberg: Springer-Verlag, 2009.
- (2009) Ch. Spectrum Modification for Emotional Speech Synthesis , pp. 232-241
- Pribilova, A.¹ Pribil, J.²

10
- 60849097547
- Normalized mutual information feature selection
- Feb
- P. A. Estevez, M. Tesmer, C. A. Perez, and J. M. Zurada, "Normalized mutual information feature selection, " IEEE Trans. on Neural Networks, vol. 20, pp. 189-201, Feb. 2009.
- (2009) IEEE Trans. on Neural Networks , vol.20 , pp. 189-201
- Estevez, P.A.¹ Tesmer, M.² Perez, C.A.³ Zurada, J.M.⁴

11
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- Jan
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models, " IEEE Trans. on Speech and Audio Processing, vol. 3, pp. 72-83, Jan. 1995.
- (1995) IEEE Trans. on Speech and Audio Processing , vol.3 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

12
- 85131821539
- Melgeneralized cepstral analysis-A unified approach to speech spectral estimation
- K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Melgeneralized cepstral analysis-a unified approach to speech spectral estimation, " in Proc. of ICSLP, pp. 1043-1046, 1994.
- (1994) Proc. of ICSLP , pp. 1043-1046
- Tokuda, K.¹ Kobayashi, T.² Masuko, T.³ Imai, S.⁴

13
- 84867616957
- Complex cepstrum as phase information for statistical parametric speech synthesis
- R. Maia, M. Akamine, and M. F. J. Gales, "Complex cepstrum as phase information for statistical parametric speech synthesis, " in Proc. of ICASSP, pp. 4581-4584, 2012.
- (2012) Proc. of ICASSP , pp. 4581-4584
- Maia, R.¹ Akamine, M.² Gales, M.F.J.³

14
- 0002450185
- Efficient representation of short-time phase based on group delay
- H. Banno, J. Lu, S. Nakamura, K. Shikano, and H. Kawahara, "Efficient representation of short-time phase based on group delay, " in Proc. of ICASSP, pp. 861-864, 1998.
- (1998) Proc. of ICASSP , pp. 861-864
- Banno, H.¹ Lu, J.² Nakamura, S.³ Shikano, K.⁴ Kawahara, H.⁵

15
- 84874199000
- Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system straight
- H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT, " in Proc. of MAVEBA, pp. 13-18, 2001.
- (2001) Proc. of MAVEBA , pp. 13-18
- Kawahara, H.¹ Estill, J.² Fujimura, O.³

16
- 84865777002
- The cstr/emime hts system for the blizzard challenge 2010
- Blizzard Challenge Workshop
- J. Yamagishi and O. Watts, "The CSTR/EMIME HTS system for the Blizzard Challenge 2010, " in Proc. Blizzard Challenge Workshop, 2010
- (2010) Proc
- Yamagishi, J.¹ Watts, O.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.