SCOPUS 정보 검색 플랫폼

Volumn 56, Issue 9, 2007, Pages 1245-1254

Conversion function clustering and selection using linguistic and spectral information for emotional voice conversion

(3) Hsia, Chi Chun a Wu, Chung Hsien a Wu, Jian Qi a

a NATIONAL CHENG KUNG UNIVERSITY (Taiwan)

Author keywords

Emotional text to speech synthesis; Emotional voice conversion; Function clustering and selection; Gaussian mixture bigram model; Linguistic feature

Indexed keywords

DATABASE SYSTEMS; LINGUISTICS; SPEECH ANALYSIS; STATISTICAL TESTS;

EMOTIONAL VOICE CONVERSION; FUNCTION CLUSTERING; FUNCTION SELECTION; GAUSSIAN MIXTURE BIGRAM MODEL;

SPEECH SYNTHESIS;

EID: 34548216761 PISSN: 00189340 EISSN: None Source Type: Journal
DOI: 10.1109/TC.2007.1079 Document Type: Article

Times cited : (24)

References (26)

1
- 0027447292
- Towards the Simulation of Emotion in Synthetic Speech: A Review of the Literature on Human Vocal Emotion
- I.R. Murray and J.L. Arnott, "Towards the Simulation of Emotion in Synthetic Speech: A Review of the Literature on Human Vocal Emotion," J. Acoustic Soc. Am., vol. 93, no. 2, pp. 1097-1108, 1993.
- (1993) J. Acoustic Soc. Am , vol.93 , Issue.2 , pp. 1097-1108
- Murray, I.R.¹ Arnott, J.L.²

2
- 84971539709
- Emotional Speech Synthesis-A Review
- M. Schro"der, "Emotional Speech Synthesis-A Review," Proc. European Conf. Speech Comm. and Technology (EUROSPEECH '01 , vol. 1, pp. 561-564, 2001.
- (2001) Proc. European Conf. Speech Comm. and Technology (EUROSPEECH '01 , vol.1 , pp. 561-564
- Schro"der, M.¹

3
- 0037380318
- A Corpus-Based Speech Synthesis System with Emotion
- A. Iida, F. Higuchi, N. Campbell, and M. Yasumura, "A Corpus-Based Speech Synthesis System with Emotion," Speech Comm., vol. 40, nos. 1-2, pp. 161-187, 2003.
- (2003) Speech Comm , vol.40 , Issue.1-2 , pp. 161-187
- Iida, A.¹ Higuchi, F.² Campbell, N.³ Yasumura, M.⁴

4
- 84876497245
- GMM-Based Voice Conversion Applied to Emotional Speech Synthesis
- H. Kawanami, Y. Iwami, T. Toda, H. Saruwatari, and K. Shikano, "GMM-Based Voice Conversion Applied to Emotional Speech Synthesis," Proc. European Conf. Speech Comm. and Technology (EUROSPEECH '03), pp. 2401-2404, 2003.
- (2003) Proc. European Conf. Speech Comm. and Technology (EUROSPEECH '03) , pp. 2401-2404
- Kawanami, H.¹ Iwami, Y.² Toda, T.³ Saruwatari, H.⁴ Shikano, K.⁵

5
- 0027252189
- Application of the Analysis of Glottal Excitation of Stressed Speech to Speaking Style Modification
- Apr
- K.E. Cummings and M.A. Clements, "Application of the Analysis of Glottal Excitation of Stressed Speech to Speaking Style Modification," Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '93), vol. 2, pp. 207-210, Apr. 1993.
- (1993) Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '93) , vol.2 , pp. 207-210
- Cummings, K.E.¹ Clements, M.A.²

6
- 0023739214
- Voice Conversion through Vector Quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice Conversion through Vector Quantization," Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '98), pp. 655-658, 1988.
- (1988) Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '98) , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

7
- 0032026483
- Continuous Probabilistic Transform for Voice Conversion
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous Probabilistic Transform for Voice Conversion," IEEE Trans. Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998.
- (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

8
- 0031623661
- Spectral Voice Conversion for Text-to-Speech Synthesis
- A. Kain and M.W. Macon, "Spectral Voice Conversion for Text-to-Speech Synthesis," Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '98), 1998.
- (1998) Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '98)
- Kain, A.¹ Macon, M.W.²

9
- 33646779506
- Spectral Conversion Based on Maximum Likelihood Estimation Considering Global Variance of Converted Parameter
- Mar
- T. Toda, A.W. Black, and K. Tokuda, "Spectral Conversion Based on Maximum Likelihood Estimation Considering Global Variance of Converted Parameter," Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '05), vol. 1, pp. 9-12, Mar. 2005.
- (2005) Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '05) , vol.1 , pp. 9-12
- Toda, T.¹ Black, A.W.² Tokuda, K.³

10
- 84994241109
- Including Dynamic and Phonetic Information in Voice Conversion Systems
- H. Duxans, A. Bonafonte, A. Kain, and J. van Santen, "Including Dynamic and Phonetic Information in Voice Conversion Systems," Proc. Int'l Conf. Speech and Language Processing (ICSLP '04), pp. 5-8, 2004.
- (2004) Proc. Int'l Conf. Speech and Language Processing (ICSLP '04) , pp. 5-8
- Duxans, H.¹ Bonafonte, A.² Kain, A.³ van Santen, J.⁴

11
- 85135141647
- Hidden Markov Model Based Voice Conversion Using Dynamic Characteristics of Speaker
- Sept
- E.K. Kim, S. Lee, and Y.H. Oh, "Hidden Markov Model Based Voice Conversion Using Dynamic Characteristics of Speaker," Proc. European Conf. Speech Comm. and Technology (EUROSPEECH '97), vol. 5, pp. 2519-2522, Sept. 1997.
- (1997) Proc. European Conf. Speech Comm. and Technology (EUROSPEECH '97) , vol.5 , pp. 2519-2522
- Kim, E.K.¹ Lee, S.² Oh, Y.H.³

12
- 34047247202
- Voice Conversion Using Duration-Embedded Bi-HMMs for Expressive Speech Synthesis
- C.H. Wu, C.C. Hsia, T.H. Liu, and J.F. Wang, "Voice Conversion Using Duration-Embedded Bi-HMMs for Expressive Speech Synthesis," IEEE Trans. Audio, Speech, and Language Processing, vol. 14, no. 4, pp. 1109-1116, 2006.
- (2006) IEEE Trans. Audio, Speech, and Language Processing , vol.14 , Issue.4 , pp. 1109-1116
- Wu, C.H.¹ Hsia, C.C.² Liu, T.H.³ Wang, J.F.⁴

13
- 0036497598
- Discriminative Training of Gaussian Mixture Bigram Models with Application to Chinese Dialect Identification
- W.H. Tsai and W.W. Chang, "Discriminative Training of Gaussian Mixture Bigram Models with Application to Chinese Dialect Identification," Speech Comm., vol. 36, no. 3-4, pp. 317-326, 2002.
- (2002) Speech Comm , vol.36 , Issue.3-4 , pp. 317-326
- Tsai, W.H.¹ Chang, W.W.²

14
- 0001927585
- On Information and Sufficiency
- Mar
- S. Kullback and R.A. Leibler, "On Information and Sufficiency," Annals of Math. Statistics, vol. 22, no. 1, pp. 79-86, Mar. 1951.
- (1951) Annals of Math. Statistics , vol.22 , Issue.1 , pp. 79-86
- Kullback, S.¹ Leibler, R.A.²

15
- 0035478985
- Automatic Generation of Synthesis Units and Prosodic Information for Chinese Concatenative Synthesis
- C.H. Wu and J.H. Chen, "Automatic Generation of Synthesis Units and Prosodic Information for Chinese Concatenative Synthesis," Speech Comm., vol. 35, nos. 3-4, pp. 219-237, 2001.
- (2001) Speech Comm , vol.35 , Issue.3-4 , pp. 219-237
- Wu, C.H.¹ Chen, J.H.²

16
- 0030677481
- Speech Representation and Transformation Using Adaptive Interpolation of Weighted Spectrum: Vocoder Revisited
- H. Kawahara, "Speech Representation and Transformation Using Adaptive Interpolation of Weighted Spectrum: Vocoder Revisited," Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '97), vol. 2, pp. 1303-1306, 1997.
- (1997) Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '97) , vol.2 , pp. 1303-1306
- Kawahara, H.¹

17
- 0032673049
- Restructuring Speech Representations Using a Pitch Adaptive Time-Frequency-Based F0 Extraction: Possible Role of a Repetitive Structure in Sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring Speech Representations Using a Pitch Adaptive Time-Frequency-Based F0 Extraction: Possible Role of a Repetitive Structure in Sounds," Speech Comm., vol. 27, nos. 3-4, pp. 187-207, 1999.
- (1999) Speech Comm , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² de Cheveigné, A.³

18
- 0002629270
- Maximum Likelihood from Incomplete Data via the EM Algorithm
- A.P. Dempster, N.M. Laird, and D.B. Rubin, "Maximum Likelihood from Incomplete Data via the EM Algorithm," J. Royal Statistical Soc. B vol. 39, pp. 1-38, 1977.
- (1977) J. Royal Statistical Soc. B , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

19
- 0003612818
- MIT Press
- C.D. Manning and H. Schutze, Foundations of Statistical Natural Language Processing. MIT Press, 1999.
- (1999) Foundations of Statistical Natural Language Processing
- Manning, C.D.¹ Schutze, H.²

20
- 0032073761
- An RNN-Based Prosodic Information Synthesis for Mandarin Text-to-Speech
- S.H. Chen, S.H. Hwang, and Y.R. Wang, "An RNN-Based Prosodic Information Synthesis for Mandarin Text-to-Speech," IEEE Trans. Speech and Audio Processing, vol. 6, no. 3, pp. 226-239, 1998.
- (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.3 , pp. 226-239
- Chen, S.H.¹ Hwang, S.H.² Wang, Y.R.³

21
- 29144503308
- Part-of-Speech (POS) Analysis on Chinese Language
- Inst. of Information Science Academia Sinica
- L.L. Chang et al., "Part-of-Speech (POS) Analysis on Chinese Language," technical report, Inst. of Information Science Academia Sinica, 1989.
- (1989) technical report
- Chang, L.L.¹

22
- 2942538615
- Recovery of False Rejection Using Statistical Partial Pattern Trees for Sentence Verification
- C.H. Wu and Y.J. Chen, "Recovery of False Rejection Using Statistical Partial Pattern Trees for Sentence Verification," Speech Comm., vol. 43, pp. 71-88, 2004.
- (2004) Speech Comm , vol.43 , pp. 71-88
- Wu, C.H.¹ Chen, Y.J.²

23
- 34047263010
- Prosody Conversion from Neutral Speech to Emotional Speech
- J. Tao, Y. Kang, and A. Li, "Prosody Conversion from Neutral Speech to Emotional Speech," IEEE Trans. Audio, Speech, and Language Processing, vol. 14, no. 4, pp. 1145-1154, 2006.
- (2006) IEEE Trans. Audio, Speech, and Language Processing , vol.14 , Issue.4 , pp. 1145-1154
- Tao, J.¹ Kang, Y.² Li, A.³

24
- 21844454654
- The Determination, Analysis, and Synthesis of Fundamental Frequency,
- PhD dissertation, Northwestern Univ
- X. Sun, "The Determination, Analysis, and Synthesis of Fundamental Frequency," PhD dissertation, Northwestern Univ., 2002.
- (2002)
- Sun, X.¹

25
- 0000873069
- A Method for the Solution of Certain Problems in Least Squares
- K. Levenberg, "A Method for the Solution of Certain Problems in Least Squares," Quarterly Applied Math., vol. 2, pp. 164-168, 1944.
- (1944) Quarterly Applied Math , vol.2 , pp. 164-168
- Levenberg, K.¹

26
- 0004116974
- W.B. Sauders
- S. Shott, "Statistics for Health Professionals," W.B. Sauders, 1990.
- (1990) Statistics for Health Professionals
- Shott, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.