SCOPUS 정보 검색 플랫폼

Cognitive Computation

Volumn 6, Issue 4, 2014, Pages 928-939

GMM-Based Evaluation of Emotional Style Transformation in Czech and Slovak

(2) Přibil, Jiří a Přibilová, Anna b

a INSTITUTE OF MEASUREMENT SCIENCE (Slovakia)

b Institute of Electronics and Photonics (Slovakia)

Author keywords

Emotional speech transformation; GMM based emotion classification; Spectral and prosodic features of speech

Indexed keywords

BENCHMARKING; CLASSIFICATION (OF INFORMATION); SPEECH ANALYSIS; SPEECH SYNTHESIS; SUPPORT VECTOR MACHINES;

CLASSIFICATION ACCURACY; EMOTION CLASSIFICATION; EMOTIONAL SPEECH; FEED BACK INFORMATION; GAUSSIAN MIXTURE MODEL (GMMS); PROSODIC FEATURES; STATISTICAL EVALUATION; TEXT-TO-SPEECH SYSTEM;

QUALITY CONTROL;

EID: 84916205035 PISSN: 18669956 EISSN: 18669964 Source Type: Journal
DOI: 10.1007/s12559-014-9283-y Document Type: Article

Times cited : (10)

References (35)

1
- 84874556693
- Biometric applications related to human beings: there is life beyond security
- Faundez-Zanuy M, Hussain A, Mekyska J, Sesa-Nogueras E, Monte-Moreno E, Esposito A, Chetouani M, Garre-Olmo J, Abel A, Smékal Z, Lopez-de-Ipiña K. Biometric applications related to human beings: there is life beyond security. Cognit Comput. 2013;5(1):136–51.
- (2013) Cognit Comput , vol.5 , Issue.1 , pp. 136-151
- Faundez-Zanuy, M.¹ Hussain, A.² Mekyska, J.³ Sesa-Nogueras, E.⁴ Monte-Moreno, E.⁵ Esposito, A.⁶ Chetouani, M.⁷ Garre-Olmo, J.⁸ Abel, A.⁹ Smékal, Z.¹⁰ Lopez-de-Ipiña, K.¹¹

2
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- Reynolds DA, Quatieri TF, Dunn RB. Speaker verification using adapted Gaussian mixture models. Digit Signal Proc. 2000;10(1–3):19–41.
- (2000) Digit Signal Proc , vol.10 , Issue.1-3 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

3
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- Reynolds DA, Rose RC. Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Process. 1995;3(1):72–83.
- (1995) IEEE Trans Speech Audio Process , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

4
- 84870448053
- Speaker-characterized emotion recognition using online and iterative speaker adaptation
- Kim J-B, Park J-S, Oh Y-H. Speaker-characterized emotion recognition using online and iterative speaker adaptation. Cognit Comput. 2012;4(4):398–408.
- (2012) Cognit Comput , vol.4 , Issue.4 , pp. 398-408
- Kim, J.-B.¹ Park, J.-S.² Oh, Y.-H.³

5
- 54549099008
- Investigation on LP-residual representations for speaker identification
- Chetouani M, Faundez-Zanuy M, Gas B, Zarader JL. Investigation on LP-residual representations for speaker identification. Pattern Recogn. 2009;42(3):487–94.
- (2009) Pattern Recogn , vol.42 , Issue.3 , pp. 487-494
- Chetouani, M.¹ Faundez-Zanuy, M.² Gas, B.³ Zarader, J.L.⁴

6
- 29044444825
- Support vector machines for speaker and language recognition
- Campbell WM, Campbell JP, Reynolds DA, Singer E, Torres-Carrasquillo PA. Support vector machines for speaker and language recognition. Comput Speech Lang. 2006;20(2–3):210–29.
- (2006) Comput Speech Lang , vol.20 , Issue.2-3 , pp. 210-229
- Campbell, W.M.¹ Campbell, J.P.² Reynolds, D.A.³ Singer, E.⁴ Torres-Carrasquillo, P.A.⁵

7
- 84890439886
- GFM-based methods for speaker identification
- PID: 23193244
- Bhardwaj S, Srivastava S, Hanmandlu M, Gupta JRP. GFM-based methods for speaker identification. IEEE Trans Cybern. 2013;43(3):1047–58.
- (2013) IEEE Trans Cybern , vol.43 , Issue.3 , pp. 1047-1058
- Bhardwaj, S.¹ Srivastava, S.² Hanmandlu, M.³ Gupta, J.R.P.⁴

8
- 78649328053
- Survey on speech emotion recognition: features, classification schemes, and databases
- Ayadi ME, Kamel MS, Karray F. Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn. 2011;44(3):572–87.
- (2011) Pattern Recogn , vol.44 , Issue.3 , pp. 572-587
- Ayadi, M.E.¹ Kamel, M.S.² Karray, F.³

9
- 84874444772
- Emotion recognition from spontaneous Slavic speech
- Atassi H, Esposito A, Smékal Z. Emotion recognition from spontaneous Slavic speech. In: Proceedings of the IEEE international conference on cognitive infocommunications; 2012. p. 389–94.
- In: Proceedings of the IEEE international conference on cognitive infocommunications , vol.2012 , pp. 389-394
- Atassi, H.¹ Esposito, A.² Smékal, Z.³

10
- 84876449733
- Emotion recognition improvement using normalized formant. supplementary features by hybrid of DTW-MLP-GMM model
- Gharavian D, Sheikhan M, Ashoftedel F. Emotion recognition improvement using normalized formant. supplementary features by hybrid of DTW-MLP-GMM model. Neural Comput Appl. 2013;22(6):1181–91.
- (2013) Neural Comput Appl , vol.22 , Issue.6 , pp. 1181-1191
- Gharavian, D.¹ Sheikhan, M.² Ashoftedel, F.³

11
- 84894634753
- Class-specific multiple classifiers scheme to recognize emotions from speech signals
- Milton A., Tamil Selvi S. Class-specific multiple classifiers scheme to recognize emotions from speech signals. Comput Speech Lang. 2013. doi:10.1016/j.csl.2013.08.004.
- (2013) Comput Speech Lang
- Milton, A.¹ Tamil Selvi, S.²

12
- 84884611357
- Compensating for speaker or lexical variabilities in speech for emotion recognition
- Mariooryad S, Busso C. Compensating for speaker or lexical variabilities in speech for emotion recognition. Speech Commun. 2014;57:1–12. doi:10.1016/j.specom.2013.07.
- (2014) Speech Commun , vol.57 , pp. 1-12
- Mariooryad, S.¹ Busso, C.²

13
- 77950029338
- Voice conversion by mapping the speaker-specific features using pitch synchronous approach
- Rao KS. Voice conversion by mapping the speaker-specific features using pitch synchronous approach. Comput Speech Lang. 2010;24(3):474–94.
- (2010) Comput Speech Lang , vol.24 , Issue.3 , pp. 474-494
- Rao, K.S.¹

14
- 84902548006
- On the impact of excitation and spectral parameters for expressive statistical parametric speech synthesis
- Maia R, Akamine M. On the impact of excitation and spectral parameters for expressive statistical parametric speech synthesis. Comput Speech Lang. 2013. doi:10.1016/j.csl.2013.10.001.
- (2013) Comput Speech Lang
- Maia, R.¹ Akamine, M.²

15
- 67650486451
- Spectrum modification for emotional speech synthesis
- Esposito A, Hussain A, Marinaro M, Martone R, (eds), Springer, Berlin:
- Přibilová A, Přibil J. Spectrum modification for emotional speech synthesis. In: Esposito A, Hussain A, Marinaro M, Martone R, editors. Multimodal signals: cognitive and algorithmic issues. LNAI 5398. Berlin: Springer; 2009. p. 232–41.
- (2009) Multimodal signals: cognitive and algorithmic issues. LNAI 5398 , pp. 232-241
- Přibilová, A.¹ Přibil, J.²

16
- 77952039799
- Harmonic model for female voice emotional synthesis
- Fierrez J, Ortega-Garcia J, Esposito A, Drygajlo A, Faundez-Zanuy M, (eds), Springer, Berlin:
- Přibilová A, Přibil J. Harmonic model for female voice emotional synthesis. In: Fierrez J, Ortega-Garcia J, Esposito A, Drygajlo A, Faundez-Zanuy M, editors. Biometric ID management and multimodal communication. LNCS 5707. Berlin: Springer; 2009. p. 41–8.
- (2009) Biometric ID management and multimodal communication. LNCS 5707 , pp. 41-48
- Přibilová, A.¹ Přibil, J.²

17
- 84866939235
- New cepstral zero-pole vocal tract models for TTS synthesis
- Vích R, Přibil J, Smékal Z. New cepstral zero-pole vocal tract models for TTS synthesis. In: Proceedings of IEEE Region 8 EUROCON’2001; 2001, vol. 2, p. 458–62.
- In: Proceedings of IEEE Region 8 EUROCON’2001; 2001 , vol.2 , pp. 458-462
- Vích, R.¹ Přibil, J.² Smékal, Z.³

18
- 0037384712
- Vocal communication of emotion: a review of research paradigms
- Scherer KR. Vocal communication of emotion: a review of research paradigms. Speech Commun. 2003;40(1–2):227–56.
- (2003) Speech Commun , vol.40 , Issue.1-2 , pp. 227-256
- Scherer, K.R.¹

19
- 80052706006
- Statistical analysis of complementary spectral features of emotional speech in Czech and Slovak
- Habernal I, Matoušek V, (eds), Springer, Berlin:
- Přibil J, Přibilová A. Statistical analysis of complementary spectral features of emotional speech in Czech and Slovak. In: Habernal I, Matoušek V, editors. Text, speech and dialogue. LNAI 6836. Berlin: Springer; 2011. p. 299–306.
- (2011) Text, speech and dialogue. LNAI 6836 , pp. 299-306
- Přibil, J.¹ Přibilová, A.²

20
- 80051635711
- Comparison of spectral and prosodic parameters of male and female emotional speech in Czech and Slovak
- Přibil J, Přibilová A. Comparison of spectral and prosodic parameters of male and female emotional speech in Czech and Slovak. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP); 2011, p. 4720–3.
- In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP) , vol.2011 , pp. 4720-4723
- Přibil, J.¹ Přibilová, A.²

21
- 84867336595
- Automatic speaker age and gender recognition using acoustic and prosodic level information fusion
- Li M, Han KJ, Narayan S. Automatic speaker age and gender recognition using acoustic and prosodic level information fusion. Comput Speech Lang. 2013;27(1):151–67.
- (2013) Comput Speech Lang , vol.27 , Issue.1 , pp. 151-167
- Li, M.¹ Han, K.J.² Narayan, S.³

22
- 84887051130
- Evaluation of influence of spectral and prosodic features on GMM classification of Czech and Slovak emotional speech
- Přibil J, Přibilová A. Evaluation of influence of spectral and prosodic features on GMM classification of Czech and Slovak emotional speech. EURASIP J Audio Speech Music Process. 2013;2013(8):1–22.
- (2013) EURASIP J Audio Speech Music Process , vol.2013 , Issue.8 , pp. 1-22
- Přibil, J.¹ Přibilová, A.²

23
- 82955173836
- Influence of visual stimuli on evaluation of converted emotional speech by listening tests
- Esposito A, Vinciarelli A, Vicsi K, Pelachaud C, Nijholt A, (eds), Springer, Berlin:
- Přibil J, Přibilová A. Influence of visual stimuli on evaluation of converted emotional speech by listening tests. In: Esposito A, Vinciarelli A, Vicsi K, Pelachaud C, Nijholt A, editors. Analysis of verbal and nonverbal communication and enactment. LNCS 6800. Berlin: Springer; 2011. p. 378–92.
- (2011) Analysis of verbal and nonverbal communication and enactment. LNCS 6800 , pp. 378-392
- Přibil, J.¹ Přibilová, A.²

24
- 57349126313
- Inter-coder agreement for computational linguistics
- Artstein R, Poesio M. Inter-coder agreement for computational linguistics. Comput Linguist. 2008;4:555–96. doi:10.1162/coli.07-034-R2.
- (2008) Comput Linguist , vol.4 , pp. 555-596
- Artstein, R.¹ Poesio, M.²

25
- 84902364868
- Siegert I, Böck R, Wendemuth A. Inter-rater reliability for emotion annotation in human-computer interaction—comparison and methodological improvements. J Multimodal User Interfaces Special Issue From Multimodal Analysis to Real-Time Interactions with Virtual Agents, doi:, Springer, 2013 (online)
- Siegert I, Böck R, Wendemuth A. Inter-rater reliability for emotion annotation in human-computer interaction—comparison and methodological improvements. J Multimodal User Interfaces Special Issue From Multimodal Analysis to Real-Time Interactions with Virtual Agents, doi:10.1007/s12193-013-0129-9, Springer, 2013 (online).

26
- 33745202280
- Burkhardt F, Paeschke A, Rolfes M, Sendlmeier W, Weiss B. A database of German emotional speech. In Proceedings of INTERSPEECH 2005, Lisbon, Portugal, p. 1517–1520.
- (2005) Lisbon, Portugal , pp. 1517-1520
- Burkhardt, F.¹ Paeschke, A.² Rolfes, M.³ Sendlmeier, W.⁴

27
- 67650474760
- Recognition of emotions in german speech using Gaussian Mixture models
- Esposito A, Hussain A, Marinaro M, Martone R, (eds), Springer, Berlin:
- Vondra M, Vích R. Recognition of emotions in german speech using Gaussian Mixture models. In: Esposito A, Hussain A, Marinaro M, Martone R, editors. Multimodal signals: cognitive and algorithmic issues. LNAI 5398. Berlin: Springer; 2009. p. 256–63.
- (2009) Multimodal signals: cognitive and algorithmic issues. LNAI 5398 , pp. 256-263
- Vondra, M.¹ Vích, R.²

28
- 77956401353
- Class-level spectral features for emotion recognition
- PID: 23794771
- Bitouk D, Verma R, Nenkova A. Class-level spectral features for emotion recognition. Speech Commun. 2010;52:613–25.
- (2010) Speech Commun , vol.52 , pp. 613-625
- Bitouk, D.¹ Verma, R.² Nenkova, A.³

29
- 84886500247
- Class-specific GMM based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines
- Dileep AD, Sekhar CC. Class-specific GMM based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines. Speech Commun. 2014;57:126–43.
- (2014) Speech Commun , vol.57 , pp. 126-143
- Dileep, A.D.¹ Sekhar, C.C.²

30
- 84887669226
- Novel approach in speaker identification using SVM and GMM
- Bourouba H, Korba CA, Djemili R. Novel approach in speaker identification using SVM and GMM. Control Eng Appl Inform. 2013;15(3):87–95.
- (2013) Control Eng Appl Inform , vol.15 , Issue.3 , pp. 87-95
- Bourouba, H.¹ Korba, C.A.² Djemili, R.³

31
- 84864723353
- Speaker-independent emotion recognition exploiting a psychologically-inspired binary cascade classification schema
- Kotti M, Paternò F. Speaker-independent emotion recognition exploiting a psychologically-inspired binary cascade classification schema. Int J Speech Technol. 2012;15:131–50. doi:10.1007/s10772-012-9127-7.
- (2012) Int J Speech Technol , vol.15 , pp. 131-150
- Kotti, M.¹ Paternò, F.²

32
- 80053925819
- Cross-corpus acoustic emotion recognition: variances and strategies
- Schuller B, Vlasenko B, Eyben F, Wollmer M, Stuhlsatz A, Wendemuth A, Rigoll G. Cross-corpus acoustic emotion recognition: variances and strategies. IEEE Trans Affect Comput. 2010;1(2):119–31.
- (2010) IEEE Trans Affect Comput , vol.1 , Issue.2 , pp. 119-131
- Schuller, B.¹ Vlasenko, B.² Eyben, F.³ Wollmer, M.⁴ Stuhlsatz, A.⁵ Wendemuth, A.⁶ Rigoll, G.⁷

33
- 84916235222
- Nabney IT. Netlab Pattern Analysis Toolbox. Copyright (1996–2001). Retrieved 16 Feb 2012, from
- Nabney IT. Netlab Pattern Analysis Toolbox. Copyright (1996–2001). Retrieved 16 Feb 2012, from http://www.mathworks.com/matlabcentral/fileexchange/2654-netlab.

34
- 33947164164
- An evaluation of the robustness of existing supervised machine learning approaches to the classification of emotions in speech
- Shami M, Verhelst W. An evaluation of the robustness of existing supervised machine learning approaches to the classification of emotions in speech. Speech Commun. 2007;49:201–12.
- (2007) Speech Commun , vol.49 , pp. 201-212
- Shami, M.¹ Verhelst, W.²

35
- 84884955838
- SVM-based detection of misannotated words in read speech corpora
- Habernal I, Matoušek V, (eds), Springer, Berlin:
- Matoušek J, Tihelka D. SVM-based detection of misannotated words in read speech corpora. In: Habernal I, Matoušek V, editors. Text, speech, and dialogue. LNCS 8082. Berlin: Springer; 2013. p. 457–64.
- (2013) Text, speech, and dialogue. LNCS 8082 , pp. 457-464
- Matoušek, J.¹ Tihelka, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.