메뉴 건너뛰기




Volumn 6, Issue 4, 2014, Pages 928-939

GMM-Based Evaluation of Emotional Style Transformation in Czech and Slovak

Author keywords

Emotional speech transformation; GMM based emotion classification; Spectral and prosodic features of speech

Indexed keywords

BENCHMARKING; CLASSIFICATION (OF INFORMATION); SPEECH ANALYSIS; SPEECH SYNTHESIS; SUPPORT VECTOR MACHINES;

EID: 84916205035     PISSN: 18669956     EISSN: 18669964     Source Type: Journal    
DOI: 10.1007/s12559-014-9283-y     Document Type: Article
Times cited : (10)

References (35)
  • 2
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • Reynolds DA, Quatieri TF, Dunn RB. Speaker verification using adapted Gaussian mixture models. Digit Signal Proc. 2000;10(1–3):19–41.
    • (2000) Digit Signal Proc , vol.10 , Issue.1-3 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 3
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Reynolds DA, Rose RC. Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans Speech Audio Process. 1995;3(1):72–83.
    • (1995) IEEE Trans Speech Audio Process , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 4
    • 84870448053 scopus 로고    scopus 로고
    • Speaker-characterized emotion recognition using online and iterative speaker adaptation
    • Kim J-B, Park J-S, Oh Y-H. Speaker-characterized emotion recognition using online and iterative speaker adaptation. Cognit Comput. 2012;4(4):398–408.
    • (2012) Cognit Comput , vol.4 , Issue.4 , pp. 398-408
    • Kim, J.-B.1    Park, J.-S.2    Oh, Y.-H.3
  • 5
    • 54549099008 scopus 로고    scopus 로고
    • Investigation on LP-residual representations for speaker identification
    • Chetouani M, Faundez-Zanuy M, Gas B, Zarader JL. Investigation on LP-residual representations for speaker identification. Pattern Recogn. 2009;42(3):487–94.
    • (2009) Pattern Recogn , vol.42 , Issue.3 , pp. 487-494
    • Chetouani, M.1    Faundez-Zanuy, M.2    Gas, B.3    Zarader, J.L.4
  • 8
    • 78649328053 scopus 로고    scopus 로고
    • Survey on speech emotion recognition: features, classification schemes, and databases
    • Ayadi ME, Kamel MS, Karray F. Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn. 2011;44(3):572–87.
    • (2011) Pattern Recogn , vol.44 , Issue.3 , pp. 572-587
    • Ayadi, M.E.1    Kamel, M.S.2    Karray, F.3
  • 10
    • 84876449733 scopus 로고    scopus 로고
    • Emotion recognition improvement using normalized formant. supplementary features by hybrid of DTW-MLP-GMM model
    • Gharavian D, Sheikhan M, Ashoftedel F. Emotion recognition improvement using normalized formant. supplementary features by hybrid of DTW-MLP-GMM model. Neural Comput Appl. 2013;22(6):1181–91.
    • (2013) Neural Comput Appl , vol.22 , Issue.6 , pp. 1181-1191
    • Gharavian, D.1    Sheikhan, M.2    Ashoftedel, F.3
  • 11
    • 84894634753 scopus 로고    scopus 로고
    • Class-specific multiple classifiers scheme to recognize emotions from speech signals
    • Milton A., Tamil Selvi S. Class-specific multiple classifiers scheme to recognize emotions from speech signals. Comput Speech Lang. 2013. doi:10.1016/j.csl.2013.08.004.
    • (2013) Comput Speech Lang
    • Milton, A.1    Tamil Selvi, S.2
  • 12
    • 84884611357 scopus 로고    scopus 로고
    • Compensating for speaker or lexical variabilities in speech for emotion recognition
    • Mariooryad S, Busso C. Compensating for speaker or lexical variabilities in speech for emotion recognition. Speech Commun. 2014;57:1–12. doi:10.1016/j.specom.2013.07.
    • (2014) Speech Commun , vol.57 , pp. 1-12
    • Mariooryad, S.1    Busso, C.2
  • 13
    • 77950029338 scopus 로고    scopus 로고
    • Voice conversion by mapping the speaker-specific features using pitch synchronous approach
    • Rao KS. Voice conversion by mapping the speaker-specific features using pitch synchronous approach. Comput Speech Lang. 2010;24(3):474–94.
    • (2010) Comput Speech Lang , vol.24 , Issue.3 , pp. 474-494
    • Rao, K.S.1
  • 14
    • 84902548006 scopus 로고    scopus 로고
    • On the impact of excitation and spectral parameters for expressive statistical parametric speech synthesis
    • Maia R, Akamine M. On the impact of excitation and spectral parameters for expressive statistical parametric speech synthesis. Comput Speech Lang. 2013. doi:10.1016/j.csl.2013.10.001.
    • (2013) Comput Speech Lang
    • Maia, R.1    Akamine, M.2
  • 15
    • 67650486451 scopus 로고    scopus 로고
    • Spectrum modification for emotional speech synthesis
    • Esposito A, Hussain A, Marinaro M, Martone R, (eds), Springer, Berlin:
    • Přibilová A, Přibil J. Spectrum modification for emotional speech synthesis. In: Esposito A, Hussain A, Marinaro M, Martone R, editors. Multimodal signals: cognitive and algorithmic issues. LNAI 5398. Berlin: Springer; 2009. p. 232–41.
    • (2009) Multimodal signals: cognitive and algorithmic issues. LNAI 5398 , pp. 232-241
    • Přibilová, A.1    Přibil, J.2
  • 16
    • 77952039799 scopus 로고    scopus 로고
    • Harmonic model for female voice emotional synthesis
    • Fierrez J, Ortega-Garcia J, Esposito A, Drygajlo A, Faundez-Zanuy M, (eds), Springer, Berlin:
    • Přibilová A, Přibil J. Harmonic model for female voice emotional synthesis. In: Fierrez J, Ortega-Garcia J, Esposito A, Drygajlo A, Faundez-Zanuy M, editors. Biometric ID management and multimodal communication. LNCS 5707. Berlin: Springer; 2009. p. 41–8.
    • (2009) Biometric ID management and multimodal communication. LNCS 5707 , pp. 41-48
    • Přibilová, A.1    Přibil, J.2
  • 18
    • 0037384712 scopus 로고    scopus 로고
    • Vocal communication of emotion: a review of research paradigms
    • Scherer KR. Vocal communication of emotion: a review of research paradigms. Speech Commun. 2003;40(1–2):227–56.
    • (2003) Speech Commun , vol.40 , Issue.1-2 , pp. 227-256
    • Scherer, K.R.1
  • 19
    • 80052706006 scopus 로고    scopus 로고
    • Statistical analysis of complementary spectral features of emotional speech in Czech and Slovak
    • Habernal I, Matoušek V, (eds), Springer, Berlin:
    • Přibil J, Přibilová A. Statistical analysis of complementary spectral features of emotional speech in Czech and Slovak. In: Habernal I, Matoušek V, editors. Text, speech and dialogue. LNAI 6836. Berlin: Springer; 2011. p. 299–306.
    • (2011) Text, speech and dialogue. LNAI 6836 , pp. 299-306
    • Přibil, J.1    Přibilová, A.2
  • 21
    • 84867336595 scopus 로고    scopus 로고
    • Automatic speaker age and gender recognition using acoustic and prosodic level information fusion
    • Li M, Han KJ, Narayan S. Automatic speaker age and gender recognition using acoustic and prosodic level information fusion. Comput Speech Lang. 2013;27(1):151–67.
    • (2013) Comput Speech Lang , vol.27 , Issue.1 , pp. 151-167
    • Li, M.1    Han, K.J.2    Narayan, S.3
  • 22
    • 84887051130 scopus 로고    scopus 로고
    • Evaluation of influence of spectral and prosodic features on GMM classification of Czech and Slovak emotional speech
    • Přibil J, Přibilová A. Evaluation of influence of spectral and prosodic features on GMM classification of Czech and Slovak emotional speech. EURASIP J Audio Speech Music Process. 2013;2013(8):1–22.
    • (2013) EURASIP J Audio Speech Music Process , vol.2013 , Issue.8 , pp. 1-22
    • Přibil, J.1    Přibilová, A.2
  • 23
    • 82955173836 scopus 로고    scopus 로고
    • Influence of visual stimuli on evaluation of converted emotional speech by listening tests
    • Esposito A, Vinciarelli A, Vicsi K, Pelachaud C, Nijholt A, (eds), Springer, Berlin:
    • Přibil J, Přibilová A. Influence of visual stimuli on evaluation of converted emotional speech by listening tests. In: Esposito A, Vinciarelli A, Vicsi K, Pelachaud C, Nijholt A, editors. Analysis of verbal and nonverbal communication and enactment. LNCS 6800. Berlin: Springer; 2011. p. 378–92.
    • (2011) Analysis of verbal and nonverbal communication and enactment. LNCS 6800 , pp. 378-392
    • Přibil, J.1    Přibilová, A.2
  • 24
    • 57349126313 scopus 로고    scopus 로고
    • Inter-coder agreement for computational linguistics
    • Artstein R, Poesio M. Inter-coder agreement for computational linguistics. Comput Linguist. 2008;4:555–96. doi:10.1162/coli.07-034-R2.
    • (2008) Comput Linguist , vol.4 , pp. 555-596
    • Artstein, R.1    Poesio, M.2
  • 25
    • 84902364868 scopus 로고    scopus 로고
    • Siegert I, Böck R, Wendemuth A. Inter-rater reliability for emotion annotation in human-computer interaction—comparison and methodological improvements. J Multimodal User Interfaces Special Issue From Multimodal Analysis to Real-Time Interactions with Virtual Agents, doi:, Springer, 2013 (online)
    • Siegert I, Böck R, Wendemuth A. Inter-rater reliability for emotion annotation in human-computer interaction—comparison and methodological improvements. J Multimodal User Interfaces Special Issue From Multimodal Analysis to Real-Time Interactions with Virtual Agents, doi:10.1007/s12193-013-0129-9, Springer, 2013 (online).
  • 27
    • 67650474760 scopus 로고    scopus 로고
    • Recognition of emotions in german speech using Gaussian Mixture models
    • Esposito A, Hussain A, Marinaro M, Martone R, (eds), Springer, Berlin:
    • Vondra M, Vích R. Recognition of emotions in german speech using Gaussian Mixture models. In: Esposito A, Hussain A, Marinaro M, Martone R, editors. Multimodal signals: cognitive and algorithmic issues. LNAI 5398. Berlin: Springer; 2009. p. 256–63.
    • (2009) Multimodal signals: cognitive and algorithmic issues. LNAI 5398 , pp. 256-263
    • Vondra, M.1    Vích, R.2
  • 28
    • 77956401353 scopus 로고    scopus 로고
    • Class-level spectral features for emotion recognition
    • PID: 23794771
    • Bitouk D, Verma R, Nenkova A. Class-level spectral features for emotion recognition. Speech Commun. 2010;52:613–25.
    • (2010) Speech Commun , vol.52 , pp. 613-625
    • Bitouk, D.1    Verma, R.2    Nenkova, A.3
  • 29
    • 84886500247 scopus 로고    scopus 로고
    • Class-specific GMM based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines
    • Dileep AD, Sekhar CC. Class-specific GMM based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines. Speech Commun. 2014;57:126–43.
    • (2014) Speech Commun , vol.57 , pp. 126-143
    • Dileep, A.D.1    Sekhar, C.C.2
  • 30
    • 84887669226 scopus 로고    scopus 로고
    • Novel approach in speaker identification using SVM and GMM
    • Bourouba H, Korba CA, Djemili R. Novel approach in speaker identification using SVM and GMM. Control Eng Appl Inform. 2013;15(3):87–95.
    • (2013) Control Eng Appl Inform , vol.15 , Issue.3 , pp. 87-95
    • Bourouba, H.1    Korba, C.A.2    Djemili, R.3
  • 31
    • 84864723353 scopus 로고    scopus 로고
    • Speaker-independent emotion recognition exploiting a psychologically-inspired binary cascade classification schema
    • Kotti M, Paternò F. Speaker-independent emotion recognition exploiting a psychologically-inspired binary cascade classification schema. Int J Speech Technol. 2012;15:131–50. doi:10.1007/s10772-012-9127-7.
    • (2012) Int J Speech Technol , vol.15 , pp. 131-150
    • Kotti, M.1    Paternò, F.2
  • 33
    • 84916235222 scopus 로고    scopus 로고
    • Nabney IT. Netlab Pattern Analysis Toolbox. Copyright (1996–2001). Retrieved 16 Feb 2012, from
    • Nabney IT. Netlab Pattern Analysis Toolbox. Copyright (1996–2001). Retrieved 16 Feb 2012, from http://www.mathworks.com/matlabcentral/fileexchange/2654-netlab.
  • 34
    • 33947164164 scopus 로고    scopus 로고
    • An evaluation of the robustness of existing supervised machine learning approaches to the classification of emotions in speech
    • Shami M, Verhelst W. An evaluation of the robustness of existing supervised machine learning approaches to the classification of emotions in speech. Speech Commun. 2007;49:201–12.
    • (2007) Speech Commun , vol.49 , pp. 201-212
    • Shami, M.1    Verhelst, W.2
  • 35
    • 84884955838 scopus 로고    scopus 로고
    • SVM-based detection of misannotated words in read speech corpora
    • Habernal I, Matoušek V, (eds), Springer, Berlin:
    • Matoušek J, Tihelka D. SVM-based detection of misannotated words in read speech corpora. In: Habernal I, Matoušek V, editors. Text, speech, and dialogue. LNCS 8082. Berlin: Springer; 2013. p. 457–64.
    • (2013) Text, speech, and dialogue. LNCS 8082 , pp. 457-464
    • Matoušek, J.1    Tihelka, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.