메뉴 건너뛰기




Volumn 53, Issue 9-10, 2011, Pages 1172-1185

Application of speaker- and language identification state-of-the-art techniques for emotion recognition

Author keywords

Emotion recognition; Gaussian mixture models; Intersession variability compensation; Maximum mutual information; Score level fusion

Indexed keywords

EMOTION RECOGNITION; GAUSSIAN MIXTURE MODEL; INTERSESSION VARIABILITY; MAXIMUM-MUTUAL-INFORMATION; SCORE-LEVEL FUSION;

EID: 79960848738     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2011.01.007     Document Type: Article
Times cited : (61)

References (33)
  • 5
    • 29044433376 scopus 로고    scopus 로고
    • Application-independent evaluation of speaker detection
    • DOI 10.1016/j.csl.2005.08.001, PII S0885230805000483, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
    • N. Brümmer, and J. du Preez Application-independent evaluation of speaker detection Comput. Speech Lang. 20 2-3 2006 230 275 (Pubitemid 41787538)
    • (2006) Computer Speech and Language , vol.20 , Issue.2-3 SPEC. ISS. , pp. 230-275
    • Brummer, N.1    Du Preez, J.2
  • 8
    • 33745224873 scopus 로고
    • Vocal tract normalization in speech recognition: Compensating for systematic speaker variability
    • J. Cohen, T. Kamm, and A. Andreou Vocal tract normalization in speech recognition: Compensating for systematic speaker variability J. Acoust. Soc. Amer. 97 1995 3246
    • (1995) J. Acoust. Soc. Amer. , vol.97 , pp. 3246
    • Cohen, J.1    Kamm, T.2    Andreou, A.3
  • 9
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S. Davis, and P. Mermelstein Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Trans. Audio, Speech Lang Process. 28 pp. 1-4 1980 357 366 (Pubitemid 11464930)
    • (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis Steven, B.1    Mermelstein Paul2
  • 11
    • 67649524984 scopus 로고    scopus 로고
    • Performance analysis of spectral and prosodic features and their fusion for emotion recognition in speech
    • SLT 2008. IEEE
    • Gaurav, M.; 2008. Performance analysis of spectral and prosodic features and their fusion for emotion recognition in speech. In: Spoken Language Technology Workshop, 2008. SLT 2008. IEEE, pp. 313-316.
    • (2008) Spoken Language Technology Workshop, 2008 , pp. 313-316
    • Gaurav, M.1
  • 13
    • 79960837924 scopus 로고    scopus 로고
    • Discriminative training and channel compensation for acoustic language recognition
    • Hubeika, V.; Burget, L.; Matejka, P.; Schwarz, P.; 2008. Discriminative training and channel compensation for acoustic language recognition. In: Proceedings of Interspeech, 1990-9772.
    • (2008) Proceedings of Interspeech , pp. 1990-9772
    • Hubeika, V.1    Burget, L.2    Matejka, P.3    Schwarz, P.4
  • 15
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • T. Kinnunen, and H. Li An overview of text-independent speaker recognition: From features to supervectors Speech Commun. 52 1 2010 12 40
    • (2010) Speech Commun. , vol.52 , Issue.1 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 16
    • 67649543737 scopus 로고    scopus 로고
    • Contour modeling of prosodic and acoustic features for speaker recognition
    • IEEE
    • Kockmann, M.; Burget, L.; 2008. Contour modeling of prosodic and acoustic features for speaker recognition. In: Spoken Language Technology Workshop, SLT 2008. IEEE, pp. 45-48.
    • (2008) Spoken Language Technology Workshop, SLT 2008 , pp. 45-48
    • Kockmann, M.1    Burget, L.2
  • 17
    • 70450177653 scopus 로고    scopus 로고
    • Brno University of Technology System for Interspeech 2009 Emotion Challenge
    • Brighton
    • Kockmann, M.; Burget, L.; Cernocky, J.; 2009. Brno University of technology system for interspeech 2009 emotion challenge. In: Proceedings of Interspeech, Brighton, pp. 348-351.
    • (2009) Proceedings of Interspeech , pp. 348-351
    • Kockmann, M.1    Burget, L.2    Cernocky, J.3
  • 19
    • 34548833109 scopus 로고    scopus 로고
    • Brno University of Technology system for NIST 2005 language recognition evaluation
    • Matejka, P.; Burget, L.; Schwarz, P.; Cernocky, J.; 2006. Brno University of Technology system for NIST 2005 language recognition evaluation. In: Proceedings of Odyssey.
    • (2006) Proceedings of Odyssey
    • Matejka, P.1    Burget, L.2    Schwarz, P.3    Cernocky, J.4
  • 23
    • 78149472083 scopus 로고    scopus 로고
    • Emotion recognition in the noise applying large acoustic feature sets
    • Dresden
    • Schuller, B.; Arsic, D.; Wallhoff, F.; Rigoll, G.; 2006. Emotion recognition in the noise applying large acoustic feature sets. Speech Prosody, Dresden.
    • (2006) Speech Prosody
    • Schuller, B.1    Arsic, D.2    Wallhoff, F.3    Rigoll, G.4
  • 26
    • 33947620115 scopus 로고    scopus 로고
    • Hierarchical structures of neural networks for phoneme recognition
    • Toulouse
    • Schwarz, P.; Matejka, P.; Cernocky, J.; 2006. Hierarchical structures of neural networks for phoneme recognition. In: Proceedings of ICASSP 2006, Toulouse, pp. 325-328.
    • (2006) Proceedings of ICASSP 2006 , pp. 325-328
    • Schwarz, P.1    Matejka, P.2    Cernocky, J.3
  • 28
    • 70450188723 scopus 로고    scopus 로고
    • Does session variability compensation in speaker recognition model intrinsic variation under mismatched conditions?
    • E. Shriberg, S. Kajarekar, and N. Scheffer Does session variability compensation in speaker recognition model intrinsic variation under mismatched conditions? Interspeech Brighton 2009
    • (2009) Interspeech Brighton
    • Shriberg, E.1    Kajarekar, S.2    Scheffer, N.3
  • 29
    • 79952014572 scopus 로고    scopus 로고
    • Automatic classification of emotion-related user states in spontaneous children's speech
    • Bd. 28, ISBN 978-3-8325-2145-5, 1-260 (January)
    • Steidl, S.; 2009. Automatic classification of emotion-related user states in spontaneous children's speech. Studien zur Mustererkennung, Bd. 28, ISBN 978-3-8325-2145-5, 1-260 (January).
    • (2009) Studien Zur Mustererkennung
    • Steidl, S.1
  • 32
    • 56149115138 scopus 로고    scopus 로고
    • Combining frame and turn-level information for robust recognition of emotions within speech
    • Vlasenko, B.; Schuller, B.; Wendemuth, A.; Rigoll, G.; 2007. Combining frame and turn-level information for robust recognition of emotions within speech. In: Proceedings of Interspeech, pp. 2249-2252.
    • (2007) Proceedings of Interspeech , pp. 2249-2252
    • Vlasenko, B.1    Schuller, B.2    Wendemuth, A.3    Rigoll, G.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.