SCOPUS 정보 검색 플랫폼

Volumn 53, Issue 9-10, 2011, Pages 1172-1185

Application of speaker- and language identification state-of-the-art techniques for emotion recognition

(3) Kockmann, Marcel a Burget, Lukáš a Honza Černocký, Jan a

a BRNO UNIVERSITY OF TECHNOLOGY (Czech Republic)

Author keywords

Emotion recognition; Gaussian mixture models; Intersession variability compensation; Maximum mutual information; Score level fusion

Indexed keywords

EMOTION RECOGNITION; GAUSSIAN MIXTURE MODEL; INTERSESSION VARIABILITY; MAXIMUM-MUTUAL-INFORMATION; SCORE-LEVEL FUSION;

EXPERIMENTS; FEATURE EXTRACTION; GAUSSIAN DISTRIBUTION; SPEECH ANALYSIS;

SPEECH RECOGNITION;

EID: 79960848738 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2011.01.007 Document Type: Article

Times cited : (61)

References (33)

1
- 34547505647
- Combining efforts for improving automatic classification of emotional user states
- Batliner, A.; Steidl, S.; Schuller, B.; Seppi, D.; Laskowski, K.; Vogt, T.; Devillers, L.; Vidrascu, L.; Amir, N.; Kessous, L.; 2006. Combining efforts for improving automatic classification of emotional user states. In: Proceedings of IS-LTC, pp. 240-245.
- (2006) Proceedings of IS-LTC , pp. 240-245
- Batliner, A.¹ Steidl, S.² Schuller, B.³ Seppi, D.⁴ Laskowski, K.⁵ Vogt, T.⁶ Devillers, L.⁷ Vidrascu, L.⁸ Amir, N.⁹ Kessous, L.¹⁰

2
- 33846516584
- Bishop, C.; 2006. Pattern recognition and machine learning.
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.¹

3
- 34547496857
- Spescom DataVoice NIST 2004 system description
- Toledo, Spain, June. 2004
- Brümmer, N.; 2004. Spescom DataVoice NIST 2004 system description. In: Proceedings NIST Speaker Recognition Evaluation 2004, Toledo, Spain, June. 2004.
- (2004) Proceedings NIST Speaker Recognition Evaluation 2004
- Brümmer, N.¹

4
- 51449086024
- Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006
- N. Brümmer, L. Burget, J. Cernocky, O. Glembek, F. Grezl, M. Karafiat, D.A. van Leeuwen, P. Matejka, P. Schwarz, and A. Strasheim Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006 IEEE Trans. Audio, Speech Lang. Process. 15 7 2007 2072 2084
- (2007) IEEE Trans. Audio, Speech Lang. Process. , vol.15 , Issue.7 , pp. 2072-2084
- Brümmer, N.¹ Burget, L.² Cernocky, J.³ Glembek, O.⁴ Grezl, F.⁵ Karafiat, M.⁶ Van Leeuwen, D.A.⁷ Matejka, P.⁸ Schwarz, P.⁹ Strasheim, A.¹⁰

5
- 29044433376
- Application-independent evaluation of speaker detection
- DOI 10.1016/j.csl.2005.08.001, PII S0885230805000483, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
- N. Brümmer, and J. du Preez Application-independent evaluation of speaker detection Comput. Speech Lang. 20 2-3 2006 230 275 (Pubitemid 41787538)
- (2006) Computer Speech and Language , vol.20 , Issue.2-3 SPEC. ISS. , pp. 230-275
- Brummer, N.¹ Du Preez, J.²

6
- 58349102016
- Analysis of feature extraction and channel compensation in a GMM speaker recognition system
- L. Burget, P. Matejka, P. Schwarz, O. Glembek, and J. Cernocky Analysis of feature extraction and channel compensation in a GMM speaker recognition system IEEE Trans. Audio, Speech, Lang. Process. 15 7 2007 1979 1986
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 1979-1986
- Burget, L.¹ Matejka, P.² Schwarz, P.³ Glembek, O.⁴ Cernocky, J.⁵

7
- 33745202280
- A database of german emotional speech
- Burkhardt, F.; Paeschke, A.; Rolfes, M.; Sendlmeier, W.; Weiss, B.; 2005. A database of german emotional speech. In: Ninth European Conference on Speech Communication and Technology.
- (2005) Ninth European Conference on Speech Communication and Technology
- Burkhardt, F.¹ Paeschke, A.² Rolfes, M.³ Sendlmeier, W.⁴ Weiss, B.⁵

8
- 33745224873
- Vocal tract normalization in speech recognition: Compensating for systematic speaker variability
- J. Cohen, T. Kamm, and A. Andreou Vocal tract normalization in speech recognition: Compensating for systematic speaker variability J. Acoust. Soc. Amer. 97 1995 3246
- (1995) J. Acoust. Soc. Amer. , vol.97 , pp. 3246
- Cohen, J.¹ Kamm, T.² Andreou, A.³

9
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- S. Davis, and P. Mermelstein Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Trans. Audio, Speech Lang Process. 28 pp. 1-4 1980 357 366 (Pubitemid 11464930)
- (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis Steven, B.¹ Mermelstein Paul²

10
- 80051639925
- Front-end factor analysis for speaker verification
- N. Dehak, P. Kenny, R. eda Dehak, P. Dumouchel, and P. Ouellet Front-end factor analysis for speaker verification IEEE Trans. Audio, Speech Lang. Process. 2009 1 23
- (2009) IEEE Trans. Audio, Speech Lang. Process. , pp. 1-23
- Dehak, N.¹ Kenny, P.² Eda Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

11
- 67649524984
- Performance analysis of spectral and prosodic features and their fusion for emotion recognition in speech
- SLT 2008. IEEE
- Gaurav, M.; 2008. Performance analysis of spectral and prosodic features and their fusion for emotion recognition in speech. In: Spoken Language Technology Workshop, 2008. SLT 2008. IEEE, pp. 313-316.
- (2008) Spoken Language Technology Workshop, 2008 , pp. 313-316
- Gaurav, M.¹

12
- 0028517164
- Rasta processing of speech
- H. Hermansky, and N. Morgan Rasta processing of speech IEEE Trans. Speech Audio Process. 2 pp. 1-4 1994 578 589
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

13
- 79960837924
- Discriminative training and channel compensation for acoustic language recognition
- Hubeika, V.; Burget, L.; Matejka, P.; Schwarz, P.; 2008. Discriminative training and channel compensation for acoustic language recognition. In: Proceedings of Interspeech, 1990-9772.
- (2008) Proceedings of Interspeech , pp. 1990-9772
- Hubeika, V.¹ Burget, L.² Matejka, P.³ Schwarz, P.⁴

14
- 58349106697
- A study of inter-speaker variability in speaker verification
- P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel A study of inter-speaker variability in speaker verification IEEE Trans. Audio, Speech, Lang. Process. 16 5 2008 980 988
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.5 , pp. 980-988
- Kenny, P.¹ Ouellet, P.² Dehak, N.³ Gupta, V.⁴ Dumouchel, P.⁵

15
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- T. Kinnunen, and H. Li An overview of text-independent speaker recognition: From features to supervectors Speech Commun. 52 1 2010 12 40
- (2010) Speech Commun. , vol.52 , Issue.1 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

16
- 67649543737
- Contour modeling of prosodic and acoustic features for speaker recognition
- IEEE
- Kockmann, M.; Burget, L.; 2008. Contour modeling of prosodic and acoustic features for speaker recognition. In: Spoken Language Technology Workshop, SLT 2008. IEEE, pp. 45-48.
- (2008) Spoken Language Technology Workshop, SLT 2008 , pp. 45-48
- Kockmann, M.¹ Burget, L.²

17
- 70450177653
- Brno University of Technology System for Interspeech 2009 Emotion Challenge
- Brighton
- Kockmann, M.; Burget, L.; Cernocky, J.; 2009. Brno University of technology system for interspeech 2009 emotion challenge. In: Proceedings of Interspeech, Brighton, pp. 348-351.
- (2009) Proceedings of Interspeech , pp. 348-351
- Kockmann, M.¹ Burget, L.² Cernocky, J.³

18
- 84867212876
- But language recognition system for NIST 2007 evaluations
- Matejka, P.; Burget, L.; Glembek, O.; Schwarz, P.; Hubeika, V.; Fapso, M.; Mikolov, T.; Plchot, O.; Cernocky, J.; 2008. But language recognition system for NIST 2007 evaluations. In: Proceedings of Interspeech.
- (2008) Proceedings of Interspeech
- Matejka, P.¹ Burget, L.² Glembek, O.³ Schwarz, P.⁴ Hubeika, V.⁵ Fapso, M.⁶ Mikolov, T.⁷ Plchot, O.⁸ Cernocky, J.⁹

19
- 34548833109
- Brno University of Technology system for NIST 2005 language recognition evaluation
- Matejka, P.; Burget, L.; Schwarz, P.; Cernocky, J.; 2006. Brno University of Technology system for NIST 2005 language recognition evaluation. In: Proceedings of Odyssey.
- (2006) Proceedings of Odyssey
- Matejka, P.¹ Burget, L.² Schwarz, P.³ Cernocky, J.⁴

20
- 34250628706
- NIST
- NIST, 2005. The 2005 NIST language recognition evaluation plan, pp. 1-6.
- (2005) The 2005 NIST Language Recognition Evaluation Plan , pp. 1-6

21
- 4544265717
- Ph.D thesis, Cambridge University Engineering Department, 2003
- Povey, D.; 2003. Discriminative Training for Large Vocabulary Speech Recognition. Ph.D thesis, Cambridge University Engineering Department, 2003, pp. 1-172.
- (2003) Discriminative Training for Large Vocabulary Speech Recognition , pp. 1-172
- Povey, D.¹

22
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- DOI 10.1006/dspr.1999.0361
- D. Reynolds, T. Quatieri, and R. Dunn Speaker verification using adapted Gaussian Mixture Models Digital Signal Process. 10 1-3 2000 19 41 (Pubitemid 30592166)
- (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

23
- 78149472083
- Emotion recognition in the noise applying large acoustic feature sets
- Dresden
- Schuller, B.; Arsic, D.; Wallhoff, F.; Rigoll, G.; 2006. Emotion recognition in the noise applying large acoustic feature sets. Speech Prosody, Dresden.
- (2006) Speech Prosody
- Schuller, B.¹ Arsic, D.² Wallhoff, F.³ Rigoll, G.⁴

24
- 48249094713
- The relevance of feature type for the automatic classification of emotional user states: Low level descriptors and functionals
- June
- B. Schuller, A. Batliner, D. Seppi, S. Steidl, T. Vogt, J. Wagner, L. Devillers, L. Vidrascu, N. Amir, L. Kessous, and V. Aharonson The relevance of feature type for the automatic classification of emotional user states: Low level descriptors and functionals INTERSPEECH 2007 2007 1 4 June
- (2007) INTERSPEECH 2007 , pp. 1-4
- Schuller, B.¹ Batliner, A.² Seppi, D.³ Steidl, S.⁴ Vogt, T.⁵ Wagner, J.⁶ Devillers, L.⁷ Vidrascu, L.⁸ Amir, N.⁹ Kessous, L.¹⁰ Aharonson, V.¹¹

25
- 84920277540
- The INTERSPEECH 2009 Emotion Challenge
- Feb
- Schuller, B.; Steidl, S.; Batliner, A.; Feb 2009. The INTERSPEECH 2009 Emotion Challenge. In: Proceedings of Interspeech, Brighton, pp. 1-4.
- (2009) Proceedings of Interspeech, Brighton , pp. 1-4
- Schuller, B.¹ Steidl, S.² Batliner, A.³

26
- 33947620115
- Hierarchical structures of neural networks for phoneme recognition
- Toulouse
- Schwarz, P.; Matejka, P.; Cernocky, J.; 2006. Hierarchical structures of neural networks for phoneme recognition. In: Proceedings of ICASSP 2006, Toulouse, pp. 325-328.
- (2006) Proceedings of ICASSP 2006 , pp. 325-328
- Schwarz, P.¹ Matejka, P.² Cernocky, J.³

27
- 84867226105
- Patterns, prototypes, performance: Classifying emotional user states
- Seppi, D.; Batliner, A.; Schuller, B.; Steidl, S.; Vogt, T.; Wagner, J.; Devillers, L.; Vidrascu, L.; Amir, N.; Aharonson, V.; 2008. Patterns, prototypes, performance: classifying emotional user states. In: Proceedings of Interspeech.
- (2008) Proceedings of Interspeech
- Seppi, D.¹ Batliner, A.² Schuller, B.³ Steidl, S.⁴ Vogt, T.⁵ Wagner, J.⁶ Devillers, L.⁷ Vidrascu, L.⁸ Amir, N.⁹ Aharonson, V.¹⁰

28
- 70450188723
- Does session variability compensation in speaker recognition model intrinsic variation under mismatched conditions?
- E. Shriberg, S. Kajarekar, and N. Scheffer Does session variability compensation in speaker recognition model intrinsic variation under mismatched conditions? Interspeech Brighton 2009
- (2009) Interspeech Brighton
- Shriberg, E.¹ Kajarekar, S.² Scheffer, N.³

29
- 79952014572
- Automatic classification of emotion-related user states in spontaneous children's speech
- Bd. 28, ISBN 978-3-8325-2145-5, 1-260 (January)
- Steidl, S.; 2009. Automatic classification of emotion-related user states in spontaneous children's speech. Studien zur Mustererkennung, Bd. 28, ISBN 978-3-8325-2145-5, 1-260 (January).
- (2009) Studien Zur Mustererkennung
- Steidl, S.¹

30
- 85009275225
- Approaches to language identification using Gaussian Mixture Models and shifted delta cepstral features
- Torres-Carrasquillo, P.; Singer, E.; Kohler, M.; Greene, R.; Reynolds, D.; Jr, J.D.; 2002. Approaches to language identification using Gaussian Mixture Models and shifted delta cepstral features. In: Seventh International Conference on Spoken Language Processing.
- (2002) Seventh International Conference on Spoken Language Processing
- Torres-Carrasquillo, P.¹ Singer, E.² Kohler, M.³ Greene, R.⁴ Reynolds, D.⁵ Jr., J.D.⁶

31
- 4544315904
- A state of the art review on emotional speech databases
- Ververidis, D.; Kotropoulos, C.; 2003. A state of the art review on emotional speech databases. In: Proceedings of 1st Richmedia Conference, pp. 109-119.
- (2003) Proceedings of 1st Richmedia Conference , pp. 109-119
- Ververidis, D.¹ Kotropoulos, C.²

32
- 56149115138
- Combining frame and turn-level information for robust recognition of emotions within speech
- Vlasenko, B.; Schuller, B.; Wendemuth, A.; Rigoll, G.; 2007. Combining frame and turn-level information for robust recognition of emotions within speech. In: Proceedings of Interspeech, pp. 2249-2252.
- (2007) Proceedings of Interspeech , pp. 2249-2252
- Vlasenko, B.¹ Schuller, B.² Wendemuth, A.³ Rigoll, G.⁴

33
- 60749097551
- Young, S.; Evermann, G.; Gales, M.; Kershaw, D.; Moore, G.; Odell, J.; Ollason, D.; Povey, D.; Valtchev, V.; Woodland, P.; 2006. The htk book version 3.4.
- (2006) The Htk Book Version 3.4
- Young, S.¹ Evermann, G.² Gales, M.³ Kershaw, D.⁴ Moore, G.⁵ Odell, J.⁶ Ollason, D.⁷ Povey, D.⁸ Valtchev, V.⁹ Woodland, P.¹⁰

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.