SCOPUS 정보 검색 플랫폼

International Journal of Speech Technology

Volumn 14, Issue 2, 2011, Pages 77-87

Affective speaker state analysis in the presence of reverberation

(1) Schuller, Björn a

a TECHNICAL UNIVERSITY OF MUNICH (Germany)

Author keywords

Affective computing; Model adaptation; Reverberation; Speaker classification

Indexed keywords

ACOUSTIC ENVIRONMENT; AFFECTIVE COMPUTING; AFFECTIVE SPEECH; CLASSIFICATION SYSTEM; DATA SETS; FEATURE TYPES; MODEL ADAPTATION; PUBLIC ROOMS; SPEAKER ADAPTATION; SPEAKER CLASSIFICATION; SPEECH DATA; STATE ANALYSIS;

IMPULSE RESPONSE;

REVERBERATION;

EID: 80052565773 PISSN: 13812416 EISSN: 15728110 Source Type: Journal
DOI: 10.1007/s10772-011-9090-8 Document Type: Article

Times cited : (13)

References (45)

1
- 21544466181
- ASR for emotional speech: Clarifying the issues and enhancing performance
- DOI 10.1016/j.neunet.2005.03.008, PII S0893608005000419, Emotion and Brain
- Athanaselis, T., Bakamidis, S., Dologlu, I., Cowie, R., Douglas-Cowie, E., & Cox, C. (2005). ASR for emotional speech: clarifying the issues and enhancing performance. Neural Networks, 18, 437-444. (Pubitemid 40922650)
- (2005) Neural Networks , vol.18 , Issue.4 , pp. 437-444
- Athanaselis, T.¹ Bakamidis, S.² Dologlou, I.³ Cowie, R.⁴ Douglas-Cowie, E.⁵ Cox, C.⁶

2
- 34547505647
- Combining efforts for improving automatic classification of emotional user states
- Ljubliana
- Batliner, A., Steidl, S., Schuller, B., Seppi, D., Laskowski, K., Vogt, T., Devillers, L., Vidrascu, L., Amir, N., Kessous, L., & Aharonson, V. (2006). Combining efforts for improving automatic classification of emotional user states. In Proc. IS-LTC 2006 (pp. 240-245), Ljubliana.
- (2006) Proc. IS-LTC 2006 , pp. 240-245
- Batliner, A.¹ Steidl, S.² Schuller, B.³ Seppi, D.⁴ Laskowski, K.⁵ Vogt, T.⁶ Devillers, L.⁷ Vidrascu, L.⁸ Amir, N.⁹ Kessous, L.¹⁰ Aharonson, V.¹¹

3
- 77955412961
- Whodunnit-searching for the most important feature types signalling emotional user states in speech
- Batliner, A., Steidl, S., Schuller, B., Seppi, D., Vogt, T., Wagner, J., Devillers, L., Vidrascu, L., Aharonson, V., & Amir, N. (2011). Whodunnit-searching for the most important feature types signalling emotional user states in speech. Computer Speech and Language, 25, 4-28.
- (2011) Computer Speech and Language , vol.25 , pp. 4-28
- Batliner, A.¹ Steidl, S.² Schuller, B.³ Seppi, D.⁴ Vogt, T.⁵ Wagner, J.⁶ Devillers, L.⁷ Vidrascu, L.⁸ Aharonson, V.⁹ Amir, N.¹⁰

4
- 33745202280
- A database of German emotional speech
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., & Weiss, B. (2005). A database of German emotional speech. In Proc. Interspeech (pp. 1517-1520), Lisbon. (Pubitemid 43908362)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 1517-1520
- Burkhardt, F.¹ Paeschke, A.² Rolfes, M.³ Sendlmeier, W.⁴ Weiss, B.⁵

5
- 85031495833
- A new audacity feature: Room objective acoustical parameters calculation module
- Campanini, S., & Farina, A. (2009). A new audacity feature: room objective acoustical parameters calculation module. In Proc. Linux audio conference.
- (2009) Proc. Linux audio conference
- Campanini, S.¹ Farina, A.²

6
- 84898877448
- Semantic audio-visual data fusion for automatic emotion recognition
- Eurosis
- Datcu, D., & Rothkrantz, L. J. M. (2008). Semantic audio-visual data fusion for automatic emotion recognition. In Proc. Euromedia 2008, Eurosis.
- (2008) Proc. Euromedia 2008
- Datcu, D.¹ Rothkrantz, L.J.M.²

7
- 38049036813
- On the necessity and feasibility of detecting a driver's emotional state while driving
- A. Paiva, R. Prada, & R. W. Picard (Eds.), Berlin: Springer
- Grimm, M., Kroschel, K., Harris, H., Nass, C., Schuller, B., Rigoll, G., & Moosmayr, T. (2007). On the necessity and feasibility of detecting a driver's emotional state while driving. In A. Paiva, R. Prada, & R. W. Picard (Eds.), Affective computing and intelligent interaction (pp. 126-138). Berlin: Springer.
- (2007) Affective Computing and Intelligent Interaction , pp. 126-138
- Grimm, M.¹ Kroschel, K.² Harris, H.³ Nass, C.⁴ Schuller, B.⁵ Rigoll, G.⁶ Moosmayr, T.⁷

8
- 33646055191
- Using artificially reverberated training data in distanttalking ASR
- Text, speech and dialogue Berlin: Springer
- Haderlein, T., Nöth, E., Herbordt,W., Kellermann,W., & Niemann, H. (2005). Using artificially reverberated training data in distanttalking ASR. In LNCS: Vol. 3658. Text, speech and dialogue (pp. 226-233). Berlin: Springer.
- (2005) LNCS , vol.3658 , pp. 226-233
- Haderlein, T.¹ Nöth, E.² Herbordt, W.³ Kellermann, W.⁴ Niemann, H.⁵

9
- 0004060921
- PhD thesis Hamilton, Waikato University, Department of Computer Science
- Hall, M. A. (1998). Correlation-based feature selection for machine learning. PhD thesis, Hamilton, Waikato University, Department of Computer Science.
- (1998) Correlation-Based Feature Selection for Machine Learning
- Hall, M.A.¹

10
- 70350291535
- Vocal emotion recognition in five native languages of assam using new wavelet features
- Kandali, A. B., Routray, A., & Basu, T. K. (2009). Vocal emotion recognition in five native languages of assam using new wavelet features. International Journal of Speech Technology, 12, 1-13.
- (2009) International Journal of Speech Technology , vol.12 , pp. 1-13
- Kandali, A.B.¹ Routray, A.² Basu, T.K.³

11
- 33746628988
- Robust emotion recognition feature, frequency range of meaningful signal
- DOI 10.1109/ROMAN.2005.1513856, 1513856, 14th IEEE Workshop on Robot and Human Interactive Communication, RO-MAN 2005
- Kim, E. H., Hyun, K. H.,& Kwak, Y. K. (2005). Robust emotion recognition feature, frequency range of meaningful signal. In Proc. IEEE international workshop on robots and human interactive communication (RO-MAN) (pp. 667-671), Nashville, USA. (Pubitemid 44144459)
- (2005) Proceedings - IEEE International Workshop on Robot and Human Interactive Communication , vol.2005 , pp. 667-671
- Kim, E.H.¹ Hyun, K.H.² Kwak, Y.K.³

12
- 33748848302
- Robust feature extraction for mobile-based speech emotion recognition system
- Berlin: Springer
- Lee, K. K., Cho, Y. H., & Park, K. S. (2006). Robust feature extraction for mobile-based speech emotion recognition system. In Lecture notes in control and information sciences. Intelligent computing in signal processing and pattern recognition (pp. 470-477). Berlin: Springer.
- (2006) Lecture Notes in Control and Information Sciences. Intelligent Computing in Signal Processing and Pattern Recognition , pp. 470-477
- Lee, K.K.¹ Cho, Y.H.² Park, K.S.³

13
- 33947615772
- Robust estimation of voice quality parameters under real world disturbances
- Toulouse
- Lugger, M., Yang, B., & Wokurek, W. (2006). Robust estimation of voice quality parameters under real world disturbances. In Proc. ICASSP (pp. 1097-1100), Toulouse.
- (2006) Proc. ICASSP , pp. 1097-1100
- Lugger, M.¹ Yang, B.² Wokurek, W.³

14
- 57549115990
- Bimodal persondependent emotion recognition comparison of feature level and decision level information fusion
- New York: ACM
- Mansoorizadeh, M., & Charkari, N. M. (2008). Bimodal persondependent emotion recognition comparison of feature level and decision level information fusion. In Proc. 1st international conference on pervasive technologies related to assistive environments (pp. 1-4). New York: ACM.
- (2008) Proc. 1st International Conference on Pervasive Technologies Related to Assistive Environments , pp. 1-4
- Mansoorizadeh, M.¹ Charkari, N.M.²

15
- 84922798491
- The enterface'05 audio-visual emotion database
- Martin, O., Kotsia, I., Macq, B., & Pitas, I. (2006). The enterface'05 audio-visual emotion database. In Proc. IEEE workshop on multimedia database management.
- (2006) Proc. IEEE Workshop on Multimedia Database Management
- Martin, O.¹ Kotsia, I.² Macq, B.³ Pitas, I.⁴

16
- 60349106688
- Combined speech-emotion recognition for spoken human-computer interfaces
- Dubai, United Emirates. New York: IEEE Press
- Meng, H., Pittermann, J., Pittermann, A., & Minker, W. (2007). Combined speech-emotion recognition for spoken human-computer interfaces. In Proc. international conference on signal processing and communications (pp. 1179-1182), Dubai, United Emirates. New York: IEEE Press.
- (2007) Proc. International Conference on Signal Processing and Communications , pp. 1179-1182
- Meng, H.¹ Pittermann, J.² Pittermann, A.³ Minker, W.⁴

17
- 34547507403
- Speech dereverberation
- EURASIP
- Naylor, PA, & Gaubitch, N. D. (2005). Speech dereverberation. In Proc. 2005 international workshop on acoustic echo and noise control, EURASIP.
- (2005) Proc. 2005 International Workshop on Acoustic Echo and Noise Control
- Naylor, P.A.¹ Gaubitch, N.D.²

18
- 80051618981
- London: Springer
- Naylor, P.,& Gaubitch, N. D. (2010). Speech dereverberation. London: Springer.
- (2010) Speech Dereverberation
- Naylor, P.¹ Gaubitch, N.D.²

19
- 0018494073
- Invertibility of a room impulse response
- Neely, S. T. & Allen, J. B. (1979). Invertibility of a room impulse response. Journal of the Acoustical Society of America, 66, 165-169.
- (1979) Journal of the Acoustical Society of America , vol.66 , pp. 165-169
- Neely, S.T.¹ Allen, J.B.²

20
- 59049092031
- Evidence theorybased multimodal emotion recognition
- Berlin: Springer
- Paleari, M., Benmokhtar, R., & Huet, B. (2008). Evidence theorybased multimodal emotion recognition. In Proc. 15th international multimedia modeling conference on advances in multimedia modeling (pp. 435-446). Berlin: Springer.
- (2008) Proc. 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling , pp. 435-446
- Paleari, M.¹ Benmokhtar, R.² Huet, B.³

21
- 0028210639
- Intelligibility of conversational and clear speech in noise and reverberation for listeners with normal and impaired hearing
- Payton, K. L., Uchanski, R. M., & Braida, L. D. (1994). Intelligibility of conversational and clear speech in noise and reverberation for listeners with normal and impaired hearing. Journal of the Acoustical Society of America, 95, 1581-1592. (Pubitemid 24085759)
- (1994) Journal of the Acoustical Society of America , vol.95 , Issue.3 , pp. 1581-1592
- Payton, K.L.¹ Uchanski, R.M.² Braida, L.D.³

22
- 77950021520
- Emotion recognition and adaptation in spoken dialogue systems
- Pittermann, J., Pittermann, A., & Minker, W. (2010). Emotion recognition and adaptation in spoken dialogue systems. International Journal of Speech Technology, 13, 49-60.
- (2010) International Journal of Speech Technology , vol.13 , pp. 49-60
- Pittermann, J.¹ Pittermann, A.² Minker, W.³

23
- 79952896080
- Speaker recognition under stressed condition
- Raja, G. S., & Dandapat, S. (2010). Speaker recognition under stressed condition. International Journal of Speech Technology, 13, 141-161.
- (2010) International Journal of Speech Technology , vol.13 , pp. 141-161
- Raja, G.S.¹ Dandapat, S.²

24
- 70450168321
- Towards responsive sensitive artificial listeners
- Bellagio
- Schröder, M., Cowie, R., Heylen, D., Pantic, M., Pelachaud, C., & Schuller, B. (2008). Towards responsive sensitive artificial listeners. In Proc. 4th international workshop on human-computer conversation, Bellagio.
- (2008) Proc. 4th International Workshop on Human-Computer Conversation
- Schröder, M.¹ Cowie, R.² Heylen, D.³ Pantic, M.⁴ Pelachaud, C.⁵ Schuller, B.⁶

25
- 33646758175
- Metaclassifiers in acoustic and linguistic feature fusion-based affect recognition
- Philadelphia
- Schuller, B., Jiménez Villar, R., Rigoll, G., & Lang, M. (2005). Metaclassifiers in acoustic and linguistic feature fusion-based affect recognition. In Proc. ICASSP (Vol. I, pp. 325-328), Philadelphia.
- (2005) Proc. ICASSP , vol.1 , pp. 325-328
- Schuller, B.¹ Jiménez Villar, R.² Rigoll, G.³ Lang, M.⁴

26
- 78149472083
- Emotion recognition in the noise applying large acoustic feature sets
- Dresden
- Schuller, B., Arsíc, D., Wallhoff, F., & Rigoll, G. (2006a). Emotion recognition in the noise applying large acoustic feature sets. In Proc. speech prosody 2006, Dresden.
- (2006) Proc. Speech Prosody 2006
- Schuller, B.¹ Arsíc, D.² Wallhoff, F.³ Rigoll, G.⁴

27
- 44949160056
- Recognition of interest in human conversational speech
- Pittsburgh
- Schuller, B., Köhler, N., Müller, R., & Rigoll, G. (2006b). Recognition of interest in human conversational speech. In Proc. interspeech (pp. 793-796), Pittsburgh.
- (2006) Proc. Interspeech , pp. 793-796
- Schuller, B.¹ Köhler, N.² Müller, R.³ Rigoll, G.⁴

28
- 34247624725
- Evolutionary feature generation in speech emotion recognition
- DOI 10.1109/ICME.2006.262500, 4036522, 2006 IEEE International Conference on Multimedia and Expo, ICME 2006 - Proceedings
- Schuller, B., Reiter, S., & Rigoll, G. (2006c). Evolutionary feature generation in speech emotion recognition. In Proc. international conference on multimedia and Expo ICME 2006 (pp. 5-8), Toronto, Canada. (Pubitemid 46679640)
- (2006) 2006 IEEE International Conference on Multimedia and Expo, ICME 2006 - Proceedings , vol.2006 , pp. 5-8
- Schuller, B.¹ Reiter, S.² Rigoll, G.³

29
- 34547549142
- Towards more reality in the recognition of emotional speech
- Honolulu
- Schuller, B., Seppi, D., Batliner, A., Meier, A., & Steidl, S. (2007). Towards more reality in the recognition of emotional speech. In Proc. ICASSP (pp. 941-944), Honolulu.
- (2007) Proc. ICASSP , pp. 941-944
- Schuller, B.¹ Seppi, D.² Batliner, A.³ Meier, A.⁴ Steidl, S.⁵

30
- 84867198846
- Detection of security related affect and behaviour in passenger transport
- Brisbane
- Schuller, B., Wimmer, M., Arsic, D., Moosmayr, T., & Rigoll, G. (2008a). Detection of security related affect and behaviour in passenger transport. In Proc. interspeech (pp. 265-268), Brisbane.
- (2008) Proc. Interspeech , pp. 265-268
- Schuller, B.¹ Wimmer, M.² Arsic, D.³ Moosmayr, T.⁴ Rigoll, G.⁵

31
- 51449104640
- Brute-forcing hierarchical functionals for paralinguistics: A waste of feature space
- Las Vegas
- Schuller, B., Wimmer, M., Mösenlechner, L., Kern, C., Arsic, D., & Rigoll, G. (2008b). Brute-forcing hierarchical functionals for paralinguistics: a waste of feature space. In Proc. ICASSP (pp. 4501-4504), Las Vegas.
- (2008) Proc. ICASSP , pp. 4501-4504
- Schuller, B.¹ Wimmer, M.² Mösenlechner, L.³ Kern, C.⁴ Arsic, D.⁵ Rigoll, G.⁶

32
- 70349292240
- Being bored? Recognising natural interest by extensive audiovisual integration for real-life application
- Special Issue on Visual and Multimodal Analysis of Human Spontaneous Behavior
- Schuller, B., Müller, R., Eyben, F., Gast, J., Hörnler, B., Wöllmer, M., Rigoll, G., Höthker, A., & Konosu, H. (2009). Being bored? Recognising natural interest by extensive audiovisual integration for real-life application. Image and Vision Computing Journal, 27, 1760-1774. Special Issue on Visual and Multimodal Analysis of Human Spontaneous Behavior.
- (2009) Image and Vision Computing Journal , vol.27 , pp. 1760-1774
- Schuller, B.¹ Müller, R.² Eyben, F.³ Gast, J.⁴ Hörnler, B.⁵ Wöllmer, M.⁶ Rigoll, G.⁷ Höthker, A.⁸ Konosu, H.⁹

33
- 79954999224
- The INTERSPEECH 2010 paralinguistic challenge
- Makuhari, Japan
- Schuller, B., Steidl, S., Batliner, A., Burkhardt, F., Devillers, L., Müller, C., & Narayanan, S. (2010a). The INTERSPEECH 2010 paralinguistic challenge. In Proc. INTERSPEECH 2010 (pp. 2794-2797), Makuhari, Japan.
- (2010) Proc. INTERSPEECH 2010 , pp. 2794-2797
- Schuller, B.¹ Steidl, S.² Batliner, A.³ Burkhardt, F.⁴ Devillers, L.⁵ Müller, C.⁶ Narayanan, S.⁷

34
- 80052606383
- Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge
- Schuller, B., Steidl, S., Batliner, A., & Seppi, D. (2010b). Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge. Speech Communication. Special issue on "Sensing emotion and affect-facing realism in speech processing".
- (2010) Speech Communication. Special Issue on "Sensing Emotion and Affect-Facing Realism in Speech Processing"
- Schuller, B.¹ Steidl, S.² Batliner, A.³ Seppi, D.⁴

35
- 80053925819
- Cross-corpus acoustic emotion recognition: Variances and strategies
- Schuller, B., Vlasenko, B., Eyben, F.,Wöllmer,M., Stuhlsatz, A.,Wendemuth, A., & Rigoll, G. (2010c). Cross-corpus acoustic emotion recognition: variances and strategies. IEEE Transactions on Affective Computing, 1.
- (2010) IEEE Transactions on Affective Computing , vol.1
- Schuller, B.¹ Vlasenko, B.² Eyben, F.³ Wöllmer, M.⁴ Stuhlsatz, A.⁵ Wendemuth, A.⁶ Rigoll, G.⁷

36
- 80052565871
- A cognitive science reasoning in recognition of emotions in audio-visual speech
- Slavova, V., Verhelst, W., & Sahli, H. (2008). A cognitive science reasoning in recognition of emotions in audio-visual speech. International Journal Information Technologies and Knowledge, 2, 324-334.
- (2008) International Journal Information Technologies and Knowledge , vol.2 , pp. 324-334
- Slavova, V.¹ Verhelst, W.² Sahli, H.³

37
- 77951457701
- On the impact of children's emotional speech on acoustic and language models
- Steidl, S., Batliner, A., Seppi, D., & Schuller, B. (2010). On the impact of children's emotional speech on acoustic and language models. EURASIP Journal on Audio, Speech, and Music Processing, 2010, 783954.
- (2010) EURASIP Journal on Audio, Speech, and Music Processing, 2010 , pp. 783954
- Steidl, S.¹ Batliner, A.² Seppi, D.³ Schuller, B.⁴

38
- 78149484045
- Speech emotion analysis in noisy real world environment
- Istanbul, Turkey
- Tawari, A., & Trivedi, M. (2010). Speech emotion analysis in noisy real world environment. In Proc. ICPR (pp. 4605-4608), Istanbul, Turkey.
- (2010) Proc. ICPR , pp. 4605-4608
- Tawari, A.¹ Trivedi, M.²

39
- 84862624179
- Fast sequential floating forward selection applied to emotional speech features estimated on des and susas data collection
- Florence
- Ververidis, D., & Kotropoulos, C. (2006). Fast sequential floating forward selection applied to emotional speech features estimated on des and susas data collection. In Proc. European signal processing conf. (EUSIPCO 2006), Florence.
- (2006) Proc. European Signal Processing Conf. (EUSIPCO 2006)
- Ververidis, D.¹ Kotropoulos, C.²

40
- 0003957032
- (2nd edn.). San Francisco: Morgan Kaufmann
- Witten, I. H., & Frank, E. (2005). Data mining: practical machine learning tools and techniques (2nd edn.). San Francisco: Morgan Kaufmann.
- (2005) Data Mining: Practical Machine Learning Tools and Techniques
- Witten, I.H.¹ Frank, E.²

41
- 70349203870
- Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks
- Taipei, Taiwan
- Wöllmer, M., Eyben, F., Keshet, J., Graves, A., Schuller, B., & Rigoll, G. (2009). Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks. In Proc. of ICASSP (pp. 3949-3952), Taipei, Taiwan.
- (2009) Proc. of ICASSP , pp. 3949-3952
- Wöllmer, M.¹ Eyben, F.² Keshet, J.³ Graves, A.⁴ Schuller, B.⁵ Rigoll, G.⁶

42
- 38049049176
- A study of speech emotion recognition and its application to mobile services
- Berlin: Springer
- Yoon, W. J., Cho, Y. H., & Park, K. S. (2007). A study of speech emotion recognition and its application to mobile services. In Lecture notes in computer science. Ubiquitous intelligence and computing (pp. 758-766). Berlin: Springer.
- (2007) Lecture Notes in Computer Science. Ubiquitous Intelligence and Computing , pp. 758-766
- Yoon, W.J.¹ Cho, Y.H.² Park, K.S.³

43
- 34247604548
- Emotion recognition from noisy speech
- DOI 10.1109/ICME.2006.262865, 4036934, 2006 IEEE International Conference on Multimedia and Expo, ICME 2006 - Proceedings
- You, M., Chen, C., Bu, J., Liu, J., & Tao, J. (2006). Emotion recognition from noisy speech. In Proc. ICME (pp. 1653-1656), Toronto. (Pubitemid 46680050)
- (2006) 2006 IEEE International Conference on Multimedia and Expo, ICME 2006 - Proceedings , vol.2006 , pp. 1653-1656
- You, M.¹ Chen, C.² Bu, J.³ Liu, J.⁴ Tao, J.⁵

44
- 67650220433
- Manifolds based emotion recognition in speech
- You, M., Chen, C., Bu, J., Liu, J., & Tao, J. (2007). Manifolds based emotion recognition in speech. Computational Linguistics and Chinese Language Processing, 12, 49-64.
- (2007) Computational Linguistics and Chinese Language Processing , vol.12 , pp. 49-64
- You, M.¹ Chen, C.² Bu, J.³ Liu, J.⁴ Tao, J.⁵

45
- 57149144228
- A survey of affect recognition methods: Audio, visual, and spontaneous expressions
- Zeng, Z., Pantic, M., Roisman, G. I., & Huang, T. S. (2009). A survey of affect recognition methods: audio, visual, and spontaneous expressions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(1), 39-58.
- (2009) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.31 , Issue.1 , pp. 39-58
- Zeng, Z.¹ Pantic, M.² Roisman, G.I.³ Huang, T.S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.