메뉴 건너뛰기




Volumn 14, Issue 2, 2011, Pages 77-87

Affective speaker state analysis in the presence of reverberation

Author keywords

Affective computing; Model adaptation; Reverberation; Speaker classification

Indexed keywords

ACOUSTIC ENVIRONMENT; AFFECTIVE COMPUTING; AFFECTIVE SPEECH; CLASSIFICATION SYSTEM; DATA SETS; FEATURE TYPES; MODEL ADAPTATION; PUBLIC ROOMS; SPEAKER ADAPTATION; SPEAKER CLASSIFICATION; SPEECH DATA; STATE ANALYSIS;

EID: 80052565773     PISSN: 13812416     EISSN: 15728110     Source Type: Journal    
DOI: 10.1007/s10772-011-9090-8     Document Type: Article
Times cited : (13)

References (45)
  • 1
    • 21544466181 scopus 로고    scopus 로고
    • ASR for emotional speech: Clarifying the issues and enhancing performance
    • DOI 10.1016/j.neunet.2005.03.008, PII S0893608005000419, Emotion and Brain
    • Athanaselis, T., Bakamidis, S., Dologlu, I., Cowie, R., Douglas-Cowie, E., & Cox, C. (2005). ASR for emotional speech: clarifying the issues and enhancing performance. Neural Networks, 18, 437-444. (Pubitemid 40922650)
    • (2005) Neural Networks , vol.18 , Issue.4 , pp. 437-444
    • Athanaselis, T.1    Bakamidis, S.2    Dologlou, I.3    Cowie, R.4    Douglas-Cowie, E.5    Cox, C.6
  • 5
    • 85031495833 scopus 로고    scopus 로고
    • A new audacity feature: Room objective acoustical parameters calculation module
    • Campanini, S., & Farina, A. (2009). A new audacity feature: room objective acoustical parameters calculation module. In Proc. Linux audio conference.
    • (2009) Proc. Linux audio conference
    • Campanini, S.1    Farina, A.2
  • 6
    • 84898877448 scopus 로고    scopus 로고
    • Semantic audio-visual data fusion for automatic emotion recognition
    • Eurosis
    • Datcu, D., & Rothkrantz, L. J. M. (2008). Semantic audio-visual data fusion for automatic emotion recognition. In Proc. Euromedia 2008, Eurosis.
    • (2008) Proc. Euromedia 2008
    • Datcu, D.1    Rothkrantz, L.J.M.2
  • 7
    • 38049036813 scopus 로고    scopus 로고
    • On the necessity and feasibility of detecting a driver's emotional state while driving
    • A. Paiva, R. Prada, & R. W. Picard (Eds.), Berlin: Springer
    • Grimm, M., Kroschel, K., Harris, H., Nass, C., Schuller, B., Rigoll, G., & Moosmayr, T. (2007). On the necessity and feasibility of detecting a driver's emotional state while driving. In A. Paiva, R. Prada, & R. W. Picard (Eds.), Affective computing and intelligent interaction (pp. 126-138). Berlin: Springer.
    • (2007) Affective Computing and Intelligent Interaction , pp. 126-138
    • Grimm, M.1    Kroschel, K.2    Harris, H.3    Nass, C.4    Schuller, B.5    Rigoll, G.6    Moosmayr, T.7
  • 8
    • 33646055191 scopus 로고    scopus 로고
    • Using artificially reverberated training data in distanttalking ASR
    • Text, speech and dialogue Berlin: Springer
    • Haderlein, T., Nöth, E., Herbordt,W., Kellermann,W., & Niemann, H. (2005). Using artificially reverberated training data in distanttalking ASR. In LNCS: Vol. 3658. Text, speech and dialogue (pp. 226-233). Berlin: Springer.
    • (2005) LNCS , vol.3658 , pp. 226-233
    • Haderlein, T.1    Nöth, E.2    Herbordt, W.3    Kellermann, W.4    Niemann, H.5
  • 10
    • 70350291535 scopus 로고    scopus 로고
    • Vocal emotion recognition in five native languages of assam using new wavelet features
    • Kandali, A. B., Routray, A., & Basu, T. K. (2009). Vocal emotion recognition in five native languages of assam using new wavelet features. International Journal of Speech Technology, 12, 1-13.
    • (2009) International Journal of Speech Technology , vol.12 , pp. 1-13
    • Kandali, A.B.1    Routray, A.2    Basu, T.K.3
  • 11
    • 33746628988 scopus 로고    scopus 로고
    • Robust emotion recognition feature, frequency range of meaningful signal
    • DOI 10.1109/ROMAN.2005.1513856, 1513856, 14th IEEE Workshop on Robot and Human Interactive Communication, RO-MAN 2005
    • Kim, E. H., Hyun, K. H.,& Kwak, Y. K. (2005). Robust emotion recognition feature, frequency range of meaningful signal. In Proc. IEEE international workshop on robots and human interactive communication (RO-MAN) (pp. 667-671), Nashville, USA. (Pubitemid 44144459)
    • (2005) Proceedings - IEEE International Workshop on Robot and Human Interactive Communication , vol.2005 , pp. 667-671
    • Kim, E.H.1    Hyun, K.H.2    Kwak, Y.K.3
  • 13
    • 33947615772 scopus 로고    scopus 로고
    • Robust estimation of voice quality parameters under real world disturbances
    • Toulouse
    • Lugger, M., Yang, B., & Wokurek, W. (2006). Robust estimation of voice quality parameters under real world disturbances. In Proc. ICASSP (pp. 1097-1100), Toulouse.
    • (2006) Proc. ICASSP , pp. 1097-1100
    • Lugger, M.1    Yang, B.2    Wokurek, W.3
  • 21
    • 0028210639 scopus 로고
    • Intelligibility of conversational and clear speech in noise and reverberation for listeners with normal and impaired hearing
    • Payton, K. L., Uchanski, R. M., & Braida, L. D. (1994). Intelligibility of conversational and clear speech in noise and reverberation for listeners with normal and impaired hearing. Journal of the Acoustical Society of America, 95, 1581-1592. (Pubitemid 24085759)
    • (1994) Journal of the Acoustical Society of America , vol.95 , Issue.3 , pp. 1581-1592
    • Payton, K.L.1    Uchanski, R.M.2    Braida, L.D.3
  • 25
    • 33646758175 scopus 로고    scopus 로고
    • Metaclassifiers in acoustic and linguistic feature fusion-based affect recognition
    • Philadelphia
    • Schuller, B., Jiménez Villar, R., Rigoll, G., & Lang, M. (2005). Metaclassifiers in acoustic and linguistic feature fusion-based affect recognition. In Proc. ICASSP (Vol. I, pp. 325-328), Philadelphia.
    • (2005) Proc. ICASSP , vol.1 , pp. 325-328
    • Schuller, B.1    Jiménez Villar, R.2    Rigoll, G.3    Lang, M.4
  • 27
    • 44949160056 scopus 로고    scopus 로고
    • Recognition of interest in human conversational speech
    • Pittsburgh
    • Schuller, B., Köhler, N., Müller, R., & Rigoll, G. (2006b). Recognition of interest in human conversational speech. In Proc. interspeech (pp. 793-796), Pittsburgh.
    • (2006) Proc. Interspeech , pp. 793-796
    • Schuller, B.1    Köhler, N.2    Müller, R.3    Rigoll, G.4
  • 29
    • 34547549142 scopus 로고    scopus 로고
    • Towards more reality in the recognition of emotional speech
    • Honolulu
    • Schuller, B., Seppi, D., Batliner, A., Meier, A., & Steidl, S. (2007). Towards more reality in the recognition of emotional speech. In Proc. ICASSP (pp. 941-944), Honolulu.
    • (2007) Proc. ICASSP , pp. 941-944
    • Schuller, B.1    Seppi, D.2    Batliner, A.3    Meier, A.4    Steidl, S.5
  • 30
    • 84867198846 scopus 로고    scopus 로고
    • Detection of security related affect and behaviour in passenger transport
    • Brisbane
    • Schuller, B., Wimmer, M., Arsic, D., Moosmayr, T., & Rigoll, G. (2008a). Detection of security related affect and behaviour in passenger transport. In Proc. interspeech (pp. 265-268), Brisbane.
    • (2008) Proc. Interspeech , pp. 265-268
    • Schuller, B.1    Wimmer, M.2    Arsic, D.3    Moosmayr, T.4    Rigoll, G.5
  • 31
    • 51449104640 scopus 로고    scopus 로고
    • Brute-forcing hierarchical functionals for paralinguistics: A waste of feature space
    • Las Vegas
    • Schuller, B., Wimmer, M., Mösenlechner, L., Kern, C., Arsic, D., & Rigoll, G. (2008b). Brute-forcing hierarchical functionals for paralinguistics: a waste of feature space. In Proc. ICASSP (pp. 4501-4504), Las Vegas.
    • (2008) Proc. ICASSP , pp. 4501-4504
    • Schuller, B.1    Wimmer, M.2    Mösenlechner, L.3    Kern, C.4    Arsic, D.5    Rigoll, G.6
  • 32
    • 70349292240 scopus 로고    scopus 로고
    • Being bored? Recognising natural interest by extensive audiovisual integration for real-life application
    • Special Issue on Visual and Multimodal Analysis of Human Spontaneous Behavior
    • Schuller, B., Müller, R., Eyben, F., Gast, J., Hörnler, B., Wöllmer, M., Rigoll, G., Höthker, A., & Konosu, H. (2009). Being bored? Recognising natural interest by extensive audiovisual integration for real-life application. Image and Vision Computing Journal, 27, 1760-1774. Special Issue on Visual and Multimodal Analysis of Human Spontaneous Behavior.
    • (2009) Image and Vision Computing Journal , vol.27 , pp. 1760-1774
    • Schuller, B.1    Müller, R.2    Eyben, F.3    Gast, J.4    Hörnler, B.5    Wöllmer, M.6    Rigoll, G.7    Höthker, A.8    Konosu, H.9
  • 38
    • 78149484045 scopus 로고    scopus 로고
    • Speech emotion analysis in noisy real world environment
    • Istanbul, Turkey
    • Tawari, A., & Trivedi, M. (2010). Speech emotion analysis in noisy real world environment. In Proc. ICPR (pp. 4605-4608), Istanbul, Turkey.
    • (2010) Proc. ICPR , pp. 4605-4608
    • Tawari, A.1    Trivedi, M.2
  • 39
    • 84862624179 scopus 로고    scopus 로고
    • Fast sequential floating forward selection applied to emotional speech features estimated on des and susas data collection
    • Florence
    • Ververidis, D., & Kotropoulos, C. (2006). Fast sequential floating forward selection applied to emotional speech features estimated on des and susas data collection. In Proc. European signal processing conf. (EUSIPCO 2006), Florence.
    • (2006) Proc. European Signal Processing Conf. (EUSIPCO 2006)
    • Ververidis, D.1    Kotropoulos, C.2
  • 41
    • 70349203870 scopus 로고    scopus 로고
    • Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks
    • Taipei, Taiwan
    • Wöllmer, M., Eyben, F., Keshet, J., Graves, A., Schuller, B., & Rigoll, G. (2009). Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks. In Proc. of ICASSP (pp. 3949-3952), Taipei, Taiwan.
    • (2009) Proc. of ICASSP , pp. 3949-3952
    • Wöllmer, M.1    Eyben, F.2    Keshet, J.3    Graves, A.4    Schuller, B.5    Rigoll, G.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.