메뉴 건너뛰기




Volumn 27, Issue 1, 2013, Pages 135-150

Speaker state recognition using an HMM-based feature extraction method

Author keywords

Emotion recognition; Hidden Markov Models; Intoxication recognition; Model adaptation; Universal Background Model

Indexed keywords

ACOUSTIC FEATURES; ADAPTATION SCHEME; DETECTION TASKS; EMOTION RECOGNITION; FEATURE EXTRACTION METHODS; GAUSSIAN MIXTURE MODEL; INTOXICATION RECOGNITION; MODEL ADAPTATION; STATE RECOGNITION; STATE-OF-THE-ART SYSTEM; UNIVERSAL BACKGROUND MODEL;

EID: 84867328373     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2012.01.007     Document Type: Article
Times cited : (21)

References (41)
  • 1
    • 38749108114 scopus 로고    scopus 로고
    • Private emotions versus social interaction: A data-driven approach towards analysing emotion in speech
    • Batliner A.; Steidl S.; Hacker C.; and Nöth E. Private emotions versus social interaction: a data-driven approach towards analysing emotion in speech User Model. User-Adapted Interact. 18 2008 175 206
    • (2008) User Model. User-Adapted Interact. , vol.18 , pp. 175-206
    • Batliner, A.1    Steidl, S.2    Hacker, C.3    Nöth, E.4
  • 3
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • Baum L.E.; Petrie T.; Soules G.; and Weiss N. A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains Ann. Math. Stat. 41 1970 164 171
    • (1970) Ann. Math. Stat. , vol.41 , pp. 164-171
    • Baum, L.E.1    Petrie, T.2    Soules, G.3    Weiss, N.4
  • 4
    • 79959826869 scopus 로고    scopus 로고
    • Age and gender recognition based on multiple systems - Early vs. late fusion
    • Bocklet T.; Stemmer G.; Zeißler V.; and Nöth E. Age and gender recognition based on multiple systems - early vs. late fusion INTERSPEECH 2010 2830 2833
    • (2010) INTERSPEECH , pp. 2830-2833
    • Bocklet, T.1    Stemmer, G.2    Zeißler, V.3    Nöth, E.4
  • 5
    • 33947660079 scopus 로고    scopus 로고
    • Discriminative training techniques for acoustic language identification
    • Burget L.; Matějka P.; and Černocký J. Discriminative training techniques for acoustic language identification Proceedings of ICASSP 2006 2006 209 212
    • (2006) Proceedings of ICASSP 2006 , pp. 209-212
    • Burget, L.1    Matějka, P.2    Černocký, J.3
  • 6
    • 65249116503 scopus 로고    scopus 로고
    • Analysis of emotionally salient aspects of fundamental frequency for emotion detection
    • Busso C.; Lee S.; and Narayanan S. Analysis of emotionally salient aspects of fundamental frequency for emotion detection IEEE Trans. Audio Speech Lang. Proc. 17 2009 582 596
    • (2009) IEEE Trans. Audio Speech Lang. Proc. , vol.17 , pp. 582-596
    • Busso, C.1    Lee, S.2    Narayanan, S.3
  • 8
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis S.; and Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Trans. Acoust. Speech Signal Proc. 28 1980 357 366
    • (1980) IEEE Trans. Acoust. Speech Signal Proc. , vol.28 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 12
    • 78650977476 scopus 로고    scopus 로고
    • OpenSMILE - The munich versatile and fast open-source audio feature extractor
    • Florence, Italy
    • Eyben F.; Wöllmer M.; and Schuller B. openSMILE - the munich versatile and fast open-source audio feature extractor Proc. ACM Multimedia (MM) Florence, Italy 2010 1459 1462
    • (2010) Proc. ACM Multimedia (MM) , pp. 1459-1462
    • Eyben, F.1    Wöllmer, M.2    Schuller, B.3
  • 13
    • 79959854220 scopus 로고    scopus 로고
    • Automatic speaker age and gender recognition in the car for tailoring dialog and mobile services
    • Feld M.; Burkhardt F.; and Müller C. Automatic speaker age and gender recognition in the car for tailoring dialog and mobile services INTERSPEECH, ISCA 2010 2834 2837
    • (2010) INTERSPEECH, ISCA , pp. 2834-2837
    • Feld, M.1    Burkhardt, F.2    Müller, C.3
  • 14
    • 84865710144 scopus 로고    scopus 로고
    • University of ljubljana system for interspeech 2011 speaker state challenge
    • Gajšek R.; Dobrišek S.; and Mihelič F. University of ljubljana system for interspeech 2011 speaker state challenge INTERSPEECH 2011, ISCA 2011 3297 3300
    • (2011) INTERSPEECH 2011, ISCA , pp. 3297-3300
    • Gajšek, R.1    Dobrišek, S.2    Mihelič, F.3
  • 15
    • 78149489907 scopus 로고    scopus 로고
    • Multi-modal emotion recognition using canonical correlations and acoustic features
    • IEEE Computer Society
    • Gajšek R.; Štruc V.; and Mihelič F. Multi-modal emotion recognition using canonical correlations and acoustic features Int. Conf. on Pattern Recognition 2010 IEEE Computer Society 2010 4133 4136
    • (2010) Int. Conf. on Pattern Recognition 2010 , pp. 4133-4136
    • Gajšek, R.1    Štruc, V.2    Mihelič, F.3
  • 16
    • 79959823933 scopus 로고    scopus 로고
    • Gender and affect recognition based on GMM and GMM-UBM modeling with relevance MAP estimation
    • Gajšek R.; Žibert J.; Justin T.; Štruc V.; Vesnicer B.; and Mihelič F. Gender and affect recognition based on GMM and GMM-UBM modeling with relevance MAP estimation INTERSPEECH-2010 2010 2810 2813
    • (2010) INTERSPEECH-2010 , pp. 2810-2813
    • Gajšek, R.1    Žibert, J.2    Justin, T.3    Štruc, V.4    Vesnicer, B.5    Mihelič, F.6
  • 19
    • 70450177653 scopus 로고    scopus 로고
    • Brno University of Technology System for Interspeech 2009 Emotion Challenge
    • Kockmann M.; Burget L.; and Černocký J. Brno University of Technology System for Interspeech 2009 Emotion Challenge Proc. INTERSPEECH 2009, ISCA 2009 348 351
    • (2009) Proc. INTERSPEECH 2009, ISCA , pp. 348-351
    • Kockmann, M.1    Burget, L.2    Černocký, J.3
  • 20
    • 79959829347 scopus 로고    scopus 로고
    • Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge
    • Kockmann M.; Burget L.; and Černocký J. Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge Proc. INTERSPEECH 2010, ISCA 2010 2822 2825
    • (2010) Proc. INTERSPEECH 2010, ISCA , pp. 2822-2825
    • Kockmann, M.1    Burget, L.2    Černocký, J.3
  • 22
    • 0018918171 scopus 로고
    • An algorithm for vector quantizer design
    • Linde Y.; Buzo A.; and Gray R. An algorithm for vector quantizer design IEEE Trans. Commun. 28 1980 84 95
    • (1980) IEEE Trans. Commun. , vol.28 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.3
  • 24
    • 0346586663 scopus 로고    scopus 로고
    • Synthetic minority over-sampling technique
    • Nitesh V.; and Chawla E.A. Synthetic minority over-sampling technique J. Artif. Intel. Res. 16 2002 321 357
    • (2002) J. Artif. Intel. Res. , vol.16 , pp. 321-357
    • Nitesh, V.1    Chawla, E.A.2
  • 25
    • 0003120218 scopus 로고    scopus 로고
    • Fast training of support vector machines using sequential minimal optimization
    • Schoelkopf B. Burges C. Smola A. MIT Cambridge, MA, USA
    • Platt J.C. Fast training of support vector machines using sequential minimal optimization Schoelkopf B. Burges C. Smola A. Advances in Kernel Methods - Support Vector Learning 1999 MIT Cambridge, MA, USA 185 208
    • (1999) Advances in Kernel Methods - Support Vector Learning , pp. 185-208
    • Platt, J.C.1
  • 27
    • 0033884858 scopus 로고    scopus 로고
    • Speaker Verification Using Adapted Gaussian Mixture Models
    • Reynolds D.A.; Quatieri T.F.; and Dunn R.B. Speaker Verification Using Adapted Gaussian Mixture Models Digit. Signal Proc. 10 2000 19 41
    • (2000) Digit. Signal Proc. , vol.10 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 28
    • 84865734881 scopus 로고    scopus 로고
    • Alcohol language corpus: The first public corpus of alcoholized german speech
    • Schiel F.; Heinrich C.; and Barfüsser S. Alcohol language corpus: the first public corpus of alcoholized german speech Lang. Resourc. Eval. 2011 1 19
    • (2011) Lang. Resourc. Eval. , pp. 1-19
    • Schiel, F.1    Heinrich, C.2    Barfüsser, S.3
  • 30
    • 79960846940 scopus 로고    scopus 로고
    • Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge
    • in press
    • Schuller, B.; Batliner, A.; Steidl, S.; Seppi, D.; 2011. Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge. Speech Communication, in press.
    • (2011) Speech Communication
    • Schuller, B.1    Batliner, A.2    Steidl, S.3    Seppi, D.4
  • 31
    • 84867329448 scopus 로고    scopus 로고
    • 'Mister D.J.; Cheer me up!': Musical and textual features for automatic mood classification
    • Taylor & Francis
    • Schuller B.; Hage C.; Schuller D.; and Rigoll G. 'Mister D.J.; Cheer me up!': musical and textual features for automatic mood classification J. New Music Res. (JNMR) 38 4 2009 Taylor & Francis
    • (2009) J. New Music Res. (JNMR) , vol.38 , Issue.4
    • Schuller, B.1    Hage, C.2    Schuller, D.3    Rigoll, G.4
  • 32
    • 70450206416 scopus 로고    scopus 로고
    • The INTERSPEECH 2009 emotion challenge
    • ISCA, Brighton, UK
    • Schuller B.; Steidl S.; and Batliner A. The INTERSPEECH 2009 emotion challenge INTERSPEECH 2009 ISCA, Brighton, UK 2009 312 315
    • (2009) INTERSPEECH 2009 , pp. 312-315
    • Schuller, B.1    Steidl, S.2    Batliner, A.3
  • 36
    • 85053783447 scopus 로고    scopus 로고
    • The role of prosody in affective speech, linguistic insights, studies in language and communication
    • Schuller B.; Wöllmer M.; Eyben F.; and Rigoll G. The role of prosody in affective speech, linguistic insights, studies in language and communication J. New Music Res. 97 2009 285 307
    • (2009) J. New Music Res. , vol.97 , pp. 285-307
    • Schuller, B.1    Wöllmer, M.2    Eyben, F.3    Rigoll, G.4
  • 39
    • 70450176954 scopus 로고    scopus 로고
    • Processing affected speech within human-machine interaction
    • Brighton, UK, ISCA
    • Vlasenko B.; and Wendemuth A. Processing affected speech within human-machine interaction Proc. INTERSPEECH 2009 Brighton, UK, ISCA 2009 2039 2042
    • (2009) Proc. INTERSPEECH 2009 , pp. 2039-2042
    • Vlasenko, B.1    Wendemuth, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.