메뉴 건너뛰기




Volumn 20, Issue 2, 2012, Pages 585-598

Loss-scaled large-margin Gaussian mixture models for speech emotion classification

Author keywords

Gaussian mixture models (GMMs); margin scaling; speech emotion classification; Watson and Tellegen's model

Indexed keywords

CLASSIFICATION ACCURACY; CONSTRAINED OPTIMIZATION PROBLEMS; DISCRIMINANT FUNCTIONS; DISTANCE METRICS; EMOTION CLASSIFICATION; EMOTION MODELING; EMOTION MODELS; GAUSSIAN MIXTURE MODEL; GAUSSIAN MIXTURE MODELS; GENERALIZATION ABILITY; LEARNING FRAMEWORKS; LOSS FUNCTIONS; MARGIN SCALING; MAXIMUM MUTUAL INFORMATION; PARAMETER SET; SEMI-DEFINITE PROGRAMMING; SPEECH EMOTION CLASSIFICATION; TELLEGEN; TESTING DATA; TRAINING DATA SETS;

EID: 83655164697     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2011.2162405     Document Type: Article
Times cited : (44)

References (55)
  • 7
    • 34547951152 scopus 로고    scopus 로고
    • Bi-modal emotion recognition from expressive face and body gestures
    • DOI 10.1016/j.jnca.2006.09.007, PII S1084804506000774
    • H. Gunes and M. Piccardi, "Bi-modal emotion recognition from expressive face and body gestures" J. Netw. Comput. Applicat., vol. 30, no. 4, pp. 1334-1345, 2007. (Pubitemid 47263518)
    • (2007) Journal of Network and Computer Applications , vol.30 , Issue.4 , pp. 1334-1345
    • Gunes, H.1    Piccardi, M.2
  • 8
  • 9
    • 9444222731 scopus 로고    scopus 로고
    • Emotion recognition using bio-sensors: First steps towards an automatic system
    • Affective Dialogue Systems
    • A. Haag, S. Goronzy, P. Schaich, and J. Williams, "Emotion recognition using bio-sensors: First steps towards an automatic system" Lecture Note Comput. Sci., vol. 3068, pp. 36-48, 2004. (Pubitemid 38851666)
    • (2004) Lecture Note Comput. Sci. , vol.3068 , pp. 36-48
    • Haag, A.1    Goronzy, S.2    Schaich, P.3    Williams, J.4
  • 10
    • 50049092345 scopus 로고    scopus 로고
    • Fast and accurate sequential floating forward feature selection with the Bayes classifier applied to speech emotion recognition
    • D. Ververidis and C. Kotropoulos, "Fast and accurate sequential floating forward feature selection with the Bayes classifier applied to speech emotion recognition" Signal Process., vol. 88, no. 12, pp. 2869-3014, 2008.
    • (2008) Signal Process. , vol.88 , Issue.12 , pp. 2869-3014
    • Ververidis, D.1    Kotropoulos, C.2
  • 11
    • 85115260483 scopus 로고
    • Floating search method for feature selection with nonmonotonic criterion functions
    • P. Pudil, F. Ferri, J. Novovicova, and J. Kittler, "Floating search method for feature selection with nonmonotonic criterion functions" Pattern Recognit., vol. 2, pp. 279-283, 1994.
    • (1994) Pattern Recognit. , vol.2 , pp. 279-283
    • Pudil, P.1    Ferri, F.2    Novovicova, J.3    Kittler, J.4
  • 12
    • 34547958553 scopus 로고    scopus 로고
    • Multistyle classification of speech under stress using feature subset selection based on genetic algorithms
    • DOI 10.1016/j.specom.2007.04.012, PII S0167639307000830
    • S. Casale, A. Russo, and S. Serrano, "Multistyle classification of speech under stress using feature subset selection based on genetic algorithms" Speech Commun., vol. 49, pp. 801-810, 2007. (Pubitemid 47268572)
    • (2007) Speech Communication , vol.49 , Issue.10-11 , pp. 801-810
    • Casale, S.1    Russo, A.2    Serrano, S.3
  • 14
    • 84979947871 scopus 로고    scopus 로고
    • Automatic speech classification to five emotional states based on gender information
    • D. Ververidis and C. Kotropoulos, "Automatic speech classification to five emotional states based on gender information" in Proc. Eur. Signal Process. Conf., 2004, pp. 341-344.
    • (2004) Proc. Eur. Signal Process. Conf. , pp. 341-344
    • Ververidis, D.1    Kotropoulos, C.2
  • 15
    • 70450136545 scopus 로고    scopus 로고
    • An incremental analysis of different feature groups in speaker independent emotion recognition
    • M. Lugger and B. Yang, "An incremental analysis of different feature groups in speaker independent emotion recognition" in Proc. Int. Congr. Phonet. Sci., 2007, pp. 2149-2152.
    • (2007) Proc. Int. Congr. Phonet. Sci. , pp. 2149-2152
    • Lugger, M.1    Yang, B.2
  • 17
    • 28444440262 scopus 로고    scopus 로고
    • Speech emotion recognition based on HMM and SVM
    • Y. Lin and G. Wei, "Speech emotion recognition based on HMM and SVM" in Proc. IEEE Int. Conf. Mach. Learn. Cybern., 2005, vol. 8, pp. 18-21.
    • (2005) Proc. IEEE Int. Conf. Mach. Learn. Cybern. , vol.8 , pp. 18-21
    • Lin, Y.1    Wei, G.2
  • 18
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains" IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 23
    • 24944537843 scopus 로고    scopus 로고
    • Large margin methods for structured and interdependent output variables
    • I. Tsochantaridis, T. Joachims, and T. Hofmann, "Large margin methods for structured and interdependent output variables" J. Mach. Learn. Res., vol. 6, pp. 1453-1484, 2005.
    • (2005) J. Mach. Learn. Res. , vol.6 , pp. 1453-1484
    • Tsochantaridis, I.1    Joachims, T.2    Hofmann, T.3
  • 25
    • 56449091292 scopus 로고    scopus 로고
    • Modified MMI/ MPE: A direct evaluation of the margin in speech recognition
    • G. Heigold, T. Deselaers, R. Schlüter, and H. Ney, "Modified MMI/ MPE: A direct evaluation of the margin in speech recognition" in Proc. Int. Conf. Mach. Learn., 2008, pp. 384-391.
    • (2008) Proc. Int. Conf. Mach. Learn. , pp. 384-391
    • Heigold, G.1    Deselaers, T.2    R. Schlüter3    Ney, H.4
  • 26
    • 84864038630 scopus 로고    scopus 로고
    • Large margin hidden Markov models for automatic speech recognition
    • F. Sha and L. K. Saul, "Large margin hidden Markov models for automatic speech recognition" Neural Inf. Process. Syst., vol. 19, pp. 1249-1256, 2007.
    • (2007) Neural Inf. Process. Syst. , vol.19 , pp. 1249-1256
    • Sha, F.1    Saul, L.K.2
  • 27
    • 56449089882 scopus 로고    scopus 로고
    • Accurate max-margin training for structured output spaces
    • S. Sarawagi and R. Gupta, "Accurate max-margin training for structured output spaces" in Proc. Int. Conf. Mach. Learn., 2008, pp. 888-895.
    • (2008) Proc. Int. Conf. Mach. Learn. , pp. 888-895
    • Sarawagi, S.1    Gupta, R.2
  • 28
    • 0141827752 scopus 로고    scopus 로고
    • On the dimensional and hierarchical structure of affect
    • A. Tellegen, D. Watson, and L. Clark, "On the dimensional and hierarchical structure of affect" Psychol. Sci., vol. 10, no. 4, pp. 297-303, 1999.
    • (1999) Psychol. Sci. , vol.10 , Issue.4 , pp. 297-303
    • Tellegen, A.1    Watson, D.2    Clark, L.3
  • 29
    • 0022115623 scopus 로고
    • Toward a consensual structure of mood
    • D. Watson and A. Tellegen, "Toward a consensual structure of mood" Psychol. Bull., vol. 98, no. 2, pp. 219-35, 1985.
    • (1985) Psychol. Bull. , vol.98 , Issue.2 , pp. 219-35
    • Watson, D.1    Tellegen, A.2
  • 30
    • 0024023344 scopus 로고
    • Development and validation of brief measures of positive and negative affect: The PANAS scales
    • D.Watson, L. A. Clark, and A. Tellegen, "Development and validation of brief measures of positive and negative affect: The PANAS scales" J. Personal. Soc. Psychol., vol. 54, no. 2, pp. 1063-1070, 1988.
    • (1988) J. Personal. Soc. Psychol. , vol.54 , Issue.2 , pp. 1063-1070
    • Watson, D.1    Clark, L.A.2    Tellegen, A.3
  • 31
    • 70349192899 scopus 로고    scopus 로고
    • Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion model
    • S. Yun and C. D. Yoo, "Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion model" in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2009, pp. 4169-4172.
    • (2009) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 4169-4172
    • Yun, S.1    Yoo, C.D.2
  • 32
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.s
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition" Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 34
    • 2442503633 scopus 로고    scopus 로고
    • A discriminative training algorithm for hidden Markov models
    • May
    • A. B. Yishai and D. Burshtein, "A discriminative training algorithm for hidden Markov models" IEEE Trans. Speech Audio Process., vol. 12, no. 3, pp. 204-217, May 2004.
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.3 , pp. 204-217
    • Yishai, A.B.1    Burshtein, D.2
  • 35
    • 0028412908 scopus 로고
    • High-performance connected digit recognition using maximum mutual information estimation
    • Apr.
    • Y. Normandin, R. Cardin, and R. D. Mori, "High-performance connected digit recognition using maximum mutual information estimation" IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 299-311, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 299-311
    • Normandin, Y.1    Cardin, R.2    Mori, R.D.3
  • 36
    • 0020796537 scopus 로고
    • A decision theoretic formulation of a training problem in speech recognition and a comparison of training by unconditional vesus conditional maximum likelihood
    • A. Nadas, "A decision theoretic formulation of a training problem in speech recognition and a comparison of training by unconditional vesus conditional maximum likelihood" IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-31, no. 4, pp. 814-817, Aug. 1983. (Pubitemid 14455162)
    • (1983) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-31 , Issue.4 , pp. 814-817
    • Nadas, A.1
  • 37
    • 0010442827 scopus 로고    scopus 로고
    • On the algorithmic implementation of multiclass kernel-based vector machines
    • K. Crammer and Y. Singer, "On the algorithmic implementation of multiclass kernel-based vector machines" J. Mach. Learn. Res., vol. 2, pp. 265-292, 2001.
    • (2001) J. Mach. Learn. Res. , vol.2 , pp. 265-292
    • Crammer, K.1    Singer, Y.2
  • 39
    • 44849092926 scopus 로고    scopus 로고
    • A compact semidefinite programming (SDP) formulation for large margin estimation of HMMs in speech recognition
    • Y. Yin and H. Jiang, "A compact semidefinite programming (SDP) formulation for large margin estimation of HMMs in speech recognition" in Proc. IEEE Workshop Autom. Speech Recognit. Understand., 2007, pp. 312-317.
    • (2007) Proc. IEEE Workshop Autom. Speech Recognit. Understand. , pp. 312-317
    • Yin, Y.1    Jiang, H.2
  • 40
    • 44849126208 scopus 로고    scopus 로고
    • Solving large-margin hidden Markov model estimation via semidefinite programming
    • Nov.
    • X. Li and H. Jiang, "Solving large-margin hidden Markov model estimation via semidefinite programming" IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2383-2392, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2383-2392
    • Li, X.1    Jiang, H.2
  • 41
    • 44249088324 scopus 로고    scopus 로고
    • Algorithm 875: DSDP5-Software for semidefinite programming
    • S. J. Benson and Y. Ye, "Algorithm 875: DSDP5-Software for semidefinite programming" Res. J. Assoc. for Comput. Machinery Math. Software, vol. 34, no. 3, pp. 16:1-16:20, 2008.
    • (2008) Res. J. Assoc. for Comput. Machinery Math. Software , vol.34 , Issue.3 , pp. 161-1620
    • Benson, S.J.1    Ye, Y.2
  • 43
    • 85089273681 scopus 로고    scopus 로고
    • Getting started with susas: A speech under simulated and actual stress database
    • J. Hansen and S. Bou-Ghazale, "Getting started with susas: A speech under simulated and actual stress database" in Proc. Eur. Conf. Speech Commun. Technol., 1997, vol. 4, pp. 1743-1746.
    • (1997) Proc. Eur. Conf. Speech Commun. Technol. , vol.4 , pp. 1743-1746
    • Hansen, J.1    Bou-Ghazale, S.2
  • 47
    • 2342561037 scopus 로고    scopus 로고
    • Phoneme recognition using ica-based feature extraction and transformation
    • O. Kwon and T. Lee, "Phoneme recognition using ica-based feature extraction and transformation" Signal Process., vol. 84, no. 6, pp. 1005-1019, 2004.
    • (2004) Signal Process. , vol.84 , Issue.6 , pp. 1005-1019
    • Kwon, O.1    Lee, T.2
  • 51
    • 83655204455 scopus 로고    scopus 로고
    • Analysis of robustness of attributes selection applied to speech emotion recognition
    • S. Casale, A. Russo, and S. Serrano, "Analysis of robustness of attributes selection applied to speech emotion recognition" in Proc. Eur. Signal Process. Conf., 2010, pp. 1174-1178.
    • (2010) Proc. Eur. Signal Process. Conf. , pp. 1174-1178
    • Casale, S.1    Russo, A.2    Serrano, S.3
  • 53
    • 34547940048 scopus 로고    scopus 로고
    • Primitives-based evaluation and estimation of emotions in speech
    • DOI 10.1016/j.specom.2007.01.010, PII S0167639307000040
    • M. Grimm, K. Kroschel, E. Mower, and S. Narayanan, "Primitives- based evaluation and estimation of emotions in speech" Speech Commun., vol. 49, no. 10-11, pp. 787-800, 2007. (Pubitemid 47268568)
    • (2007) Speech Communication , vol.49 , Issue.10-11 , pp. 787-800
    • Grimm, M.1    Kroschel, K.2    Mower, E.3    Narayanan, S.4
  • 54
    • 0347269187 scopus 로고    scopus 로고
    • Noise adaptive speech recognition based on sequential noise parameter estimation
    • K. Yao, K. K. Paliwal, and S. Nakamura, "Noise adaptive speech recognition based on sequential noise parameter estimation" Speech Commun., vol. 42, no. 1, pp. 5-23, 2004.
    • (2004) Speech Commun. , vol.42 , Issue.1 , pp. 5-23
    • Yao, K.1    Paliwal, K.K.2    Nakamura, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.