메뉴 건너뛰기




Volumn 16, Issue 2, 2013, Pages 133-141

Gender-dependent emotion recognition based on HMMs and SPHMMs

Author keywords

Emotion recognition; Gender recognition; Hidden Markov models; Mel frequency cepstral coefficients; Suprasegmental hidden Markov models

Indexed keywords

EMOTION IDENTIFICATIONS; EMOTION RECOGNITION; GENDER RECOGNITION; HIDDEN MARKOV MODELS (HMMS); MEL-FREQUENCY CEPSTRAL COEFFICIENTS; SUBJECTIVE ASSESSMENTS; SUPRASEGMENTAL HIDDEN MARKOV MODELS; TWO-STAGE APPROACHES;

EID: 84882867741     PISSN: 13812416     EISSN: 15728110     Source Type: Journal    
DOI: 10.1007/s10772-012-9170-4     Document Type: Article
Times cited : (23)

References (28)
  • 2
    • 0037382560 scopus 로고    scopus 로고
    • Emotions, speech and the ASR framework
    • 1006.68943
    • Bosch, L. T. (2003). Emotions, speech and the ASR framework. Speech Communication, 40(1-2), 213-225.
    • (2003) Speech Communication , vol.40 , Issue.1-2 , pp. 213-225
    • Bosch, L.T.1
  • 3
    • 0034229795 scopus 로고    scopus 로고
    • A comparative study of traditional and newly proposed features for recognition of speech under stress
    • 10.1109/89.848224
    • Bou-Ghazale, S. E.; Hansen, J. H. L. (2000). A comparative study of traditional and newly proposed features for recognition of speech under stress. IEEE Transactions on Speech and Audio Processing, 8(4), 429-442.
    • (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.4 , pp. 429-442
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 4
    • 34547958553 scopus 로고    scopus 로고
    • Multistyle classification of speech under stress using feature subset selection based on genetic algorithms
    • 10.1016/j.specom.2007.04.012
    • Casale, S.; Russo, A.; Serrano, S. (2007). Multistyle classification of speech under stress using feature subset selection based on genetic algorithms. Speech Communication, 49(10-11), 801-810.
    • (2007) Speech Communication , vol.49 , Issue.10-11 , pp. 801-810
    • Casale, S.1    Russo, A.2    Serrano, S.3
  • 6
    • 70449360175 scopus 로고    scopus 로고
    • Modulation spectral features for robust far-field speaker identification
    • 10.1109/TASL.2009.2023679
    • Falk, T. H.; Chan, W. Y. (2010). Modulation spectral features for robust far-field speaker identification. IEEE Transactions on Audio, Speech, and Language Processing, 18(1), 90-100.
    • (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , Issue.1 , pp. 90-100
    • Falk, T.H.1    Chan, W.Y.2
  • 7
    • 21544458365 scopus 로고    scopus 로고
    • Emotion recognition in human-computer interaction
    • 10.1016/j.neunet.2005.03.006 Special issue
    • Fragopanagos, N.; Taylor, J. G. (2005). Emotion recognition in human-computer interaction. Neural Networks, 18, 389-405 (Special issue)
    • (2005) Neural Networks , vol.18 , pp. 389-405
    • Fragopanagos, N.1    Taylor, J.G.2
  • 8
    • 0030196359 scopus 로고    scopus 로고
    • Feature analysis and neural network-based classification of speech under stress
    • 10.1109/89.506935
    • Hansen, J. H. L.; Womack, B. (1996). Feature analysis and neural network-based classification of speech under stress. IEEE Transactions on Speech and Audio Processing, 4(4), 307-313.
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.4 , pp. 307-313
    • Hansen, J.H.L.1    Womack, B.2
  • 9
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • 10.1016/j.specom.2009.08.009
    • Kinnunen, T.; Li, H. (2010). An overview of text-independent speaker recognition: from features to supervectors. Speech Communication, 52(1), 12-40.
    • (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 12
    • 33846952503 scopus 로고    scopus 로고
    • Ensemble methods for spoken emotion recognition in call-centres
    • 10.1016/j.specom.2006.11.004
    • Morrison, D.; Wang, R.; De Silva, L. C. (2007). Ensemble methods for spoken emotion recognition in call-centres. Speech Communication, 49(2), 98-112.
    • (2007) Speech Communication , vol.49 , Issue.2 , pp. 98-112
    • Morrison, D.1    Wang, R.2    De Silva, L.C.3
  • 13
    • 0242721417 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden Markov models
    • 10.1016/S0167-6393(03)00099-2
    • Nwe, T. L.; Foo, S. W.; De Silva, L. C. (2003). Speech emotion recognition using hidden Markov models. Speech Communication, 41(4), 603-623.
    • (2003) Speech Communication , vol.41 , Issue.4 , pp. 603-623
    • Nwe, T.L.1    Foo, S.W.2    De Silva, L.C.3
  • 14
    • 0038548330 scopus 로고    scopus 로고
    • The production and recognition of emotions in speech: Features and algorithms
    • 10.1016/S1071-5819(02)00141-6
    • Oudeyer, P. Y. (2003). The production and recognition of emotions in speech: features and algorithms. International Journal of Human-Computer Studies, 59, 157-183.
    • (2003) International Journal of Human-Computer Studies , vol.59 , pp. 157-183
    • Oudeyer, P.Y.1
  • 15
    • 0141815650 scopus 로고    scopus 로고
    • Emotion recognition and acoustic analysis from speech signal
    • Portland, Oregon, USA July 20-24 4
    • Park, C. H.; Sim, K. B. (2003). Emotion recognition and acoustic analysis from speech signal. In Proceedings of the international joint conference on neural networks, Portland, Oregon, USA, July 20-24 (Vol. 4, pp. 2594-2598).
    • (2003) Proceedings of the International Joint Conference on Neural Networks , pp. 2594-2598
    • Park, C.H.1    Sim, K.B.2
  • 17
    • 69849087531 scopus 로고    scopus 로고
    • Analysis and classification of speech signals by generalized fractal dimension features
    • 10.1016/j.specom.2009.06.005
    • Pitsikalis, V.; Maragos, P. (2009). Analysis and classification of speech signals by generalized fractal dimension features. Speech Communication, 51(12), 1206-1223.
    • (2009) Speech Communication , vol.51 , Issue.12 , pp. 1206-1223
    • Pitsikalis, V.1    Maragos, P.2
  • 19
    • 63649147868 scopus 로고    scopus 로고
    • Emotion recognition using Mel-frequency cepstral coefficients
    • 10.5715/jnlp.14.4-83
    • Sato, N.; Obuchi, Y. (2007). Emotion recognition using Mel-frequency cepstral coefficients. Journal of Natural Language Processing, 14(4), 83-96.
    • (2007) Journal of Natural Language Processing , vol.14 , Issue.4 , pp. 83-96
    • Sato, N.1    Obuchi, Y.2
  • 20
    • 47749098868 scopus 로고    scopus 로고
    • Speaker identification in the shouted environment using suprasegmental hidden Markov models
    • 1151.94408 10.1016/j.sigpro.2008.05.012
    • Shahin, I. (2008). Speaker identification in the shouted environment using suprasegmental hidden Markov models. Signal Processing Journal, 88(11), 2700-2708.
    • (2008) Signal Processing Journal , vol.88 , Issue.11 , pp. 2700-2708
    • Shahin, I.1
  • 22
    • 80052603818 scopus 로고    scopus 로고
    • Identifying speakers using their emotion cues
    • 10.1007/s10772-011-9089-1 10.1007/s10772-011-9089-1
    • Shahin, I. (2011a). Identifying speakers using their emotion cues. International Journal of Speech Technology, 14(2), 89-98. doi: 10.1007/s10772-011-9089-1.
    • (2011) International Journal of Speech Technology , vol.14 , Issue.2 , pp. 89-98
    • Shahin, I.1
  • 23
    • 80051700998 scopus 로고    scopus 로고
    • Analysis and investigation of emotion identification in biased emotional talking environments
    • 10.1049/iet-spr.2010.0059 10.1049/iet-spr.2010.0059
    • Shahin, I. (2011b). Analysis and investigation of emotion identification in biased emotional talking environments. IET Signal Processing Journal, 5(5), 461-470. doi: 10.1049/iet-spr.2010.0059.
    • (2011) IET Signal Processing Journal , vol.5 , Issue.5 , pp. 461-470
    • Shahin, I.1
  • 24
    • 79957832640 scopus 로고    scopus 로고
    • Speaker identification in each of the neutral and shouted talking environments based on gender-dependent approach using SPHMMs
    • 2809545
    • Shahin, I. (2011c). Speaker identification in each of the neutral and shouted talking environments based on gender-dependent approach using SPHMMs. International Journal of Computers & Applications, 33(1), 83-91.
    • (2011) International Journal of Computers & Applications , vol.33 , Issue.1 , pp. 83-91
    • Shahin, I.1
  • 25
    • 33746410556 scopus 로고    scopus 로고
    • Emotional speech recognition: Resources, features, and methods
    • 10.1016/j.specom.2006.04.003
    • Ververidis, D.; Kotropoulos, C. (2006). Emotional speech recognition: resources, features, and methods. Speech Communication, 48(9), 1162-1181.
    • (2006) Speech Communication , vol.48 , Issue.9 , pp. 1162-1181
    • Ververidis, D.1    Kotropoulos, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.