SCOPUS 정보 검색 플랫폼

International Journal of Speech Technology

Volumn 16, Issue 2, 2013, Pages 133-141

Gender-dependent emotion recognition based on HMMs and SPHMMs

a UNIVERSITY OF SHARJAH (United Arab Emirates)

Author keywords

Emotion recognition; Gender recognition; Hidden Markov models; Mel frequency cepstral coefficients; Suprasegmental hidden Markov models

Indexed keywords

EMOTION IDENTIFICATIONS; EMOTION RECOGNITION; GENDER RECOGNITION; HIDDEN MARKOV MODELS (HMMS); MEL-FREQUENCY CEPSTRAL COEFFICIENTS; SUBJECTIVE ASSESSMENTS; SUPRASEGMENTAL HIDDEN MARKOV MODELS; TWO-STAGE APPROACHES;

DATABASE SYSTEMS; HIDDEN MARKOV MODELS; SPEECH RECOGNITION;

SOCIAL SCIENCES;

EID: 84882867741 PISSN: 13812416 EISSN: 15728110 Source Type: Journal
DOI: 10.1007/s10772-012-9170-4 Document Type: Article

Times cited : (23)

References (28)

1
- 33646772406
- Improving speech recognition performance through gender separation
- Dunedin, New Zealand
- Abdulla, W. H.; Kasabov, N. K. (2001). Improving speech recognition performance through gender separation. In Artificial neural networks and expert systems international conference (ANNES), Dunedin, New Zealand (pp. 218-222).
- (2001) Artificial Neural Networks and Expert Systems International Conference (ANNES) , pp. 218-222
- Abdulla, W.H.¹ Kasabov, N.K.²

2
- 0037382560
- Emotions, speech and the ASR framework
- 1006.68943
- Bosch, L. T. (2003). Emotions, speech and the ASR framework. Speech Communication, 40(1-2), 213-225.
- (2003) Speech Communication , vol.40 , Issue.1-2 , pp. 213-225
- Bosch, L.T.¹

3
- 0034229795
- A comparative study of traditional and newly proposed features for recognition of speech under stress
- 10.1109/89.848224
- Bou-Ghazale, S. E.; Hansen, J. H. L. (2000). A comparative study of traditional and newly proposed features for recognition of speech under stress. IEEE Transactions on Speech and Audio Processing, 8(4), 429-442.
- (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.4 , pp. 429-442
- Bou-Ghazale, S.E.¹ Hansen, J.H.L.²

4
- 34547958553
- Multistyle classification of speech under stress using feature subset selection based on genetic algorithms
- 10.1016/j.specom.2007.04.012
- Casale, S.; Russo, A.; Serrano, S. (2007). Multistyle classification of speech under stress using feature subset selection based on genetic algorithms. Speech Communication, 49(10-11), 801-810.
- (2007) Speech Communication , vol.49 , Issue.10-11 , pp. 801-810
- Casale, S.¹ Russo, A.² Serrano, S.³

5
- 85032751766
- Emotion recognition in human-computer interaction
- 10.1109/79.911197
- Cowie, R.; Douglas-Cowie, E.; Tsapatsoulis, N.; Votsis, G.; Collias, S.; Fellenz, W.; Taylor, J. (2001). Emotion recognition in human-computer interaction. IEEE Signal Processing Magazine, 18(1), 32-80.
- (2001) IEEE Signal Processing Magazine , vol.18 , Issue.1 , pp. 32-80
- Cowie, R.¹ Douglas-Cowie, E.² Tsapatsoulis, N.³ Votsis, G.⁴ Collias, S.⁵ Fellenz, W.⁶ Taylor, J.⁷

6
- 70449360175
- Modulation spectral features for robust far-field speaker identification
- 10.1109/TASL.2009.2023679
- Falk, T. H.; Chan, W. Y. (2010). Modulation spectral features for robust far-field speaker identification. IEEE Transactions on Audio, Speech, and Language Processing, 18(1), 90-100.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , Issue.1 , pp. 90-100
- Falk, T.H.¹ Chan, W.Y.²

7
- 21544458365
- Emotion recognition in human-computer interaction
- 10.1016/j.neunet.2005.03.006 Special issue
- Fragopanagos, N.; Taylor, J. G. (2005). Emotion recognition in human-computer interaction. Neural Networks, 18, 389-405 (Special issue)
- (2005) Neural Networks , vol.18 , pp. 389-405
- Fragopanagos, N.¹ Taylor, J.G.²

8
- 0030196359
- Feature analysis and neural network-based classification of speech under stress
- 10.1109/89.506935
- Hansen, J. H. L.; Womack, B. (1996). Feature analysis and neural network-based classification of speech under stress. IEEE Transactions on Speech and Audio Processing, 4(4), 307-313.
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.4 , pp. 307-313
- Hansen, J.H.L.¹ Womack, B.²

9
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- 10.1016/j.specom.2009.08.009
- Kinnunen, T.; Li, H. (2010). An overview of text-independent speaker recognition: from features to supervectors. Speech Communication, 52(1), 12-40.
- (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

10
- 85009223246
- Emotion recognition by speech signals
- Geneva, Switzerland September 2003
- Kwon, O. W.; Chan, K.; Hao, J.; Lee, T. W. (2003). Emotion recognition by speech signals. In 8th European conference on speech communication and technology 2003, Geneva, Switzerland, September 2003 (pp. 125-128).
- (2003) 8th European Conference on Speech Communication and Technology 2003 , pp. 125-128
- Kwon, O.W.¹ Chan, K.² Hao, J.³ Lee, T.W.⁴

11
- 14644439843
- Towards detecting emotions in spoken dialogs
- 10.1109/TSA.2004.838534
- Lee, C. M.; Narayanan, S. S. (2005). Towards detecting emotions in spoken dialogs. IEEE Transactions on Speech and Audio Processing, 13(2), 293-303.
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.2 , pp. 293-303
- Lee, C.M.¹ Narayanan, S.S.²

12
- 33846952503
- Ensemble methods for spoken emotion recognition in call-centres
- 10.1016/j.specom.2006.11.004
- Morrison, D.; Wang, R.; De Silva, L. C. (2007). Ensemble methods for spoken emotion recognition in call-centres. Speech Communication, 49(2), 98-112.
- (2007) Speech Communication , vol.49 , Issue.2 , pp. 98-112
- Morrison, D.¹ Wang, R.² De Silva, L.C.³

13
- 0242721417
- Speech emotion recognition using hidden Markov models
- 10.1016/S0167-6393(03)00099-2
- Nwe, T. L.; Foo, S. W.; De Silva, L. C. (2003). Speech emotion recognition using hidden Markov models. Speech Communication, 41(4), 603-623.
- (2003) Speech Communication , vol.41 , Issue.4 , pp. 603-623
- Nwe, T.L.¹ Foo, S.W.² De Silva, L.C.³

14
- 0038548330
- The production and recognition of emotions in speech: Features and algorithms
- 10.1016/S1071-5819(02)00141-6
- Oudeyer, P. Y. (2003). The production and recognition of emotions in speech: features and algorithms. International Journal of Human-Computer Studies, 59, 157-183.
- (2003) International Journal of Human-Computer Studies , vol.59 , pp. 157-183
- Oudeyer, P.Y.¹

15
- 0141815650
- Emotion recognition and acoustic analysis from speech signal
- Portland, Oregon, USA July 20-24 4
- Park, C. H.; Sim, K. B. (2003). Emotion recognition and acoustic analysis from speech signal. In Proceedings of the international joint conference on neural networks, Portland, Oregon, USA, July 20-24 (Vol. 4, pp. 2594-2598).
- (2003) Proceedings of the International Joint Conference on Neural Networks , pp. 2594-2598
- Park, C.H.¹ Sim, K.B.²

16
- 85009080929
- Emotion recognition in speech signal: Experimental study, development, and application
- Petrushin, V. A. (2000). Emotion recognition in speech signal: experimental study, development, and application. In Proceedings of international conference on spoken language processing (ICSLP 2000).
- (2000) Proceedings of International Conference on Spoken Language Processing (ICSLP 2000)
- Petrushin, V.A.¹

17
- 69849087531
- Analysis and classification of speech signals by generalized fractal dimension features
- 10.1016/j.specom.2009.06.005
- Pitsikalis, V.; Maragos, P. (2009). Analysis and classification of speech signals by generalized fractal dimension features. Speech Communication, 51(12), 1206-1223.
- (2009) Speech Communication , vol.51 , Issue.12 , pp. 1206-1223
- Pitsikalis, V.¹ Maragos, P.²

18
- 4544352297
- Detecting emotions in speech
- Polzin, T. S.; Waibel, A. H. (1998). Detecting emotions in speech. In Cooperative multimodal communication, second international conference (CMC 1998).
- (1998) Cooperative Multimodal Communication, Second International Conference (CMC 1998)
- Polzin, T.S.¹ Waibel, A.H.²

19
- 63649147868
- Emotion recognition using Mel-frequency cepstral coefficients
- 10.5715/jnlp.14.4-83
- Sato, N.; Obuchi, Y. (2007). Emotion recognition using Mel-frequency cepstral coefficients. Journal of Natural Language Processing, 14(4), 83-96.
- (2007) Journal of Natural Language Processing , vol.14 , Issue.4 , pp. 83-96
- Sato, N.¹ Obuchi, Y.²

20
- 47749098868
- Speaker identification in the shouted environment using suprasegmental hidden Markov models
- 1151.94408 10.1016/j.sigpro.2008.05.012
- Shahin, I. (2008). Speaker identification in the shouted environment using suprasegmental hidden Markov models. Signal Processing Journal, 88(11), 2700-2708.
- (2008) Signal Processing Journal , vol.88 , Issue.11 , pp. 2700-2708
- Shahin, I.¹

21
- 76949108081
- Speaker identification in emotional environments
- 2595466
- Shahin, I. (2009). Speaker identification in emotional environments. Iranian Journal of Electrical and Computer Engineering, 8(1), 41-46.
- (2009) Iranian Journal of Electrical and Computer Engineering , vol.8 , Issue.1 , pp. 41-46
- Shahin, I.¹

22
- 80052603818
- Identifying speakers using their emotion cues
- 10.1007/s10772-011-9089-1 10.1007/s10772-011-9089-1
- Shahin, I. (2011a). Identifying speakers using their emotion cues. International Journal of Speech Technology, 14(2), 89-98. doi: 10.1007/s10772-011-9089-1.
- (2011) International Journal of Speech Technology , vol.14 , Issue.2 , pp. 89-98
- Shahin, I.¹

23
- 80051700998
- Analysis and investigation of emotion identification in biased emotional talking environments
- 10.1049/iet-spr.2010.0059 10.1049/iet-spr.2010.0059
- Shahin, I. (2011b). Analysis and investigation of emotion identification in biased emotional talking environments. IET Signal Processing Journal, 5(5), 461-470. doi: 10.1049/iet-spr.2010.0059.
- (2011) IET Signal Processing Journal , vol.5 , Issue.5 , pp. 461-470
- Shahin, I.¹

24
- 79957832640
- Speaker identification in each of the neutral and shouted talking environments based on gender-dependent approach using SPHMMs
- 2809545
- Shahin, I. (2011c). Speaker identification in each of the neutral and shouted talking environments based on gender-dependent approach using SPHMMs. International Journal of Computers & Applications, 33(1), 83-91.
- (2011) International Journal of Computers & Applications , vol.33 , Issue.1 , pp. 83-91
- Shahin, I.¹

25
- 33746410556
- Emotional speech recognition: Resources, features, and methods
- 10.1016/j.specom.2006.04.003
- Ververidis, D.; Kotropoulos, C. (2006). Emotional speech recognition: resources, features, and methods. Speech Communication, 48(9), 1162-1181.
- (2006) Speech Communication , vol.48 , Issue.9 , pp. 1162-1181
- Ververidis, D.¹ Kotropoulos, C.²

26
- 85034230784
- Improving automatic emotion recognition from speech via gender differentiation
- Genoa, Italy
- Vogt, T.; Andre, E. (2006). Improving automatic emotion recognition from speech via gender differentiation. In Proceedings of language resources and evaluation conference (LREC 2006), Genoa, Italy.
- (2006) Proceedings of Language Resources and Evaluation Conference (LREC 2006)
- Vogt, T.¹ Andre, E.²

27
- 44949199375
- Study on speaker verification on emotional speech
- Wu, W.; Zheng, T. F.; Xu, M. X.; Bao, H. J. (2006). Study on speaker verification on emotional speech. In INTERSPEECH 2006: proceedings of international conference on spoken language processing (ICSLP) (pp. 2102-2105).
- (2006) INTERSPEECH 2006: Proceedings of International Conference on Spoken Language Processing (ICSLP) , pp. 2102-2105
- Wu, W.¹ Zheng, T.F.² Xu, M.X.³ Bao, H.J.⁴

28
- 0035278948
- Nonlinear feature based classification of speech under stress
- 10.1109/89.905995
- Zhou, G.; Hansen, J. H. L.; Kaiser, J. F. (2001). Nonlinear feature based classification of speech under stress. IEEE Transactions on Speech and Audio Processing, 9(3), 201-216.
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.3 , pp. 201-216
- Zhou, G.¹ Hansen, J.H.L.² Kaiser, J.F.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.