SCOPUS 정보 검색 플랫폼

International Journal of Speech Technology

Volumn 15, Issue 2, 2012, Pages 265-289

Emotion recognition from speech using source, system, and prosodic features

(2) Koolagudi, Shashidhar G a Rao, K Sreenivasa a

a INDIAN INSTITUTE OF TECHNOLOGY (India)

Author keywords

Emo DB; Emotion recognition; Excitation source features; Global prosodic features; Glottal closure instants; IITKGP SESC; Local prosodic features; Pitch synchronous analysis; System features; Zero frequency filter

Indexed keywords

EMO-DB; EMOTION RECOGNITION; EXCITATION SOURCES; GLOTTAL CLOSURE INSTANTS; IITKGP-SESC; PITCH SYNCHRONOUS ANALYSIS; PROSODIC FEATURES; SYSTEM FEATURES; ZERO FREQUENCY;

FACE RECOGNITION; MICROPHONES; NEURAL NETWORKS; SPEECH RECOGNITION;

PSYCHOLOGY COMPUTING;

EID: 84864708818 PISSN: 13812416 EISSN: 15728110 Source Type: Journal
DOI: 10.1007/s10772-012-9139-3 Document Type: Article

Times cited : (77)

References (74)

1
- 84856275800
- Master's thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai 600 036, India
- Anjani, A. V. N. S. (2000). Autoassociate neural network models for processing degraded speech.Master's thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai 600 036, India.
- (2000) Autoassociate neural network models for processing degraded speech
- Anjani, A.V.N.S.¹

2
- 0015476226
- Automatic speaker recognition based on pitch contours
- Atal, B. S. (1972). Automatic speaker recognition based on pitch contours. The Journal of the Acoustical Society of America, 52(6), 1687-1697.
- (1972) The Journal of the Acoustical Society of America , vol.52 , Issue.6 , pp. 1687-1697
- Atal, B.S.¹

3
- 1942504040
- Exploring features for audio clip classification using LP residual and AANN models
- (ICISIP 2004), Chennai, India, Jan. 2004
- Bajpai, A., & Yegnanarayana, B. (2004). Exploring features for audio clip classification using LP residual and AANN models. In The international conference on intelligent sensing and information processing 2004 (ICISIP 2004), Chennai, India, Jan. 2004 (pp. 305-310).
- (2004) The international conference on intelligent sensing and information processing 2004 , pp. 305-310
- Bajpai, A.¹ Yegnanarayana, B.²

4
- 21844456055
- The role of intonation in emotional expressions
- DOI 10.1016/j.specom.2005.02.016, PII S0167639305000890, Quantitative Prosody Modelling for Natural Speech Description and Generation
- Banziger, T., & Scherer, K. R. (2005). The role of intonation in emotional expressions. Speech Communication, 46, 252-267. (Pubitemid 40952515)
- (2005) Speech Communication , vol.46 , Issue.3-4 , pp. 252-267
- Banziger, T.¹ Scherer, K.R.²

5
- 70450202253
- INTERSPEECH, Brighton, UK, September 6-10
- Bapineedu, G., Avinash, B., Gangashetty, S. V., & Yegnanarayana, B. (2009). Analysis of lombard speech using excitation source information. In INTERSPEECH-09, Brighton, UK, September 6-10 (pp. 1091-1094).
- (2009) Analysis of Lombard Speech Using Excitation Source Information , pp. 1091-1094
- Bapineedu, G.¹ Avinash, B.² Gangashetty, S.V.³ Yegnanarayana, B.⁴

6
- 77956401353
- Class-level spectral features for emotion recognition
- Bitouk, D., Verma, R., & Nenkova, A. (2010). Class-level spectral features for emotion recognition. Speech Communication, 52(7-8), 613-625.
- (2010) Speech Communication , vol.52 , Issue.7-8 , pp. 613-625
- Bitouk, D.¹ Verma, R.² Nenkova, A.³

7
- 0002689942
- Verification of acoustical correlates of emotional speech using formant synthesis
- Newcastle, Northern Ireland, UK, Sept. 2000
- Burkhardt, F., & Sendlmeier, W. F. (2000). Verification of acoustical correlates of emotional speech using formant synthesis. In ITRW on speech and emotion, Newcastle, Northern Ireland, UK, Sept. 2000 (pp. 151-156).
- (2000) ITRW on speech and emotion , pp. 151-156
- Burkhardt, F.¹ Sendlmeier, W.F.²

8
- 47949107218
- A database of German emotional speech
- Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., & Weiss, B. (2005). A database of German emotional speech. In Interspeech.
- (2005) Interspeech
- Burkhardt, F.¹ Paeschke, A.² Rolfes, M.³ Sendlmeier, W.⁴ Weiss, B.⁵

9
- 0002515370
- The generation of affect in synthesized speech
- Jul. 1990
- Cahn, J. E. (1990). The generation of affect in synthesized speech. In JAVIOS, Jul. 1990 (pp. 1-19).
- (1990) JAVIOS , pp. 1-19
- Cahn, J.E.¹

10
- 0037382510
- Describing the emotional states that are expressed in speech
- Cowie, R., & Cornelius, R. R. (2003). Describing the emotional states that are expressed in speech. Speech Communication, 40, 5-32.
- (2003) Speech Communication , vol.40 , pp. 5-32
- Cowie, R.¹ Cornelius, R.R.²

11
- 0029030164
- Analysis of the glottal excitation of emotionally styled and stressed speech
- Cummings, K. E., & Clements, M. A. (1995). Analysis of the glottal excitation of emotionally styled and stressed speech. The Journal of the Acoustical Society of America, 98, 88-98.
- (1995) The Journal of the Acoustical Society of America , vol.98 , pp. 88-98
- Cummings, K.E.¹ Clements, M.A.²

12
- 77958460688
- Recognising emotions in speech
- 96, Oct. 1996.
- Dellaert, F., Polzin, T., & Waibel, A. (1996). Recognising emotions in speech. In ICSLP 96, Oct. 1996.
- (1996) ICSLP
- Dellaert, F.¹ Polzin, T.² Waibel, A.³

13
- 0030353343
- Recognizing emotion in speech
- Philadelphia, PA, USA, Oct. 1996
- Dellert, F., Polzin, T., & Waibel, A. (1996). Recognizing emotion in speech. In 4th international conference on spoken language processing, Philadelphia, PA, USA, Oct. 1996 (pp. 1970-1973)
- (1996) 4Th International Conference on Spoken Language Processing , pp. 1970-1973
- Dellert, F.¹ Polzin, T.² Waibel, A.³

14
- 77953707533
- Spectral mapping using artificial neural networks for voice conversion
- Desai, S., Black, A. W., Yegnanarayana, B., & Prahallad, K. (2010). Spectral mapping using artificial neural networks for voice conversion. IEEE Transactions on Audio, Speech, and Language Processing, 18, 954-964.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , pp. 954-964
- Desai, S.¹ Black, A.W.² Yegnanarayana, B.³ Prahallad, K.⁴

15
- 0003549017
- New York: Wiley.
- Diamantaras, K. I., & Kung, S. Y. (1996). Principal component neural networks: Theory and applications. New York: Wiley.
- (1996) Principal cOmponent Neural Networks: Theory and Applications
- Diamantaras, K.I.¹ Kung, S.Y.²

16
- 84856240372
- Master's thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai 600 036, India.
- Gupta, C. S. (2003). Significance of source features for speaker recognition. Master's thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai 600 036, India.
- (2003) Significance of Source Features for Speaker Recognition
- Gupta, C.S.¹

17
- 0036082789
- Autoassociative neural network models for online speaker verification using source features from vowels
- Honolulu, Hawaii, USA, May 2002.
- Gupta, C. S., Prasanna, S. R. M., & Yegnanarayana, B. (2002). Autoassociative neural network models for online speaker verification using source features from vowels. In Int. joint conf. neural networks, Honolulu, Hawaii, USA, May 2002.
- (2002) Int. joint conf. neural networks
- Gupta, C.S.¹ Prasanna, S.R.M.² Yegnanarayana, B.³

18
- 44949264114
- Feature analysis for emotion recognition from Mandarin speech considering the special characteristics of Chinese language
- Pittsburgh, Pennsylvania, Sept. 2006
- hao Kao, Y.,&shan Lee, L. (2006). Feature analysis for emotion recognition from Mandarin speech considering the special characteristics of Chinese language. In INTERSPEECH-ICSLP, Pittsburgh, Pennsylvania, Sept. 2006 (pp. 1814-1817).
- (2006) INTERSPEECH-ICSLP , pp. 1814-1817
- Hao Kao, Y.¹ Shan Lee, L.²

19
- 0003413187
- New Delhi: Pearson Education Asia, Inc.
- Haykin, S. (1999) Neural networks: A comprehensive foundation. New Delhi: Pearson Education Asia, Inc.
- (1999) Neural Networks: A Comprehensive Foundation
- Haykin, S.¹

20
- 84864689580
- Berlin: Springer.
- Hua, L. Z., Yu, H., & Hua, W. R. (2005). A novel source analysis method by matching spectral characters of LF model with STRAIGHT spectrum. Berlin: Springer.
- (2005) A novel source analysis method by matching spectral characters of LF model with STRAIGHT spectrum
- Hua, L.Z.¹ Yu, H.² Hua, W.R.³

21
- 0037380318
- A corpusbased speech synthesis system with emotion
- Iida, A., Campbell, N., Higuchi, F., & Yasumura, M. (2003). A corpusbased speech synthesis system with emotion. Speech Communication, 40, 161-187.
- (2003) Speech Communication , vol.40 , pp. 161-187
- Iida, A.¹ Campbell, N.² Higuchi, F.³ Yasumura, M.⁴

22
- 0033334228
- Analysis of autoassociative mapping neural networks
- Ikbal, M. S., Misra, H., & Yegnanarayana, B. (1999). Analysis of autoassociative mapping neural networks. In Int. joint conf. neural networks, USA (pp. 854-858).
- (1999) Int. Joint Conf. Neural Networks, USA , pp. 854-858
- Ikbal, M.S.¹ Misra, H.² Yegnanarayana, B.³

23
- 70449585153
- Statistical evaluation of speech features for emotion recognition
- Colmar, France, July 2009
- Iliou, T., & Anagnostopoulos, C. N. (2009). Statistical evaluation of speech features for emotion recognition. In Fourth international conference on digital telecommunications, Colmar, France, July 2009 (pp. 121-126).
- (2009) Fourth International Conference on Digital Telecommunications , pp. 121-126
- Iliou, T.¹ Anagnostopoulos, C.N.²

24
- 70449726750
- Features extraction for speech emotion
- Kamaruddin, N., & Wahab, A. (2009). Features extraction for speech emotion. Journal of Computational Methods in Science and Engineering, 9(9), 1-12.
- (2009) Journal of Computational Methods in Science and Engineering , vol.9 , Issue.9 , pp. 1-12
- Kamaruddin, N.¹ Wahab, A.²

25
- 0034862114
- Online text-independent speaker verification system using autoassociative neural network models
- Kishore, S. P., & Yegnanarayana, B. (2001). Online text-independent speaker verification system using autoassociative neural network models. In Int. joint conf. neural networks, Washington, USA, Aug. 2001 (Vol. 2, pp. 1548-1553. (Pubitemid 32805395)
- (2001) Proceedings of the International Joint Conference on Neural Networks , vol.2 , pp. 1548-1553
- Kishore, S.P.¹ Yegnanarayana, B.² Gangashetty, S.V.³

26
- 70349897091
- IITKGP-SESC: Speech database for emotion analysis
- LNCS. Berlin: Springer.
- Koolagudi, S. G., Maity, S., Kumar, V. A., Chakrabarti, S., & Rao, K. S. (2009). IITKGP-SESC: Speech database for emotion analysis. Communications in computer and information science, LNCS. Berlin: Springer.
- (2009) Communications in Computer and Information Science
- Koolagudi, S.G.¹ Maity, S.² Kumar, V.A.³ Chakrabarti, S.⁴ Rao, K.S.⁵

27
- 76249109428
- Exploring speech features for classifying emotions along valence dimension
- S.(PReMI-09), IIT Delhi, December 2009 pp. Heidelberg: Springer.
- Koolagudi, S. G., & Rao, K. S. (2009). Exploring speech features for classifying emotions along valence dimension. In S. Chandhury, et al. (Eds.), LNCS. The 3rd international conference on pattern recognition and machine intelligence (PReMI-09), IIT Delhi, December 2009 (pp. 537-542. Heidelberg: Springer.
- (2009) Chandhury, et al. (Eds.), LNCS. The 3rd International Conference on Pattern Recognition and Machine Intelligence , pp. 537-542
- Koolagudi, S.G.¹ Rao, K.S.²

28
- 70450182943
- Analysis of laugh signals for detecting in continuous speech
- Brighton, UK, September 6-10
- Kumar, K. S., Reddy, M. S. H., Murty, K. S. R., & Yegnanarayana, B. (2009). Analysis of laugh signals for detecting in continuous speech. In INTERSPEECH-09, Brighton, UK, September 6-10 (pp. 1591-1594.
- (2009) INTERSPEECH , pp. 1591-1594
- Kumar, K.S.¹ Reddy, M.S.H.² Murty, K.S.R.³ Yegnanarayana, B.⁴

29
- 14644439843
- Toward detecting emotions in spoken dialogs
- DOI 10.1109/TSA.2004.838534
- Lee, C. M., & Narayanan, S. S. (2005). Toward detecting emotions in spoken dialogs. IEEE Transactions on Speech and Audio Processing, 13, 293-303. (Pubitemid 40320247)
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.2 , pp. 293-303
- Lee, C.M.¹ Narayanan, S.S.²

30
- 0347336217
- On the use of features from prediction residual signal in speaker recognition
- Liu, J. H. L., & Palm, G. (1997). On the use of features from prediction residual signal in speaker recognition. In European conf. speech processing and technology (EUROSPEECH) (pp. 313-316.)
- (1997) European conf. speech processing and technology (EUROSPEECH) , pp. 313-316
- Liu, J.H.L.¹ Palm, G.²

31
- 33744926919
- Automatic emotion recognition using prosodic parameters
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- Luengo, I., Navas, E., Hernez, I., & Snchez, J. (2005). Automatic emotion recognition using prosodic parameters. In INTERSPEECH, Lisbon, Portugal, Sept. 2005 (pp. 493-496.) (Pubitemid 43908107)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 493-496
- Luengo, I.¹ Navas, E.² Hernaez, I.³ Sanchez, J.⁴

32
- 34547496515
- The relevance of voice quality features in speaker independent emotion recognition
- DOI 10.1109/ICASSP.2007.367152, 4218026, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
- Lugger, M., & Yang, B. (2007). The relevance of voice quality features in speaker independent emotion recognition. In ICASSP, Honolulu, Hawaii, USA, May 2007 (pp. IV17-IV20). New York: IEE (Pubitemid 47178301)
- (2007) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.4
- Lugger, M.¹ Yang, B.²

33
- 1942439710
- Autoassociative neural network models for language identification
- Aug. 24 2004 New York: IEEE.
- E. Mary, L., & Yegnanarayana, B. (2004). Autoassociative neural network models for language identification. In International conference on intelligent sensing and information processing, Aug. 24 2004 (pp. 317-320.)New York: IEEE.
- (2004) International Conference on Intelligent Sensing and Information Processing , pp. 317-320
- Mary, L.¹ Yegnanarayana, B.²

34
- 0003135459
- Approaching automatic recognition of emotion from voice: A rough benchmark
- Belfast.
- McGilloway, S., Cowie, R., Douglas-Cowie, E., Gielen, S.,Westerdijk, M., & Stroeve, S. (2000). Approaching automatic recognition of emotion from voice: A rough benchmark. In ISCA workshop on speech and emotion, Belfast.
- (2000) ISCA Workshop on Speech and Emotion
- Mcgilloway, S.¹ Cowie, R.² Douglas-Cowie, E.³ Gielen, S.⁴ Westerdijk, M.⁵ Stroeve, S.⁶

35
- 84884947988
- Classification of sport videos using edge-based features and autoassociative neural network models
- Mohan, C. K., & Yegnanarayana, B. (2008). Classification of sport videos using edge-based features and autoassociative neural network models. Signal, Image and Video Processing, 4, 61-73.
- (2008) Signal, Image and Video Processing , vol.4 , pp. 61-73
- Mohan, C.K.¹ Yegnanarayana, B.²

36
- 33847124004
- Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources
- Sydney, Australia, Aug. 2005
- Mubarak, O. M., Ambikairajah, E., & Epps, J. (2005). Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources. In 8th international symposium on signal processing and its applications, Sydney, Australia, Aug. 2005.
- (2005) 8th International Symposium on Signal Processing and its Applications
- Mubarak, O.M.¹ Ambikairajah, E.² Epps, J.³

37
- 0029325035
- Implementation and testing of a system for producing emotion by rule in synthetic speech
- Murray, I. R., & Arnott, J. L. (1995). Implementation and testing of a system for producing emotion by rule in synthetic speech. Speech Communication, 16, 369-390.
- (1995) Speech Communication , vol.16 , pp. 369-390
- Murray, I.R.¹ Arnott, J.L.²

38
- 0030291449
- Emotional stress in synthetic speech: Progress and future directions
- Murray, I. R., Arnott, J. L., & Rohwer, E. A. (1996). Emotional stress in synthetic speech: Progress and future directions. Speech Communication, 20, 85-91.
- Speech Communication , vol.20 , pp. 85-91
- Murray, I.R.¹ Arnott, J.L.² Rohwer, E.A.³

39
- 65249091627
- Epoch extraction from speech signals
- Murty, K. S. R., & Yegnanarayana, B. (2008). Epoch extraction from speech signals. IEEE Transactions on Audio, Speech, and Language Processing, 16, 1602-1613.
- (2008) IEEE Transactions on Audio, Speech, and Language Processing , vol.16 , pp. 1602-1613
- Murty, K.S.R.¹ Yegnanarayana, B.²

40
- 0023715232
- Pitch synchronous analysis of hoarseness in running speech
- Muta, H., Baer, T.,Wagatsuma, K., Muraoka, T., & Fukuda, H. (1988). Pitch synchronous analysis of hoarseness in running speech. The Journal of the Acoustical Society of America, 84, 1292-1301.
- (1988) The Journal of the Acoustical Society of America , vol.84 , pp. 1292-1301
- Muta, H.¹ Baer T.Wagatsuma, K.² Muraoka, T.³ Fukuda, H.⁴

41
- 38749103707
- Emotion recognition in spontaneous speech using GMMs
- In. Pittsburgh, Pennsylvania, 17-19 September
- Neiberg, D., Elenius, K., & Laskowski, K. (2006). Emotion recognition in spontaneous speech using GMMs. In INTERSPEECH'ICSLP. Pittsburgh, Pennsylvania, 17-19 September 2006 (pp. 809-812).
- (2006) INTERSPEECH-ICSLP , pp. 809-812
- Neiberg, D.¹ Elenius, K.² Laskowski, K.³

42
- 0242721417
- Speech emotion recognition using hidden markov models
- Nwe, T. L., Foo, S. W., & Silva, L. C. D. (2003). Speech emotion recognition using hidden Markov models. Speech Communication, 41, 603-623.
- (2003) Speech Communication , vol.41 , pp. 603-623
- Nwe, T.L.¹ Foo, S.W.² Silva, L.C.D.³

43
- 0038548330
- The production and recognition of emotions in speech: Features and algorithms
- Oudeyer, P. Y. (2003). The production and recognition of emotions in speech: Features and algorithms. International Journal ofHuman-Computer Studies, 59, 157-183.
- (2003) International Journal of Human-Computer Studies , vol.59 , pp. 157-183
- Oudeyer, P.Y.¹

44
- 33646758219
- Combining acoustic features for improved emotion recognition in mandarin speech
- In J. Tao, T. Tan, & R. Picard (Eds.), Berlin, Heidelberg, Berlin: Springer
- Pao, T. L., Chen, Y. T., Yeh, J. H., & Liao, W. Y. (2005). Combining acoustic features for improved emotion recognition in Mandarin speech. In J. Tao, T. Tan, & R. Picard (Eds.), LNCS. ACII, Berlin, Heidelberg (pp. 279-285), Berlin: Springer.
- (2005) LNCS. ACII , pp. 279-285
- Pao, T.L.¹ Chen, Y.T.² Yeh, J.H.³ Liao, W.Y.⁴

45
- 38049006375
- Feature combination for better differentiating anger from neutral in mandarin emotional speech
- Berlin: Springer
- Pao, T. L., Chen, Y. T., Yeh, J. H., Cheng, Y. M., & Chien, C. S. (2007). Feature combination for better differentiating anger from neutral in mandarin emotional speech. In LNCS: Vol. 4738. ACII 2007. Berlin: Springer.
- (2007) LNCS , vol.4738
- Pao, T.L.¹ Chen, Y.T.² Yeh, J.H.³ Cheng, Y.M.⁴ Chien, C.S.⁵

46
- 65249112285
- Vowel onset point detection using source, spectral peaks, and modulation spectrum energies
- Prasanna, S. R. M., Reddy, B. V. S., & Krishnamoorthy, P. (2009). Vowel onset point detection using source, spectral peaks, and modulation spectrum energies. IEEE Transactions on Audio, Speech, and Language Processing, 17, 556-565.
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , pp. 556-565
- Prasanna, S.R.M.¹ Reddy, B.V.S.² Krishnamoorthy, P.³

47
- 84856286035
- PhD thesis, Dept. of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India, May 2005
- Rao, K. S. (2005). Acquisition and incorporation prosody knowledge for speech systems in Indian languages. PhD thesis, Dept. of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India, May 2005.
- (2005) Acquisition and Incorporation Prosody Knowledge for Speech Systems in Indian Languages
- Rao, K.S.¹

48
- 54049142844
- Intonation modeling for Indian languages
- Rao, K. S., & Yegnanarayana, B. (2009). Intonation modeling for Indian languages. Computer Speech and Language, 23, 240-256.
- (2009) Computer Speech and Language , vol.23 , pp. 240-256
- Rao, K.S.¹ Yegnanarayana, B.²

49
- 34548794790
- Determination of instants of significant excitation in speech using Hilbert envelope and group delay function
- DOI 10.1109/LSP.2007.896454
- Rao, K. S., Prasanna, S. R. M., & Yegnanarayana, B. (2007). Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. IEEE Signal Processing Letters, 14, 762-765. (Pubitemid 47434104)
- (2007) IEEE Signal Processing Letters , vol.14 , Issue.10 , pp. 762-765
- Sreenivasa Rao, K.¹ Prasanna, S.R.M.² Yegnanarayana, B.³

50
- 85082473748
- Characterization of emotions using the dynamics of prosodic features
- In, Chicago, USA, May 2010
- Rao, K. S., Reddy, R., Maity, S., & Koolagudi, S. G. (2010). Characterization of emotions using the dynamics of prosodic features. In International conference on speech prosody, Chicago, USA, May 2010.
- (2010) International Conference on Speech Prosody
- Rao, K.S.¹ Reddy, R.² Maity, S.³ Koolagudi, S.G.⁴

51
- 84864576614
- Master's thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai 600 036, India
- Reddy, K. S. (2004). Source and system features for speaker recognition. Master's thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai 600 036, India.
- (2004) Source and System Features for Speaker Recognition
- Reddy, K.S.¹

52
- 0037384712
- Vocal communication of emotion: A review of research paradigms
- Scherer, K. R. (2003). Vocal communication of emotion: A review of research paradigms. Speech Communication, 40, 227-256.
- (2003) Speech Communication , vol.40 , pp. 227-256
- Scherer, K.R.¹

53
- 84971539709
- Emoptional speech synthesis: A review
- In, Aalborg, Denmark, Sept. 2001
- Schroder, M. (2001). Emoptional speech synthesis: A review. In Seventh European conference on speech communication and technology, Eurospeech, Aalborg, Denmark, Sept. 2001.
- (2001) Seventh European Conference on Speech Communication and Technology, Eurospeech
- Schroder, M.¹

54
- 84864664331
- Issues in emotion-oriented computing toward a shared understanding
- Schroder, M., & Cowie, R. (2006). Issues in emotion-oriented computing toward a shared understanding. In Workshop on emotion and computing (HUMAINE).
- (2006) Workshop on Emotion and Computing (HUMAINE)
- Schroder, M.¹ Cowie, R.²

55
- 70350503956
- Perceived loudness of speech based on the characteristics of glottal excitation source
- Seshadri, G. P., & Yegnanarayana, B. (2009). Perceived loudness of speech based on the characteristics of glottal excitation source. The Journal of the Acoustical Society of America, 126, 2061-2071.
- (2009) The Journal of the Acoustical Society of America , vol.126 , pp. 2061-2071
- Seshadri, G.P.¹ Yegnanarayana, B.²

56
- 77951101585
- Spectral analysis of speech under stress
- Sigmund, M. (2007). Spectral analysis of speech under stress. International Journal of Computer Science and Network Security, 7, 170-172.
- (2007) International Journal of Computer Science and Network Security , vol.7 , pp. 170-172
- Sigmund, M.¹

57
- 85009159448
- Emotional space improves emotion recognition
- In, Denver, Colorado, USA, Sept. 16-20 2002
- Tato, R., Santos, R., & Pardo, R. K. J. (2002). Emotional space improves emotion recognition. In 7th international conference on spoken language processing, Denver, Colorado, USA, Sept. 16-20 2002.
- (2002) 7th international Conference on Spoken Language Processing , pp. 16-20
- Tato, R.¹ Santos, R.² Pardo, R.K.J.³

58
- 85013694715
- (3rd ed.). New York: Elsevier, Academic Press
- Theodoridis, S., & Koutroumbas, K. (2006). Pattern recognition (3rd ed.). New York: Elsevier, Academic Press.
- (2006) Pattern Recognition
- Theodoridis, S.¹ Koutroumbas, K.²

59
- 0029356550
- Usefulness of LPC residue in textindependent speaker verification
- Thevenaz, P., & Hugli, H. (1995). Usefulness of LPC residue in textindependent speaker verification. Speech Communication, 17, 145-157.
- (1995) Speech Communication , vol.17 , pp. 145-157
- Thevenaz, P.¹ Hugli, H.²

60
- 70349948886
- A state of the art review on emotional speech databases
- In, Auckland, New Zealand, Dec. 2006
- Ververidis, D., & Kotropoulos, C. (2006). A state of the art review on emotional speech databases. In Eleventh Australasian international conference on speech science and technology, Auckland, New Zealand, Dec. 2006.
- (2006) Eleventh Australasian International Conference on Speech Science and Technology
- Ververidis, D.¹ Kotropoulos, C.²

61
- 4544247331
- Automatic emotional speech classification
- New York: IEEE
- Ververidis, D., Kotropoulos, C., & Pitas, I. (2004). Automatic emotional speech classification. In ICASSP (pp. I593-I596). New York: IEEE.
- (2004) ICASSP
- Ververidis, D.¹ Kotropoulos, C.² Pitas, I.³

62
- 0016963172
- Residual energy of linear prediction to vowel and speaker recognition
- Wakita, H. (1976). Residual energy of linear prediction to vowel and speaker recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing, 24, 270-271.
- (1976) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.24 , pp. 270-271
- Wakita, H.¹

63
- 57649193345
- Adaptive and optimal classification of speech emotion recognition
- In, Oct. 2008
- Wang, Y., Du, S., & Zhan, Y. (2008). Adaptive and optimal classification of speech emotion recognition. In Fourth international conference on natural computation, Oct. 2008 (pp. 407-411).
- (2008) Fourth International Conference on Natural Computation , pp. 407-411
- Wang, Y.¹ Du, S.² Zhan, Y.³

64
- 0003938587
- Vocal correlates of emotional states
- Williams, C. E., & Stevens, K. N. (1981). Vocal correlates of emotional states. In Speech evaluation in psychiatry (pp. 189-220).
- (1981) Speech Evaluation in Psychiatry , pp. 189-220
- Williams, C.E.¹ Stevens, K.N.²

65
- 0004312284
- New Delhi: Prentice-Hall
- Yegnanarayana, B. (1999). Artificial neural networks. New Delhi: Prentice-Hall.
- (1999) Artificial Neural Networks
- Yegnanarayana, B.¹

66
- 0035989168
- AANN: An alternative to GMM for pattern recognition
- DOI 10.1016/S0893-6080(02)00019-9, PII S0893608002000199
- Yegnanarayana, B., & Kishore, S. P. (2002). AANN an alternative to GMM for pattern recognition. Neural Networks, 15, 459-469. (Pubitemid 34518411)
- (2002) Neural Networks , vol.15 , Issue.3 , pp. 459-469
- Yegnanarayana, B.¹ Kishore, S.P.²

67
- 0031639899
- Enhancement of reverberant speech using lp residual
- In, Seattle, WA, USA, May 1998. New York: IEEE Xplore
- Yegnanarayana, B., Murthy, P. S., Avendano, C., & Hermansky, H. (1998). Enhancement of reverberant speech using lp residual. In IEEE international conference on acoustics, speech and signal processing, Seattle, WA, USA, May 1998 (Vol. 1, pp. 405-408). New York: IEEE Xplore.
- (1998) IEEE International Conference on Acoustics, Speech and Signal Processing , vol.1 , pp. 405-408
- Yegnanarayana, B.¹ Murthy, P.S.² Avendano, C.³ Hermansky, H.⁴

68
- 0034856452
- Source and system features for speaker recognition using aann models
- In, Salt Lake City, UT, May 2001
- Yegnanarayana, B., Reddy, K. S., & Kishore, S. P. (2001a). Source and system features for speaker recognition using aann models. In IEEE int. conf. acoust., speech, and signal processing, Salt Lake City, UT, May 2001.
- (2001) IEEE Int. Conf. Acoust., Speech, and Signal Processing
- Yegnanarayana, B.¹ Reddy, K.S.² Kishore, S.P.³

69
- 0034856452
- Source and system features for speaker recognition using AANN models
- Yegnanarayana, B., Reddy, K. S., & Kishore, S. P. (2001b). Source and system features for speaker recognition using AANN models. In Proc. IEEE int. conf. acoust., speech, signal processing, Salt Lake City, Utah, USA, May 2001 (pp. 409-412). (Pubitemid 32839274)
- (2001) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.1 , pp. 409-412
- Yegnanarayana, B.¹ Reddy, K.S.² Kishore, S.P.³

70
- 85008020244
- Determining mixing parameters from multispeaker data using speech-specific information
- Yegnanarayana, B., Swamy, R. K., & Murty, K. S. R. (2009). Determining mixing parameters from multispeaker data using speech-specific information. IEEE Transactions on Audio, Speech, and Language Processing, 17(6), 1196-1207.
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.6 , pp. 1196-1207
- Yegnanarayana, B.¹ Swamy, R.K.² Murty, K.S.R.³

71
- 33745190613
- An acoustic study of emotions expressed in speech
- In (ICSLP 2004), Jeju Island, Korea, Oct. 2004
- Yildirim, S., Bulut, M., Lee, C. M., Kazemzadeh, A., Busso, C., Deng, Z., Lee, S., & Narayanan, S. (2004). An acoustic study of emotions expressed in speech. In Int. conf. on spoken language processing (ICSLP 2004), Jeju Island, Korea, Oct. 2004.
- (2004) Int. Conf. on Spoken Language Processing
- Yildirim, S.¹ Bulut, M.² Lee, C.M.³ Kazemzadeh, A.⁴ Busso, C.⁵ Deng, Z.⁶ Lee, S.⁷ Narayanan, S.⁸

72
- 38049009485
- Pitch synchronous analysis method and fisher criterion based speaker identification
- In, Washington DC, USA . Washington: IEEE Computer Society
- Zeng, Y., Wu, H., & Gao, R. (2007). Pitch synchronous analysis method and Fisher criterion based speaker identification. In Third international conference on natural computation, Washington DC, USA (pp. 691-695). Washington: IEEE Computer Society.
- (2007) Third International Conference on Natural Computation , pp. 691-695
- Zeng, Y.¹ Wu, H.² Gao, R.³

73
- 58849123680
- Emotion recognition in Chinese natural speech by combining prosody and voice quality features
- In Sun, et al. (Eds.), Berlin: Springer
- Zhang, S. (2008). Emotion recognition in Chinese natural speech by combining prosody and voice quality features. In Sun, et al. (Eds.), Lecture notes in computer science. Advances in neural networks (pp. 457-464). Berlin: Springer.
- (2008) Lecture Notes in Computer Science. Advances in Neural Networks , pp. 457-464
- Zhang, S.¹

74
- 38149033734
- Study on speech emotion recognition system in E-learning
- In J. Jacko (Ed.), Berlin: Springer
- Zhu, A., & Luo, Q. (2007). Study on speech emotion recognition system in E-learning. In J. Jacko (Ed.), LNCS. Human computer interaction, Part III, HCII (pp. 544-552). Berlin: Springer.
- (2007) LNCS. Human Computer Interaction, Part III, HCII , pp. 544-552
- Zhu, A.¹ Luo, Q.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.