SCOPUS 정보 검색 플랫폼

International Journal of Speech Technology

Volumn 15, Issue 4, 2012, Pages 495-511

Emotion recognition from speech using sub-syllabic and pitch synchronous spectral features

(2) Koolagudi, Shashidhar G a Krothapalli, Sreenivasa Rao a

a INDIAN INSTITUTE OF TECHNOLOGY (India)

Author keywords

Consonant region; CV transition region; Emotion recognition; Pitch synchronous analysis; Spectral features; Vowel onset point; Vowel region

Indexed keywords

CONSONANT REGION; EMOTION RECOGNITION; PITCH SYNCHRONOUS ANALYSIS; SPECTRAL FEATURE; TRANSITION REGIONS; VOWEL ONSET POINT; VOWEL REGION;

LINGUISTICS;

CONTINUOUS SPEECH RECOGNITION;

EID: 84869495918 PISSN: 13812416 EISSN: 15728110 Source Type: Journal
DOI: 10.1007/s10772-012-9150-8 Document Type: Article

Times cited : (39)

References (60)

1
- 77956401353
- Class-level spectral features for emotion recognition
- Bitouk, D., Verma, R., & Nenkova, A. (2010). Class-level spectral features for emotion recognition. Speech Communication, 52, 613-625.
- (2010) Speech Communication , vol.52 , pp. 613-625
- Bitouk, D.¹ Verma, R.² Nenkova, A.³

2
- 70450177656
- Improving automatic emotion recognition from speech signals
- In , Brighton, UK, 6-10 September 2009 (pp
- Bozkurt, E., Erzin, E., Erdem, C. E., & Erdem, A. T. (2009). Improving automatic emotion recognition from speech signals. In 10th annual conference of the international speech communication association (interspeech), Brighton, UK, 6-10 September 2009 (pp. 324-327).
- (2009) 10th Annual Conference of the International Speech Communication Association (Interspeech) , pp. 324-327
- Bozkurt, E.¹ Erzin, E.² Erdem, C.E.³ Erdem, A.T.⁴

3
- 47949107218
- A database of German emotional speech
- Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., & Weiss, B. (2005). A database of German emotional speech. In Interspeech.
- (2005) Interspeech.
- Burkhardt, F.¹ Paeschke, A.² Rolfes, M.³ Sendlmeier, W.⁴ Weiss, B.⁵

4
- 14944351245
- Analysis of emotion recognition using facial expressions, speech and multimodal information
- State College, PA, The USA, October 2004
- Busso, C., Deng, Z., Yildirim, S., Bulut, M., Lee, C. M., Kazemzadeh, A., Lee, S., Neumann, U., & Narayanan, S. (2004). Analysis of emotion recognition using facial expressions, speech and multimodal information. In ACM 6th international conference on multimodal interfaces (ICMI 2004), State College, PA, The USA, October 2004.
- (2004) ACM 6th International Conference on Multimodal Interfaces (ICMI 2004)
- Busso, C.¹ Deng, Z.² Yildirim, S.³ Bulut, M.⁴ Lee, C.M.⁵ Kazemzadeh, A.⁶ Lee, S.⁷ Neumann, U.⁸ Narayanan, S.⁹

5
- 0442326756
- Recognition of noisy speech using dynamic spectral subband centroids
- February
- Chen, J., Huang, Y. A., Li, Q., & Paliwal, K. K. (2004). Recognition of noisy speech using dynamic spectral subband centroids. IEEE Signal Processing Letters, 11, 258-261 (February 2004).
- (2004) IEEE Signal Processing Letters , vol.11 , pp. 258-261
- Chen, J.¹ Huang, Y.A.² Li, Q.³ Paliwal, K.K.⁴

6
- 0030353343
- Recognizing emotion in speech
- In , Philadelphia, PA, USA, October 1996 (pp
- Dellert, F., Polzin, T., & Waibel, A. (1996). Recognizing emotion in speech. In 4th international conference on spoken language processing, Philadelphia, PA, USA, October 1996 (pp. 1970-1973).
- (1996) 4th International Conference on Spoken Language Processing , pp. 1970-1973
- Dellert, F.¹ Polzin, T.² Waibel, A.³

7
- 0003549017
- New York: Wiley
- Diamantaras, K. I., & Kung, S. Y. (1996). Principal component neural networks: theory and applications. New York: Wiley.
- (1996) Principal Component Neural Networks: Theory and Applications
- Diamantaras, K.I.¹ Kung, S.Y.²

8
- 0003922190
- 2nd ed.). Singapore: Wiley-Interscience
- Duda, R. O., Hart, P. E., & Stork, D. G. (2004). Pattern classification (2nd ed.). Singapore: Wiley-Interscience.
- (2004) Pattern Classification
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

9
- 84869508610
- Detection of vowel on set points in continuous speech using auto-associative neural network models
- In . New York: IEEE Press
- Gangashetty, S. V., Sekhar, C. C., & Yegnanarayana, B. (2004). Detection of vowel on set points in continuous speech using auto-associative neural network models. In INTERSPEECH. New York: IEEE Press.
- (2004) Interspeech
- Gangashetty, S.V.¹ Sekhar, C.C.² Yegnanarayana, B.³

10
- 33745477715
- Spotting multilingual consonant-vowel units of speech using neural network models
- In M. Faundez-Zanuy (Ed.), (pp. ). Berlin: Springer
- Gangashetty, S. V., Sekhar, C. C., & Yegnanarayana, B. (2005). Spotting multilingual consonant-vowel units of speech using neural network models. In M. Faundez-Zanuy (Ed.), NOLISP (pp. 303-317). Berlin: Springer.
- (2005) Nolisp , pp. 303-317
- Gangashetty, S.V.¹ Sekhar, C.C.² Yegnanarayana, B.³

11
- 0036082789
- Autoassociative neural network models for online speaker verification using source features from vowels
- In , USA, May 2002
- Gupta, C. S., Prasanna, S. R. M., & Yegnanarayana, B. (2002). Autoassociative neural network models for online speaker verification using source features from vowels. In Int. joint conf. neural networks, Honululu, Hawii, USA, May 2002.
- (2002) Int. Joint Conf. Neural Networks, Honululu, Hawii
- Gupta, C.S.¹ Prasanna, S.R.M.² Yegnanarayana, B.³

12
- 0003413187
- New Delhi: Pearson Education Aisa
- Haykin, S. (1999). Neural networks: a comprehensive foundation. New Delhi: Pearson Education Aisa.
- (1999) Neural Networks: A Comprehensive Foundation
- Haykin, S.¹

13
- 33749580033
- Robust recognition of emotion from speech
- In Intelligent virtual agents (pp. ). Berlin: Springer
- Hoque, M. E., Yeasin, M., & Louwerse, M. M. (2006). Robust recognition of emotion from speech. In Lecture notes in computer science. Intelligent virtual agents (pp. 42-53). Berlin: Springer.
- (2006) Lecture Notes in Computer Science. , pp. 42-53
- Hoque, M.E.¹ Yeasin, M.² Louwerse, M.M.³

14
- 0033334228
- Analysis of autoassociative mapping neural networks
- In , USA (pp
- Ikbal, M. S., Misra, H., & Yegnanarayana, B. (1999). Analysis of autoassociative mapping neural networks. In Int. joint conf. neural networks, USA (pp. 854-858).
- (1999) Int. Joint Conf. Neural Networks , pp. 854-858
- Ikbal, M.S.¹ Misra, H.² Yegnanarayana, B.³

15
- 77950073346
- Spoken emotion recognition through optimum-path forest classification using glottal features
- Iliev, A. I., Scordilis, M. S., Papa, J. P., & Falco, A. X. (2010). Spoken emotion recognition through optimum-path forest classification using glottal features. Computer Speech and Language, 24(3), 445-460.
- (2010) Computer Speech and Language , vol.24 , Issue.3 , pp. 445-460
- Iliev, A.I.¹ Scordilis, M.S.² Papa, J.P.³ Falco, A.X.⁴

16
- 70449726750
- Features extraction for speech emotion
- Kamaruddin, N., & Wahab, A. (2009). Features extraction for speech emotion. Journal of Computational Methods in Science and Engineering, 9(9), 1-12.
- (2009) Journal of Computational Methods in Science and Engineering , vol.9 , Issue.9 , pp. 1-12
- Kamaruddin, N.¹ Wahab, A.²

17
- 0034862114
- Online text-independent speaker verification system using autoassociative neural network models
- In , Washington, USA, August 2001 (pp
- Kishore, S. P., & Yegnanarayana, B. (2001). Online text-independent speaker verification system using autoassociative neural network models. In Int. joint conf. neural networks (V2), Washington, USA, August 2001 (pp. 1548-1553).
- (2001) Int. Joint Conf. Neural Networks (V2) , pp. 1548-1553
- Kishore, S.P.¹ Yegnanarayana, B.²

18
- 77958499804
- Ph.D. thesis, Dept. of Computer Science, IIT, Madras (March 2009
- Kodukula, S. R. M. (2009). Significance of excitation source information for speech analysis. Ph.D. thesis, Dept. of Computer Science, IIT, Madras (March 2009).
- (2009) Significance of Excitation Source Information for Speech Analysis
- Kodukula, S.R.M.¹

19
- 76249109428
- Exploring speech features for classifying emotions along valence dimension
- In . The 3rd international conference on pattern recognition and machine intelligence (PReMI-09
- Koolagudi, S. G., & Rao, K. S. (2009). Exploring speech features for classifying emotions along valence dimension. In Springer LNCS. The 3rd international conference on pattern recognition and machine intelligence (PReMI-09).
- (2009) Springer LNCS
- Koolagudi, S.G.¹ Rao, K.S.²

20
- 79953182985
- Two stage emotion recognition based on speaking rate
- Koolagudi, S. G., & Rao, K. S. (2011). Two stage emotion recognition based on speaking rate. International Journal of Speech Technology, 14, 35-48.
- (2011) International Journal of Speech Technology , vol.14 , pp. 35-48
- Koolagudi, S.G.¹ Rao, K.S.²

21
- 70349897091
- IITKGP-SESC: Speech database for emotion analysis
- In , August 2009. Berlin: Springer
- Koolagudi, S. G., Maity, S., Kumar, V. A., Chakrabarti, S., & Rao, K. S. (2009). IITKGP-SESC: speech database for emotion analysis. In LNCS. Communications in computer and information science, August 2009. Berlin: Springer.
- (2009) LNCS. Communications in Computer and Information Science
- Koolagudi, S.G.¹ Maity, S.² Kumar, V.A.³ Chakrabarti, S.⁴ Rao, K.S.⁵

22
- 77956999825
- Emotion classification based on speaking rate
- In
- Koolagudi, S. G., Ray, S., & Rao, K. S. (2010). Emotion classification based on speaking rate. In The 3rd international conference on contemporary computing.
- (2010) The 3rd International Conference on Contemporary Computing
- Koolagudi, S.G.¹ Ray, S.² Rao, K.S.³

23
- 85009223246
- Emotion recognition by speech signals
- In (pp
- Kwon, O., Chan, K., Hao, J., & Lee, T. (2003). Emotion recognition by speech signals. In Eurospeech, Geneva (pp. 125-128).
- (2003) Eurospeech, Geneva , pp. 125-128
- Kwon, O.¹ Chan, K.² Hao, J.³ Lee, T.⁴

24
- 14644439843
- Toward detecting emotions in spoken dialogs
- March
- Lee, C. M., & Narayanan, S. (2005). Toward detecting emotions in spoken dialogs. IEEE Transactions on Speech and Audio Processing, 13, 293-303 (March 2005).
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , pp. 293-303
- Lee, C.M.¹ Narayanan, S.²

25
- 79959831679
- Significance of pitch synchronous analysis for speaker recognition using AANN models
- In , Makuhari, Japan, September 2010
- Mallidi, S. H. R., Prahallad, K., Gangashetty, S. V., & Yegnanarayana, B. (2010). Significance of pitch synchronous analysis for speaker recognition using AANN models. In INTERSPEECH-2010, Makuhari, Japan, September 2010.
- (2010) Interspeech-2010
- Mallidi, S.H.R.¹ Prahallad, K.² Gangashetty, S.V.³ Yegnanarayana, B.⁴

26
- 52949094265
- Speech Communication, , (April 2008
- Mary, L., & Yegnanarayana, B. (2008). Extraction and representation of prosodic features for language and speaker recognition. Speech Communication, 50, 782-796 (April 2008).
- (2008) Extraction and Representation of Prosodic Features for Language and Speaker Recognition. , vol.50 , pp. 782-796
- Mary, L.¹ Yegnanarayana, B.²

27
- 0003135459
- Approaching automatic recognition of emotion from voice: A rough benchmark
- In , Belfast
- McGilloway, S., Cowie, R., Douglas-Cowie, E., Gielen, S., Westerdijk, M., & Stroeve, S. (2000). Approaching automatic recognition of emotion from voice: a rough benchmark. In ISCA workshop on speech and emotion, Belfast.
- (2000) ISCA Workshop on Speech and Emotion
- McGilloway, S.¹ Cowie, R.² Douglas-Cowie, E.³ Gielen, S.⁴ Westerdijk, M.⁵ Stroeve, S.⁶

28
- 33847124004
- Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources
- In , Sydney, Australia, August 2005
- Mubarak, O. M., Ambikairajah, E., & Epps, J. (2005). Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources. In 8th international symposium on signal processing and its applications, Sydney, Australia, August 2005.
- (2005) 8th International Symposium on Signal Processing and Its Applications
- Mubarak, O.M.¹ Ambikairajah, E.² Epps, J.³

29
- 65249091627
- Epoch extraction from speech signals
- Murty, K. S. R., & Yegnanarayana, B. (2008). Epoch extraction from speech signals. IEEE Transactions on Audio, Speech, and Language Processing, 16, 1602-1613.
- (2008) IEEE Transactions on Audio, Speech, and Language Processing , vol.16 , pp. 1602-1613
- Murty, K.S.R.¹ Yegnanarayana, B.²

30
- 0023715232
- Pitch synchronous analysis of hoarseness in running speech
- Muta, H., Baer, T., Wagatsuma, K., Muraoka, T., & Fukuda, H. (1988a). Pitch synchronous analysis of hoarseness in running speech. The Journal of the Acoustical Society of America, 84, 1292-1301.
- (1988) The Journal of the Acoustical Society of America , vol.84 , pp. 1292-1301
- Muta, H.¹ Baer, T.² Wagatsuma, K.³ Muraoka, T.⁴ Fukuda, H.⁵

31
- 84869488893
- A pitch-synchronous analysis of hoarseness in running speech
- b). , Haskins laboratories
- Muta, H., Baer, T., Wagatsuma, K., Muraoka, T., & Fukudatt, H. (1988b). A pitch-synchronous analysis of hoarseness in running speech. Status report on speech research SR-93/94, Haskins laboratories.
- (1988) Status Report on Speech Research SR-93/94
- Muta, H.¹ Baer, T.² Wagatsuma, K.³ Muraoka, T.⁴ Fukudatt, H.⁵

32
- 38749103707
- Emotion recognition in spontaneous speech using GMMs
- In , Pittsburgh, Pennsylvania, 17-19 September 2006 (pp
- Neiberg, D., Elenius, K., & Laskowski, K. (2006). Emotion recognition in spontaneous speech using GMMs. In INTERSPEECH 2006 - ICSLP, Pittsburgh, Pennsylvania, 17-19 September 2006 (pp. 809-812).
- (2006) Interspeech 2006 - ICSLP , pp. 809-812
- Neiberg, D.¹ Elenius, K.² Laskowski, K.³

33
- 85016350179
- Emotion recognition in speech using neural networks
- In , Perth, WA, Australia, August 1999 (pp
- Nicholson, J., Takahashi, K., & Nakatsu, R. (1999). Emotion recognition in speech using neural networks. In 6th international conference on neural information processing (ICONIP-99), Perth, WA, Australia, August 1999 (pp. 495-501).
- (1999) 6th International Conference on Neural Information Processing (ICONIP-99) , pp. 495-501
- Nicholson, J.¹ Takahashi, K.² Nakatsu, R.³

34
- 33646758219
- Combining acoustic features for improved emotion recognition in Mandarin speech
- In J. Tao, T. Tan & R. Picard (Eds.), (pp. ). Berlin: Springer
- Pao, T. L., Chen, Y. T., Yeh, J. H., & Liao, W. Y. (2005). Combining acoustic features for improved emotion recognition in Mandarin speech. In J. Tao, T. Tan & R. Picard (Eds.), LNCS. ACII (pp. 279-285). Berlin: Springer.
- (2005) Lncs. Acii , pp. 279-285
- Pao, T.L.¹ Chen, Y.T.² Yeh, J.H.³ Liao, W.Y.⁴

35
- 38049006375
- In LNCS:. ACII 2007. Berlin: Springer
- Pao, T. L., Chen, Y. T., Yeh, J. H., Cheng, Y. M., & Chien, C. S. (2007). Feature combination for better differentiating anger from neutral in mandarin emotional speech. In LNCS: Vol. 4738. ACII 2007. Berlin: Springer.
- (2007) Feature Combination for Better Differentiating Anger from Neutral in Mandarin Emotional Speech. , vol.4738
- Pao, T.L.¹ Chen, Y.T.² Yeh, J.H.³ Cheng, Y.M.⁴ Chien, C.S.⁵

36
- 0033329296
- Emotion in speech: Recognition and application to call centers
- In
- Petrushin, V. A. (1999). Emotion in speech: recognition and application to call centers. In Proceedings of the 1999 conference on artificial neural networks in engineering (ANNIE 99).
- (1999) Proceedings of the 1999 Conference on Artificial Neural Networks in Engineering (ANNIE 99)
- Petrushin, V.A.¹

37
- 1942512334
- Begin-end detection using vowel onset points
- In , TIFR Mumbai, India (January 2003
- Prasanna, S. R. M., Zachariah, J. M., & Yegnanarayana, B. (2003). Begin-end detection using vowel onset points. In Proceedings workshop on spoken language, TIFR Mumbai, India (January 2003).
- (2003) Proceedings Workshop on Spoken Language
- Prasanna, S.R.M.¹ Zachariah, J.M.² Yegnanarayana, B.³

38
- 33748443739
- Extraction of speaker-specific excitation information from linear prediction residual of speech
- Prasannaa, S. M., Gupta, C. S., & Yegnanarayana, B. (2006). Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Communication, 48, 1243-1261.
- (2006) Speech Communication , vol.48 , pp. 1243-1261
- Prasannaa, S.M.¹ Gupta, C.S.² Yegnanarayana, B.³

39
- 65249112285
- IEEE Transactions on Audio, Speech, and Language Processing, , (May 2009
- Prasanna, S. R. M., Reddy, B. V. S., & Krishnamoorthy, P. (2009). Vowel onset point detection using source, spectral peaks, and modulation spectrum energies. IEEE Transactions on Audio, Speech, and Language Processing, 17, 556-565 (May 2009).
- (2009) Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies. , vol.17 , pp. 556-565
- Prasanna, S.R.M.¹ Reddy, B.V.S.² Krishnamoorthy, P.³

40
- 0004244302
- Englewood Cliffs: Prentice-Hall
- Rabiner, L. R., & Juang, B. H. (1993). Fundamentals of speech recognition. Englewood Cliffs: Prentice-Hall.
- (1993) Fundamentals of Speech Recognition.
- Rabiner, L.R.¹ Juang, B.H.²

41
- 77950029338
- Voice conversion by mapping the speaker-specific features using pitch synchronous approach
- Rao, K. S. (2010). Voice conversion by mapping the speaker-specific features using pitch synchronous approach. Computer Speech and Language, 24, 474-494.
- (2010) Computer Speech and Language , vol.24 , pp. 474-494
- Rao, K.S.¹

42
- 79953168002
- Application of prosody models for developing speech systems in Indian languages
- Rao, K. S. (2011a). Application of prosody models for developing speech systems in Indian languages. International Journal of Speech Technology, 14, 19-33.
- (2011) International Journal of Speech Technology , vol.14 , pp. 19-33
- Rao, K.S.¹

43
- 84856289513
- Role of neural network models for developing speech systems
- b). Sadhana
- Rao, K. S. (2011b). Role of neural network models for developing speech systems. Sadhana (Springer), 36, 783-836.
- (2011) Springer , vol.36 , pp. 783-836
- Rao, K.S.¹

44
- 84869490965
- Identification of Hindi dialects and emotions using spectral and prosodic features of speech
- Rao, K. S., & Koolagudi, S. G. (2011). Identification of Hindi dialects and emotions using spectral and prosodic features of speech. Journal of Systemics, Cybernetics and Informatics, 9(4), 24-33.
- (2011) Journal of Systemics, Cybernetics and Informatics , vol.9 , Issue.4 , pp. 24-33
- Rao, K.S.¹ Koolagudi, S.G.²

45
- 34047248058
- Prosody modification using instants of significant excitation
- May, 2006
- Rao, K. S., & Yegnanarayana, B. (2006). Prosody modification using instants of significant excitation. IEEE Transactions on Speech and Audio Processing, 14, 972-980 (May 2006).
- (2006) IEEE Transactions on Speech and Audio Processing , vol.14 , pp. 972-980
- Rao, K.S.¹ Yegnanarayana, B.²

46
- 69949159711
- Duration modification using glottal closure instants and vowel onset points
- Rao, K. S., & Yegnanarayana, B. (2009). Duration modification using glottal closure instants and vowel onset points. Speech Communication, 51, 1263-1269.
- (2009) Speech Communication , vol.51 , pp. 1263-1269
- Rao, K.S.¹ Yegnanarayana, B.²

47
- 84864576614
- Source and system features for speaker recognition
- Indian Institute of Technology Madras, Chennai 600 036, India 2004
- Reddy, K. S. (2004). Source and system features for speaker recognition. Master's thesis, MS thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai 600 036, India 2004.
- (2004) Master's Thesis, MS Thesis, Department of Computer Science and Engineering
- Reddy, K.S.¹

48
- 63049095964
- Keyword spotting using vowel onset point, vector quantization and hiddenMarkovmodeling based techniques
- In , Hyderabad. New York: IEEE Press
- Reddy, B. V. S., Rao, K. V., & Prasanna, S. R. M. (2008). Keyword spotting using vowel onset point, vector quantization and hiddenMarkovmodeling based techniques. In TENCON 2008 - 2008 IEEE region 10 conference, IIIT, Hyderabad. New York: IEEE Press.
- (2008) Tencon 2008 - 2008 IEEE Region 10 Conference, IIIT
- Reddy, B.V.S.¹ Rao, K.V.² Prasanna, S.R.M.³

49
- 4544316885
- Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture
- In (pp. ). New York: IEEE Press
- Schuller, B., Rigoll, G., & Lang, M. (2004). Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture. In Proc. IEEE int. conf. acoust., speech, signal processing (pp. 577-580). New York: IEEE Press.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 577-580
- Schuller, B.¹ Rigoll, G.² Lang, M.³

50
- 77951101585
- Spectral analysis of speech under stress
- Sigmund, M. (2007). Spectral analysis of speech under stress. IJCSNS International Journal of Computer Science and Network Security, 7, 170-172.
- (2007) IJCSNS International Journal of Computer Science and Network Security , vol.7 , pp. 170-172
- Sigmund, M.¹

51
- 33746410556
- Emotional speech recognition: Resources, features, and methods
- Ververidis, D., & Kotropoulos, C. (2006). Emotional speech recognition: resources, features, and methods. Speech Communication, 48, 1162-1181.
- (2006) Speech Communication , vol.48 , pp. 1162-1181
- Ververidis, D.¹ Kotropoulos, C.²

52
- 4544247331
- Automatic emotional speech classification
- In (pp. ). New York: IEEE Press
- Ververidis, D., Kotropoulos, C., & Pitas, I. (2004). Automatic emotional speech classification. In ICASSP (pp. I593-I596). New York: IEEE Press.
- (2004) Icassp
- Ververidis, D.¹ Kotropoulos, C.² Pitas, I.³

53
- 84861577723
- Improved vowel onset point detection using epoch intervals
- a). doi:10. 1016/j.aeue.2
- Vuppala, A. K., Rao, K. S., & Chakrabarti, S. (2012a). Improved vowel onset point detection using epoch intervals. International Journal of Electronics and Communications. doi:10. 1016/j.aeue.2.
- (2012) International Journal of Electronics and Communications.
- Vuppala, A.K.¹ Rao, K.S.² Chakrabarti, S.³

54
- 84860875011
- Vowel onset point detection for low bit rate coded speech
- b). , , (August 2012
- Vuppala, A. K., Yadav, J., Chakrabarti, S., & Rao, K. S. (2012b). Vowel onset point detection for low bit rate coded speech. IEEE Transactions on Audio, Speech, and Language Processing, 20, 1894-1903 (August 2012).
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , pp. 1894-1903
- Vuppala, A.K.¹ Yadav, J.² Chakrabarti, S.³ Rao, K.S.⁴

55
- 70449580752
- Automatic recognition of speech emotion using long-term spectro-temporal features
- In , Santorini-Hellas, 5-7 July 2009 (pp. ). New York: IEEE Press
- Wu, S., Falk, T. H., & Chan, W. Y. (2009). Automatic recognition of speech emotion using long-term spectro-temporal features. In 16th international conference on digital signal processing, Santorini-Hellas, 5-7 July 2009 (pp. 1-6). New York: IEEE Press.
- (2009) 16th International Conference on Digital Signal Processing , pp. 1-6
- Wu, S.¹ Falk, T.H.² Chan, W.Y.³

56
- 0004312284
- New Delhi: Prentice-Hall
- Yegnanarayana, B. (1999). Artificial neural networks. New Delhi: Prentice-Hall.
- (1999) Artificial Neural Networks.
- Yegnanarayana, B.¹

57
- 0035989168
- AANN an alternative to GMM for pattern recognition
- Yegnanarayana, B., & Kishore, S. P. (2002). AANN an alternative to GMM for pattern recognition. Neural Networks, 15, 459-469.
- (2002) Neural Networks , vol.15 , pp. 459-469
- Yegnanarayana, B.¹ Kishore, S.P.²

58
- 0034856452
- Source and system features for speaker recognition using aann models
- a). In , Salt Lake City, UT, May 2001
- Yegnanarayana, B., Reddy, K. S., & Kishore, S. P. (2001a). Source and system features for speaker recognition using aann models. In IEEE int. conf. acoust., speech, and signal processing, Salt Lake City, UT, May 2001.
- (2001) IEEE Int. Conf. Acoust., Speech, and Signal Processing
- Yegnanarayana, B.¹ Reddy, K.S.² Kishore, S.P.³

59
- 0034856452
- Source and system features for speaker recognition using AANN models
- b). In , Salt Lake City, Utah, USA, May 2001 (pp
- Yegnanarayana, B., Reddy, K. S., & Kishore, S. P. (2001b). Source and system features for speaker recognition using AANN models. In Proc. IEEE int. conf. acoust., speech, signal processing, Salt Lake City, Utah, USA, May 2001 (pp. 409-412).
- (2001) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 409-412
- Yegnanarayana, B.¹ Reddy, K.S.² Kishore, S.P.³

60
- 38049009485
- Pitch synchronous analysis method and Fisher criterion based speaker identification
- In , Washington D.C., USA (pp. ). Los Alamitos: IEEE Comput. Soc
- Zeng, Y., Wu, H., & Gao, R. (2007). Pitch synchronous analysis method and Fisher criterion based speaker identification. In Third international conference on natural computation, Washington D.C., USA (pp. 691-695). Los Alamitos: IEEE Comput. Soc.
- (2007) Third International Conference on Natural Computation , pp. 691-695
- Zeng, Y.¹ Wu, H.² Gao, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.