-
1
-
-
77956401353
-
Class-level spectral features for emotion recognition
-
Bitouk, D., Verma, R., & Nenkova, A. (2010). Class-level spectral features for emotion recognition. Speech Communication, 52, 613-625.
-
(2010)
Speech Communication
, vol.52
, pp. 613-625
-
-
Bitouk, D.1
Verma, R.2
Nenkova, A.3
-
2
-
-
70450177656
-
Improving automatic emotion recognition from speech signals
-
In , Brighton, UK, 6-10 September 2009 (pp
-
Bozkurt, E., Erzin, E., Erdem, C. E., & Erdem, A. T. (2009). Improving automatic emotion recognition from speech signals. In 10th annual conference of the international speech communication association (interspeech), Brighton, UK, 6-10 September 2009 (pp. 324-327).
-
(2009)
10th Annual Conference of the International Speech Communication Association (Interspeech)
, pp. 324-327
-
-
Bozkurt, E.1
Erzin, E.2
Erdem, C.E.3
Erdem, A.T.4
-
3
-
-
47949107218
-
A database of German emotional speech
-
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., & Weiss, B. (2005). A database of German emotional speech. In Interspeech.
-
(2005)
Interspeech.
-
-
Burkhardt, F.1
Paeschke, A.2
Rolfes, M.3
Sendlmeier, W.4
Weiss, B.5
-
4
-
-
14944351245
-
Analysis of emotion recognition using facial expressions, speech and multimodal information
-
State College, PA, The USA, October 2004
-
Busso, C., Deng, Z., Yildirim, S., Bulut, M., Lee, C. M., Kazemzadeh, A., Lee, S., Neumann, U., & Narayanan, S. (2004). Analysis of emotion recognition using facial expressions, speech and multimodal information. In ACM 6th international conference on multimodal interfaces (ICMI 2004), State College, PA, The USA, October 2004.
-
(2004)
ACM 6th International Conference on Multimodal Interfaces (ICMI 2004)
-
-
Busso, C.1
Deng, Z.2
Yildirim, S.3
Bulut, M.4
Lee, C.M.5
Kazemzadeh, A.6
Lee, S.7
Neumann, U.8
Narayanan, S.9
-
5
-
-
0442326756
-
Recognition of noisy speech using dynamic spectral subband centroids
-
February
-
Chen, J., Huang, Y. A., Li, Q., & Paliwal, K. K. (2004). Recognition of noisy speech using dynamic spectral subband centroids. IEEE Signal Processing Letters, 11, 258-261 (February 2004).
-
(2004)
IEEE Signal Processing Letters
, vol.11
, pp. 258-261
-
-
Chen, J.1
Huang, Y.A.2
Li, Q.3
Paliwal, K.K.4
-
6
-
-
0030353343
-
Recognizing emotion in speech
-
In , Philadelphia, PA, USA, October 1996 (pp
-
Dellert, F., Polzin, T., & Waibel, A. (1996). Recognizing emotion in speech. In 4th international conference on spoken language processing, Philadelphia, PA, USA, October 1996 (pp. 1970-1973).
-
(1996)
4th International Conference on Spoken Language Processing
, pp. 1970-1973
-
-
Dellert, F.1
Polzin, T.2
Waibel, A.3
-
9
-
-
84869508610
-
Detection of vowel on set points in continuous speech using auto-associative neural network models
-
In . New York: IEEE Press
-
Gangashetty, S. V., Sekhar, C. C., & Yegnanarayana, B. (2004). Detection of vowel on set points in continuous speech using auto-associative neural network models. In INTERSPEECH. New York: IEEE Press.
-
(2004)
Interspeech
-
-
Gangashetty, S.V.1
Sekhar, C.C.2
Yegnanarayana, B.3
-
10
-
-
33745477715
-
Spotting multilingual consonant-vowel units of speech using neural network models
-
In M. Faundez-Zanuy (Ed.), (pp. ). Berlin: Springer
-
Gangashetty, S. V., Sekhar, C. C., & Yegnanarayana, B. (2005). Spotting multilingual consonant-vowel units of speech using neural network models. In M. Faundez-Zanuy (Ed.), NOLISP (pp. 303-317). Berlin: Springer.
-
(2005)
Nolisp
, pp. 303-317
-
-
Gangashetty, S.V.1
Sekhar, C.C.2
Yegnanarayana, B.3
-
11
-
-
0036082789
-
Autoassociative neural network models for online speaker verification using source features from vowels
-
In , USA, May 2002
-
Gupta, C. S., Prasanna, S. R. M., & Yegnanarayana, B. (2002). Autoassociative neural network models for online speaker verification using source features from vowels. In Int. joint conf. neural networks, Honululu, Hawii, USA, May 2002.
-
(2002)
Int. Joint Conf. Neural Networks, Honululu, Hawii
-
-
Gupta, C.S.1
Prasanna, S.R.M.2
Yegnanarayana, B.3
-
13
-
-
33749580033
-
Robust recognition of emotion from speech
-
In Intelligent virtual agents (pp. ). Berlin: Springer
-
Hoque, M. E., Yeasin, M., & Louwerse, M. M. (2006). Robust recognition of emotion from speech. In Lecture notes in computer science. Intelligent virtual agents (pp. 42-53). Berlin: Springer.
-
(2006)
Lecture Notes in Computer Science.
, pp. 42-53
-
-
Hoque, M.E.1
Yeasin, M.2
Louwerse, M.M.3
-
14
-
-
0033334228
-
Analysis of autoassociative mapping neural networks
-
In , USA (pp
-
Ikbal, M. S., Misra, H., & Yegnanarayana, B. (1999). Analysis of autoassociative mapping neural networks. In Int. joint conf. neural networks, USA (pp. 854-858).
-
(1999)
Int. Joint Conf. Neural Networks
, pp. 854-858
-
-
Ikbal, M.S.1
Misra, H.2
Yegnanarayana, B.3
-
15
-
-
77950073346
-
Spoken emotion recognition through optimum-path forest classification using glottal features
-
Iliev, A. I., Scordilis, M. S., Papa, J. P., & Falco, A. X. (2010). Spoken emotion recognition through optimum-path forest classification using glottal features. Computer Speech and Language, 24(3), 445-460.
-
(2010)
Computer Speech and Language
, vol.24
, Issue.3
, pp. 445-460
-
-
Iliev, A.I.1
Scordilis, M.S.2
Papa, J.P.3
Falco, A.X.4
-
16
-
-
70449726750
-
Features extraction for speech emotion
-
Kamaruddin, N., & Wahab, A. (2009). Features extraction for speech emotion. Journal of Computational Methods in Science and Engineering, 9(9), 1-12.
-
(2009)
Journal of Computational Methods in Science and Engineering
, vol.9
, Issue.9
, pp. 1-12
-
-
Kamaruddin, N.1
Wahab, A.2
-
17
-
-
0034862114
-
Online text-independent speaker verification system using autoassociative neural network models
-
In , Washington, USA, August 2001 (pp
-
Kishore, S. P., & Yegnanarayana, B. (2001). Online text-independent speaker verification system using autoassociative neural network models. In Int. joint conf. neural networks (V2), Washington, USA, August 2001 (pp. 1548-1553).
-
(2001)
Int. Joint Conf. Neural Networks (V2)
, pp. 1548-1553
-
-
Kishore, S.P.1
Yegnanarayana, B.2
-
19
-
-
76249109428
-
Exploring speech features for classifying emotions along valence dimension
-
In . The 3rd international conference on pattern recognition and machine intelligence (PReMI-09
-
Koolagudi, S. G., & Rao, K. S. (2009). Exploring speech features for classifying emotions along valence dimension. In Springer LNCS. The 3rd international conference on pattern recognition and machine intelligence (PReMI-09).
-
(2009)
Springer LNCS
-
-
Koolagudi, S.G.1
Rao, K.S.2
-
21
-
-
70349897091
-
IITKGP-SESC: Speech database for emotion analysis
-
In , August 2009. Berlin: Springer
-
Koolagudi, S. G., Maity, S., Kumar, V. A., Chakrabarti, S., & Rao, K. S. (2009). IITKGP-SESC: speech database for emotion analysis. In LNCS. Communications in computer and information science, August 2009. Berlin: Springer.
-
(2009)
LNCS. Communications in Computer and Information Science
-
-
Koolagudi, S.G.1
Maity, S.2
Kumar, V.A.3
Chakrabarti, S.4
Rao, K.S.5
-
23
-
-
85009223246
-
Emotion recognition by speech signals
-
In (pp
-
Kwon, O., Chan, K., Hao, J., & Lee, T. (2003). Emotion recognition by speech signals. In Eurospeech, Geneva (pp. 125-128).
-
(2003)
Eurospeech, Geneva
, pp. 125-128
-
-
Kwon, O.1
Chan, K.2
Hao, J.3
Lee, T.4
-
25
-
-
79959831679
-
Significance of pitch synchronous analysis for speaker recognition using AANN models
-
In , Makuhari, Japan, September 2010
-
Mallidi, S. H. R., Prahallad, K., Gangashetty, S. V., & Yegnanarayana, B. (2010). Significance of pitch synchronous analysis for speaker recognition using AANN models. In INTERSPEECH-2010, Makuhari, Japan, September 2010.
-
(2010)
Interspeech-2010
-
-
Mallidi, S.H.R.1
Prahallad, K.2
Gangashetty, S.V.3
Yegnanarayana, B.4
-
26
-
-
52949094265
-
-
Speech Communication, , (April 2008
-
Mary, L., & Yegnanarayana, B. (2008). Extraction and representation of prosodic features for language and speaker recognition. Speech Communication, 50, 782-796 (April 2008).
-
(2008)
Extraction and Representation of Prosodic Features for Language and Speaker Recognition.
, vol.50
, pp. 782-796
-
-
Mary, L.1
Yegnanarayana, B.2
-
27
-
-
0003135459
-
Approaching automatic recognition of emotion from voice: A rough benchmark
-
In , Belfast
-
McGilloway, S., Cowie, R., Douglas-Cowie, E., Gielen, S., Westerdijk, M., & Stroeve, S. (2000). Approaching automatic recognition of emotion from voice: a rough benchmark. In ISCA workshop on speech and emotion, Belfast.
-
(2000)
ISCA Workshop on Speech and Emotion
-
-
McGilloway, S.1
Cowie, R.2
Douglas-Cowie, E.3
Gielen, S.4
Westerdijk, M.5
Stroeve, S.6
-
28
-
-
33847124004
-
Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources
-
In , Sydney, Australia, August 2005
-
Mubarak, O. M., Ambikairajah, E., & Epps, J. (2005). Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources. In 8th international symposium on signal processing and its applications, Sydney, Australia, August 2005.
-
(2005)
8th International Symposium on Signal Processing and Its Applications
-
-
Mubarak, O.M.1
Ambikairajah, E.2
Epps, J.3
-
29
-
-
65249091627
-
Epoch extraction from speech signals
-
Murty, K. S. R., & Yegnanarayana, B. (2008). Epoch extraction from speech signals. IEEE Transactions on Audio, Speech, and Language Processing, 16, 1602-1613.
-
(2008)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.16
, pp. 1602-1613
-
-
Murty, K.S.R.1
Yegnanarayana, B.2
-
30
-
-
0023715232
-
Pitch synchronous analysis of hoarseness in running speech
-
Muta, H., Baer, T., Wagatsuma, K., Muraoka, T., & Fukuda, H. (1988a). Pitch synchronous analysis of hoarseness in running speech. The Journal of the Acoustical Society of America, 84, 1292-1301.
-
(1988)
The Journal of the Acoustical Society of America
, vol.84
, pp. 1292-1301
-
-
Muta, H.1
Baer, T.2
Wagatsuma, K.3
Muraoka, T.4
Fukuda, H.5
-
31
-
-
84869488893
-
A pitch-synchronous analysis of hoarseness in running speech
-
b). , Haskins laboratories
-
Muta, H., Baer, T., Wagatsuma, K., Muraoka, T., & Fukudatt, H. (1988b). A pitch-synchronous analysis of hoarseness in running speech. Status report on speech research SR-93/94, Haskins laboratories.
-
(1988)
Status Report on Speech Research SR-93/94
-
-
Muta, H.1
Baer, T.2
Wagatsuma, K.3
Muraoka, T.4
Fukudatt, H.5
-
32
-
-
38749103707
-
Emotion recognition in spontaneous speech using GMMs
-
In , Pittsburgh, Pennsylvania, 17-19 September 2006 (pp
-
Neiberg, D., Elenius, K., & Laskowski, K. (2006). Emotion recognition in spontaneous speech using GMMs. In INTERSPEECH 2006 - ICSLP, Pittsburgh, Pennsylvania, 17-19 September 2006 (pp. 809-812).
-
(2006)
Interspeech 2006 - ICSLP
, pp. 809-812
-
-
Neiberg, D.1
Elenius, K.2
Laskowski, K.3
-
33
-
-
85016350179
-
Emotion recognition in speech using neural networks
-
In , Perth, WA, Australia, August 1999 (pp
-
Nicholson, J., Takahashi, K., & Nakatsu, R. (1999). Emotion recognition in speech using neural networks. In 6th international conference on neural information processing (ICONIP-99), Perth, WA, Australia, August 1999 (pp. 495-501).
-
(1999)
6th International Conference on Neural Information Processing (ICONIP-99)
, pp. 495-501
-
-
Nicholson, J.1
Takahashi, K.2
Nakatsu, R.3
-
34
-
-
33646758219
-
Combining acoustic features for improved emotion recognition in Mandarin speech
-
In J. Tao, T. Tan & R. Picard (Eds.), (pp. ). Berlin: Springer
-
Pao, T. L., Chen, Y. T., Yeh, J. H., & Liao, W. Y. (2005). Combining acoustic features for improved emotion recognition in Mandarin speech. In J. Tao, T. Tan & R. Picard (Eds.), LNCS. ACII (pp. 279-285). Berlin: Springer.
-
(2005)
Lncs. Acii
, pp. 279-285
-
-
Pao, T.L.1
Chen, Y.T.2
Yeh, J.H.3
Liao, W.Y.4
-
35
-
-
38049006375
-
-
In LNCS:. ACII 2007. Berlin: Springer
-
Pao, T. L., Chen, Y. T., Yeh, J. H., Cheng, Y. M., & Chien, C. S. (2007). Feature combination for better differentiating anger from neutral in mandarin emotional speech. In LNCS: Vol. 4738. ACII 2007. Berlin: Springer.
-
(2007)
Feature Combination for Better Differentiating Anger from Neutral in Mandarin Emotional Speech.
, vol.4738
-
-
Pao, T.L.1
Chen, Y.T.2
Yeh, J.H.3
Cheng, Y.M.4
Chien, C.S.5
-
37
-
-
1942512334
-
Begin-end detection using vowel onset points
-
In , TIFR Mumbai, India (January 2003
-
Prasanna, S. R. M., Zachariah, J. M., & Yegnanarayana, B. (2003). Begin-end detection using vowel onset points. In Proceedings workshop on spoken language, TIFR Mumbai, India (January 2003).
-
(2003)
Proceedings Workshop on Spoken Language
-
-
Prasanna, S.R.M.1
Zachariah, J.M.2
Yegnanarayana, B.3
-
38
-
-
33748443739
-
Extraction of speaker-specific excitation information from linear prediction residual of speech
-
Prasannaa, S. M., Gupta, C. S., & Yegnanarayana, B. (2006). Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Communication, 48, 1243-1261.
-
(2006)
Speech Communication
, vol.48
, pp. 1243-1261
-
-
Prasannaa, S.M.1
Gupta, C.S.2
Yegnanarayana, B.3
-
39
-
-
65249112285
-
-
IEEE Transactions on Audio, Speech, and Language Processing, , (May 2009
-
Prasanna, S. R. M., Reddy, B. V. S., & Krishnamoorthy, P. (2009). Vowel onset point detection using source, spectral peaks, and modulation spectrum energies. IEEE Transactions on Audio, Speech, and Language Processing, 17, 556-565 (May 2009).
-
(2009)
Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies.
, vol.17
, pp. 556-565
-
-
Prasanna, S.R.M.1
Reddy, B.V.S.2
Krishnamoorthy, P.3
-
41
-
-
77950029338
-
Voice conversion by mapping the speaker-specific features using pitch synchronous approach
-
Rao, K. S. (2010). Voice conversion by mapping the speaker-specific features using pitch synchronous approach. Computer Speech and Language, 24, 474-494.
-
(2010)
Computer Speech and Language
, vol.24
, pp. 474-494
-
-
Rao, K.S.1
-
42
-
-
79953168002
-
Application of prosody models for developing speech systems in Indian languages
-
Rao, K. S. (2011a). Application of prosody models for developing speech systems in Indian languages. International Journal of Speech Technology, 14, 19-33.
-
(2011)
International Journal of Speech Technology
, vol.14
, pp. 19-33
-
-
Rao, K.S.1
-
43
-
-
84856289513
-
Role of neural network models for developing speech systems
-
b). Sadhana
-
Rao, K. S. (2011b). Role of neural network models for developing speech systems. Sadhana (Springer), 36, 783-836.
-
(2011)
Springer
, vol.36
, pp. 783-836
-
-
Rao, K.S.1
-
44
-
-
84869490965
-
Identification of Hindi dialects and emotions using spectral and prosodic features of speech
-
Rao, K. S., & Koolagudi, S. G. (2011). Identification of Hindi dialects and emotions using spectral and prosodic features of speech. Journal of Systemics, Cybernetics and Informatics, 9(4), 24-33.
-
(2011)
Journal of Systemics, Cybernetics and Informatics
, vol.9
, Issue.4
, pp. 24-33
-
-
Rao, K.S.1
Koolagudi, S.G.2
-
45
-
-
34047248058
-
Prosody modification using instants of significant excitation
-
May, 2006
-
Rao, K. S., & Yegnanarayana, B. (2006). Prosody modification using instants of significant excitation. IEEE Transactions on Speech and Audio Processing, 14, 972-980 (May 2006).
-
(2006)
IEEE Transactions on Speech and Audio Processing
, vol.14
, pp. 972-980
-
-
Rao, K.S.1
Yegnanarayana, B.2
-
46
-
-
69949159711
-
Duration modification using glottal closure instants and vowel onset points
-
Rao, K. S., & Yegnanarayana, B. (2009). Duration modification using glottal closure instants and vowel onset points. Speech Communication, 51, 1263-1269.
-
(2009)
Speech Communication
, vol.51
, pp. 1263-1269
-
-
Rao, K.S.1
Yegnanarayana, B.2
-
47
-
-
84864576614
-
Source and system features for speaker recognition
-
Indian Institute of Technology Madras, Chennai 600 036, India 2004
-
Reddy, K. S. (2004). Source and system features for speaker recognition. Master's thesis, MS thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai 600 036, India 2004.
-
(2004)
Master's Thesis, MS Thesis, Department of Computer Science and Engineering
-
-
Reddy, K.S.1
-
48
-
-
63049095964
-
Keyword spotting using vowel onset point, vector quantization and hiddenMarkovmodeling based techniques
-
In , Hyderabad. New York: IEEE Press
-
Reddy, B. V. S., Rao, K. V., & Prasanna, S. R. M. (2008). Keyword spotting using vowel onset point, vector quantization and hiddenMarkovmodeling based techniques. In TENCON 2008 - 2008 IEEE region 10 conference, IIIT, Hyderabad. New York: IEEE Press.
-
(2008)
Tencon 2008 - 2008 IEEE Region 10 Conference, IIIT
-
-
Reddy, B.V.S.1
Rao, K.V.2
Prasanna, S.R.M.3
-
49
-
-
4544316885
-
Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture
-
In (pp. ). New York: IEEE Press
-
Schuller, B., Rigoll, G., & Lang, M. (2004). Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture. In Proc. IEEE int. conf. acoust., speech, signal processing (pp. 577-580). New York: IEEE Press.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing
, pp. 577-580
-
-
Schuller, B.1
Rigoll, G.2
Lang, M.3
-
51
-
-
33746410556
-
Emotional speech recognition: Resources, features, and methods
-
Ververidis, D., & Kotropoulos, C. (2006). Emotional speech recognition: resources, features, and methods. Speech Communication, 48, 1162-1181.
-
(2006)
Speech Communication
, vol.48
, pp. 1162-1181
-
-
Ververidis, D.1
Kotropoulos, C.2
-
52
-
-
4544247331
-
Automatic emotional speech classification
-
In (pp. ). New York: IEEE Press
-
Ververidis, D., Kotropoulos, C., & Pitas, I. (2004). Automatic emotional speech classification. In ICASSP (pp. I593-I596). New York: IEEE Press.
-
(2004)
Icassp
-
-
Ververidis, D.1
Kotropoulos, C.2
Pitas, I.3
-
54
-
-
84860875011
-
Vowel onset point detection for low bit rate coded speech
-
b). , , (August 2012
-
Vuppala, A. K., Yadav, J., Chakrabarti, S., & Rao, K. S. (2012b). Vowel onset point detection for low bit rate coded speech. IEEE Transactions on Audio, Speech, and Language Processing, 20, 1894-1903 (August 2012).
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, pp. 1894-1903
-
-
Vuppala, A.K.1
Yadav, J.2
Chakrabarti, S.3
Rao, K.S.4
-
55
-
-
70449580752
-
Automatic recognition of speech emotion using long-term spectro-temporal features
-
In , Santorini-Hellas, 5-7 July 2009 (pp. ). New York: IEEE Press
-
Wu, S., Falk, T. H., & Chan, W. Y. (2009). Automatic recognition of speech emotion using long-term spectro-temporal features. In 16th international conference on digital signal processing, Santorini-Hellas, 5-7 July 2009 (pp. 1-6). New York: IEEE Press.
-
(2009)
16th International Conference on Digital Signal Processing
, pp. 1-6
-
-
Wu, S.1
Falk, T.H.2
Chan, W.Y.3
-
57
-
-
0035989168
-
AANN an alternative to GMM for pattern recognition
-
Yegnanarayana, B., & Kishore, S. P. (2002). AANN an alternative to GMM for pattern recognition. Neural Networks, 15, 459-469.
-
(2002)
Neural Networks
, vol.15
, pp. 459-469
-
-
Yegnanarayana, B.1
Kishore, S.P.2
-
58
-
-
0034856452
-
Source and system features for speaker recognition using aann models
-
a). In , Salt Lake City, UT, May 2001
-
Yegnanarayana, B., Reddy, K. S., & Kishore, S. P. (2001a). Source and system features for speaker recognition using aann models. In IEEE int. conf. acoust., speech, and signal processing, Salt Lake City, UT, May 2001.
-
(2001)
IEEE Int. Conf. Acoust., Speech, and Signal Processing
-
-
Yegnanarayana, B.1
Reddy, K.S.2
Kishore, S.P.3
-
59
-
-
0034856452
-
Source and system features for speaker recognition using AANN models
-
b). In , Salt Lake City, Utah, USA, May 2001 (pp
-
Yegnanarayana, B., Reddy, K. S., & Kishore, S. P. (2001b). Source and system features for speaker recognition using AANN models. In Proc. IEEE int. conf. acoust., speech, signal processing, Salt Lake City, Utah, USA, May 2001 (pp. 409-412).
-
(2001)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing
, pp. 409-412
-
-
Yegnanarayana, B.1
Reddy, K.S.2
Kishore, S.P.3
-
60
-
-
38049009485
-
Pitch synchronous analysis method and Fisher criterion based speaker identification
-
In , Washington D.C., USA (pp. ). Los Alamitos: IEEE Comput. Soc
-
Zeng, Y., Wu, H., & Gao, R. (2007). Pitch synchronous analysis method and Fisher criterion based speaker identification. In Third international conference on natural computation, Washington D.C., USA (pp. 691-695). Los Alamitos: IEEE Comput. Soc.
-
(2007)
Third International Conference on Natural Computation
, pp. 691-695
-
-
Zeng, Y.1
Wu, H.2
Gao, R.3
|