-
1
-
-
0033335618
-
Modeling pronunciation variation for ASR: a survey of the literature
-
Strik H, Cucchiarini C (1999) Modeling pronunciation variation for ASR: a survey of the literature. Speech Commun 29: 225-246.
-
(1999)
Speech Commun
, vol.29
, pp. 225-246
-
-
Strik, H.1
Cucchiarini, C.2
-
3
-
-
70349205575
-
Emotional speech recognition based on style estimation and adaptation with multiple-regression HMM
-
Ijima Y, Tachibana M, Nose T, Kobayashi T (2009) Emotional speech recognition based on style estimation and adaptation with multiple-regression HMM. Proceedings of IEEE international conference on acoustic, speech and signal processing, pp 4157-4160.
-
(2009)
Proceedings of IEEE International Conference On Acoustic, Speech and Signal Processing
, pp. 4157-4160
-
-
Ijima, Y.1
Tachibana, M.2
Nose, T.3
Kobayashi, T.4
-
4
-
-
33746410556
-
Emotional speech recognition: resources, features, and methods
-
Ververidis D, Kotropoulos C (2006) Emotional speech recognition: resources, features, and methods. Speech Commun 48: 1162-1181.
-
(2006)
Speech Commun
, vol.48
, pp. 1162-1181
-
-
Ververidis, D.1
Kotropoulos, C.2
-
5
-
-
34547549142
-
Towards more reality in the recognition of emotional speech
-
Schuller B, Seppi D, Batliner A, Maier A, Steidl S (2007) Towards more reality in the recognition of emotional speech. Proceedings of IEEE international conference on acoustic, speech and signal processing, vol 4, pp 941-944.
-
(2007)
Proceedings of IEEE international conference on acoustic, speech and signal processing
, vol.4
, pp. 941-944
-
-
Schuller, B.1
Seppi, D.2
Batliner, A.3
Maier, A.4
Steidl, S.5
-
6
-
-
70349193703
-
Emotion recognition from speech: putting ASR in the loop
-
Schuller B, Batliner A, Steidl S, Seppi D (2009) Emotion recognition from speech: putting ASR in the loop. Proceedings of IEEE international conference on acoustic, speech and signal processing, pp 4585-4588.
-
(2009)
Proceedings of IEEE international conference on acoustic, speech and signal processing
, pp. 4585-4588
-
-
Schuller, B.1
Batliner, A.2
Steidl, S.3
Seppi, D.4
-
8
-
-
21544466181
-
ASR for emotional speech: clarifying the issues and enhancing performance
-
Athanaselis T, Bakamidis S, Dologlou I, Cowie R, Douglas-Cowie E, Cox C (2005) ASR for emotional speech: clarifying the issues and enhancing performance. J Neural Netw 18: 437-444.
-
(2005)
J Neural Netw
, vol.18
, pp. 437-444
-
-
Athanaselis, T.1
Bakamidis, S.2
Dologlou, I.3
Cowie, R.4
Douglas-Cowie, E.5
Cox, C.6
-
11
-
-
34547941599
-
Automatic speech recognition and speech variability: a review
-
Benzeghiba M, De Mori R, Deroo O, Dupont S, Erbes T, Jouvet D, Fissore L, Laface P, Mertins A, Ris C, Rose R, Tyagi V, Wellekens C (2007) Automatic speech recognition and speech variability: a review. Speech Commun 49: 763-786.
-
(2007)
Speech Commun
, vol.49
, pp. 763-786
-
-
Benzeghiba, M.1
de Mori, R.2
Deroo, O.3
Dupont, S.4
Erbes, T.5
Jouvet, D.6
Fissore, L.7
Laface, P.8
Mertins, A.9
Ris, C.10
Rose, R.11
Tyagi, V.12
Wellekens, C.13
-
16
-
-
84864948871
-
Pitch in emotional speech and emotional speech recognition using pitch frequency
-
Gharavian D, Sheikhan M, Janipour M (2010) Pitch in emotional speech and emotional speech recognition using pitch frequency. Majlesi J Electr Eng 4(1): 19-24.
-
(2010)
Majlesi J Electr Eng
, vol.4
, Issue.1
, pp. 19-24
-
-
Gharavian, D.1
Sheikhan, M.2
Janipour, M.3
-
17
-
-
0037382560
-
Emotions, speech and the ASR framework
-
Bosch LT (2003) Emotions, speech and the ASR framework. Speech Commun 40: 213-225.
-
(2003)
Speech Commun
, vol.40
, pp. 213-225
-
-
Bosch, L.T.1
-
18
-
-
79955539267
-
Contextual invariant-integration features for improved speaker-independent speech recognition
-
doi: 10. 1016/j. specom. 2011. 02. 002 Article in Press
-
Müller F, Mertins A (2011) Contextual invariant-integration features for improved speaker-independent speech recognition. Speech Commun. doi: 10. 1016/j. specom. 2011. 02. 002 Article in Press.
-
(2011)
Speech Commun
-
-
-
21
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
Gales MJF (1998) Maximum likelihood linear transformations for HMM-based speech recognition. Comput Speech Lang 12: 75-98.
-
(1998)
Comput Speech Lang
, vol.12
, pp. 75-98
-
-
Gales, M.J.F.1
-
22
-
-
3042820894
-
Automatic recognition of spontaneous speech for access to multilingual oral history archives
-
Byrne W, Doermann D, Franz M, Gustman S, Hajič J, Oard D, Picheny M, Psutka J, Ramabhadran B, Soergel D, Ward T, Zhu W-J (2004) Automatic recognition of spontaneous speech for access to multilingual oral history archives. IEEE Trans Speech Audio Process 12: 420-435.
-
(2004)
IEEE Trans Speech Audio Process
, vol.12
, pp. 420-435
-
-
Byrne, W.1
Doermann, D.2
Franz, M.3
Gustman, S.4
Hajič, J.5
Oard, D.6
Picheny, M.7
Psutka, J.8
Ramabhadran, B.9
Soergel, D.10
Ward, T.11
Zhu, W.-J.12
-
28
-
-
26844479120
-
Warped discrete cosine transform-based noisy speech enhancement
-
Chang J-H (2005) Warped discrete cosine transform-based noisy speech enhancement. IEEE Trans Circuits Syst II 52: 535-539.
-
(2005)
IEEE Trans Circuits Syst II
, vol.52
, pp. 535-539
-
-
Chang, J.-H.1
-
29
-
-
44949157762
-
Frequency warping by linear transformation of standard MFCC
-
Panchapagesan S (2006) Frequency warping by linear transformation of standard MFCC. Proceedings of interspeech, pp 397-400.
-
(2006)
Proceedings of Interspeech
, pp. 397-400
-
-
Panchapagesan, S.1
-
31
-
-
77955423547
-
Fiction support for realistic portrayals of fear-type emotional manifestations
-
Clavel C, Vasilescu I, Devillers L (2011) Fiction support for realistic portrayals of fear-type emotional manifestations. Comput Speech Lang 25: 63-83.
-
(2011)
Comput Speech Lang
, vol.25
, pp. 63-83
-
-
Clavel, C.1
Vasilescu, I.2
Devillers, L.3
-
32
-
-
33646197299
-
The speech database of Farsi spoken language
-
Bijankhan M, Sheikhzadegan J, Roohani MR, Samareh Y, Lucas C, Tebiani M (1994) The speech database of Farsi spoken language. Proceedings of Australian international conference on speech science and technology, pp 826-831.
-
(1994)
Proceedings of Australian International Conference On Speech Science and Technology
, pp. 826-831
-
-
Bijankhan, M.1
Sheikhzadegan, J.2
Roohani, M.R.3
Samareh, Y.4
Lucas, C.5
Tebiani, M.6
-
33
-
-
0003822743
-
-
Cambridge University, Cambridge
-
Young SJ, Evermann G, Kershaw D, Moore G, Odell J, Ollason D, Povey D, Valtchev V, Woodland V (2002) The HTK book (Ver. 3. 2). Cambridge University, Cambridge.
-
(2002)
The HTK Book (Ver. 3. 2)
, pp. 2
-
-
Young, S.J.1
Evermann, G.2
Kershaw, D.3
Moore, G.4
Odell, J.5
Ollason, D.6
Povey, D.7
Valtchev, V.8
Woodland, V.9
-
34
-
-
0016049328
-
An Algorithm for formant extraction using linear prediction spectra
-
McCandless SS (1974) An Algorithm for formant extraction using linear prediction spectra. IEEE Trans Acoustics Speech Signal Process 22: 135-141.
-
(1974)
IEEE Trans Acoustics Speech Signal Process
, vol.22
, pp. 135-141
-
-
McCandless, S.S.1
|