-
1
-
-
34547941599
-
Automatic speech recognition and speech variability: A review
-
Benzeghiba, M., Mori, R.D., Deroo, O., Dupont, S., Erbes, T., Jouvet, D., Fissore, L., Laface, P., Mertins, A., Ris, C., Rose, R., Tyagi, V., Wellekens, C.: Automatic speech recognition and speech variability: a review. Speech Communication 49(10-11), 763-786 (2007)
-
(2007)
Speech Communication
, vol.49
, Issue.10-11
, pp. 763-786
-
-
Benzeghiba, M.1
Mori, R.D.2
Deroo, O.3
Dupont, S.4
Erbes, T.5
Jouvet, D.6
Fissore, L.7
Laface, P.8
Mertins, A.9
Ris, C.10
Rose, R.11
Tyagi, V.12
Wellekens, C.13
-
2
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
Gales, M.J.F.: Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language 12(2), 75-98 (1998)
-
(1998)
Computer Speech and Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.J.F.1
-
3
-
-
27644522706
-
Vocal tract normalization equals linear transformation in cepstral space
-
ausgedruckt
-
Pitz, M., Ney, H.: Vocal tract normalization equals linear transformation in cepstral space. IEEE Trans. Speech and Audio Processing 13(5 Part 2), 930-944 (2005) (ausgedruckt)
-
(2005)
IEEE Trans. Speech and Audio Processing
, vol.13
, Issue.5 PART 2
, pp. 930-944
-
-
Pitz, M.1
Ney, H.2
-
4
-
-
0036753897
-
Speaker adaptive modeling by vocal tract normalization
-
Welling, L., Ney, H., Kanthak, S.: Speaker adaptive modeling by vocal tract normalization. IEEE Trans. Speech and Audio Processing 10(6), 415-426 (2002)
-
(2002)
IEEE Trans. Speech and Audio Processing
, vol.10
, Issue.6
, pp. 415-426
-
-
Welling, L.1
Ney, H.2
Kanthak, S.3
-
5
-
-
0031647824
-
A frequency warping approach to speaker normalization
-
Lee, L., Rose, R.C.: A frequency warping approach to speaker normalization. IEEE Trans. Speech and Audio Processing 6(1), 49-60 (1998)
-
(1998)
IEEE Trans. Speech and Audio Processing
, vol.6
, Issue.1
, pp. 49-60
-
-
Lee, L.1
Rose, R.C.2
-
6
-
-
0032761999
-
Scale transform in speech analysis
-
Umesh, S., Cohen, L., Marinovic, N., Nelson, D.J.: Scale transform in speech analysis. IEEE Trans. Speech and Audio Processing 7, 40-45 (1999)
-
(1999)
IEEE Trans. Speech and Audio Processing
, vol.7
, pp. 40-45
-
-
Umesh, S.1
Cohen, L.2
Marinovic, N.3
Nelson, D.J.4
-
7
-
-
33947666117
-
Frequency-warping invariant features for automatic speech recognition
-
Mertins, A., Rademacher, J.: Frequency-warping invariant features for automatic speech recognition. In: Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Toulouse, France, May 2006, vol. V, pp. 1025-1028 (2006)
-
(2006)
Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Toulouse, France, May 2006
, vol.5
, pp. 1025-1028
-
-
Mertins, A.1
Rademacher, J.2
-
8
-
-
44949218505
-
Improved warping-invariant features for automatic speech recognition
-
Rademacher, J., Wächter, M., Mertins, A.: Improved warping-invariant features for automatic speech recognition. In: Proc. Int. Conf. Spoken Language Processing (Interspeech 2006 - ICSLP), Pittsburgh, PA, USA, September 2006, pp. 1499-1502 (2006)
-
(2006)
Proc. Int. Conf. Spoken Language Processing (Interspeech 2006 - ICSLP), Pittsburgh, PA, USA, September 2006
, pp. 1499-1502
-
-
Rademacher, J.1
Wächter, M.2
Mertins, A.3
-
9
-
-
70450166695
-
Low-dimensional, auditory feature vectors that improve vocal-tract-length normalization in automatic speech recognition
-
Monaghan, J.J., Feldbauer, C., Walters, T.C., Patterson, R.D.: Low-dimensional, auditory feature vectors that improve vocal-tract-length normalization in automatic speech recognition. The Journal of the Acoustical Society of America 123(5), 3066-3066 (2008)
-
(2008)
The Journal of the Acoustical Society of America
, vol.123
, Issue.5
, pp. 3066-3066
-
-
Monaghan, J.J.1
Feldbauer, C.2
Walters, T.C.3
Patterson, R.D.4
-
10
-
-
0002163712
-
Invariant features in pattern recognition - Fundamentals and applications
-
John Wiley & Sons, Chichester
-
Burkhardt, H., Siggelkow, S.: Invariant features in pattern recognition - fundamentals and applications. In: Nonlinear Model-Based Image/Video Processing and Analysis, pp. 269-307. John Wiley & Sons, Chichester (2001)
-
(2001)
Nonlinear Model-Based Image/Video Processing and Analysis
, pp. 269-307
-
-
Burkhardt, H.1
Siggelkow, S.2
-
11
-
-
0017480678
-
A class of translation invariant transforms
-
Wagh, M., Kanetkar, S.: A class of translation invariant transforms. IEEE Trans. Acoustics, Speech, and Signal Processing 25(2), 203-205 (1977)
-
(1977)
IEEE Trans. Acoustics, Speech, and Signal Processing
, vol.25
, Issue.2
, pp. 203-205
-
-
Wagh, M.1
Kanetkar, S.2
-
12
-
-
0019075787
-
On invariant sets of a certain class of fast translation-invariant transforms
-
Burkhardt, H., Müller, X.: On invariant sets of a certain class of fast translation-invariant transforms. IEEE Trans. Acoustic, Speech, and Signal Processing 28(5), 517-523 (1980)
-
(1980)
IEEE Trans. Acoustic, Speech, and Signal Processing
, vol.28
, Issue.5
, pp. 517-523
-
-
Burkhardt, H.1
Müller, X.2
-
13
-
-
84975559454
-
Modified rapid transform
-
Fang, M., Häusler, G.: Modified rapid transform. Applied Optics 28(6), 1257-1262 (1989)
-
(1989)
Applied Optics
, vol.28
, Issue.6
, pp. 1257-1262
-
-
Fang, M.1
Häusler, G.2
-
14
-
-
0014551188
-
A transformation with invariance under cyclic permutation for applications in pattern recognition
-
Reitboeck, H., Brody, T.P.: A transformation with invariance under cyclic permutation for applications in pattern recognition. Inf. & Control. 15, 130-154 (1969)
-
(1969)
Inf. & Control
, vol.15
, pp. 130-154
-
-
Reitboeck, H.1
Brody, T.P.2
-
15
-
-
0015723408
-
Machine recognition of printed chinese characters via transformation algorithms
-
Wang, P.P., Shiau, R.C.: Machine recognition of printed chinese characters via transformation algorithms. Pattern Recognition 5(4), 303-321 (1973)
-
(1973)
Pattern Recognition
, vol.5
, Issue.4
, pp. 303-321
-
-
Wang, P.P.1
Shiau, R.C.2
-
16
-
-
77951495322
-
Use of Invertible Rapid Transform in Motion Analysis
-
Gamec, J., Turan, J.: Use of Invertible Rapid Transform in Motion Analysis. Radioengineering 5(4), 21-27 (1996)
-
(1996)
Radioengineering
, vol.5
, Issue.4
, pp. 21-27
-
-
Gamec, J.1
Turan, J.2
-
17
-
-
0027680376
-
Multiscale fourier descriptors for classifying semivowels in spectrograms
-
Pinkowski, B.: Multiscale fourier descriptors for classifying semivowels in spectrograms. Pattern Recognition 26(10), 1593-1602 (1993)
-
(1993)
Pattern Recognition
, vol.26
, Issue.10
, pp. 1593-1602
-
-
Pinkowski, B.1
-
18
-
-
84962875934
-
Multiple time resolutions for derivatives ofMel-frequency cepstral coefficients
-
Stemmer, G., Hacker, C., Noth, E., Niemann, H.: Multiple time resolutions for derivatives ofMel-frequency cepstral coefficients. In: IEEEWorkshop on Automatic Speech Recognition and Understanding, December 2001, pp. 37-40 (2001)
-
(2001)
IEEEWorkshop on Automatic Speech Recognition and Understanding, December 2001
, pp. 37-40
-
-
Stemmer, G.1
Hacker, C.2
Noth, E.3
Niemann, H.4
-
19
-
-
4544303183
-
Speech discrimination based on multiscale spectro-temporal modulations
-
Mesgarani, N., Shamma, S., Slaney, M.: Speech discrimination based on multiscale spectro-temporal modulations. In: Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, May 2004, vol. 1, pp. I-601-I-604 (2004)
-
(2004)
Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, May 2004
, vol.1
-
-
Mesgarani, N.1
Shamma, S.2
Slaney, M.3
-
20
-
-
4544298320
-
Audio segmentation based on multi-scale audio classification
-
Zhang, Y., Zhou, J.: Audio segmentation based on multi-scale audio classification. In: IEEE Int. Con. Acoustics, Speech, and Signal Processing, May 2004, vol. 4, pp. iv-349-iv-352 (2004)
-
(2004)
IEEE Int. Con. Acoustics, Speech, and Signal Processing, May 2004
, vol.4
-
-
Zhang, Y.1
Zhou, J.2
-
21
-
-
24344458137
-
Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy
-
Peng, H., Long, F., Ding, C.: Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Analysis and Machine Intelligence 27(8), 1226-1238 (2005)
-
(2005)
IEEE Trans. Pattern Analysis and Machine Intelligence
, vol.27
, Issue.8
, pp. 1226-1238
-
-
Peng, H.1
Long, F.2
Ding, C.3
-
22
-
-
0024768209
-
Speaker-independent phone recognition using hidden Markov models
-
Lee, K.F., Hon, H.W.: Speaker-independent phone recognition using hidden Markov models. IEEE Trans. Acoustics, Speech and Signal Processing 37(11), 1641-1648 (1989)
-
(1989)
IEEE Trans. Acoustics, Speech and Signal Processing
, vol.37
, Issue.11
, pp. 1641-1648
-
-
Lee, K.F.1
Hon, H.W.2
-
23
-
-
0003571976
-
-
Cambridge University Engineering Department, Cambridge
-
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X.A., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK version 3.4). Cambridge University Engineering Department, Cambridge (2006)
-
(2006)
The HTK Book (For HTK Version 3.4)
-
-
Young, S.1
Evermann, G.2
Gales, M.3
Hain, T.4
Kershaw, D.5
Liu, X.A.6
Moore, G.7
Odell, J.8
Ollason, D.9
Povey, D.10
Valtchev, V.11
Woodland, P.12
-
24
-
-
0034227088
-
Auditory images: How complex sounds are represented in the auditory system
-
Patterson, R.D.: Auditory images: How complex sounds are represented in the auditory system. Journal-Acoustical Society of Japan (E) 21(4), 183-190 (2000)
-
(2000)
Journal-Acoustical Society of Japan (E)
, vol.21
, Issue.4
, pp. 183-190
-
-
Patterson, R.D.1
-
25
-
-
29744445786
-
-
Springer, Heidelberg
-
Bacon, S., Fay, R., Popper, A.: Compression: from cochlea to cochlear implants. Springer, Heidelberg (2004)
-
(2004)
Compression: From Cochlea to Cochlear Implants
-
-
Bacon, S.1
Fay, R.2
Popper, A.3
|