SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 5933 LNAI, Issue , 2010, Pages 111-119

Robust features for speaker-independent speech recognition based on a certain class of translation-invariant transformations

(2) Müller, Florian a Mertins, Alfred a

a UNIVERSITY OF LÜBECK (Germany)

Author keywords

Speaker independency; Speech recognition; Translation invariance

Indexed keywords

AUTOMATIC SPEECH RECOGNITION SYSTEM; INDEX SPACE; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; RECOGNITION RATES; SPEAKER-INDEPENDENT SPEECH RECOGNITION; SPECTRAL EFFECTS; SUB-BANDS; TRANSLATION INVARIANCE; TRANSLATION INVARIANTS; VOCAL TRACT LENGTH NORMALIZATION; VOCAL TRACT LENGTHS;

FILTER BANKS; MATHEMATICAL TRANSFORMATIONS; REMELTING; SPEECH PROCESSING;

SPEECH RECOGNITION;

EID: 77951480608 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-11509-7_15 Document Type: Conference Paper

Times cited : (4)

References (25)

1
- 34547941599
- Automatic speech recognition and speech variability: A review
- Benzeghiba, M., Mori, R.D., Deroo, O., Dupont, S., Erbes, T., Jouvet, D., Fissore, L., Laface, P., Mertins, A., Ris, C., Rose, R., Tyagi, V., Wellekens, C.: Automatic speech recognition and speech variability: a review. Speech Communication 49(10-11), 763-786 (2007)
- (2007) Speech Communication , vol.49 , Issue.10-11 , pp. 763-786
- Benzeghiba, M.¹ Mori, R.D.² Deroo, O.³ Dupont, S.⁴ Erbes, T.⁵ Jouvet, D.⁶ Fissore, L.⁷ Laface, P.⁸ Mertins, A.⁹ Ris, C.¹⁰ Rose, R.¹¹ Tyagi, V.¹² Wellekens, C.¹³

2
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- Gales, M.J.F.: Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language 12(2), 75-98 (1998)
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

3
- 27644522706
- Vocal tract normalization equals linear transformation in cepstral space
- ausgedruckt
- Pitz, M., Ney, H.: Vocal tract normalization equals linear transformation in cepstral space. IEEE Trans. Speech and Audio Processing 13(5 Part 2), 930-944 (2005) (ausgedruckt)
- (2005) IEEE Trans. Speech and Audio Processing , vol.13 , Issue.5 PART 2 , pp. 930-944
- Pitz, M.¹ Ney, H.²

4
- 0036753897
- Speaker adaptive modeling by vocal tract normalization
- Welling, L., Ney, H., Kanthak, S.: Speaker adaptive modeling by vocal tract normalization. IEEE Trans. Speech and Audio Processing 10(6), 415-426 (2002)
- (2002) IEEE Trans. Speech and Audio Processing , vol.10 , Issue.6 , pp. 415-426
- Welling, L.¹ Ney, H.² Kanthak, S.³

5
- 0031647824
- A frequency warping approach to speaker normalization
- Lee, L., Rose, R.C.: A frequency warping approach to speaker normalization. IEEE Trans. Speech and Audio Processing 6(1), 49-60 (1998)
- (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.1 , pp. 49-60
- Lee, L.¹ Rose, R.C.²

6
- 0032761999
- Scale transform in speech analysis
- Umesh, S., Cohen, L., Marinovic, N., Nelson, D.J.: Scale transform in speech analysis. IEEE Trans. Speech and Audio Processing 7, 40-45 (1999)
- (1999) IEEE Trans. Speech and Audio Processing , vol.7 , pp. 40-45
- Umesh, S.¹ Cohen, L.² Marinovic, N.³ Nelson, D.J.⁴

7
- 33947666117
- Frequency-warping invariant features for automatic speech recognition
- Mertins, A., Rademacher, J.: Frequency-warping invariant features for automatic speech recognition. In: Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Toulouse, France, May 2006, vol. V, pp. 1025-1028 (2006)
- (2006) Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Toulouse, France, May 2006 , vol.5 , pp. 1025-1028
- Mertins, A.¹ Rademacher, J.²

8
- 44949218505
- Improved warping-invariant features for automatic speech recognition
- Rademacher, J., Wächter, M., Mertins, A.: Improved warping-invariant features for automatic speech recognition. In: Proc. Int. Conf. Spoken Language Processing (Interspeech 2006 - ICSLP), Pittsburgh, PA, USA, September 2006, pp. 1499-1502 (2006)
- (2006) Proc. Int. Conf. Spoken Language Processing (Interspeech 2006 - ICSLP), Pittsburgh, PA, USA, September 2006 , pp. 1499-1502
- Rademacher, J.¹ Wächter, M.² Mertins, A.³

9
- 70450166695
- Low-dimensional, auditory feature vectors that improve vocal-tract-length normalization in automatic speech recognition
- Monaghan, J.J., Feldbauer, C., Walters, T.C., Patterson, R.D.: Low-dimensional, auditory feature vectors that improve vocal-tract-length normalization in automatic speech recognition. The Journal of the Acoustical Society of America 123(5), 3066-3066 (2008)
- (2008) The Journal of the Acoustical Society of America , vol.123 , Issue.5 , pp. 3066-3066
- Monaghan, J.J.¹ Feldbauer, C.² Walters, T.C.³ Patterson, R.D.⁴

10
- 0002163712
- Invariant features in pattern recognition - Fundamentals and applications
- John Wiley & Sons, Chichester
- Burkhardt, H., Siggelkow, S.: Invariant features in pattern recognition - fundamentals and applications. In: Nonlinear Model-Based Image/Video Processing and Analysis, pp. 269-307. John Wiley & Sons, Chichester (2001)
- (2001) Nonlinear Model-Based Image/Video Processing and Analysis , pp. 269-307
- Burkhardt, H.¹ Siggelkow, S.²

11
- 0017480678
- A class of translation invariant transforms
- Wagh, M., Kanetkar, S.: A class of translation invariant transforms. IEEE Trans. Acoustics, Speech, and Signal Processing 25(2), 203-205 (1977)
- (1977) IEEE Trans. Acoustics, Speech, and Signal Processing , vol.25 , Issue.2 , pp. 203-205
- Wagh, M.¹ Kanetkar, S.²

12
- 0019075787
- On invariant sets of a certain class of fast translation-invariant transforms
- Burkhardt, H., Müller, X.: On invariant sets of a certain class of fast translation-invariant transforms. IEEE Trans. Acoustic, Speech, and Signal Processing 28(5), 517-523 (1980)
- (1980) IEEE Trans. Acoustic, Speech, and Signal Processing , vol.28 , Issue.5 , pp. 517-523
- Burkhardt, H.¹ Müller, X.²

13
- 84975559454
- Modified rapid transform
- Fang, M., Häusler, G.: Modified rapid transform. Applied Optics 28(6), 1257-1262 (1989)
- (1989) Applied Optics , vol.28 , Issue.6 , pp. 1257-1262
- Fang, M.¹ Häusler, G.²

14
- 0014551188
- A transformation with invariance under cyclic permutation for applications in pattern recognition
- Reitboeck, H., Brody, T.P.: A transformation with invariance under cyclic permutation for applications in pattern recognition. Inf. & Control. 15, 130-154 (1969)
- (1969) Inf. & Control , vol.15 , pp. 130-154
- Reitboeck, H.¹ Brody, T.P.²

15
- 0015723408
- Machine recognition of printed chinese characters via transformation algorithms
- Wang, P.P., Shiau, R.C.: Machine recognition of printed chinese characters via transformation algorithms. Pattern Recognition 5(4), 303-321 (1973)
- (1973) Pattern Recognition , vol.5 , Issue.4 , pp. 303-321
- Wang, P.P.¹ Shiau, R.C.²

16
- 77951495322
- Use of Invertible Rapid Transform in Motion Analysis
- Gamec, J., Turan, J.: Use of Invertible Rapid Transform in Motion Analysis. Radioengineering 5(4), 21-27 (1996)
- (1996) Radioengineering , vol.5 , Issue.4 , pp. 21-27
- Gamec, J.¹ Turan, J.²

17
- 0027680376
- Multiscale fourier descriptors for classifying semivowels in spectrograms
- Pinkowski, B.: Multiscale fourier descriptors for classifying semivowels in spectrograms. Pattern Recognition 26(10), 1593-1602 (1993)
- (1993) Pattern Recognition , vol.26 , Issue.10 , pp. 1593-1602
- Pinkowski, B.¹

18
- 84962875934
- Multiple time resolutions for derivatives ofMel-frequency cepstral coefficients
- Stemmer, G., Hacker, C., Noth, E., Niemann, H.: Multiple time resolutions for derivatives ofMel-frequency cepstral coefficients. In: IEEEWorkshop on Automatic Speech Recognition and Understanding, December 2001, pp. 37-40 (2001)
- (2001) IEEEWorkshop on Automatic Speech Recognition and Understanding, December 2001 , pp. 37-40
- Stemmer, G.¹ Hacker, C.² Noth, E.³ Niemann, H.⁴

19
- 4544303183
- Speech discrimination based on multiscale spectro-temporal modulations
- Mesgarani, N., Shamma, S., Slaney, M.: Speech discrimination based on multiscale spectro-temporal modulations. In: Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, May 2004, vol. 1, pp. I-601-I-604 (2004)
- (2004) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, May 2004 , vol.1
- Mesgarani, N.¹ Shamma, S.² Slaney, M.³

20
- 4544298320
- Audio segmentation based on multi-scale audio classification
- Zhang, Y., Zhou, J.: Audio segmentation based on multi-scale audio classification. In: IEEE Int. Con. Acoustics, Speech, and Signal Processing, May 2004, vol. 4, pp. iv-349-iv-352 (2004)
- (2004) IEEE Int. Con. Acoustics, Speech, and Signal Processing, May 2004 , vol.4
- Zhang, Y.¹ Zhou, J.²

21
- 24344458137
- Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy
- Peng, H., Long, F., Ding, C.: Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Analysis and Machine Intelligence 27(8), 1226-1238 (2005)
- (2005) IEEE Trans. Pattern Analysis and Machine Intelligence , vol.27 , Issue.8 , pp. 1226-1238
- Peng, H.¹ Long, F.² Ding, C.³

22
- 0024768209
- Speaker-independent phone recognition using hidden Markov models
- Lee, K.F., Hon, H.W.: Speaker-independent phone recognition using hidden Markov models. IEEE Trans. Acoustics, Speech and Signal Processing 37(11), 1641-1648 (1989)
- (1989) IEEE Trans. Acoustics, Speech and Signal Processing , vol.37 , Issue.11 , pp. 1641-1648
- Lee, K.F.¹ Hon, H.W.²

23
- 0003571976
- Cambridge University Engineering Department, Cambridge
- Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X.A., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK version 3.4). Cambridge University Engineering Department, Cambridge (2006)
- (2006) The HTK Book (For HTK Version 3.4)
- Young, S.¹ Evermann, G.² Gales, M.³ Hain, T.⁴ Kershaw, D.⁵ Liu, X.A.⁶ Moore, G.⁷ Odell, J.⁸ Ollason, D.⁹ Povey, D.¹⁰ Valtchev, V.¹¹ Woodland, P.¹²

24
- 0034227088
- Auditory images: How complex sounds are represented in the auditory system
- Patterson, R.D.: Auditory images: How complex sounds are represented in the auditory system. Journal-Acoustical Society of Japan (E) 21(4), 183-190 (2000)
- (2000) Journal-Acoustical Society of Japan (E) , vol.21 , Issue.4 , pp. 183-190
- Patterson, R.D.¹

25
- 29744445786
- Springer, Heidelberg
- Bacon, S., Fay, R., Popper, A.: Compression: from cochlea to cochlear implants. Springer, Heidelberg (2004)
- (2004) Compression: From Cochlea to Cochlear Implants
- Bacon, S.¹ Fay, R.² Popper, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.