메뉴 건너뛰기




Volumn 5933 LNAI, Issue , 2010, Pages 111-119

Robust features for speaker-independent speech recognition based on a certain class of translation-invariant transformations

Author keywords

Speaker independency; Speech recognition; Translation invariance

Indexed keywords

AUTOMATIC SPEECH RECOGNITION SYSTEM; INDEX SPACE; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; RECOGNITION RATES; SPEAKER-INDEPENDENT SPEECH RECOGNITION; SPECTRAL EFFECTS; SUB-BANDS; TRANSLATION INVARIANCE; TRANSLATION INVARIANTS; VOCAL TRACT LENGTH NORMALIZATION; VOCAL TRACT LENGTHS;

EID: 77951480608     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-11509-7_15     Document Type: Conference Paper
Times cited : (4)

References (25)
  • 2
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Gales, M.J.F.: Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language 12(2), 75-98 (1998)
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 3
    • 27644522706 scopus 로고    scopus 로고
    • Vocal tract normalization equals linear transformation in cepstral space
    • ausgedruckt
    • Pitz, M., Ney, H.: Vocal tract normalization equals linear transformation in cepstral space. IEEE Trans. Speech and Audio Processing 13(5 Part 2), 930-944 (2005) (ausgedruckt)
    • (2005) IEEE Trans. Speech and Audio Processing , vol.13 , Issue.5 PART 2 , pp. 930-944
    • Pitz, M.1    Ney, H.2
  • 5
    • 0031647824 scopus 로고    scopus 로고
    • A frequency warping approach to speaker normalization
    • Lee, L., Rose, R.C.: A frequency warping approach to speaker normalization. IEEE Trans. Speech and Audio Processing 6(1), 49-60 (1998)
    • (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.1 , pp. 49-60
    • Lee, L.1    Rose, R.C.2
  • 9
    • 70450166695 scopus 로고    scopus 로고
    • Low-dimensional, auditory feature vectors that improve vocal-tract-length normalization in automatic speech recognition
    • Monaghan, J.J., Feldbauer, C., Walters, T.C., Patterson, R.D.: Low-dimensional, auditory feature vectors that improve vocal-tract-length normalization in automatic speech recognition. The Journal of the Acoustical Society of America 123(5), 3066-3066 (2008)
    • (2008) The Journal of the Acoustical Society of America , vol.123 , Issue.5 , pp. 3066-3066
    • Monaghan, J.J.1    Feldbauer, C.2    Walters, T.C.3    Patterson, R.D.4
  • 10
    • 0002163712 scopus 로고    scopus 로고
    • Invariant features in pattern recognition - Fundamentals and applications
    • John Wiley & Sons, Chichester
    • Burkhardt, H., Siggelkow, S.: Invariant features in pattern recognition - fundamentals and applications. In: Nonlinear Model-Based Image/Video Processing and Analysis, pp. 269-307. John Wiley & Sons, Chichester (2001)
    • (2001) Nonlinear Model-Based Image/Video Processing and Analysis , pp. 269-307
    • Burkhardt, H.1    Siggelkow, S.2
  • 12
    • 0019075787 scopus 로고
    • On invariant sets of a certain class of fast translation-invariant transforms
    • Burkhardt, H., Müller, X.: On invariant sets of a certain class of fast translation-invariant transforms. IEEE Trans. Acoustic, Speech, and Signal Processing 28(5), 517-523 (1980)
    • (1980) IEEE Trans. Acoustic, Speech, and Signal Processing , vol.28 , Issue.5 , pp. 517-523
    • Burkhardt, H.1    Müller, X.2
  • 13
    • 84975559454 scopus 로고
    • Modified rapid transform
    • Fang, M., Häusler, G.: Modified rapid transform. Applied Optics 28(6), 1257-1262 (1989)
    • (1989) Applied Optics , vol.28 , Issue.6 , pp. 1257-1262
    • Fang, M.1    Häusler, G.2
  • 14
    • 0014551188 scopus 로고
    • A transformation with invariance under cyclic permutation for applications in pattern recognition
    • Reitboeck, H., Brody, T.P.: A transformation with invariance under cyclic permutation for applications in pattern recognition. Inf. & Control. 15, 130-154 (1969)
    • (1969) Inf. & Control , vol.15 , pp. 130-154
    • Reitboeck, H.1    Brody, T.P.2
  • 15
    • 0015723408 scopus 로고
    • Machine recognition of printed chinese characters via transformation algorithms
    • Wang, P.P., Shiau, R.C.: Machine recognition of printed chinese characters via transformation algorithms. Pattern Recognition 5(4), 303-321 (1973)
    • (1973) Pattern Recognition , vol.5 , Issue.4 , pp. 303-321
    • Wang, P.P.1    Shiau, R.C.2
  • 16
    • 77951495322 scopus 로고    scopus 로고
    • Use of Invertible Rapid Transform in Motion Analysis
    • Gamec, J., Turan, J.: Use of Invertible Rapid Transform in Motion Analysis. Radioengineering 5(4), 21-27 (1996)
    • (1996) Radioengineering , vol.5 , Issue.4 , pp. 21-27
    • Gamec, J.1    Turan, J.2
  • 17
    • 0027680376 scopus 로고
    • Multiscale fourier descriptors for classifying semivowels in spectrograms
    • Pinkowski, B.: Multiscale fourier descriptors for classifying semivowels in spectrograms. Pattern Recognition 26(10), 1593-1602 (1993)
    • (1993) Pattern Recognition , vol.26 , Issue.10 , pp. 1593-1602
    • Pinkowski, B.1
  • 21
    • 24344458137 scopus 로고    scopus 로고
    • Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy
    • Peng, H., Long, F., Ding, C.: Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Analysis and Machine Intelligence 27(8), 1226-1238 (2005)
    • (2005) IEEE Trans. Pattern Analysis and Machine Intelligence , vol.27 , Issue.8 , pp. 1226-1238
    • Peng, H.1    Long, F.2    Ding, C.3
  • 22
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using hidden Markov models
    • Lee, K.F., Hon, H.W.: Speaker-independent phone recognition using hidden Markov models. IEEE Trans. Acoustics, Speech and Signal Processing 37(11), 1641-1648 (1989)
    • (1989) IEEE Trans. Acoustics, Speech and Signal Processing , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.F.1    Hon, H.W.2
  • 24
    • 0034227088 scopus 로고    scopus 로고
    • Auditory images: How complex sounds are represented in the auditory system
    • Patterson, R.D.: Auditory images: How complex sounds are represented in the auditory system. Journal-Acoustical Society of Japan (E) 21(4), 183-190 (2000)
    • (2000) Journal-Acoustical Society of Japan (E) , vol.21 , Issue.4 , pp. 183-190
    • Patterson, R.D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.