메뉴 건너뛰기




Volumn 14, Issue 3, 2006, Pages 808-831

Optimization of temporal filters for constructing robust features in speech recognition

Author keywords

Linear discriminant analysis (LDA); Minimum classification error (MCE); Principal component analysis (PCA); Speech recognition; Temporal filters

Indexed keywords

CEPSTRAL MEAN AND VARIANCE NORMALIZATION (CMVN); LINEAR DISCRIMINANT ANALYSIS (LDA); MINIMUM CLASSIFICATION ERROR (MCE); TEMPORAL FILTERS;

EID: 34047247200     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.857801     Document Type: Article
Times cited : (63)

References (35)
  • 1
    • 0022883703 scopus 로고
    • Noise compensation for speech recognition using probabilistic models
    • J. N. Holmes and N. C. Sedgwick, "Noise compensation for speech recognition using probabilistic models," in Proc. ICASSP, 1986.
    • (1986) Proc. ICASSP
    • Holmes, J.N.1    Sedgwick, N.C.2
  • 2
    • 84944816135 scopus 로고
    • A digital filterbank for spectral matching
    • D. H. Klatt, "A digital filterbank for spectral matching," in Proc. ICASSP, 1979, pp. 573-576.
    • (1979) Proc. ICASSP , pp. 573-576
    • Klatt, D.H.1
  • 3
    • 0023739211 scopus 로고
    • Speech recognition using noise-adaptive prototypes
    • A. Nadas, D. Nahamoo, and M. Picheny, "Speech recognition using noise-adaptive prototypes," in Proc. ICASSP, 1988, pp. 517-520.
    • (1988) Proc. ICASSP , pp. 517-520
    • Nadas, A.1    Nahamoo, D.2    Picheny, M.3
  • 4
    • 0025681008 scopus 로고
    • Hidden Markov model decomposition of speech and noise
    • A. P. Varga and R. K. Moore, "Hidden Markov model decomposition of speech and noise," in Proc. ICASSP, 1990, pp. 845-848.
    • (1990) Proc. ICASSP , pp. 845-848
    • Varga, A.P.1    Moore, R.K.2
  • 5
    • 0026384952 scopus 로고
    • An hypothesized Wiener filtering approach to noisy speech recognition
    • A. D. Berstein and I. D. Shallom, "An hypothesized Wiener filtering approach to noisy speech recognition," in Proc. ICASSP, 1991, pp. 913-916.
    • (1991) Proc. ICASSP , pp. 913-916
    • Berstein, A.D.1    Shallom, I.D.2
  • 6
    • 0006936809 scopus 로고
    • Hidden Markov model state-based cepstral noise compensation
    • V. L. Beattie and S. J. Young, "Hidden Markov model state-based cepstral noise compensation," in Proc. ICSLP, 1992, pp. 519-522.
    • (1992) Proc. ICSLP , pp. 519-522
    • Beattie, V.L.1    Young, S.J.2
  • 7
    • 85009113852 scopus 로고    scopus 로고
    • HMM adaptation using vector Taylor series for noisy speech recognition
    • A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP, 2000, pp. 869-872.
    • (2000) Proc. ICSLP , pp. 869-872
    • Acero, A.1    Deng, L.2    Kristjansson, T.3    Zhang, J.4
  • 8
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
    • C. J. Leggester and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density HMMs," Comput. Speech Lang., pp. 171-186, 1995.
    • (1995) Comput. Speech Lang , pp. 171-186
    • Leggester, C.J.1    Woodland, P.C.2
  • 9
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, pp. 190-202, 1996.
    • (1996) IEEE Trans. Acoust., Speech, Signal Processing , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 10
    • 0032140546 scopus 로고    scopus 로고
    • On stochastic feature and model compensation approaches to robust speech recognition
    • C.-H. Lee, "On stochastic feature and model compensation approaches to robust speech recognition," Speech Commun., vol. 25, pp. 29-47, 1998.
    • (1998) Speech Commun , vol.25 , pp. 29-47
    • Lee, C.-H.1
  • 11
    • 0032116601 scopus 로고    scopus 로고
    • Data-driven environmental compensation for speech recognition: A unified approach
    • P. J. Moreno, B. Raj, and R. M. Stern, "Data-driven environmental compensation for speech recognition: A unified approach," Speech Commun., vol. 24, pp. 267-285, 1998.
    • (1998) Speech Commun , vol.24 , pp. 267-285
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 12
    • 0027622731 scopus 로고
    • Cepstral parameter compensation for HMM recognition in noise
    • M. J. F. Gales and S. J. Young, "Cepstral parameter compensation for HMM recognition in noise," Speech Commun., vol. 12, pp. 231-239, 1993.
    • (1993) Speech Commun , vol.12 , pp. 231-239
    • Gales, M.J.F.1    Young, S.J.2
  • 13
    • 0029390135 scopus 로고
    • Robust speech recognition in additive and convolutional noise using parallel model combination
    • _, "Robust speech recognition in additive and convolutional noise using parallel model combination," Comput. Speech Lang., vol. 9, pp. 289-307, 1995.
    • (1995) Comput. Speech Lang , vol.9 , pp. 289-307
    • Gales, M.J.F.1    Young, S.J.2
  • 14
    • 0028996863 scopus 로고
    • A fast and flexible implementation of parallel model combination
    • _, "A fast and flexible implementation of parallel model combination," in Proc. ICASSP, 1995, pp. 131-136.
    • (1995) Proc. ICASSP , pp. 131-136
    • Gales, M.J.F.1    Young, S.J.2
  • 15
    • 85009112933 scopus 로고    scopus 로고
    • Optimization of sub-band weights using simulated noisy speech in multi-band speech recognition
    • Y. C. Tam and B. Mak, "Optimization of sub-band weights using simulated noisy speech in multi-band speech recognition," in Proc. ICSLP, 2000, pp. 313-316.
    • (2000) Proc. ICSLP , pp. 313-316
    • Tam, Y.C.1    Mak, B.2
  • 16
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech. Signal Process., vol. ASSP-27, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech. Signal Process , vol.ASSP-27 , pp. 113-120
    • Boll, S.F.1
  • 18
    • 0141699738 scopus 로고    scopus 로고
    • Log-domain speech feature enhancement using sequential MAP noise estimation and a phase-sensitive model of the acoustic environment
    • L. Deng, J. Droppo, and A. Acero, "Log-domain speech feature enhancement using sequential MAP noise estimation and a phase-sensitive model of the acoustic environment," in Proc. ICSLP, 2002, pp. 192-195.
    • (2002) Proc. ICSLP , pp. 192-195
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 19
    • 84892187452 scopus 로고    scopus 로고
    • Maximum likelihood modeling with Gaussian distributions for classification
    • R. A. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. ICASSP, 1998.
    • (1998) Proc. ICASSP
    • Gopinath, R.A.1
  • 20
    • 0036298776 scopus 로고    scopus 로고
    • Adaptation experiments on the spine database using the extended maximum likelihood linear transformation (EMLLT) model
    • R. A. Gopinath, V. Goel, K. Visweswariah, and P. Olsen, "Adaptation experiments on the spine database using the extended maximum likelihood linear transformation (EMLLT) model," in Proc. ICASSP, 2002.
    • (2002) Proc. ICASSP
    • Gopinath, R.A.1    Goel, V.2    Visweswariah, K.3    Olsen, P.4
  • 21
    • 0031146514 scopus 로고    scopus 로고
    • HMM-based speech recognition using state-dependent, discriminatively derived transforms on mel-warped DFT features
    • May
    • C. Rathinavalu and L. Deng, "HMM-based speech recognition using state-dependent, discriminatively derived transforms on mel-warped DFT features," IEEE Trans. Speech Audio Processing, pp. 243-256, May 1997.
    • (1997) IEEE Trans. Speech Audio Processing , pp. 243-256
    • Rathinavalu, C.1    Deng, L.2
  • 22
    • 0141590384 scopus 로고    scopus 로고
    • Discriminative training of auditory filters of different shapes for robust speech recognition
    • B. Mak, Y. C. Tam, and R. Hsiao, "Discriminative training of auditory filters of different shapes for robust speech recognition," in Proc. ICASSP, 2003. pp. 45-18.
    • (2003) Proc. ICASSP , pp. 45-18
    • Mak, B.1    Tam, Y.C.2    Hsiao, R.3
  • 24
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, no. 6, pp. 1304-1312, 1974.
    • (1974) J. Acoust. Soc. Amer , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.S.1
  • 25
    • 85135190755 scopus 로고    scopus 로고
    • Multiband and adaptation approaches to robust speech recognition
    • S. Tibrewala and H. Hermansky, "Multiband and adaptation approaches to robust speech recognition," in Proc. Eurospeech 97, 1997, pp. 2619-2622.
    • (1997) Proc. Eurospeech 97 , pp. 2619-2622
    • Tibrewala, S.1    Hermansky, H.2
  • 26
    • 0141699833 scopus 로고    scopus 로고
    • Noise robust HMM-based speech recognition using segmental cepstral feature vector normalization
    • Pont-a-Mousson, France
    • O. Viikki and K. Laurila, "Noise robust HMM-based speech recognition using segmental cepstral feature vector normalization," in ESCA NATO Workshop Robust Speech Recognition Unknown Communication Channels, Pont-a-Mousson, France, 1997, pp. 107-110.
    • (1997) ESCA NATO Workshop Robust Speech Recognition Unknown Communication Channels , pp. 107-110
    • Viikki, O.1    Laurila, K.2
  • 29
    • 0030374936 scopus 로고    scopus 로고
    • Data based filter design for RASTA-like channel normalization in ASR
    • C. Avendano, S. van Vuuren, and H. Hermansky, "Data based filter design for RASTA-like channel normalization in ASR," in Proc. ICSLP, 1996.
    • (1996) Proc. ICSLP
    • Avendano, C.1    van Vuuren, S.2    Hermansky, H.3
  • 30
    • 85017295162 scopus 로고    scopus 로고
    • Data-driven modulation filter design under adverse acoustic conditions and using phonetic and syllabic units
    • M. L. Shire, "Data-driven modulation filter design under adverse acoustic conditions and using phonetic and syllabic units," in Proc. Eurospeech, 1999.
    • (1999) Proc. Eurospeech
    • Shire, M.L.1
  • 31
    • 0027239233 scopus 로고
    • Improvements in connected digit recognition using linear discriminant analysis and mixture densities
    • R. Haeb-Umbach, D. Geller, and H. Ney, "Improvements in connected digit recognition using linear discriminant analysis and mixture densities," in Proc. ICASSP, 1993.
    • (1993) Proc. ICASSP
    • Haeb-Umbach, R.1    Geller, D.2    Ney, H.3
  • 32
    • 85009063569 scopus 로고    scopus 로고
    • Comparative analysis for data-driven temporal filters obtained via principal component analysis (PCA) and linear discriminant analysis (LDA) in speech recognition
    • J.-W. Hung, H.-M. Wang, and L.-S. Lee, "Comparative analysis for data-driven temporal filters obtained via principal component analysis (PCA) and linear discriminant analysis (LDA) in speech recognition," in Proc. Eurospeech, 2001.
    • (2001) Proc. Eurospeech
    • Hung, J.-W.1    Wang, H.-M.2    Lee, L.-S.3
  • 33
    • 17444450002 scopus 로고    scopus 로고
    • Data-driven temporal filters for robust features in speech recognition obtained via minimum classification error (MCE)
    • J.-W. Hung and L.-S. Lee, "Data-driven temporal filters for robust features in speech recognition obtained via minimum classification error (MCE)," in Proceedings of ICASSP, 2002.
    • (2002) Proceedings of ICASSP
    • Hung, J.-W.1    Lee, L.-S.2
  • 34
    • 34047247652 scopus 로고    scopus 로고
    • Available
    • [Online]. Available: http://rocling.iis.sinica.edu.tw/ROCLING/


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.