메뉴 건너뛰기




Volumn 15, Issue 1, 2007, Pages 224-234

Robust feature extraction for continuous speech recognition using the MVDR spectrum estimation method

Author keywords

Distortionless response; Minimum variance; Robust feature extraction for continuous speech recognition; Spectral analysis; Speech analysis

Indexed keywords

CLASS SEPARABILITY; DISTORTIONLESS RESPONSE; FEATURE EXTRACTION TECHNIQUES; FISHER LINEAR DISCRIMINANTS; HIGH NOISE; IN-CAR SPEECH RECOGNITION; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MINIMUM VARIANCE; MINIMUM VARIANCE DISTORTIONLESS RESPONSE; PERCEPTUAL INFORMATIONS; PERCEPTUAL LINEAR PREDICTIONS; ROBUST FEATURE EXTRACTION FOR CONTINUOUS SPEECH RECOGNITION; SPEAKER VARIABILITIES; SPECTRAL ANALYSIS; SPECTRUM ESTIMATIONS; STATISTICAL SIGNIFICANCE TESTS; WALL STREET JOURNALS;

EID: 53149126814     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.876776     Document Type: Article
Times cited : (50)

References (35)
  • 1
    • 0141517832 scopus 로고    scopus 로고
    • Spectral signal processing for ASR
    • Dec
    • M. Hunt, "Spectral signal processing for ASR," in Proc. ASRU, Dec. 1999, pp. 17-26.
    • (1999) Proc. ASRU , pp. 17-26
    • Hunt, M.1
  • 2
    • 85009129543 scopus 로고    scopus 로고
    • Perceptual harmonic cepstral coefficients as the front-end for speech recognition
    • L. Gu and K. Rose, "Perceptual harmonic cepstral coefficients as the front-end for speech recognition," in Proc. ICSLP, 2000, pp. 583-586.
    • (2000) Proc. ICSLP , pp. 583-586
    • Gu, L.1    Rose, K.2
  • 3
    • 0032677440 scopus 로고    scopus 로고
    • Frequency-domain spectral envelope estimation for low rate coding of speech
    • M. Jelinek and J. Adoul, "Frequency-domain spectral envelope estimation for low rate coding of speech," in Proc. ICASSP, pp. 253-256.
    • Proc. ICASSP , pp. 253-256
    • Jelinek, M.1    Adoul, J.2
  • 4
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 5
    • 0025041264 scopus 로고
    • Perceptual Linear Prediction (PLP) analysis of speech
    • H. Hermansky, "Perceptual Linear Prediction (PLP) analysis of speech," JASA, vol. 87, no. 4, pp. 1738-1752, 1990.
    • (1990) JASA , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 6
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • Apr
    • J. Makhoul, "Linear prediction: A tutorial review," Proc. IEEE, vol. 63, no. 4, pp. 561-580, Apr. 1975.
    • (1975) Proc. IEEE , vol.63 , Issue.4 , pp. 561-580
    • Makhoul, J.1
  • 7
    • 0014642808 scopus 로고
    • High-resolution frequency-wavenumber spectrum analysis
    • Aug
    • J. Capon, "High-resolution frequency-wavenumber spectrum analysis," Proc. IEEE, vol. 57, no. 8, pp. 1408-1418, Aug. 1969.
    • (1969) Proc. IEEE , vol.57 , Issue.8 , pp. 1408-1418
    • Capon, J.1
  • 8
    • 0000473547 scopus 로고    scopus 로고
    • All-pole modeling of speech based on the minimum variance distortionless response spectrum
    • May
    • M. N. Murthi and B. D. Rao, "All-pole modeling of speech based on the minimum variance distortionless response spectrum," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 221-239, May 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.3 , pp. 221-239
    • Murthi, M.N.1    Rao, B.D.2
  • 9
    • 0003807773 scopus 로고
    • Englewood Cliffs, NJ: Prentice- Hall
    • S. Haykin, Adaptive Filter Theory. Englewood Cliffs, NJ: Prentice- Hall, 1991.
    • (1991) Adaptive Filter Theory
    • Haykin, S.1
  • 11
    • 0022136506 scopus 로고
    • Fast MLM power spectrum estimation from uniformly spaced correlations
    • Oct
    • B. R. Musicus, "Fast MLM power spectrum estimation from uniformly spaced correlations," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 5, pp. 133-135, Oct. 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-33 , Issue.5 , pp. 133-135
    • Musicus, B.R.1
  • 12
    • 0034842452 scopus 로고    scopus 로고
    • MVDR-based feature extraction for robust speech recognition
    • S. Dharanipragada and B. Rao, "MVDR-based feature extraction for robust speech recognition," in Proc. ICASSP, 2001, vol. 1, pp. 309-312.
    • (2001) Proc. ICASSP , vol.1 , pp. 309-312
    • Dharanipragada, S.1    Rao, B.2
  • 13
    • 0141702336 scopus 로고    scopus 로고
    • Perceptual MVDR-Based Cepstral Coefficients (PMCCs) for noise robust speech recognition
    • U. H. Yapanel and S. Dharanipragada, "Perceptual MVDR-Based Cepstral Coefficients (PMCCs) for noise robust speech recognition," in Proc. ICASSP, 2003, vol. 1, pp. 644-647.
    • (2003) Proc. ICASSP , vol.1 , pp. 644-647
    • Yapanel, U.H.1    Dharanipragada, S.2
  • 14
    • 84908144695 scopus 로고
    • The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short modified periodograms
    • Jun
    • P.Welch, "The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short modified periodograms," IEEE Trans. Audio Electroacoust., vol. AU-15, no. 2, pp. 70-76, Jun. 1967.
    • (1967) IEEE Trans. Audio Electroacoust , vol.AU-15 , Issue.2 , pp. 70-76
    • Welch, P.1
  • 16
    • 0013206596 scopus 로고    scopus 로고
    • Techniques for capturing temporal variations in speech signals with fixed-rate processing
    • S. Dharanipragada, R. A. Gopinath, and B. D. Rao, "Techniques for capturing temporal variations in speech signals with fixed-rate processing," in ICSLP, 1998, vol. 3, pp. 967-970.
    • (1998) ICSLP , vol.3 , pp. 967-970
    • Dharanipragada, S.1    Gopinath, R.A.2    Rao, B.D.3
  • 17
    • 0038036818 scopus 로고    scopus 로고
    • On robust Capon beamforming and diagonal loading
    • Jul
    • Z. W. J. Li and P. Stoica, "On robust Capon beamforming and diagonal loading," IEEE Trans. Signal Process., vol. 51, no. 7, pp. 1702-1715, Jul. 2003.
    • (2003) IEEE Trans. Signal Process , vol.51 , Issue.7 , pp. 1702-1715
    • Li, Z.W.J.1    Stoica, P.2
  • 18
    • 0022441644 scopus 로고
    • Maximum likelihood filters in spectral estimation
    • A. G. M. Lagunas, M. Santamaria, and A. Moreno, "Maximum likelihood filters in spectral estimation," Signal Process., vol. 10, pp. 19-34, 1986.
    • (1986) Signal Process , vol.10 , pp. 19-34
    • Lagunas, A.G.M.1    Santamaria, M.2    Moreno, A.3
  • 19
    • 0032645345 scopus 로고    scopus 로고
    • A new derivation of the APES filter
    • Aug
    • L. H. P. Stoica and J. Li, "A new derivation of the APES filter," IEEE Signal Process. Lett., vol. 6, no. 8, pp. 205-206, Aug. 1999.
    • (1999) IEEE Signal Process. Lett , vol.6 , Issue.8 , pp. 205-206
    • Stoica, L.H.P.1    Li, J.2
  • 23
    • 0032653634 scopus 로고    scopus 로고
    • Investigations on inter-speaker variability in the feature space
    • R. Haeb-Umbach, "Investigations on inter-speaker variability in the feature space," in Proc. ICASSP, 1999, pp. 397-400.
    • (1999) Proc. ICASSP , pp. 397-400
    • Haeb-Umbach, R.1
  • 24
    • 0024909979 scopus 로고
    • Some statistical issues in the comparison of speech recognition algorithms
    • L. Gillick and S. Cox, "Some statistical issues in the comparison of speech recognition algorithms," in Proc. ICASSP, 1988, pp. 532-535.
    • (1988) Proc. ICASSP , pp. 532-535
    • Gillick, L.1    Cox, S.2
  • 25
    • 0015600423 scopus 로고
    • The viterbi algorithm
    • Mar
    • G. D. Forney, "The viterbi algorithm," Proc. IEEE, vol. 61, no. 3, pp. 268-278, Mar. 1973.
    • (1973) Proc. IEEE , vol.61 , Issue.3 , pp. 268-278
    • Forney, G.D.1
  • 26
    • 0028996843 scopus 로고
    • Performance of the IBM large vocabulary continuous speech recognition system on the ARPA wall street journal task
    • L. R. Bahl et al., "Performance of the IBM large vocabulary continuous speech recognition system on the ARPA wall street journal task," in Proc. ICASSP, 1995, vol. 1, pp. 41-44.
    • (1995) Proc. ICASSP , vol.1 , pp. 41-44
    • Bahl, L.R.1
  • 27
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • G. Schwarz, "Estimating the dimension of a model," Ann. Statist., vol. 6, pp. 461-464, 1978.
    • (1978) Ann. Statist , vol.6 , pp. 461-464
    • Schwarz, G.1
  • 28
    • 84875953283 scopus 로고    scopus 로고
    • Clustering via the Bayesian information criterion with applications in speech recognition
    • S. S. Chen and P. S. Gopalakrishnan, "Clustering via the Bayesian information criterion with applications in speech recognition," in Proc. ICASSP, 1998, vol. 2, pp. 645-648.
    • (1998) Proc. ICASSP , vol.2 , pp. 645-648
    • Chen, S.S.1    Gopalakrishnan, P.S.2
  • 29
    • 85009088984 scopus 로고    scopus 로고
    • Robust digit recognition in noisy environments: The IBM Aurora 2 system
    • G. Saon, J. Huerta, and E. E. Jan, "Robust digit recognition in noisy environments: the IBM Aurora 2 system," in Proc. Eurospeech, 2001, pp. 629-632.
    • (2001) Proc. Eurospeech , pp. 629-632
    • Saon, G.1    Huerta, J.2    Jan, E.E.3
  • 30
    • 0003454539 scopus 로고    scopus 로고
    • Maximum Likelihood Linear Transformations for HMM-Based Speech Recognition Cambridge Univ., Cambridge, U.K
    • Tech. Rep. TR 291
    • M. J. F. Gales, "Maximum Likelihood Linear Transformations for HMM-Based Speech Recognition" Cambridge Univ., Cambridge, U.K., Tech. Rep. TR 291, 1997.
    • (1997)
    • Gales, M.J.F.1
  • 31
    • 84892187452 scopus 로고    scopus 로고
    • Maximum likelihood modeling with gaussian distributions for classification
    • R. Gopinath, "Maximum likelihood modeling with gaussian distributions for classification," in Proc. ICASSP, 1998, pp. 661-664.
    • (1998) Proc. ICASSP , pp. 661-664
    • Gopinath, R.1
  • 32
    • 85009242725 scopus 로고    scopus 로고
    • Evaluation of a noise-robust DSR front-end on AURORA databases
    • D. Macho et al., "Evaluation of a noise-robust DSR front-end on AURORA databases," in Proc. ICSLP, 2002, pp. 17-20.
    • (2002) Proc. ICSLP , pp. 17-20
    • Macho, D.1
  • 33
    • 0141591620 scopus 로고    scopus 로고
    • Recent improvements in the CU Sonic ASR system for noisy speech: The SPINE task
    • B. Pellom and K. Hacioglu, "Recent improvements in the CU Sonic ASR system for noisy speech: the SPINE task," in Proc. ICASSP, 2003, vol. 1, pp. 4-7.
    • (2003) Proc. ICASSP , vol.1 , pp. 4-7
    • Pellom, B.1    Hacioglu, K.2
  • 34
    • 85009164727 scopus 로고    scopus 로고
    • Toward domain-independent conversational speech recognition
    • B. Kingsbury et al., "Toward domain-independent conversational speech recognition," in Proc. Eurospeech, 2003, pp. 1881-1884.
    • (2003) Proc. Eurospeech , pp. 1881-1884
    • Kingsbury, B.1
  • 35
    • 85032772258 scopus 로고    scopus 로고
    • Minimum variance distortionless response spectral estimation
    • Sep
    • M. Wolfel and J. McDonough, "Minimum variance distortionless response spectral estimation," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 117-126, Sep. 2005.
    • (2005) IEEE Signal Process. Mag , vol.22 , Issue.5 , pp. 117-126
    • Wolfel, M.1    McDonough, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.