SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 1, 2007, Pages 224-234

Robust feature extraction for continuous speech recognition using the MVDR spectrum estimation method

(3) Dharanipragada, Satya a Yapanel, Umit H b,c Rao, Bhaskar D d

a Citadel Investment Group (United States)

b UNIVERSITY OF COLORADO (United States)

c Infoture Inc (United States)

d UNIVERSITY OF CALIFORNIA (United States)

Author keywords

Distortionless response; Minimum variance; Robust feature extraction for continuous speech recognition; Spectral analysis; Speech analysis

Indexed keywords

CLASS SEPARABILITY; DISTORTIONLESS RESPONSE; FEATURE EXTRACTION TECHNIQUES; FISHER LINEAR DISCRIMINANTS; HIGH NOISE; IN-CAR SPEECH RECOGNITION; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MINIMUM VARIANCE; MINIMUM VARIANCE DISTORTIONLESS RESPONSE; PERCEPTUAL INFORMATIONS; PERCEPTUAL LINEAR PREDICTIONS; ROBUST FEATURE EXTRACTION FOR CONTINUOUS SPEECH RECOGNITION; SPEAKER VARIABILITIES; SPECTRAL ANALYSIS; SPECTRUM ESTIMATIONS; STATISTICAL SIGNIFICANCE TESTS; WALL STREET JOURNALS;

COMPUTATIONAL EFFICIENCY; CONTINUOUS SPEECH RECOGNITION; ESTIMATION; POWER SPECTRUM; SPECTRUM ANALYZERS; SPEECH ANALYSIS; SPEECH PROCESSING; STATISTICAL TESTS;

FEATURE EXTRACTION;

EID: 53149126814 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.876776 Document Type: Article

Times cited : (50)

References (35)

1
- 0141517832
- Spectral signal processing for ASR
- Dec
- M. Hunt, "Spectral signal processing for ASR," in Proc. ASRU, Dec. 1999, pp. 17-26.
- (1999) Proc. ASRU , pp. 17-26
- Hunt, M.¹

2
- 85009129543
- Perceptual harmonic cepstral coefficients as the front-end for speech recognition
- L. Gu and K. Rose, "Perceptual harmonic cepstral coefficients as the front-end for speech recognition," in Proc. ICSLP, 2000, pp. 583-586.
- (2000) Proc. ICSLP , pp. 583-586
- Gu, L.¹ Rose, K.²

3
- 0032677440
- Frequency-domain spectral envelope estimation for low rate coding of speech
- M. Jelinek and J. Adoul, "Frequency-domain spectral envelope estimation for low rate coding of speech," in Proc. ICASSP, pp. 253-256.
- Proc. ICASSP , pp. 253-256
- Jelinek, M.¹ Adoul, J.²

4
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

5
- 0025041264
- Perceptual Linear Prediction (PLP) analysis of speech
- H. Hermansky, "Perceptual Linear Prediction (PLP) analysis of speech," JASA, vol. 87, no. 4, pp. 1738-1752, 1990.
- (1990) JASA , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

6
- 0016495091
- Linear prediction: A tutorial review
- Apr
- J. Makhoul, "Linear prediction: A tutorial review," Proc. IEEE, vol. 63, no. 4, pp. 561-580, Apr. 1975.
- (1975) Proc. IEEE , vol.63 , Issue.4 , pp. 561-580
- Makhoul, J.¹

7
- 0014642808
- High-resolution frequency-wavenumber spectrum analysis
- Aug
- J. Capon, "High-resolution frequency-wavenumber spectrum analysis," Proc. IEEE, vol. 57, no. 8, pp. 1408-1418, Aug. 1969.
- (1969) Proc. IEEE , vol.57 , Issue.8 , pp. 1408-1418
- Capon, J.¹

8
- 0000473547
- All-pole modeling of speech based on the minimum variance distortionless response spectrum
- May
- M. N. Murthi and B. D. Rao, "All-pole modeling of speech based on the minimum variance distortionless response spectrum," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 221-239, May 2000.
- (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.3 , pp. 221-239
- Murthi, M.N.¹ Rao, B.D.²

9
- 0003807773
- Englewood Cliffs, NJ: Prentice- Hall
- S. Haykin, Adaptive Filter Theory. Englewood Cliffs, NJ: Prentice- Hall, 1991.
- (1991) Adaptive Filter Theory
- Haykin, S.¹

10
- 0003561344
- Englewood Cliffs, NJ: Prentice-Hall
- S. L. Marple, Jr., Digital Spectral AnalysisWith Applications. Englewood Cliffs, NJ: Prentice-Hall, 1987.
- (1987) Digital Spectral AnalysisWith Applications
- Marple Jr., S.L.¹

11
- 0022136506
- Fast MLM power spectrum estimation from uniformly spaced correlations
- Oct
- B. R. Musicus, "Fast MLM power spectrum estimation from uniformly spaced correlations," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 5, pp. 133-135, Oct. 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-33 , Issue.5 , pp. 133-135
- Musicus, B.R.¹

12
- 0034842452
- MVDR-based feature extraction for robust speech recognition
- S. Dharanipragada and B. Rao, "MVDR-based feature extraction for robust speech recognition," in Proc. ICASSP, 2001, vol. 1, pp. 309-312.
- (2001) Proc. ICASSP , vol.1 , pp. 309-312
- Dharanipragada, S.¹ Rao, B.²

13
- 0141702336
- Perceptual MVDR-Based Cepstral Coefficients (PMCCs) for noise robust speech recognition
- U. H. Yapanel and S. Dharanipragada, "Perceptual MVDR-Based Cepstral Coefficients (PMCCs) for noise robust speech recognition," in Proc. ICASSP, 2003, vol. 1, pp. 644-647.
- (2003) Proc. ICASSP , vol.1 , pp. 644-647
- Yapanel, U.H.¹ Dharanipragada, S.²

14
- 84908144695
- The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short modified periodograms
- Jun
- P.Welch, "The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short modified periodograms," IEEE Trans. Audio Electroacoust., vol. AU-15, no. 2, pp. 70-76, Jun. 1967.
- (1967) IEEE Trans. Audio Electroacoust , vol.AU-15 , Issue.2 , pp. 70-76
- Welch, P.¹

15
- 0004243216
- Englewood Cliffs, NJ: Prentice-Hall
- P. Stoica and R. Moses, Spectral Analysis. Englewood Cliffs, NJ: Prentice-Hall, 1997.
- (1997) Spectral Analysis
- Stoica, P.¹ Moses, R.²

16
- 0013206596
- Techniques for capturing temporal variations in speech signals with fixed-rate processing
- S. Dharanipragada, R. A. Gopinath, and B. D. Rao, "Techniques for capturing temporal variations in speech signals with fixed-rate processing," in ICSLP, 1998, vol. 3, pp. 967-970.
- (1998) ICSLP , vol.3 , pp. 967-970
- Dharanipragada, S.¹ Gopinath, R.A.² Rao, B.D.³

17
- 0038036818
- On robust Capon beamforming and diagonal loading
- Jul
- Z. W. J. Li and P. Stoica, "On robust Capon beamforming and diagonal loading," IEEE Trans. Signal Process., vol. 51, no. 7, pp. 1702-1715, Jul. 2003.
- (2003) IEEE Trans. Signal Process , vol.51 , Issue.7 , pp. 1702-1715
- Li, Z.W.J.¹ Stoica, P.²

18
- 0022441644
- Maximum likelihood filters in spectral estimation
- A. G. M. Lagunas, M. Santamaria, and A. Moreno, "Maximum likelihood filters in spectral estimation," Signal Process., vol. 10, pp. 19-34, 1986.
- (1986) Signal Process , vol.10 , pp. 19-34
- Lagunas, A.G.M.¹ Santamaria, M.² Moreno, A.³

19
- 0032645345
- A new derivation of the APES filter
- Aug
- L. H. P. Stoica and J. Li, "A new derivation of the APES filter," IEEE Signal Process. Lett., vol. 6, no. 8, pp. 205-206, Aug. 1999.
- (1999) IEEE Signal Process. Lett , vol.6 , Issue.8 , pp. 205-206
- Stoica, L.H.P.¹ Li, J.²

20
- 0035630383
- A survey of spectral factorization methods
- A. Sayed and T. Kailath, "A survey of spectral factorization methods," Numerical Linear Algebra With Applications, vol. 8, pp. 467-496, 2001.
- (2001) Numerical Linear Algebra With Applications , vol.8 , pp. 467-496
- Sayed, A.¹ Kailath, T.²

21
- 0003513556
- Englewood Cliffs, NJ: Prentice-Hall
- A. V. Oppenheim and R. W. Schafer, Discrete-Time Signal Processing. Englewood Cliffs, NJ: Prentice-Hall, 1989.
- (1989) Discrete-Time Signal Processing
- Oppenheim, A.V.¹ Schafer, R.W.²

22
- 0003472470
- New York: Wiley
- R. O. Duda and P. E. Hart, Pattern Classification and Scene Analysis. New York: Wiley, 1993.
- (1993) Pattern Classification and Scene Analysis
- Duda, R.O.¹ Hart, P.E.²

23
- 0032653634
- Investigations on inter-speaker variability in the feature space
- R. Haeb-Umbach, "Investigations on inter-speaker variability in the feature space," in Proc. ICASSP, 1999, pp. 397-400.
- (1999) Proc. ICASSP , pp. 397-400
- Haeb-Umbach, R.¹

24
- 0024909979
- Some statistical issues in the comparison of speech recognition algorithms
- L. Gillick and S. Cox, "Some statistical issues in the comparison of speech recognition algorithms," in Proc. ICASSP, 1988, pp. 532-535.
- (1988) Proc. ICASSP , pp. 532-535
- Gillick, L.¹ Cox, S.²

25
- 0015600423
- The viterbi algorithm
- Mar
- G. D. Forney, "The viterbi algorithm," Proc. IEEE, vol. 61, no. 3, pp. 268-278, Mar. 1973.
- (1973) Proc. IEEE , vol.61 , Issue.3 , pp. 268-278
- Forney, G.D.¹

26
- 0028996843
- Performance of the IBM large vocabulary continuous speech recognition system on the ARPA wall street journal task
- L. R. Bahl et al., "Performance of the IBM large vocabulary continuous speech recognition system on the ARPA wall street journal task," in Proc. ICASSP, 1995, vol. 1, pp. 41-44.
- (1995) Proc. ICASSP , vol.1 , pp. 41-44
- Bahl, L.R.¹

27
- 0000120766
- Estimating the dimension of a model
- G. Schwarz, "Estimating the dimension of a model," Ann. Statist., vol. 6, pp. 461-464, 1978.
- (1978) Ann. Statist , vol.6 , pp. 461-464
- Schwarz, G.¹

28
- 84875953283
- Clustering via the Bayesian information criterion with applications in speech recognition
- S. S. Chen and P. S. Gopalakrishnan, "Clustering via the Bayesian information criterion with applications in speech recognition," in Proc. ICASSP, 1998, vol. 2, pp. 645-648.
- (1998) Proc. ICASSP , vol.2 , pp. 645-648
- Chen, S.S.¹ Gopalakrishnan, P.S.²

29
- 85009088984
- Robust digit recognition in noisy environments: The IBM Aurora 2 system
- G. Saon, J. Huerta, and E. E. Jan, "Robust digit recognition in noisy environments: the IBM Aurora 2 system," in Proc. Eurospeech, 2001, pp. 629-632.
- (2001) Proc. Eurospeech , pp. 629-632
- Saon, G.¹ Huerta, J.² Jan, E.E.³

30
- 0003454539
- Maximum Likelihood Linear Transformations for HMM-Based Speech Recognition Cambridge Univ., Cambridge, U.K
- Tech. Rep. TR 291
- M. J. F. Gales, "Maximum Likelihood Linear Transformations for HMM-Based Speech Recognition" Cambridge Univ., Cambridge, U.K., Tech. Rep. TR 291, 1997.
- (1997)
- Gales, M.J.F.¹

31
- 84892187452
- Maximum likelihood modeling with gaussian distributions for classification
- R. Gopinath, "Maximum likelihood modeling with gaussian distributions for classification," in Proc. ICASSP, 1998, pp. 661-664.
- (1998) Proc. ICASSP , pp. 661-664
- Gopinath, R.¹

32
- 85009242725
- Evaluation of a noise-robust DSR front-end on AURORA databases
- D. Macho et al., "Evaluation of a noise-robust DSR front-end on AURORA databases," in Proc. ICSLP, 2002, pp. 17-20.
- (2002) Proc. ICSLP , pp. 17-20
- Macho, D.¹

33
- 0141591620
- Recent improvements in the CU Sonic ASR system for noisy speech: The SPINE task
- B. Pellom and K. Hacioglu, "Recent improvements in the CU Sonic ASR system for noisy speech: the SPINE task," in Proc. ICASSP, 2003, vol. 1, pp. 4-7.
- (2003) Proc. ICASSP , vol.1 , pp. 4-7
- Pellom, B.¹ Hacioglu, K.²

34
- 85009164727
- Toward domain-independent conversational speech recognition
- B. Kingsbury et al., "Toward domain-independent conversational speech recognition," in Proc. Eurospeech, 2003, pp. 1881-1884.
- (2003) Proc. Eurospeech , pp. 1881-1884
- Kingsbury, B.¹

35
- 85032772258
- Minimum variance distortionless response spectral estimation
- Sep
- M. Wolfel and J. McDonough, "Minimum variance distortionless response spectral estimation," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 117-126, Sep. 2005.
- (2005) IEEE Signal Process. Mag , vol.22 , Issue.5 , pp. 117-126
- Wolfel, M.¹ McDonough, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.