SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 3, 2006, Pages 808-831

Optimization of temporal filters for constructing robust features in speech recognition

(2) Hung, Jeih Weih a,b Lee, Lin Shan a,c

a IEEE (Taiwan)

b NATIONAL CHI NAN UNIVERSITY (Taiwan)

c NATIONAL TAIWAN UNIVERSITY (Taiwan)

Author keywords

Linear discriminant analysis (LDA); Minimum classification error (MCE); Principal component analysis (PCA); Speech recognition; Temporal filters

Indexed keywords

CEPSTRAL MEAN AND VARIANCE NORMALIZATION (CMVN); LINEAR DISCRIMINANT ANALYSIS (LDA); MINIMUM CLASSIFICATION ERROR (MCE); TEMPORAL FILTERS;

DISCRIMINANT ANALYSIS; ERROR ANALYSIS; LINEAR ACCELERATORS; OPTIMIZATION; PRINCIPAL COMPONENT ANALYSIS; SPEECH RECOGNITION;

DIGITAL FILTERS;

EID: 34047247200 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.857801 Document Type: Article

Times cited : (63)

References (35)

1
- 0022883703
- Noise compensation for speech recognition using probabilistic models
- J. N. Holmes and N. C. Sedgwick, "Noise compensation for speech recognition using probabilistic models," in Proc. ICASSP, 1986.
- (1986) Proc. ICASSP
- Holmes, J.N.¹ Sedgwick, N.C.²

2
- 84944816135
- A digital filterbank for spectral matching
- D. H. Klatt, "A digital filterbank for spectral matching," in Proc. ICASSP, 1979, pp. 573-576.
- (1979) Proc. ICASSP , pp. 573-576
- Klatt, D.H.¹

3
- 0023739211
- Speech recognition using noise-adaptive prototypes
- A. Nadas, D. Nahamoo, and M. Picheny, "Speech recognition using noise-adaptive prototypes," in Proc. ICASSP, 1988, pp. 517-520.
- (1988) Proc. ICASSP , pp. 517-520
- Nadas, A.¹ Nahamoo, D.² Picheny, M.³

4
- 0025681008
- Hidden Markov model decomposition of speech and noise
- A. P. Varga and R. K. Moore, "Hidden Markov model decomposition of speech and noise," in Proc. ICASSP, 1990, pp. 845-848.
- (1990) Proc. ICASSP , pp. 845-848
- Varga, A.P.¹ Moore, R.K.²

5
- 0026384952
- An hypothesized Wiener filtering approach to noisy speech recognition
- A. D. Berstein and I. D. Shallom, "An hypothesized Wiener filtering approach to noisy speech recognition," in Proc. ICASSP, 1991, pp. 913-916.
- (1991) Proc. ICASSP , pp. 913-916
- Berstein, A.D.¹ Shallom, I.D.²

6
- 0006936809
- Hidden Markov model state-based cepstral noise compensation
- V. L. Beattie and S. J. Young, "Hidden Markov model state-based cepstral noise compensation," in Proc. ICSLP, 1992, pp. 519-522.
- (1992) Proc. ICSLP , pp. 519-522
- Beattie, V.L.¹ Young, S.J.²

7
- 85009113852
- HMM adaptation using vector Taylor series for noisy speech recognition
- A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP, 2000, pp. 869-872.
- (2000) Proc. ICSLP , pp. 869-872
- Acero, A.¹ Deng, L.² Kristjansson, T.³ Zhang, J.⁴

8
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
- C. J. Leggester and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density HMMs," Comput. Speech Lang., pp. 171-186, 1995.
- (1995) Comput. Speech Lang , pp. 171-186
- Leggester, C.J.¹ Woodland, P.C.²

9
- 0030149866
- A maximum-likelihood approach to stochastic matching for robust speech recognition
- A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, pp. 190-202, 1996.
- (1996) IEEE Trans. Acoust., Speech, Signal Processing , pp. 190-202
- Sankar, A.¹ Lee, C.-H.²

10
- 0032140546
- On stochastic feature and model compensation approaches to robust speech recognition
- C.-H. Lee, "On stochastic feature and model compensation approaches to robust speech recognition," Speech Commun., vol. 25, pp. 29-47, 1998.
- (1998) Speech Commun , vol.25 , pp. 29-47
- Lee, C.-H.¹

11
- 0032116601
- Data-driven environmental compensation for speech recognition: A unified approach
- P. J. Moreno, B. Raj, and R. M. Stern, "Data-driven environmental compensation for speech recognition: A unified approach," Speech Commun., vol. 24, pp. 267-285, 1998.
- (1998) Speech Commun , vol.24 , pp. 267-285
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

12
- 0027622731
- Cepstral parameter compensation for HMM recognition in noise
- M. J. F. Gales and S. J. Young, "Cepstral parameter compensation for HMM recognition in noise," Speech Commun., vol. 12, pp. 231-239, 1993.
- (1993) Speech Commun , vol.12 , pp. 231-239
- Gales, M.J.F.¹ Young, S.J.²

13
- 0029390135
- Robust speech recognition in additive and convolutional noise using parallel model combination
- _, "Robust speech recognition in additive and convolutional noise using parallel model combination," Comput. Speech Lang., vol. 9, pp. 289-307, 1995.
- (1995) Comput. Speech Lang , vol.9 , pp. 289-307
- Gales, M.J.F.¹ Young, S.J.²

14
- 0028996863
- A fast and flexible implementation of parallel model combination
- _, "A fast and flexible implementation of parallel model combination," in Proc. ICASSP, 1995, pp. 131-136.
- (1995) Proc. ICASSP , pp. 131-136
- Gales, M.J.F.¹ Young, S.J.²

15
- 85009112933
- Optimization of sub-band weights using simulated noisy speech in multi-band speech recognition
- Y. C. Tam and B. Mak, "Optimization of sub-band weights using simulated noisy speech in multi-band speech recognition," in Proc. ICSLP, 2000, pp. 313-316.
- (2000) Proc. ICSLP , pp. 313-316
- Tam, Y.C.¹ Mak, B.²

16
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr
- S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech. Signal Process., vol. ASSP-27, pp. 113-120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech. Signal Process , vol.ASSP-27 , pp. 113-120
- Boll, S.F.¹

17
- 0004319970
- Boston, MA: Kluwer
- A. Acero, Acoustical and Environmental Robustness in Automatic Speech Recognition. Boston, MA: Kluwer, 1991.
- (1991) Acoustical and Environmental Robustness in Automatic Speech Recognition
- Acero, A.¹

18
- 0141699738
- Log-domain speech feature enhancement using sequential MAP noise estimation and a phase-sensitive model of the acoustic environment
- L. Deng, J. Droppo, and A. Acero, "Log-domain speech feature enhancement using sequential MAP noise estimation and a phase-sensitive model of the acoustic environment," in Proc. ICSLP, 2002, pp. 192-195.
- (2002) Proc. ICSLP , pp. 192-195
- Deng, L.¹ Droppo, J.² Acero, A.³

19
- 84892187452
- Maximum likelihood modeling with Gaussian distributions for classification
- R. A. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. ICASSP, 1998.
- (1998) Proc. ICASSP
- Gopinath, R.A.¹

20
- 0036298776
- Adaptation experiments on the spine database using the extended maximum likelihood linear transformation (EMLLT) model
- R. A. Gopinath, V. Goel, K. Visweswariah, and P. Olsen, "Adaptation experiments on the spine database using the extended maximum likelihood linear transformation (EMLLT) model," in Proc. ICASSP, 2002.
- (2002) Proc. ICASSP
- Gopinath, R.A.¹ Goel, V.² Visweswariah, K.³ Olsen, P.⁴

21
- 0031146514
- HMM-based speech recognition using state-dependent, discriminatively derived transforms on mel-warped DFT features
- May
- C. Rathinavalu and L. Deng, "HMM-based speech recognition using state-dependent, discriminatively derived transforms on mel-warped DFT features," IEEE Trans. Speech Audio Processing, pp. 243-256, May 1997.
- (1997) IEEE Trans. Speech Audio Processing , pp. 243-256
- Rathinavalu, C.¹ Deng, L.²

22
- 0141590384
- Discriminative training of auditory filters of different shapes for robust speech recognition
- B. Mak, Y. C. Tam, and R. Hsiao, "Discriminative training of auditory filters of different shapes for robust speech recognition," in Proc. ICASSP, 2003. pp. 45-18.
- (2003) Proc. ICASSP , pp. 45-18
- Mak, B.¹ Tam, Y.C.² Hsiao, R.³

23
- 0036880074
- Distributed speech processing in MiPad's multimodal user interface
- Nov
- L. Deng, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, Y. Wang, D. Jacoby, M. Mahajan, C. Chelba, and X. D. Huang, "Distributed speech processing in MiPad's multimodal user interface," IEEE Trans. Speech Audio Processing, pp. 605-619, Nov. 2002.
- (2002) IEEE Trans. Speech Audio Processing , pp. 605-619
- Deng, L.¹ Wang, K.² Acero, A.³ Hon, H.⁴ Droppo, J.⁵ Boulis, C.⁶ Wang, Y.⁷ Jacoby, D.⁸ Mahajan, M.⁹ Chelba, C.¹⁰ Huang, X.D.¹¹

24
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, no. 6, pp. 1304-1312, 1974.
- (1974) J. Acoust. Soc. Amer , vol.55 , Issue.6 , pp. 1304-1312
- Atal, B.S.¹

25
- 85135190755
- Multiband and adaptation approaches to robust speech recognition
- S. Tibrewala and H. Hermansky, "Multiband and adaptation approaches to robust speech recognition," in Proc. Eurospeech 97, 1997, pp. 2619-2622.
- (1997) Proc. Eurospeech 97 , pp. 2619-2622
- Tibrewala, S.¹ Hermansky, H.²

26
- 0141699833
- Noise robust HMM-based speech recognition using segmental cepstral feature vector normalization
- Pont-a-Mousson, France
- O. Viikki and K. Laurila, "Noise robust HMM-based speech recognition using segmental cepstral feature vector normalization," in ESCA NATO Workshop Robust Speech Recognition Unknown Communication Channels, Pont-a-Mousson, France, 1997, pp. 107-110.
- (1997) ESCA NATO Workshop Robust Speech Recognition Unknown Communication Channels , pp. 107-110
- Viikki, O.¹ Laurila, K.²

27
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Processing, vol. 2, pp. 578-589, 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

28
- 84947590142
- Data-driven design of RASTA-like filters
- S. van Vuuren and H. Hermansky, "Data-driven design of RASTA-like filters," in Proc. Eurospeech, 1997.
- (1997) Proc. Eurospeech
- van Vuuren, S.¹ Hermansky, H.²

29
- 0030374936
- Data based filter design for RASTA-like channel normalization in ASR
- C. Avendano, S. van Vuuren, and H. Hermansky, "Data based filter design for RASTA-like channel normalization in ASR," in Proc. ICSLP, 1996.
- (1996) Proc. ICSLP
- Avendano, C.¹ van Vuuren, S.² Hermansky, H.³

30
- 85017295162
- Data-driven modulation filter design under adverse acoustic conditions and using phonetic and syllabic units
- M. L. Shire, "Data-driven modulation filter design under adverse acoustic conditions and using phonetic and syllabic units," in Proc. Eurospeech, 1999.
- (1999) Proc. Eurospeech
- Shire, M.L.¹

31
- 0027239233
- Improvements in connected digit recognition using linear discriminant analysis and mixture densities
- R. Haeb-Umbach, D. Geller, and H. Ney, "Improvements in connected digit recognition using linear discriminant analysis and mixture densities," in Proc. ICASSP, 1993.
- (1993) Proc. ICASSP
- Haeb-Umbach, R.¹ Geller, D.² Ney, H.³

32
- 85009063569
- Comparative analysis for data-driven temporal filters obtained via principal component analysis (PCA) and linear discriminant analysis (LDA) in speech recognition
- J.-W. Hung, H.-M. Wang, and L.-S. Lee, "Comparative analysis for data-driven temporal filters obtained via principal component analysis (PCA) and linear discriminant analysis (LDA) in speech recognition," in Proc. Eurospeech, 2001.
- (2001) Proc. Eurospeech
- Hung, J.-W.¹ Wang, H.-M.² Lee, L.-S.³

33
- 17444450002
- Data-driven temporal filters for robust features in speech recognition obtained via minimum classification error (MCE)
- J.-W. Hung and L.-S. Lee, "Data-driven temporal filters for robust features in speech recognition obtained via minimum classification error (MCE)," in Proceedings of ICASSP, 2002.
- (2002) Proceedings of ICASSP
- Hung, J.-W.¹ Lee, L.-S.²

34
- 34047247652
- Available
- [Online]. Available: http://rocling.iis.sinica.edu.tw/ROCLING/

35
- 0004319968
- The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition
- A. P. Varga, H. J. M. Steeneken, M. Tomlinson, and D. Jones, "The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition,", Tech. Rep. DRA Speech Research Unit, 1992.
- (1992) Tech. Rep. DRA Speech Research Unit
- Varga, A.P.¹ Steeneken, H.J.M.² Tomlinson, M.³ Jones, D.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.