SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 8, 2007, Pages 2431-2443

Noise condition-dependent training based on noise classification and SNR estimation

(4) Xu, Haitian a Dalsgaard, Paul b Tan, Zheng Hua b Lindberg, Børge b

a Toshiba Reseach Europe Ltd (United Kingdom)

Author keywords

Condition dependent training; Noise classification; Robust speech recognition; Robustness to unknown noise; Signal to noise ratio (SNR) estimation

Indexed keywords

CONDITION-DEPENDENT TRAINING; NOISE CLASSIFICATION; ROBUST SPEECH RECOGNITION; ROBUSTNESS TO UNKNOWN NOISE; SIGNAL-TO-NOISE RATIO (SNR) ESTIMATION;

ACOUSTIC INTENSITY; COMPUTATIONAL COMPLEXITY; ESTIMATION; HIDDEN MARKOV MODELS; MATERIALS HANDLING; METEORITES; REMELTING; SPEECH ANALYSIS; SPEECH RECOGNITION;

SIGNAL TO NOISE RATIO;

EID: 64349084660 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.906188 Document Type: Article

Times cited : (14)

References (27)

1
- 0346474799
- Adaptation techniques for ambience and microphone system
- S. Das, A. Nadas, D. Nahamoo, and M. Picheny, "Adaptation techniques for ambience and microphone system," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1994, vol. 1, pp. 21-23.
- (1994) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 21-23
- Das, S.¹ Nadas, A.² Nahamoo, D.³ Picheny, M.⁴

2
- 0023263708
- Multi-style training for robust isolated-word speech recognition
- R. P. Lippmann, E. A. Martin, and D. B. Paul, "Multi-style training for robust isolated-word speech recognition," in Proc. ICASSP'87, 1987, pp. 705-708.
- (1987) Proc. ICASSP'87 , pp. 705-708
- Lippmann, R.P.¹ Martin, E.A.² Paul, D.B.³

3
- 0029288202
- Speech recognition in noisy environments: A survey
- Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, pp. 261-291, 1995.
- (1995) Speech Commun , vol.16 , pp. 261-291
- Gong, Y.¹

4
- 0141702085
- Environmental sniffing: Noise knowledge estimation for robust speech systems
- Apr. 6-10
- M. Akbacak and J. H. L. Hansen, "Environmental sniffing: Noise knowledge estimation for robust speech systems," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 6-10, 2003, vol. 2, pp. 113-116.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 113-116
- Akbacak, M.¹ Hansen, J.H.L.²

5
- 77955915142
- Context awareness using environmental noise classification
- L. Ma, D. J. Smith, and B. P. Milner, "Context awareness using environmental noise classification," in Proc. Eurospeech'03, 2003, pp. 2237-2240.
- (2003) Proc. Eurospeech'03 , pp. 2237-2240
- Ma, L.¹ Smith, D.J.² Milner, B.P.³

6
- 84897584394
- Advances in acoustic noise tracking for robust in-vehicle speech systems
- H. Abut, J. H. L. Hansen, and K. Takeda, Eds. New York: Springer, ch. 10, pp
- M. Akbacak and J. H. L. Hansen, "Advances in acoustic noise tracking for robust in-vehicle speech systems," in Advances for In-Vehicle and Mobile Systems, H. Abut, J. H. L. Hansen, and K. Takeda, Eds. New York: Springer, 2007, ch. 10, pp. 109-122.
- (2007) Advances for In-Vehicle and Mobile Systems , pp. 109-122
- Akbacak, M.¹ Hansen, J.H.L.²

7
- 0030638031
- A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
- J. G. Fiscus, "A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)," in Proc. IEEE Workshop Autom. Speech Recognition Understanding, 1997, pp. 347-354.
- (1997) Proc. IEEE Workshop Autom. Speech Recognition Understanding , pp. 347-354
- Fiscus, J.G.¹

8
- 0141812649
- Speaker-independent spoken digit recognition in noisy environments using dynamic spectral features and neural networks
- T. Kitamura, S. Ando, and E. Hayahara, "Speaker-independent spoken digit recognition in noisy environments using dynamic spectral features and neural networks," in Proc. Int. Conf. Speech Lang. Process., 1992, vol. 1, pp. 699-702.
- (1992) Proc. Int. Conf. Speech Lang. Process , vol.1 , pp. 699-702
- Kitamura, T.¹ Ando, S.² Hayahara, E.³

9
- 33947676384
- Modeling variance variation in a variable parameter HMM framework for noise robust speech recognition
- X. Cui and Y. Gong, "Modeling variance variation in a variable parameter HMM framework for noise robust speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2006, vol. 1, pp. 1117-1120.
- (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 1117-1120
- Cui, X.¹ Gong, Y.²

10
- 4544334449
- A tree-structured clustering method integrating noise and SNR for piecewise linear-transformation- based noise adaptation
- Z. Zhang, T. Sugimura, and S. Furui, "A tree-structured clustering method integrating noise and SNR for piecewise linear-transformation- based noise adaptation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2004, vol. 1, pp. 981-984.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 981-984
- Zhang, Z.¹ Sugimura, T.² Furui, S.³

11
- 85009126862
- Evaluation of tree-structured piecewise- linear-transformation-based noise adaptation on Aurora 2 database
- Jeju, Korea
- Z. Zheng, T. Ohya, and S. Furui, "Evaluation of tree-structured piecewise- linear-transformation-based noise adaptation on Aurora 2 database," in Proc. Int. Conf. Spoken Lang. Process., Jeju, Korea, 2004, vol. 1, pp. 113-116.
- (2004) Proc. Int. Conf. Spoken Lang. Process , vol.1 , pp. 113-116
- Zheng, Z.¹ Ohya, T.² Furui, S.³

12
- 33745184458
- Robust speech recognition based on noise and SNR classification-A multiple-model framework
- Lisbon, Portugal, Sep
- H. Xu, Z.-H. Tan, P. Dalsgaard, and B. Lindberg, "Robust speech recognition based on noise and SNR classification-A multiple-model framework," in Proc. Interspeech 2005, Lisbon, Portugal, Sep. 2005, pp. 977-980.
- (2005) Proc. Interspeech 2005 , pp. 977-980
- Xu, H.¹ Tan, Z.-H.² Dalsgaard, P.³ Lindberg, B.⁴

13
- 33947697214
- Robust speech recognition from noise-type based feature compensation and model interpolation in a multiple model framework
- Toulouse, France, May
- H. Xu, Z.-H. Tan, P. Dalsgaard, and B. Lindberg, "Robust speech recognition from noise-type based feature compensation and model interpolation in a multiple model framework," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Toulouse, France, May 2006, pp. I-1141-I-1144.
- (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
- Xu, H.¹ Tan, Z.-H.² Dalsgaard, P.³ Lindberg, B.⁴

14
- 0032141206
- Cepstral domain segmental feature vector normalization for noise robust speech recognition
- O. Viikki and K. Laurila, "Cepstral domain segmental feature vector normalization for noise robust speech recognition," Speech Commun., vol. 25, pp. 133-147, 1998.
- (1998) Speech Commun , vol.25 , pp. 133-147
- Viikki, O.¹ Laurila, K.²

15
- 0442317754
- ETSI standard doc
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithms, ETSI standard doc, 2006.
- (2006) Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithms

16
- 0030638179
- Jacobian adaptation of noisy speech models
- Dec
- S. Sagayama, Y.Yamaguchi, and S. Takahashi, "Jacobian adaptation of noisy speech models," in Proc. IEEE Workshop Autom. Speech Recognition Understanding, Dec. 1997, pp. 396-403.
- (1997) Proc. IEEE Workshop Autom. Speech Recognition Understanding , pp. 396-403
- Sagayama, S.¹ Yamaguchi, Y.² Takahashi, S.³

17
- 0029725764
- Deleted interpolation and density sharing for continuous hidden Markov models
- X. D. Huang, M.-Y. Hwang, J. Li, and M. Mahajan, "Deleted interpolation and density sharing for continuous hidden Markov models," in Proc. Int. Conf. Acoust., Speech, Signal Process., 1996, pp. 885-888.
- (1996) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 885-888
- Huang, X.D.¹ Hwang, M.-Y.² Li, J.³ Mahajan, M.⁴

18
- 65549153550
- Speech recognition in noisy environments,
- Ph.D. dissertation, Carnegie Mellon Univ, Pittsburgh, PA
- P. Moreno, "Speech recognition in noisy environments," Ph.D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA, 1996.
- (1996)
- Moreno, P.¹

19
- 0038669544
- The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- Paris, France, Sep. 18-20
- H. G. Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ISCA ITRW ASR2000 (Autom. Speech Recognition: Challenges for the Next Millennium), Paris, France, Sep. 18-20, 2000, pp. 18-20.
- (2000) Proc. ISCA ITRW ASR2000 (Autom. Speech Recognition: Challenges for the Next Millennium) , pp. 18-20
- Hirsch, H.G.¹ Pearce, D.²

20
- 0003483593
- Eng. Dept. Speech Group and Entropic Research Lab. Inc, Cambridge Univ, Washington, DC
- S. Young, "HTK: Hidden Markov Model Toolkit V1.5," Eng. Dept. Speech Group and Entropic Research Lab. Inc., Cambridge Univ., Washington, DC, 1993.
- (1993) HTK: Hidden Markov Model Toolkit V1.5
- Young, S.¹

21
- 0019606509
- An improved endpoint detector for isolated word recognition
- Aug
- L. Lamel, L. Rabiner, A. Rosenberg, and J. Wilpon, "An improved endpoint detector for isolated word recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-29, no. 4, pp. 777-785, Aug. 1981.
- (1981) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-29 , Issue.4 , pp. 777-785
- Lamel, L.¹ Rabiner, L.² Rosenberg, A.³ Wilpon, J.⁴

22
- 85009063707
- Soft decisions in missing data techniques for robust automatic speech recognition
- Beijing, China
- J. Barker, L. Josifovski, M. Cooke, and P. Green, "Soft decisions in missing data techniques for robust automatic speech recognition," in Proc. ICSLP'00, Beijing, China, 2000, vol. 1, pp. 373-376.
- (2000) Proc. ICSLP'00 , vol.1 , pp. 373-376
- Barker, J.¹ Josifovski, L.² Cooke, M.³ Green, P.⁴

23
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- Jun
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, no. 3, pp. 267-285, Jun. 2001.
- (2001) Speech Commun , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

24
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- Jul
- R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 4, pp. 504-512, Jul. 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.4 , pp. 504-512
- Martin, R.¹

25
- 64349093568
- Hypothesis Testing
- Online, Available
- E. W. Weisstein, "Hypothesis Testing," MathWorld, 2005 [Online]. Available: http://mathworld.wolfram.com/HypothesisTesting.html
- (2005) MathWorld
- Weisstein, E.W.¹

26
- 64349117672
- Beaverton, OR: Oregon Graduate Inst
- P. Heeman and D. Cole, SpeechDatCar: US English. Beaverton, OR: Oregon Graduate Inst., 2001.
- (2001) SpeechDatCar: US English
- Heeman, P.¹ Cole, D.²

27
- 85009242725
- Evaluation of a noise-robust DSR front-end on Aurora databases
- Denver, CO
- D. Marcho et al., "Evaluation of a noise-robust DSR front-end on Aurora databases," in Proc. ICSLP'02, Denver, CO, 2002, pp. 17-20.
- (2002) Proc. ICSLP'02 , pp. 17-20
- Marcho, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.