SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 54, Issue 1, 2012, Pages 119-133

A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech

(3) Do, Cong Thanh a Pastor, Dominique b Goalic, André b

a IDIAP RESEARCH INSTITUTE (Switzerland)

b LAB STICC (France)

Author keywords

Aurora 2; Cochlear implant; HMM based ASR; Kullback Leibler divergence; Noise robust ASR; Spectrally reduced speech

Indexed keywords

AURORA 2; HMM-BASED ASR; KULLBACK LEIBLER DIVERGENCE; ROBUST ASR; SPECTRALLY REDUCED SPEECH;

COCHLEAR IMPLANTS; SPEECH ENHANCEMENT;

SPEECH RECOGNITION;

EID: 80052737228 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2011.07.006 Document Type: Article

Times cited : (9)

References (38)

1
- 0002215069
- On a measure of divergence between two statistical populations defined by their probability distributions
- A. Bhattacharyya On a measure of divergence between two statistical populations defined by their probability distributions Bull. Calcutta Math. Soc. 35 1943 99 109
- (1943) Bull. Calcutta Math. Soc. , vol.35 , pp. 99-109
- Bhattacharyya, A.¹

2
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- S.F. Boll Suppression of acoustic noise in speech using spectral subtraction IEEE Trans. Acoust. Speech Signal Process. 27 2 1979 113 120
- (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , Issue.2 , pp. 113-120
- Boll, S.F.¹

3
- 85075910098
- Fundamentals of noise reduction
- J. Benesty, M.M. Sondhi, Y. Huang, Springer
- J. Chen, J. Benesty, Y. Huang, and E.J. Diethorn Fundamentals of noise reduction J. Benesty, M.M. Sondhi, Y. Huang, Springer Handbook of Speech Processing 2008 Springer 843 871
- (2008) Springer Handbook of Speech Processing , pp. 843-871
- Chen, J.¹ Benesty, J.² Huang, Y.³ Diethorn, E.J.⁴

4
- 0036226165
- Noise estimation by minima controlled recursive averaging for robust speech enhancement
- DOI 10.1109/97.988717, PII S1070990802024100
- I. Cohen, and B. Berdugo Noise estimation by minima controlled recursive averaging for robust speech enhancement IEEE Signal Process. Lett. 9 1 2002 12 15 (Pubitemid 34306628)
- (2002) IEEE Signal Processing Letters , vol.9 , Issue.1 , pp. 12-15
- Cohen, I.¹ Berdugo, B.²

5
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho Robust automatic speech recognition with missing and unreliable acoustic data Speech Comm. 34 3 2001 267 285 (Pubitemid 32284867)
- (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

6
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- A.P. Dempster, N. Laird, and D.B. Rubin Maximum likelihood from incomplete data via the EM algorithm J. Roy. Statist. Soc. B 39 1 1977 1 38
- (1977) J. Roy. Statist. Soc. B , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.² Rubin, D.B.³

7
- 77953696646
- On the recognition of cochlear implant-like spectrally reduced speech with MFCC and HMM-based ASR
- C.-T. Do, D. Pastor, and A. Goalic On the recognition of cochlear implant-like spectrally reduced speech with MFCC and HMM-based ASR IEEE Trans. Audio Speech Lang. Process. 18 5 2010 1065 1068
- (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.5 , pp. 1065-1068
- Do, C.-T.¹ Pastor, D.² Goalic, A.³

8
- 80052745242
- Corrélation entre les différences entre les taux de reconnaissance de la parole sur deux ensembles de test et celles des distributions de probabilité des vecteurs acoustiques de ces même ensembles
- May 25-28, Mons, Belgium
- Do, C.-T.; Pastor, D.; Goalic, A.; 2010b. Corrélation entre les différences entre les taux de reconnaissance de la parole sur deux ensembles de test et celles des distributions de probabilité des vecteurs acoustiques de ces même ensembles. In: Proceedings of JEP 2010 - Journées d'Etude sur la Parole, May 25-28, Mons, Belgium, pp. 49-52.
- (2010) Proceedings of JEP 2010 - Journées d'Etude sur la Parole , pp. 49-52
- Do, C.-T.¹ Pastor, D.² Goalic, A.³

9
- 0021645331
- Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
- Y. Ephraim, and D. Malah Speech enhancement using a minimum mean square error short-time spectral amplitude estimator IEEE Trans. Acoustics Speech Signal Process. 32 6 1984 1109 1121
- (1984) IEEE Trans. Acoustics Speech Signal Process. , vol.32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

10
- 0021892216
- Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
- Y. Ephraim, and D. Malah Speech enhancement using a minimum mean-square error log-spectral amplitude estimator IEEE Trans. Acoustics Speech Signal Process. 33 2 1985 443 445
- (1985) IEEE Trans. Acoustics Speech Signal Process. , vol.33 , Issue.2 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

11
- 80052727747
- Speaker-independent isolated word recognition using dynamic features of speech spectrum
- S. Furui Speaker-independent isolated word recognition using dynamic features of speech spectrum IEEE Trans. Acoust. Speech Signal Process. 32 4 1980 357 366
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.32 , Issue.4 , pp. 357-366
- Furui, S.¹

12
- 0003671941
- Ph.D. Thesis. Cambridge University
- Gales, M.; 1996. Model-based techniques for noise robust speech recognition. Ph.D. Thesis. Cambridge University.
- (1996) Model-based Techniques for Noise Robust Speech Recognition
- Gales, M.¹

13
- 0032139556
- Predictive model-based compensation schemes for robust speech recognition
- PII S0167639398000296
- M.J.F. Gales Predictive model-based compensation schemes for robust speech recognition Speech Comm. 25 1-3 1998 49 74 (Pubitemid 128413634)
- (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 49-74
- Gales, M.J.F.¹

14
- 0001596920
- Large-vocabulary continuous speech recognition: Advances and applications
- Gauvain, J.-L.; Lamel, L.; 2000. Large-vocabulary continuous speech recognition: advances and applications. In: Proceedings of the IEEE, vol. 88, no. 8, pp. 1181-1200.
- (2000) Proceedings of the IEEE , vol.88 , Issue.8 , pp. 1181-1200
- Gauvain, J.-L.¹ Lamel, L.²

15
- 0029288202
- Speech recognition in noisy environments: A survey
- Y. Gong Speech recognition in noisy environments: a survey Speech Comm. 16 3 1995 261 291
- (1995) Speech Comm. , vol.16 , Issue.3 , pp. 261-291
- Gong, Y.¹

16
- 33748559716
- Speech enhancement using temporal masking and fractional Bark gammatone filters
- December 8-10, Sydney, Australia, December
- Gunawan, T.S.; Ambikairajah, E.; 2004. Speech enhancement using temporal masking and fractional Bark gammatone filters. In: Proceedings of the 10th Australian International Conference on Speech Science & Technology, December 8-10, Sydney, Australia, December, pp. 420-425.
- (2004) Proceedings of the 10th Australian International Conference on Speech Science & Technology , pp. 420-425
- Gunawan, T.S.¹ Ambikairajah, E.²

17
- 0035510532
- Spectral subtraction using reduced delay convolution and adaptive averaging
- DOI 10.1109/89.966083, PII S1063667601096729
- H. Gustafsson, S. Nordholm, and I. Claesson Spectral subtraction using reduced delay convolution and adaptive averaging IEEE Speech Audio Process. 9 8 2001 799 807 (Pubitemid 33137932)
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.8 , pp. 799-807
- Gustafsson, H.¹ Nordholm, S.E.² Claesson, I.³

18
- 47949104834
- Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system
- J.H.L. Hansen, V. Radhakrishnan, and K. Arehart Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system IEEE Trans. Audio Speech Lang. Process. 14 6 2006 2049 2063
- (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , Issue.6 , pp. 2049-2063
- Hansen, J.H.L.¹ Radhakrishnan, V.² Arehart, K.³

19
- 0028517164
- RASTA processing of speech
- H. Hermansky, and N. Morgan RASTA processing of speech IEEE Trans. Speech Audio Process. 2 4 1994 578 589
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

20
- 34547516258
- Approximating the Kullback Leibler divergence between Gaussian mixture models
- April 15-20, Hawaii, USA
- Hershey J.R.; Olsen, P.A.; 2007. Approximating the Kullback Leibler divergence between Gaussian mixture models. In: Proceedings of the IEEE ICASSP 2007, April 15-20, Hawaii, USA, vol. 4, pp. 317-324.
- (2007) Proceedings of the IEEE ICASSP 2007 , vol.4 , pp. 317-324
- Hershey, J.R.¹ Olsen, P.A.²

21
- 0038669544
- The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- September 18-20, Paris, France
- Hirsch, H.-G.; Pearce, D.; 2000. The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proceedings of the ISCA ASR2000: automatic speech recognition: Challenges for the new millenium, September 18-20, Paris, France.
- (2000) Proceedings of the ISCA ASR2000: Automatic Speech Recognition: Challenges for the New Millennium
- Hirsch, H.-G.¹ Pearce, D.²

22
- 0041591273
- A generalized subspace approach for enhancing speech corrupted by colored noise
- Y. Hu, and P. Loizou A generalized subspace approach for enhancing speech corrupted by colored noise IEEE Trans. Speech Audio Process. 11 4 2003 334 341
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.4 , pp. 334-341
- Hu, Y.¹ Loizou, P.²

23
- 0347337999
- Incorporating the human hearing properties in the signal subspace approach for speech enhancement
- F. Jabloun, and B. Champagne Incorporating the human hearing properties in the signal subspace approach for speech enhancement IEEE Trans. Speech Audio Process. 11 6 2003 700 708
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.6 , pp. 700-708
- Jabloun, F.¹ Champagne, B.²

24
- 0032675721
- On speech coding in a perceptual domain
- March 15-19, Phoenix, AZ, USA
- Kubin, G.; Kleijn, W.B.; 1999. On speech coding in a perceptual domain. In: Proceedings of the IEEE ICASSP 1999, March 15-19, Phoenix, AZ, USA, vol. 1, pp. 205-208.
- (1999) Proceedings of the IEEE ICASSP 1999 , vol.1 , pp. 205-208
- Kubin, G.¹ Kleijn, W.B.²

25
- 0001927585
- On information and sufficiency
- S. Kullback, and R.A. Leibler On information and sufficiency Ann. Math. Statist. 22 1 1951 79 86
- (1951) Ann. Math. Statist. , vol.22 , Issue.1 , pp. 79-86
- Kullback, S.¹ Leibler, R.A.²

26
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C.J. Leggetter, and P.C. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Comput. Speech Lang. 9 2 1995 171 185
- (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

27
- 0002560960
- A database for speaker-independent digit recognition
- March 19-21, San Diego, USA
- Leonard, R.; 1984. A database for speaker-independent digit recognition. In: Proceedings of the IEEE ICASSP 1984, March 19-21, San Diego, USA, vol. 9, pp. 328-331.
- (1984) Proceedings of the IEEE ICASSP 1984 , vol.9 , pp. 328-331
- Leonard, R.¹

28
- 0032935343
- Introduction to cochlear implants
- DOI 10.1109/51.740962
- P. Loizou Introduction to cochlear implants IEEE Eng. Med. Biology Mag. 18 1 1999 32 42 (Pubitemid 29059005)
- (1999) IEEE Engineering in Medicine and Biology Magazine , vol.18 , Issue.1 , pp. 32-42
- Loizou, P.C.¹

29
- 34447100796
- CRC Boca Raton, FL
- P. Loizou Speech Enhancement: Theory and Practice 2007 CRC Boca Raton, FL
- (2007) Speech Enhancement: Theory and Practice
- Loizou, P.¹

30
- 0024766457
- A family of distortion measures based upon projection operation for robust speech recognition
- D. Mansour, and B.-H. Juang A family of distortion measures based upon projection operation for robust speech recognition IEEE Trans. Acoust. Speech Signal Process. 37 11 1989 1659 1671
- (1989) IEEE Trans. Acoust. Speech Signal Process. , vol.37 , Issue.11 , pp. 1659-1671
- Mansour, D.¹ Juang, B.-H.²

31
- 0020796537
- A decision theoretic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood
- A. Nadas A decision theoretic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood IEEE Trans. Acoust. Speech Signal Process. 31 4 1983 814 817 (Pubitemid 14455162)
- (1983) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-31 , Issue.4 , pp. 814-817
- Nadas Arthur¹

32
- 29444448046
- A noise-estimation algorithm for highly non-stationary environments
- DOI 10.1016/j.specom.2005.08.005, PII S0167639305002001
- S. Rangachari, and P. Loizou A noise-estimation algorithm for highly non-stationary environments Speech Commun. 48 2 2006 220 231 (Pubitemid 43012033)
- (2006) Speech Communication , vol.48 , Issue.2 , pp. 220-231
- Rangachari, S.¹ Loizou, P.C.²

33
- 33750344712
- Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition
- DOI 10.1016/j.specom.2006.08.003, PII S0167639306000914
- B.J. Shannon, and K.K. Paliwal Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition Speech Comm. 48 11 2006 1458 1485 (Pubitemid 44634773)
- (2006) Speech Communication , vol.48 , Issue.11 , pp. 1458-1485
- Shannon, B.J.¹ Paliwal, K.K.²

34
- 0028823541
- Speech recognition with primarily temporal cues
- R.V. Shannon, F.-G. Zeng, V. Kamath, J. Wygonski, and M. Ekelid Speech recognition with primarily temporal cues Science 270 5234 1995 303 304
- (1995) Science , vol.270 , Issue.5234 , pp. 303-304
- Shannon, R.V.¹ Zeng, F.-G.² Kamath, V.³ Wygonski, J.⁴ Ekelid, M.⁵

35
- 34047272127
- Average divergence distance as a statistical discrimination measure for hidden Markov models
- DOI 10.1109/TSA.2005.858059
- J. Silva, and S. Narayanan Average divergence distance as a statistical discrimination measure for hidden Markov models IEEE Trans. Audio Speech Lang. Process. 14 3 2006 890 906 (Pubitemid 46547651)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 890-906
- Silva, J.¹ Narayanan, S.²

36
- 65449173640
- Upper bound Kullback-Leibler divergence for transient hidden Markov models
- J. Silva, and S. Narayanan Upper bound Kullback-Leibler divergence for transient hidden Markov models IEEE Trans. Audio Speech Lang. Process. 56 9 2008 4176 4188
- (2008) IEEE Trans. Audio Speech Lang. Process. , vol.56 , Issue.9 , pp. 4176-4188
- Silva, J.¹ Narayanan, S.²

37
- 79960554941
- HMMs and related speech technologies
- J. Benesty, M.M. Sondhi, Y. Huang, Springer
- S. Young HMMs and related speech technologies J. Benesty, M.M. Sondhi, Y. Huang, Springer Handbook of Speech Processing 2008 Springer 539 557
- (2008) Springer Handbook of Speech Processing , pp. 539-557
- Young, S.¹

38
- 0003822743
- (for HTK version 3.4), Cambridge university engineering department
- Young, S.; Evermann, G.; Gales, M.; Hain, T.; Kershaw, D.; Liu, X.; Moore, G.; Odell, J.; Ollarson, D.; Povey, D.; Valtchev, V. Woodland, P.; 2006. The HTK book (for HTK version 3.4), Cambridge university engineering department.
- (2006) The HTK Book
- Young, S.¹ Evermann, G.² Gales, M.³ Hain, T.⁴ Kershaw, D.⁵ Liu, X.⁶ Moore, G.⁷ Odell, J.⁸ Ollarson, D.⁹ Povey, D.¹⁰ Valtchev, V.¹¹ Woodland, P.¹²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.