메뉴 건너뛰기




Volumn 54, Issue 1, 2012, Pages 119-133

A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech

Author keywords

Aurora 2; Cochlear implant; HMM based ASR; Kullback Leibler divergence; Noise robust ASR; Spectrally reduced speech

Indexed keywords

AURORA 2; HMM-BASED ASR; KULLBACK LEIBLER DIVERGENCE; ROBUST ASR; SPECTRALLY REDUCED SPEECH;

EID: 80052737228     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2011.07.006     Document Type: Article
Times cited : (9)

References (38)
  • 1
    • 0002215069 scopus 로고
    • On a measure of divergence between two statistical populations defined by their probability distributions
    • A. Bhattacharyya On a measure of divergence between two statistical populations defined by their probability distributions Bull. Calcutta Math. Soc. 35 1943 99 109
    • (1943) Bull. Calcutta Math. Soc. , vol.35 , pp. 99-109
    • Bhattacharyya, A.1
  • 2
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S.F. Boll Suppression of acoustic noise in speech using spectral subtraction IEEE Trans. Acoust. Speech Signal Process. 27 2 1979 113 120
    • (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 4
    • 0036226165 scopus 로고    scopus 로고
    • Noise estimation by minima controlled recursive averaging for robust speech enhancement
    • DOI 10.1109/97.988717, PII S1070990802024100
    • I. Cohen, and B. Berdugo Noise estimation by minima controlled recursive averaging for robust speech enhancement IEEE Signal Process. Lett. 9 1 2002 12 15 (Pubitemid 34306628)
    • (2002) IEEE Signal Processing Letters , vol.9 , Issue.1 , pp. 12-15
    • Cohen, I.1    Berdugo, B.2
  • 5
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho Robust automatic speech recognition with missing and unreliable acoustic data Speech Comm. 34 3 2001 267 285 (Pubitemid 32284867)
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 6
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the em algorithm
    • A.P. Dempster, N. Laird, and D.B. Rubin Maximum likelihood from incomplete data via the EM algorithm J. Roy. Statist. Soc. B 39 1 1977 1 38
    • (1977) J. Roy. Statist. Soc. B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.2    Rubin, D.B.3
  • 7
    • 77953696646 scopus 로고    scopus 로고
    • On the recognition of cochlear implant-like spectrally reduced speech with MFCC and HMM-based ASR
    • C.-T. Do, D. Pastor, and A. Goalic On the recognition of cochlear implant-like spectrally reduced speech with MFCC and HMM-based ASR IEEE Trans. Audio Speech Lang. Process. 18 5 2010 1065 1068
    • (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.5 , pp. 1065-1068
    • Do, C.-T.1    Pastor, D.2    Goalic, A.3
  • 8
    • 80052745242 scopus 로고    scopus 로고
    • Corrélation entre les différences entre les taux de reconnaissance de la parole sur deux ensembles de test et celles des distributions de probabilité des vecteurs acoustiques de ces même ensembles
    • May 25-28, Mons, Belgium
    • Do, C.-T.; Pastor, D.; Goalic, A.; 2010b. Corrélation entre les différences entre les taux de reconnaissance de la parole sur deux ensembles de test et celles des distributions de probabilité des vecteurs acoustiques de ces même ensembles. In: Proceedings of JEP 2010 - Journées d'Etude sur la Parole, May 25-28, Mons, Belgium, pp. 49-52.
    • (2010) Proceedings of JEP 2010 - Journées d'Etude sur la Parole , pp. 49-52
    • Do, C.-T.1    Pastor, D.2    Goalic, A.3
  • 9
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
    • Y. Ephraim, and D. Malah Speech enhancement using a minimum mean square error short-time spectral amplitude estimator IEEE Trans. Acoustics Speech Signal Process. 32 6 1984 1109 1121
    • (1984) IEEE Trans. Acoustics Speech Signal Process. , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 10
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Y. Ephraim, and D. Malah Speech enhancement using a minimum mean-square error log-spectral amplitude estimator IEEE Trans. Acoustics Speech Signal Process. 33 2 1985 443 445
    • (1985) IEEE Trans. Acoustics Speech Signal Process. , vol.33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 11
    • 80052727747 scopus 로고
    • Speaker-independent isolated word recognition using dynamic features of speech spectrum
    • S. Furui Speaker-independent isolated word recognition using dynamic features of speech spectrum IEEE Trans. Acoust. Speech Signal Process. 32 4 1980 357 366
    • (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.32 , Issue.4 , pp. 357-366
    • Furui, S.1
  • 13
    • 0032139556 scopus 로고    scopus 로고
    • Predictive model-based compensation schemes for robust speech recognition
    • PII S0167639398000296
    • M.J.F. Gales Predictive model-based compensation schemes for robust speech recognition Speech Comm. 25 1-3 1998 49 74 (Pubitemid 128413634)
    • (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 49-74
    • Gales, M.J.F.1
  • 14
    • 0001596920 scopus 로고    scopus 로고
    • Large-vocabulary continuous speech recognition: Advances and applications
    • Gauvain, J.-L.; Lamel, L.; 2000. Large-vocabulary continuous speech recognition: advances and applications. In: Proceedings of the IEEE, vol. 88, no. 8, pp. 1181-1200.
    • (2000) Proceedings of the IEEE , vol.88 , Issue.8 , pp. 1181-1200
    • Gauvain, J.-L.1    Lamel, L.2
  • 15
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Y. Gong Speech recognition in noisy environments: a survey Speech Comm. 16 3 1995 261 291
    • (1995) Speech Comm. , vol.16 , Issue.3 , pp. 261-291
    • Gong, Y.1
  • 17
    • 0035510532 scopus 로고    scopus 로고
    • Spectral subtraction using reduced delay convolution and adaptive averaging
    • DOI 10.1109/89.966083, PII S1063667601096729
    • H. Gustafsson, S. Nordholm, and I. Claesson Spectral subtraction using reduced delay convolution and adaptive averaging IEEE Speech Audio Process. 9 8 2001 799 807 (Pubitemid 33137932)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.8 , pp. 799-807
    • Gustafsson, H.1    Nordholm, S.E.2    Claesson, I.3
  • 18
    • 47949104834 scopus 로고    scopus 로고
    • Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system
    • J.H.L. Hansen, V. Radhakrishnan, and K. Arehart Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system IEEE Trans. Audio Speech Lang. Process. 14 6 2006 2049 2063
    • (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , Issue.6 , pp. 2049-2063
    • Hansen, J.H.L.1    Radhakrishnan, V.2    Arehart, K.3
  • 20
    • 34547516258 scopus 로고    scopus 로고
    • Approximating the Kullback Leibler divergence between Gaussian mixture models
    • April 15-20, Hawaii, USA
    • Hershey J.R.; Olsen, P.A.; 2007. Approximating the Kullback Leibler divergence between Gaussian mixture models. In: Proceedings of the IEEE ICASSP 2007, April 15-20, Hawaii, USA, vol. 4, pp. 317-324.
    • (2007) Proceedings of the IEEE ICASSP 2007 , vol.4 , pp. 317-324
    • Hershey, J.R.1    Olsen, P.A.2
  • 22
    • 0041591273 scopus 로고    scopus 로고
    • A generalized subspace approach for enhancing speech corrupted by colored noise
    • Y. Hu, and P. Loizou A generalized subspace approach for enhancing speech corrupted by colored noise IEEE Trans. Speech Audio Process. 11 4 2003 334 341
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.4 , pp. 334-341
    • Hu, Y.1    Loizou, P.2
  • 23
    • 0347337999 scopus 로고    scopus 로고
    • Incorporating the human hearing properties in the signal subspace approach for speech enhancement
    • F. Jabloun, and B. Champagne Incorporating the human hearing properties in the signal subspace approach for speech enhancement IEEE Trans. Speech Audio Process. 11 6 2003 700 708
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.6 , pp. 700-708
    • Jabloun, F.1    Champagne, B.2
  • 24
    • 0032675721 scopus 로고    scopus 로고
    • On speech coding in a perceptual domain
    • March 15-19, Phoenix, AZ, USA
    • Kubin, G.; Kleijn, W.B.; 1999. On speech coding in a perceptual domain. In: Proceedings of the IEEE ICASSP 1999, March 15-19, Phoenix, AZ, USA, vol. 1, pp. 205-208.
    • (1999) Proceedings of the IEEE ICASSP 1999 , vol.1 , pp. 205-208
    • Kubin, G.1    Kleijn, W.B.2
  • 25
  • 26
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggetter, and P.C. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Comput. Speech Lang. 9 2 1995 171 185
    • (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 27
    • 0002560960 scopus 로고
    • A database for speaker-independent digit recognition
    • March 19-21, San Diego, USA
    • Leonard, R.; 1984. A database for speaker-independent digit recognition. In: Proceedings of the IEEE ICASSP 1984, March 19-21, San Diego, USA, vol. 9, pp. 328-331.
    • (1984) Proceedings of the IEEE ICASSP 1984 , vol.9 , pp. 328-331
    • Leonard, R.1
  • 30
    • 0024766457 scopus 로고
    • A family of distortion measures based upon projection operation for robust speech recognition
    • D. Mansour, and B.-H. Juang A family of distortion measures based upon projection operation for robust speech recognition IEEE Trans. Acoust. Speech Signal Process. 37 11 1989 1659 1671
    • (1989) IEEE Trans. Acoust. Speech Signal Process. , vol.37 , Issue.11 , pp. 1659-1671
    • Mansour, D.1    Juang, B.-H.2
  • 31
    • 0020796537 scopus 로고
    • A decision theoretic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood
    • A. Nadas A decision theoretic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood IEEE Trans. Acoust. Speech Signal Process. 31 4 1983 814 817 (Pubitemid 14455162)
    • (1983) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-31 , Issue.4 , pp. 814-817
    • Nadas Arthur1
  • 32
    • 29444448046 scopus 로고    scopus 로고
    • A noise-estimation algorithm for highly non-stationary environments
    • DOI 10.1016/j.specom.2005.08.005, PII S0167639305002001
    • S. Rangachari, and P. Loizou A noise-estimation algorithm for highly non-stationary environments Speech Commun. 48 2 2006 220 231 (Pubitemid 43012033)
    • (2006) Speech Communication , vol.48 , Issue.2 , pp. 220-231
    • Rangachari, S.1    Loizou, P.C.2
  • 33
    • 33750344712 scopus 로고    scopus 로고
    • Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition
    • DOI 10.1016/j.specom.2006.08.003, PII S0167639306000914
    • B.J. Shannon, and K.K. Paliwal Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition Speech Comm. 48 11 2006 1458 1485 (Pubitemid 44634773)
    • (2006) Speech Communication , vol.48 , Issue.11 , pp. 1458-1485
    • Shannon, B.J.1    Paliwal, K.K.2
  • 35
    • 34047272127 scopus 로고    scopus 로고
    • Average divergence distance as a statistical discrimination measure for hidden Markov models
    • DOI 10.1109/TSA.2005.858059
    • J. Silva, and S. Narayanan Average divergence distance as a statistical discrimination measure for hidden Markov models IEEE Trans. Audio Speech Lang. Process. 14 3 2006 890 906 (Pubitemid 46547651)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 890-906
    • Silva, J.1    Narayanan, S.2
  • 36
    • 65449173640 scopus 로고    scopus 로고
    • Upper bound Kullback-Leibler divergence for transient hidden Markov models
    • J. Silva, and S. Narayanan Upper bound Kullback-Leibler divergence for transient hidden Markov models IEEE Trans. Audio Speech Lang. Process. 56 9 2008 4176 4188
    • (2008) IEEE Trans. Audio Speech Lang. Process. , vol.56 , Issue.9 , pp. 4176-4188
    • Silva, J.1    Narayanan, S.2
  • 37
    • 79960554941 scopus 로고    scopus 로고
    • HMMs and related speech technologies
    • J. Benesty, M.M. Sondhi, Y. Huang, Springer
    • S. Young HMMs and related speech technologies J. Benesty, M.M. Sondhi, Y. Huang, Springer Handbook of Speech Processing 2008 Springer 539 557
    • (2008) Springer Handbook of Speech Processing , pp. 539-557
    • Young, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.