메뉴 건너뛰기




Volumn 43, Issue 1-2, 2004, Pages 123-142

Techniques for handling convolutional distortion with 'missing data' automatic speech recognition

Author keywords

Missing data; Reverberation; Spectral distortion; Spectral normalisation; Speech recognition

Indexed keywords

AUTOMATION; CONVOLUTION; DATA REDUCTION; MARKOV PROCESSES; MATHEMATICAL MODELS; REVERBERATION; SPEECH ANALYSIS; SPEECH COMMUNICATION;

EID: 2942539074     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2004.02.005     Document Type: Article
Times cited : (59)

References (59)
  • 1
    • 0141702085 scopus 로고    scopus 로고
    • Environmental sniffing: Noise knowledge estimation for robust speech systems
    • Akbacak M., Hansen J.H.L. Environmental sniffing: noise knowledge estimation for robust speech systems. Proc. ICASSP-2003. II:2003;113-116.
    • (2003) Proc. ICASSP-2003 II , pp. 113-116
    • Akbacak, M.1    Hansen, J.H.L.2
  • 3
    • 2142812604 scopus 로고    scopus 로고
    • The perception of speech under adverse acoustic conditions
    • S. Greenberg, & W. Ainsworth. Springer-Verlag (Springer Handbook of Auditory Research)
    • Assmann P., Summerfield Q. The perception of speech under adverse acoustic conditions. Greenberg S., Ainsworth W. Speech Processing in the Auditory System (Springer Handbook of Auditory Research, Vol. 18). 2003;Springer-Verlag.
    • (2003) Speech Processing in the Auditory System , vol.18
    • Assmann, P.1    Summerfield, Q.2
  • 4
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • Atal B.S. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. J. Acoust. Soc. Am. 55:1974;1304-1312.
    • (1974) J. Acoust. Soc. Am. , vol.55 , pp. 1304-1312
    • Atal, B.S.1
  • 5
    • 85009096997 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sound sources
    • Barker J., Cooke M.P., Ellis D.P.W. Decoding speech in the presence of other sound sources. Proc. ICSLP-2000. IV:2000;270-273.
    • (2000) Proc. ICSLP-2000 IV , pp. 270-273
    • Barker, J.1    Cooke, M.P.2    Ellis, D.P.W.3
  • 6
    • 85009063707 scopus 로고    scopus 로고
    • Soft decisions in missing data techniques for robust automatic speech recognition
    • Barker J., Josifovski L., Cooke M.P., Green P.D. Soft decisions in missing data techniques for robust automatic speech recognition. Proc. ICSLP-2000. I:2000;373-376.
    • (2000) Proc. ICSLP-2000 I , pp. 373-376
    • Barker, J.1    Josifovski, L.2    Cooke, M.P.3    Green, P.D.4
  • 7
    • 85009106519 scopus 로고    scopus 로고
    • Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise
    • Barker J., Cooke M.P., Green P.D. Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise. Proc. Eurospeech-2001. 2001;213-217.
    • (2001) Proc. Eurospeech-2001 , pp. 213-217
    • Barker, J.1    Cooke, M.P.2    Green, P.D.3
  • 9
    • 0022479342 scopus 로고
    • Predictors of speech intelligibility in rooms
    • Bradley J.S. Predictors of speech intelligibility in rooms. J. Acoust. Soc. Am. 80:1986;837-845.
    • (1986) J. Acoust. Soc. Am. , vol.80 , pp. 837-845
    • Bradley, J.S.1
  • 11
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • Brown G.J., Cooke M.P. Computational auditory scene analysis. Comp. Speech Lang. 8:1994;297-336.
    • (1994) Comp. Speech Lang. , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.P.2
  • 12
    • 0034850070 scopus 로고    scopus 로고
    • A neural oscillator sound separator for missing data speech recognition
    • Brown G.J., Barker J., Wang D.L. A neural oscillator sound separator for missing data speech recognition. Proc. IJCNN-2001. 2001;2907-2912.
    • (2001) Proc. IJCNN-2001 , pp. 2907-2912
    • Brown, G.J.1    Barker, J.2    Wang, D.L.3
  • 15
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Cooke M.P., Green P.D., Josifovski L., Vizinho A. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Comm. 34:2001;267-285.
    • (2001) Speech Comm. , vol.34 , pp. 267-285
    • Cooke, M.P.1    Green, P.D.2    Josifovski, L.3    Vizinho, A.4
  • 16
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis S.P., Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. ASSP-28:1980;357-366.
    • (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.ASSP-38 , pp. 357-366
    • Davis, S.P.1    Mermelstein, P.2
  • 17
    • 0036291376 scopus 로고    scopus 로고
    • Uncertainty decoding with SPLICE for noise robust speech recognition
    • Droppo J., Acero A., Deng L. Uncertainty decoding with SPLICE for noise robust speech recognition. Proc. ICASSP-2002. I:2002;57-60.
    • (2002) Proc. ICASSP-2002 I , pp. 57-60
    • Droppo, J.1    Acero, A.2    Deng, L.3
  • 18
    • 0027957839 scopus 로고
    • Effects of temporal envelope smearing on speech reception
    • Drullman R., Festen J.M., Plomp R. Effects of temporal envelope smearing on speech reception. J. Acoust. Soc. Amer. 95:1994;1053-1064.
    • (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 1053-1064
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 23
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Hermansky H. Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Am. 87:1990;1738-1752.
    • (1990) J. Acoust. Soc. Am. , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 24
    • 0032139768 scopus 로고    scopus 로고
    • Should recognisers have ears?
    • Hermansky H. Should recognisers have ears? Speech Comm. 25:1998;3-27.
    • (1998) Speech Comm. , vol.25 , pp. 3-27
    • Hermansky, H.1
  • 26
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • Hermansky H., Ellis D.P.W., Sharma S. Tandem connectionist feature extraction for conventional HMM systems. Proc. ICASSP-2000. III:2000;1635-1638.
    • (2000) Proc. ICASSP-2000 III , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 27
    • 0028996871 scopus 로고
    • Noise estimation techniques for robust speech recognition
    • Hirsch H.G., Erlicher C. Noise estimation techniques for robust speech recognition. Proc. ICASSP-1995. I:1995;153-156.
    • (1995) Proc. ICASSP-1995 I , pp. 153-156
    • Hirsch, H.G.1    Erlicher, C.2
  • 28
    • 84873312246 scopus 로고
    • A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria
    • Houtgast T., Steeneken H.J.M. A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria. J. Acoust. Soc. Am. 77:1985;1069-1077.
    • (1985) J. Acoust. Soc. Am. , vol.77 , pp. 1069-1077
    • Houtgast, T.1    Steeneken, H.J.M.2
  • 33
    • 0032676337 scopus 로고    scopus 로고
    • On the relative importance of various components of the modulation spectrum for automatic speech recognition
    • Kandera N., Arai T., Hermansky H., Pavel M. On the relative importance of various components of the modulation spectrum for automatic speech recognition. Speech Comm. 28:1999;43-55.
    • (1999) Speech Comm. , vol.28 , pp. 43-55
    • Kandera, N.1    Arai, T.2    Hermansky, H.3    Pavel, M.4
  • 35
    • 0032136330 scopus 로고    scopus 로고
    • Robust speech recognition using the modulation spectrogram
    • Kingsbury B.E.D., Morgan N., Greenberg S. Robust speech recognition using the modulation spectrogram. Speech Comm. 25:1998;117-132.
    • (1998) Speech Comm. , vol.25 , pp. 117-132
    • Kingsbury, B.E.D.1    Morgan, N.2    Greenberg, S.3
  • 36
    • 0037211087 scopus 로고    scopus 로고
    • Sub-band SNR estimation using auditory feature processing
    • Kleinschmidt M., Hohmann V. Sub-band SNR estimation using auditory feature processing. Speech Comm. 39(1-2):2003;47-64.
    • (2003) Speech Comm. , vol.39 , Issue.1-2 , pp. 47-64
    • Kleinschmidt, M.1    Hohmann, V.2
  • 37
    • 0035308233 scopus 로고    scopus 로고
    • Classification of general audio data for content-based retrieval
    • Li D., Sethi I.K., Dimitrova N., McGee T. Classification of general audio data for content-based retrieval. Pattern Recognition Lett. 22:2001;533-544.
    • (2001) Pattern Recognition Lett. , vol.22 , pp. 533-544
    • Li, D.1    Sethi, I.K.2    Dimitrova, N.3    Mcgee, T.4
  • 38
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • Lippmann R.P. Speech recognition by machines and humans. Speech Comm. 22:1997;1-15.
    • (1997) Speech Comm. , vol.22 , pp. 1-15
    • Lippmann, R.P.1
  • 39
    • 0038422099 scopus 로고    scopus 로고
    • Single gauss model set-based data imputation method for complex ASR task
    • Luo Y., Du L. Single gauss model set-based data imputation method for complex ASR task. Proc. ISCAS 2003. II:2003;564-567.
    • (2003) Proc. ISCAS 2003 II , pp. 564-567
    • Luo, Y.1    Du, L.2
  • 41
    • 0001797537 scopus 로고
    • An efficient algorithm to estimate the instantaneous SNR of speech signals
    • Martin R. An efficient algorithm to estimate the instantaneous SNR of speech signals. Proc. Eurospeech-1993. 1993;37-40.
    • (1993) Proc. Eurospeech-1993 , pp. 37-40
    • Martin, R.1
  • 42
    • 2942626614 scopus 로고    scopus 로고
    • MATLAB release 13 reference manual. Natick, MA
    • Mathworks, Inc., 2003. MATLAB release 13 reference manual. Natick, MA.
    • (2003) Mathworks, Inc.
  • 44
    • 2942557838 scopus 로고    scopus 로고
    • Analysis of noise PDF transformation in secondary feature processing
    • IDIAP, Martigny, Switzerland
    • Morris, A.C., 2002. Analysis of noise PDF transformation in secondary feature processing. IDIAP Research Report 02-29, IDIAP, Martigny, Switzerland.
    • (2002) IDIAP Research Report , vol.2 , Issue.29
    • Morris, A.C.1
  • 45
    • 84892151303 scopus 로고    scopus 로고
    • Some solutions to the missing feature problem in the classification, with application to noise-robust ASR
    • Morris A.C., Cooke M.P., Green P.D. Some solutions to the missing feature problem in the classification, with application to noise-robust ASR. Proc. ICASSP-1998. II:1998;737-740.
    • (1998) Proc. ICASSP-1998 II , pp. 737-740
    • Morris, A.C.1    Cooke, M.P.2    Green, P.D.3
  • 46
    • 0020325263 scopus 로고
    • Monaural and binaural speech perception in reverberation for listeners of various ages
    • Nabelek A.K., Robinson P.K. Monaural and binaural speech perception in reverberation for listeners of various ages. J. Acoust. Soc. Amer. 71:1982;1242-1248.
    • (1982) J. Acoust. Soc. Amer. , vol.71 , pp. 1242-1248
    • Nabelek, A.K.1    Robinson, P.K.2
  • 47
    • 0032142014 scopus 로고    scopus 로고
    • Environmental conditions and acoustic transduction in hands-free speech recognition
    • Omologo M., Svaizer P., Matassoni M. Environmental conditions and acoustic transduction in hands-free speech recognition. Speech Comm. 25:1998;75-95.
    • (1998) Speech Comm. , vol.25 , pp. 75-95
    • Omologo, M.1    Svaizer, P.2    Matassoni, M.3
  • 49
    • 24444447717 scopus 로고    scopus 로고
    • A binaural auditory model for missing data speech recognition in noisy and reverberant conditions
    • Aalborg, 2nd September
    • Palomäki, K.J., Brown, G.J., Wang, D.L., 2001. A binaural auditory model for missing data speech recognition in noisy and reverberant conditions. In: Proc. CRAC Eurospeech-2001 satellite workshop, Aalborg, 2nd September.
    • (2001) Proc. CRAC Eurospeech-2001 Satellite Workshop
    • Palomäki, K.J.1    Brown, G.J.2    Wang, D.L.3
  • 50
  • 51
    • 84902042740 scopus 로고    scopus 로고
    • A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation
    • in press
    • Palomäki, K.J., Brown, G.J., Wang, D.L., in press. A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation. Speech Comm.
    • Speech Comm
    • Palomäki, K.J.1    Brown, G.J.2    Wang, D.L.3
  • 53
    • 84987702417 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Pearce D., Hirsch H.-G. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. Proc. ICSLP-2000. 4:2000;29-32.
    • (2000) Proc. ICSLP-2000 , vol.4 , pp. 29-32
    • Pearce, D.1    Hirsch, H.-G.2
  • 56
    • 85057633672 scopus 로고    scopus 로고
    • Reconstruction of missing features for robust speech recognition
    • in press
    • Raj, B., Seltzer, M.L., Stern, R.M., in press. Reconstruction of Missing Features for Robust Speech Recognition, Speech Comm.
    • Speech Comm
    • Raj, B.1    Seltzer, M.L.2    Stern, R.M.3
  • 57
    • 84881675408 scopus 로고
    • Cepstral channel normalisation techniques for HMM-based speaker verification
    • Rosenberg A.E., Lee C.-H., Soong F.K. Cepstral channel normalisation techniques for HMM-based speaker verification. Proc. ICSLP 94. 4:1994;1835-1838.
    • (1994) Proc. ICSLP 94 , vol.4 , pp. 1835-1838
    • Rosenberg, A.E.1    Lee, C.-H.2    Soong, F.K.3
  • 58
    • 2942623044 scopus 로고    scopus 로고
    • Step by step guide to using the speech training and recognition unified tool STRUT
    • STRUT Version 2.4, 1997. Step by step guide to using the speech training and recognition unified tool STRUT. Available from 〈www.tcts.fpms.ac.be/ asr/project/strut/〉.
    • (1997) STRUT Version 2.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.