메뉴 건너뛰기




Volumn 43, Issue 4 SPEC. ISS., 2004, Pages 361-378

A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation

Author keywords

Binaural model; Missing data; Precedence effect; Speech recognition

Indexed keywords

ACOUSTIC NOISE; COMPUTATIONAL METHODS; DECODING; MICROPHONES; REVERBERATION; SENSORY AIDS;

EID: 4644304197     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2004.03.005     Document Type: Article
Times cited : (90)

References (54)
  • 1
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • Allen, J.B., Berkley, D.A., 1979. Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Amer. 65 (4), 943-950.
    • (1979) J. Acoust. Soc. Amer. , vol.65 , Issue.4 , pp. 943-950
    • Allen, J.B.1    Berkley, D.A.2
  • 2
    • 85009096997 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sound sources
    • Barker, J., Cooke, M.P., Ellis, D.P.W., 2000a. Decoding speech in the presence of other sound sources. Proc. ICSLP 4, 270-273.
    • (2000) Proc. ICSLP , vol.4 , pp. 270-273
    • Barker, J.1    Cooke, M.P.2    Ellis, D.P.W.3
  • 3
    • 85009063707 scopus 로고    scopus 로고
    • Soft decisions in missing data techniques for robust automatic speech recognition
    • Barker, J., Josifovski, L., Cooke, M.P., Green, P.D., 2000b. Soft decisions in missing data techniques for robust automatic speech recognition. Proc. ICSLP 1, 373-376.
    • (2000) Proc. ICSLP , vol.1 , pp. 373-376
    • Barker, J.1    Josifovski, L.2    Cooke, M.P.3    Green, P.D.4
  • 5
    • 0002706411 scopus 로고
    • Modelling human sound-source localization and the cocktail-party-effect
    • Bodden, M., 1993. Modelling human sound-source localization and the cocktail-party-effect. Acta Acoust. 1, 43-55.
    • (1993) Acta Acoust. , vol.1 , pp. 43-55
    • Bodden, M.1
  • 7
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • Brown, G.J., Cooke, M.P., 1994. Computational auditory scene analysis. Comput. Speech Lang. 8, 297-336.
    • (1994) Comput. Speech Lang. , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.P.2
  • 8
    • 0034850070 scopus 로고    scopus 로고
    • A neural oscillator sound separator for missing data speech recognition
    • Brown, G.J., Wang, D.L., Barker, J., 2001. A neural oscillator sound separator for missing data speech recognition. Proc. IJCNN 4, 2907-2912.
    • (2001) Proc. IJCNN , vol.4 , pp. 2907-2912
    • Brown, G.J.1    Wang, D.L.2    Barker, J.3
  • 11
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Cooke, M.P., Green, P.D., Josifovski, L., Vizinho, A., 2001. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Comm. 34 (3), 267-285.
    • (2001) Speech Comm. , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.P.1    Green, P.D.2    Josifovski, L.3    Vizinho, A.4
  • 12
    • 0029127703 scopus 로고
    • Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay
    • Culling, J.F., Summerfield, Q., 1995. Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay. J. Acoust. Soc. Amer. 98 (2), 785-797.
    • (1995) J. Acoust. Soc. Amer. , vol.98 , Issue.2 , pp. 785-797
    • Culling, J.F.1    Summerfield, Q.2
  • 13
    • 0001698589 scopus 로고
    • Auditory grouping
    • Moore, B.C.J. (Ed.), Hearing. Academic, London
    • Darwin, C.J., Carlyon, R.P., 1995. Auditory grouping. In: The Handbook of Perception and Cognition. In: Moore, B.C.J. (Ed.), Hearing, Vol. 6. Academic, London, pp. 387-424.
    • (1995) The Handbook of Perception and Cognition , vol.6 , pp. 387-424
    • Darwin, C.J.1    Carlyon, R.P.2
  • 14
    • 0033144658 scopus 로고    scopus 로고
    • Auditory objects of attention: The role of interaural time differences
    • Darwin, C.J., Hukin, R.W., 1999. Auditory objects of attention: the role of interaural time differences. J. Exp. Psychol. Hum. Percept. Perform. 25 (3), 617-629.
    • (1999) J. Exp. Psychol. Hum. Percept. Perform. , vol.25 , Issue.3 , pp. 617-629
    • Darwin, C.J.1    Hukin, R.W.2
  • 15
    • 0033939839 scopus 로고    scopus 로고
    • Effects of reverberation on spatial, prosodic and vocal-tract size cues to selective attention
    • Darwin, C.J., Hukin, R.W., 2000. Effects of reverberation on spatial, prosodic and vocal-tract size cues to selective attention. J. Acoust. Soc. Amer. 108 (1), 335-342.
    • (2000) J. Acoust. Soc. Amer. , vol.108 , Issue.1 , pp. 335-342
    • Darwin, C.J.1    Hukin, R.W.2
  • 16
    • 0026882871 scopus 로고
    • Pitch extraction and separation of overlapping speech
    • Denbigh, P.N., Zhao, J., 1992. Pitch extraction and separation of overlapping speech. Speech Comm. 11, 119-125.
    • (1992) Speech Comm. , vol.11 , pp. 119-125
    • Denbigh, P.N.1    Zhao, J.2
  • 17
    • 0004089083 scopus 로고
    • HRTF measurements of a KEMAR dummy-head microphone
    • MIT Media Lab
    • Gardner, B., Martin, K.D., 1994. HRTF measurements of a KEMAR dummy-head microphone. Technical Report #280, MIT Media Lab. Available from: http://web.media.mit.edu/~kdm/hrtf.html.
    • (1994) Technical Report #280 , vol.280
    • Gardner, B.1    Martin, K.D.2
  • 18
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • Glasberg, B.R., Moore, B.C.J., 1990. Derivation of auditory filter shapes from notched-noise data. Hear. Res. 47 (1-2), 103-138.
    • (1990) Hear. Res. , vol.47 , Issue.1-2 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 19
    • 85024441206 scopus 로고    scopus 로고
    • A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition
    • 1999
    • Glotin, H., Berthommier, F., Tessier, E., 1999. A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition. In: Proc. EUROSPEECH, 1999, pp. 2351-2354.
    • (1999) Proc. EUROSPEECH , pp. 2351-2354
    • Glotin, H.1    Berthommier, F.2    Tessier, E.3
  • 21
    • 0033046283 scopus 로고    scopus 로고
    • Speech intelligibility and localization in a multi-source environment
    • Hawley, M.L., Litovsky, R.Y., Colburn, H.S., 1999. Speech intelligibility and localization in a multi-source environment. J. Acoust. Soc. Amer. 105 (6), 3436-3448.
    • (1999) J. Acoust. Soc. Amer. , vol.105 , Issue.6 , pp. 3436-3448
    • Hawley, M.L.1    Litovsky, R.Y.2    Colburn, H.S.3
  • 22
    • 0032139768 scopus 로고    scopus 로고
    • Should recognizers have ears?
    • Hermansky, H., 1998. Should recognizers have ears?. Speech Comm. 25 (1-3), 3-27.
    • (1998) Speech Comm. , vol.25 , Issue.1-3 , pp. 3-27
    • Hermansky, H.1
  • 25
    • 0029152044 scopus 로고
    • Effects of contralateral presentation and of interaural time differences in segregating a harmonic from a vowel
    • Hukin, R.W., Darwin, C.J., 1995. Effects of contralateral presentation and of interaural time differences in segregating a harmonic from a vowel. J. Acoust. Soc. Amer. 98 (3), 1380-1387.
    • (1995) J. Acoust. Soc. Amer. , vol.98 , Issue.3 , pp. 1380-1387
    • Hukin, R.W.1    Darwin, C.J.2
  • 27
    • 0032136330 scopus 로고    scopus 로고
    • Robust speech recognition using the modulation spectrogram
    • Kingsbury, B.E.D., Morgan, N., Greenberg, S., 1998. Robust speech recognition using the modulation spectrogram. Speech Comm. 25, 117-132.
    • (1998) Speech Comm. , vol.25 , pp. 117-132
    • Kingsbury, B.E.D.1    Morgan, N.2    Greenberg, S.3
  • 28
    • 85009080602 scopus 로고    scopus 로고
    • Improving simultaneous speech recognition in real room environments using overdetermined blind source separation
    • Koutras, A., Dermatas, E., Kokkinakis, O., 2001. Improving simultaneous speech recognition in real room environments using overdetermined blind source separation. Proc. EUROSPEECH 2, 1009-1012.
    • (2001) Proc. EUROSPEECH , vol.2 , pp. 1009-1012
    • Koutras, A.1    Dermatas, E.2    Kokkinakis, O.3
  • 29
    • 0002560960 scopus 로고
    • A database for speaker-independent digit recognition
    • Leonard, R.G., 1984. A database for speaker-independent digit recognition. Proc. ICASSP 3, 111-114.
    • (1984) Proc. ICASSP , vol.3 , pp. 111-114
    • Leonard, R.G.1
  • 30
    • 0023020142 scopus 로고
    • Extension of a binaural cross-correlation model by contralateral inhibition. II. The law of the first wave front
    • Lindemann, W., 1986. Extension of a binaural cross-correlation model by contralateral inhibition. II. The law of the first wave front. J. Acoust. Soc. Amer. 80 (6), 1623-1630.
    • (1986) J. Acoust. Soc. Amer. , vol.80 , Issue.6 , pp. 1623-1630
    • Lindemann, W.1
  • 32
    • 3042556011 scopus 로고    scopus 로고
    • Ph.D. Thesis. Publications in Telecommunications Software and Multimedia, Helsinki University of Technology
    • Lokki, T., 2002. Physically-based auralization. Ph.D. Thesis. Publications in Telecommunications Software and Multimedia, Helsinki University of Technology.
    • (2002) Physically-based Auralization
    • Lokki, T.1
  • 33
    • 0026220146 scopus 로고
    • A computer model of binaural localization for stereo imaging measurement
    • MacPherson, E.A., 1991. A computer model of binaural localization for stereo imaging measurement. J. Audio Eng. Soc. 39 (9), 604-622.
    • (1991) J. Audio Eng. Soc. , vol.39 , Issue.9 , pp. 604-622
    • MacPherson, E.A.1
  • 35
    • 0027003782 scopus 로고
    • Fundamentals of binaural technology
    • Møller, H., 1992. Fundamentals of binaural technology. Appl. Acoust. 36, 171-218.
    • (1992) Appl. Acoust. , vol.36 , pp. 171-218
    • Møller, H.1
  • 37
    • 0020325263 scopus 로고
    • Monaural and binaural speech perception in reverberation for listeners of various ages
    • Nabelek, A.K., Robinson, P.K., 1982. Monaural and binaural speech perception in reverberation for listeners of various ages. J. Acoust. Soc. Amer. 71 (5), 1242-1248.
    • (1982) J. Acoust. Soc. Amer. , vol.71 , Issue.5 , pp. 1242-1248
    • Nabelek, A.K.1    Robinson, P.K.2
  • 38
    • 0032633660 scopus 로고    scopus 로고
    • Listening to two simultaneous speeches
    • Okuno, H.G., Nakatani, T., Kawabata, T., 1999. Listening to two simultaneous speeches. Speech Comm. 27, 299-310.
    • (1999) Speech Comm. , vol.27 , pp. 299-310
    • Okuno, H.G.1    Nakatani, T.2    Kawabata, T.3
  • 39
    • 0036298106 scopus 로고    scopus 로고
    • Missing data speech recognition in reverberant conditions
    • Orlando, 13th-17th May
    • Palomäki, K.J., Brown, G.J., Barker, J., 2002. Missing data speech recognition in reverberant conditions. In: Proc. ICASSP, Orlando, 13th-17th May. pp. 65-68.
    • (2002) Proc. ICASSP , pp. 65-68
    • Palomäki, K.J.1    Brown, G.J.2    Barker, J.3
  • 40
    • 2942539074 scopus 로고    scopus 로고
    • Techniques for handling convolutional distortion with 'missing data' automatic speech recognition
    • Palomäki, K.J., Brown, G.J., Barker, J., 2004. Techniques for handling convolutional distortion with 'missing data' automatic speech recognition. Speech Comm. 43, 123-142.
    • (2004) Speech Comm. , vol.43 , pp. 123-142
    • Palomäki, K.J.1    Brown, G.J.2    Barker, J.3
  • 43
    • 0141855412 scopus 로고    scopus 로고
    • Binaural tracking of multiple moving sources
    • Roman, N., Wang, D.L., 2003. Binaural tracking of multiple moving sources. In: Proc. ICASSP, pp. 149-152.
    • (2003) Proc. ICASSP , pp. 149-152
    • Roman, N.1    Wang, D.L.2
  • 44
    • 0036293705 scopus 로고    scopus 로고
    • Location-based sound segregation
    • Orlando, 13th-17th May
    • Roman, N., Wang, D.L., Brown, G.J., 2002. Location-based sound segregation. In: Proc. ICASSP, Orlando, 13th-17th May. pp. 1013-1016.
    • (2002) Proc. ICASSP , pp. 1013-1016
    • Roman, N.1    Wang, D.L.2    Brown, G.J.3
  • 46
    • 85009135262 scopus 로고    scopus 로고
    • Calibration of microphone arrays for improved speech recognition
    • Seltzer, M.L., Raj, B., 2001. Calibration of microphone arrays for improved speech recognition. Proc. EUROSPEECH 2, 1005-1008.
    • (2001) Proc. EUROSPEECH , vol.2 , pp. 1005-1008
    • Seltzer, M.L.1    Raj, B.2
  • 47
    • 0026694566 scopus 로고
    • Across frequency integration in a model of lateralization
    • Shackleton, T.M., Meddis, R., Hewitt, M.J., 1992. Across frequency integration in a model of lateralization. J. Acoust. Soc. Amer. 91 (4), 2276-2279.
    • (1992) J. Acoust. Soc. Amer. , vol.91 , Issue.4 , pp. 2276-2279
    • Shackleton, T.M.1    Meddis, R.2    Hewitt, M.J.3
  • 48
    • 0035254668 scopus 로고    scopus 로고
    • A sound segregation algorithm for reverberant conditions
    • Shamsoddini, A., Denbigh, P.N., 2001. A sound segregation algorithm for reverberant conditions. Speech Comm. 33, 179-196.
    • (2001) Speech Comm. , vol.33 , pp. 179-196
    • Shamsoddini, A.1    Denbigh, P.N.2
  • 49
    • 0000938112 scopus 로고
    • Responding to one of two simultaneous messages
    • Spieth, W., Curtis, J.F., Webster, J.C., 1954. Responding to one of two simultaneous messages. J. Acoust. Soc. Amer. 26 (3), 391-396.
    • (1954) J. Acoust. Soc. Amer. , vol.26 , Issue.3 , pp. 391-396
    • Spieth, W.1    Curtis, J.F.2    Webster, J.C.3
  • 50
    • 0035280043 scopus 로고    scopus 로고
    • A comparison of auditory and blind separation techniques for speech segregation
    • van der Kouwe, A.J.W., Wang, D.L., Brown, G.J., 2001. A comparison of auditory and blind separation techniques for speech segregation. IEEE Trans. Speech Audio Process. 9, 189-195.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 189-195
    • Van Der Kouwe, A.J.W.1    Wang, D.L.2    Brown, G.J.3
  • 52
    • 0026351353 scopus 로고
    • Central, auditory mechanisms of perceptual compensation for spectral- Envelope distortion
    • Watkins, A.J., 1991. Central, auditory mechanisms of perceptual compensation for spectral- envelope distortion. J. Acoust. Soc. Amer. 90 (6), 2942-2955.
    • (1991) J. Acoust. Soc. Amer. , vol.90 , Issue.6 , pp. 2942-2955
    • Watkins, A.J.1
  • 53
    • 0002689429 scopus 로고    scopus 로고
    • The cocktail party problem: Forty years later
    • Gilkey, R.H., Anderson, T.R. (Eds.). Lawrence Erlbaum Associates, Mahwah, NJ
    • Yost, W.A., 1997. The cocktail party problem: Forty years later. In: Gilkey, R.H., Anderson, T.R. (Eds.), Binaural Hearing in Real and Virtual Environments. Lawrence Erlbaum Associates, Mahwah, NJ, pp. 329-348.
    • (1997) Binaural Hearing in Real and Virtual Environments , pp. 329-348
    • Yost, W.A.1
  • 54
    • 0002902279 scopus 로고
    • The precedence effect
    • Yost, W.A., Gourevitch, G. (Eds.). Springer-Verlag, New York
    • Zurek, P.M., 1987. The precedence effect. In: Yost, W.A., Gourevitch, G. (Eds.), Directional Hearing. Springer-Verlag, New York, pp. 85-105.
    • (1987) Directional Hearing , pp. 85-105
    • Zurek, P.M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.