SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 43, Issue 4 SPEC. ISS., 2004, Pages 361-378

A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation

(3) Palomäki, Kalle J a,b,c Brown, Guy J a Wang, DeLiang d

a UNIVERSITY OF SHEFFIELD (United Kingdom)

b AALTO UNIVERSITY (Finland)

c UNIVERSITY OF HELSINKI (Finland)

d OHIO STATE UNIVERSITY (United States)

Author keywords

Binaural model; Missing data; Precedence effect; Speech recognition

Indexed keywords

ACOUSTIC NOISE; COMPUTATIONAL METHODS; DECODING; MICROPHONES; REVERBERATION; SENSORY AIDS;

BINAURAL MODEL; MISSING DATA; PRECEDENCE EFFECT; SPEECH CHANNELS;

SPEECH RECOGNITION;

EID: 4644304197 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2004.03.005 Document Type: Article

Times cited : (90)

References (54)

1
- 0018455820
- Image method for efficiently simulating small-room acoustics
- Allen, J.B., Berkley, D.A., 1979. Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Amer. 65 (4), 943-950.
- (1979) J. Acoust. Soc. Amer. , vol.65 , Issue.4 , pp. 943-950
- Allen, J.B.¹ Berkley, D.A.²

2
- 85009096997
- Decoding speech in the presence of other sound sources
- Barker, J., Cooke, M.P., Ellis, D.P.W., 2000a. Decoding speech in the presence of other sound sources. Proc. ICSLP 4, 270-273.
- (2000) Proc. ICSLP , vol.4 , pp. 270-273
- Barker, J.¹ Cooke, M.P.² Ellis, D.P.W.³

3
- 85009063707
- Soft decisions in missing data techniques for robust automatic speech recognition
- Barker, J., Josifovski, L., Cooke, M.P., Green, P.D., 2000b. Soft decisions in missing data techniques for robust automatic speech recognition. Proc. ICSLP 1, 373-376.
- (2000) Proc. ICSLP , vol.1 , pp. 373-376
- Barker, J.¹ Josifovski, L.² Cooke, M.P.³ Green, P.D.⁴

4
- 0003742220
- (Revised Edition). MIT Press, Cambridge, MA
- Blauert, J., 1997. Spatial Hearing: The Psychophysics of Human Sound Localization (Revised Edition). MIT Press, Cambridge, MA.
- (1997) Spatial Hearing: The Psychophysics of Human Sound Localization
- Blauert, J.¹

5
- 0002706411
- Modelling human sound-source localization and the cocktail-party-effect
- Bodden, M., 1993. Modelling human sound-source localization and the cocktail-party-effect. Acta Acoust. 1, 43-55.
- (1993) Acta Acoust. , vol.1 , pp. 43-55
- Bodden, M.¹

6
- 0003684441
- MIT Press, Cambridge, MA
- Bregman, A.S., 1990. Auditory Scene Analysis. MIT Press, Cambridge, MA.
- (1990) Auditory Scene Analysis
- Bregman, A.S.¹

7
- 0028531926
- Computational auditory scene analysis
- Brown, G.J., Cooke, M.P., 1994. Computational auditory scene analysis. Comput. Speech Lang. 8, 297-336.
- (1994) Comput. Speech Lang. , vol.8 , pp. 297-336
- Brown, G.J.¹ Cooke, M.P.²

8
- 0034850070
- A neural oscillator sound separator for missing data speech recognition
- Brown, G.J., Wang, D.L., Barker, J., 2001. A neural oscillator sound separator for missing data speech recognition. Proc. IJCNN 4, 2907-2912.
- (2001) Proc. IJCNN , vol.4 , pp. 2907-2912
- Brown, G.J.¹ Wang, D.L.² Barker, J.³

9
- 0003733873
- Prentice Hall, Englewood Cliffs, NJ
- Cohen, L., 1994. Time-Frequency Analysis: Theory and Applications. Prentice Hall, Englewood Cliffs, NJ.
- (1994) Time-frequency Analysis: Theory and Applications
- Cohen, L.¹

10
- 0003479143
- Cambridge University Press, Cambridge, UK
- Cooke, M.P., 1993. Modelling Auditory Processing and Organization. Cambridge University Press, Cambridge, UK.
- (1993) Modelling Auditory Processing and Organization
- Cooke, M.P.¹

11
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- Cooke, M.P., Green, P.D., Josifovski, L., Vizinho, A., 2001. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Comm. 34 (3), 267-285.
- (2001) Speech Comm. , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.P.¹ Green, P.D.² Josifovski, L.³ Vizinho, A.⁴

12
- 0029127703
- Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay
- Culling, J.F., Summerfield, Q., 1995. Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay. J. Acoust. Soc. Amer. 98 (2), 785-797.
- (1995) J. Acoust. Soc. Amer. , vol.98 , Issue.2 , pp. 785-797
- Culling, J.F.¹ Summerfield, Q.²

13
- 0001698589
- Auditory grouping
- Moore, B.C.J. (Ed.), Hearing. Academic, London
- Darwin, C.J., Carlyon, R.P., 1995. Auditory grouping. In: The Handbook of Perception and Cognition. In: Moore, B.C.J. (Ed.), Hearing, Vol. 6. Academic, London, pp. 387-424.
- (1995) The Handbook of Perception and Cognition , vol.6 , pp. 387-424
- Darwin, C.J.¹ Carlyon, R.P.²

14
- 0033144658
- Auditory objects of attention: The role of interaural time differences
- Darwin, C.J., Hukin, R.W., 1999. Auditory objects of attention: the role of interaural time differences. J. Exp. Psychol. Hum. Percept. Perform. 25 (3), 617-629.
- (1999) J. Exp. Psychol. Hum. Percept. Perform. , vol.25 , Issue.3 , pp. 617-629
- Darwin, C.J.¹ Hukin, R.W.²

15
- 0033939839
- Effects of reverberation on spatial, prosodic and vocal-tract size cues to selective attention
- Darwin, C.J., Hukin, R.W., 2000. Effects of reverberation on spatial, prosodic and vocal-tract size cues to selective attention. J. Acoust. Soc. Amer. 108 (1), 335-342.
- (2000) J. Acoust. Soc. Amer. , vol.108 , Issue.1 , pp. 335-342
- Darwin, C.J.¹ Hukin, R.W.²

16
- 0026882871
- Pitch extraction and separation of overlapping speech
- Denbigh, P.N., Zhao, J., 1992. Pitch extraction and separation of overlapping speech. Speech Comm. 11, 119-125.
- (1992) Speech Comm. , vol.11 , pp. 119-125
- Denbigh, P.N.¹ Zhao, J.²

17
- 0004089083
- HRTF measurements of a KEMAR dummy-head microphone
- MIT Media Lab
- Gardner, B., Martin, K.D., 1994. HRTF measurements of a KEMAR dummy-head microphone. Technical Report #280, MIT Media Lab. Available from: http://web.media.mit.edu/~kdm/hrtf.html.
- (1994) Technical Report #280 , vol.280
- Gardner, B.¹ Martin, K.D.²

18
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- Glasberg, B.R., Moore, B.C.J., 1990. Derivation of auditory filter shapes from notched-noise data. Hear. Res. 47 (1-2), 103-138.
- (1990) Hear. Res. , vol.47 , Issue.1-2 , pp. 103-138
- Glasberg, B.R.¹ Moore, B.C.J.²

19
- 85024441206
- A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition
- 1999
- Glotin, H., Berthommier, F., Tessier, E., 1999. A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition. In: Proc. EUROSPEECH, 1999, pp. 2351-2354.
- (1999) Proc. EUROSPEECH , pp. 2351-2354
- Glotin, H.¹ Berthommier, F.² Tessier, E.³

20
- 0008094967
- Pacific Brooks Cole Publishing, Grove, CA
- Hall, D.E., 1991. Musical Acoustics, second ed. Pacific Brooks Cole Publishing, Grove, CA.
- (1991) Musical Acoustics, Second Ed.
- Hall, D.E.¹

21
- 0033046283
- Speech intelligibility and localization in a multi-source environment
- Hawley, M.L., Litovsky, R.Y., Colburn, H.S., 1999. Speech intelligibility and localization in a multi-source environment. J. Acoust. Soc. Amer. 105 (6), 3436-3448.
- (1999) J. Acoust. Soc. Amer. , vol.105 , Issue.6 , pp. 3436-3448
- Hawley, M.L.¹ Litovsky, R.Y.² Colburn, H.S.³

22
- 0032139768
- Should recognizers have ears?
- Hermansky, H., 1998. Should recognizers have ears?. Speech Comm. 25 (1-3), 3-27.
- (1998) Speech Comm. , vol.25 , Issue.1-3 , pp. 3-27
- Hermansky, H.¹

23
- 0028517164
- RASTA processing of speech
- Hermansky, H., Morgan, N., 1994. RASTA processing of speech. IEEE Trans. Speech Audio Process. 2 (4), 578-589.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

24
- 0031343112
- Modeling of reflections and air absorption in acoustical spaces: A digital filter design approach
- New Paltz, NY
- Huopaniemi, J., Savioja, L., Karjalainen, M., 1997. Modeling of reflections and air absorption in acoustical spaces: a digital filter design approach. In: Proc. IEEE Workshop on Applications of Signal Processing to Acoustics and Audio, New Paltz, NY.
- (1997) Proc. IEEE Workshop on Applications of Signal Processing to Acoustics and Audio
- Huopaniemi, J.¹ Savioja, L.² Karjalainen, M.³

25
- 0029152044
- Effects of contralateral presentation and of interaural time differences in segregating a harmonic from a vowel
- Hukin, R.W., Darwin, C.J., 1995. Effects of contralateral presentation and of interaural time differences in segregating a harmonic from a vowel. J. Acoust. Soc. Amer. 98 (3), 1380-1387.
- (1995) J. Acoust. Soc. Amer. , vol.98 , Issue.3 , pp. 1380-1387
- Hukin, R.W.¹ Darwin, C.J.²

26
- 0003434858
- Ph.D. Thesis. University of California, Berkeley
- Kingsbury, B.E.D., 1998. Perceptually inspired signal-processing strategies for robust speech recognition in reverberant environments. Ph.D. Thesis. University of California, Berkeley.
- (1998) Perceptually Inspired Signal-processing Strategies for Robust Speech Recognition in Reverberant Environments
- Kingsbury, B.E.D.¹

27
- 0032136330
- Robust speech recognition using the modulation spectrogram
- Kingsbury, B.E.D., Morgan, N., Greenberg, S., 1998. Robust speech recognition using the modulation spectrogram. Speech Comm. 25, 117-132.
- (1998) Speech Comm. , vol.25 , pp. 117-132
- Kingsbury, B.E.D.¹ Morgan, N.² Greenberg, S.³

28
- 85009080602
- Improving simultaneous speech recognition in real room environments using overdetermined blind source separation
- Koutras, A., Dermatas, E., Kokkinakis, O., 2001. Improving simultaneous speech recognition in real room environments using overdetermined blind source separation. Proc. EUROSPEECH 2, 1009-1012.
- (2001) Proc. EUROSPEECH , vol.2 , pp. 1009-1012
- Koutras, A.¹ Dermatas, E.² Kokkinakis, O.³

29
- 0002560960
- A database for speaker-independent digit recognition
- Leonard, R.G., 1984. A database for speaker-independent digit recognition. Proc. ICASSP 3, 111-114.
- (1984) Proc. ICASSP , vol.3 , pp. 111-114
- Leonard, R.G.¹

30
- 0023020142
- Extension of a binaural cross-correlation model by contralateral inhibition. II. The law of the first wave front
- Lindemann, W., 1986. Extension of a binaural cross-correlation model by contralateral inhibition. II. The law of the first wave front. J. Acoust. Soc. Amer. 80 (6), 1623-1630.
- (1986) J. Acoust. Soc. Amer. , vol.80 , Issue.6 , pp. 1623-1630
- Lindemann, W.¹

31
- 0032845228
- The precedence effect
- Litovsky, R.Y., Colburn, S.H., Yost, W.A., Guzman, S.J., 1999. The precedence effect. J. Acoust. Soc. Amer. 106 (4), 1633-1654.
- (1999) J. Acoust. Soc. Amer. , vol.106 , Issue.4 , pp. 1633-1654
- Litovsky, R.Y.¹ Colburn, S.H.² Yost, W.A.³ Guzman, S.J.⁴

32
- 3042556011
- Ph.D. Thesis. Publications in Telecommunications Software and Multimedia, Helsinki University of Technology
- Lokki, T., 2002. Physically-based auralization. Ph.D. Thesis. Publications in Telecommunications Software and Multimedia, Helsinki University of Technology.
- (2002) Physically-based Auralization
- Lokki, T.¹

33
- 0026220146
- A computer model of binaural localization for stereo imaging measurement
- MacPherson, E.A., 1991. A computer model of binaural localization for stereo imaging measurement. J. Audio Eng. Soc. 39 (9), 604-622.
- (1991) J. Audio Eng. Soc. , vol.39 , Issue.9 , pp. 604-622
- MacPherson, E.A.¹

34
- 0031365313
- Echo suppression in a computational model of the precedence effect
- New Paltz, NY
- Martin, K.D., 1997. Echo suppression in a computational model of the precedence effect. In: Proc. IEEE Workshop on Applications of Signal Processing to Acoustics and Audio, New Paltz, NY.
- (1997) Proc. IEEE Workshop on Applications of Signal Processing to Acoustics and Audio
- Martin, K.D.¹

35
- 0027003782
- Fundamentals of binaural technology
- Møller, H., 1992. Fundamentals of binaural technology. Appl. Acoust. 36, 171-218.
- (1992) Appl. Acoust. , vol.36 , pp. 171-218
- Møller, H.¹

36
- 0003789815
- Academic Press, New York
- Moore, B.C.J., 1997. An Introduction to the Psychology of Hearing, fourth ed. Academic Press, New York.
- (1997) An Introduction to the Psychology of Hearing, Fourth Ed.
- Moore, B.C.J.¹

37
- 0020325263
- Monaural and binaural speech perception in reverberation for listeners of various ages
- Nabelek, A.K., Robinson, P.K., 1982. Monaural and binaural speech perception in reverberation for listeners of various ages. J. Acoust. Soc. Amer. 71 (5), 1242-1248.
- (1982) J. Acoust. Soc. Amer. , vol.71 , Issue.5 , pp. 1242-1248
- Nabelek, A.K.¹ Robinson, P.K.²

38
- 0032633660
- Listening to two simultaneous speeches
- Okuno, H.G., Nakatani, T., Kawabata, T., 1999. Listening to two simultaneous speeches. Speech Comm. 27, 299-310.
- (1999) Speech Comm. , vol.27 , pp. 299-310
- Okuno, H.G.¹ Nakatani, T.² Kawabata, T.³

39
- 0036298106
- Missing data speech recognition in reverberant conditions
- Orlando, 13th-17th May
- Palomäki, K.J., Brown, G.J., Barker, J., 2002. Missing data speech recognition in reverberant conditions. In: Proc. ICASSP, Orlando, 13th-17th May. pp. 65-68.
- (2002) Proc. ICASSP , pp. 65-68
- Palomäki, K.J.¹ Brown, G.J.² Barker, J.³

40
- 2942539074
- Techniques for handling convolutional distortion with 'missing data' automatic speech recognition
- Palomäki, K.J., Brown, G.J., Barker, J., 2004. Techniques for handling convolutional distortion with 'missing data' automatic speech recognition. Speech Comm. 43, 123-142.
- (2004) Speech Comm. , vol.43 , pp. 123-142
- Palomäki, K.J.¹ Brown, G.J.² Barker, J.³

41
- 84892260089
- A binaural model for missing data speech recognition in noisy and reverberant conditions
- Aalborg
- Palomäki, K.J., Brown, G.J., Wang, D.L., 2001. A binaural model for missing data speech recognition in noisy and reverberant conditions. In: Proc. Workshop on Consistent and Reliable Acoustic Cues for Sound Analysis (CRAC), Aalborg.
- (2001) Proc. Workshop on Consistent and Reliable Acoustic Cues for Sound Analysis (CRAC)
- Palomäki, K.J.¹ Brown, G.J.² Wang, D.L.³

42
- 0003548690
- Applied Psychology Unit, Cambridge
- Patterson, R.D., Nimmo-Smith, I., Holdsworth, J., Rice, P., 1988. APU report 2341: an efficient auditory filterbank based on the gammatone function. Applied Psychology Unit, Cambridge.
- (1988) APU Report 2341: An Efficient Auditory Filterbank Based on the Gammatone Function
- Patterson, R.D.¹ Nimmo-Smith, I.² Holdsworth, J.³ Rice, P.⁴

43
- 0141855412
- Binaural tracking of multiple moving sources
- Roman, N., Wang, D.L., 2003. Binaural tracking of multiple moving sources. In: Proc. ICASSP, pp. 149-152.
- (2003) Proc. ICASSP , pp. 149-152
- Roman, N.¹ Wang, D.L.²

44
- 0036293705
- Location-based sound segregation
- Orlando, 13th-17th May
- Roman, N., Wang, D.L., Brown, G.J., 2002. Location-based sound segregation. In: Proc. ICASSP, Orlando, 13th-17th May. pp. 1013-1016.
- (2002) Proc. ICASSP , pp. 1013-1016
- Roman, N.¹ Wang, D.L.² Brown, G.J.³

45
- 0003444613
- Lawrence Erlbaum Associates, Mahwah, NJ
- Rosenthal, D.F., Okuno, H.G., 1998. Computational Auditory Scene Analysis. Lawrence Erlbaum Associates, Mahwah, NJ.
- (1998) Computational Auditory Scene Analysis
- Rosenthal, D.F.¹ Okuno, H.G.²

46
- 85009135262
- Calibration of microphone arrays for improved speech recognition
- Seltzer, M.L., Raj, B., 2001. Calibration of microphone arrays for improved speech recognition. Proc. EUROSPEECH 2, 1005-1008.
- (2001) Proc. EUROSPEECH , vol.2 , pp. 1005-1008
- Seltzer, M.L.¹ Raj, B.²

47
- 0026694566
- Across frequency integration in a model of lateralization
- Shackleton, T.M., Meddis, R., Hewitt, M.J., 1992. Across frequency integration in a model of lateralization. J. Acoust. Soc. Amer. 91 (4), 2276-2279.
- (1992) J. Acoust. Soc. Amer. , vol.91 , Issue.4 , pp. 2276-2279
- Shackleton, T.M.¹ Meddis, R.² Hewitt, M.J.³

48
- 0035254668
- A sound segregation algorithm for reverberant conditions
- Shamsoddini, A., Denbigh, P.N., 2001. A sound segregation algorithm for reverberant conditions. Speech Comm. 33, 179-196.
- (2001) Speech Comm. , vol.33 , pp. 179-196
- Shamsoddini, A.¹ Denbigh, P.N.²

49
- 0000938112
- Responding to one of two simultaneous messages
- Spieth, W., Curtis, J.F., Webster, J.C., 1954. Responding to one of two simultaneous messages. J. Acoust. Soc. Amer. 26 (3), 391-396.
- (1954) J. Acoust. Soc. Amer. , vol.26 , Issue.3 , pp. 391-396
- Spieth, W.¹ Curtis, J.F.² Webster, J.C.³

50
- 0035280043
- A comparison of auditory and blind separation techniques for speech segregation
- van der Kouwe, A.J.W., Wang, D.L., Brown, G.J., 2001. A comparison of auditory and blind separation techniques for speech segregation. IEEE Trans. Speech Audio Process. 9, 189-195.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 189-195
- Van Der Kouwe, A.J.W.¹ Wang, D.L.² Brown, G.J.³

51
- 84924843200
- The precedence effect in sound localization
- Wallach, H., Neumann, E.B., Rosenzweig, M.R., 1949. The precedence effect in sound localization. Amer. J. Psychol. 52, 315-336.
- (1949) Amer. J. Psychol. , vol.52 , pp. 315-336
- Wallach, H.¹ Neumann, E.B.² Rosenzweig, M.R.³

52
- 0026351353
- Central, auditory mechanisms of perceptual compensation for spectral- Envelope distortion
- Watkins, A.J., 1991. Central, auditory mechanisms of perceptual compensation for spectral- envelope distortion. J. Acoust. Soc. Amer. 90 (6), 2942-2955.
- (1991) J. Acoust. Soc. Amer. , vol.90 , Issue.6 , pp. 2942-2955
- Watkins, A.J.¹

53
- 0002689429
- The cocktail party problem: Forty years later
- Gilkey, R.H., Anderson, T.R. (Eds.). Lawrence Erlbaum Associates, Mahwah, NJ
- Yost, W.A., 1997. The cocktail party problem: Forty years later. In: Gilkey, R.H., Anderson, T.R. (Eds.), Binaural Hearing in Real and Virtual Environments. Lawrence Erlbaum Associates, Mahwah, NJ, pp. 329-348.
- (1997) Binaural Hearing in Real and Virtual Environments , pp. 329-348
- Yost, W.A.¹

54
- 0002902279
- The precedence effect
- Yost, W.A., Gourevitch, G. (Eds.). Springer-Verlag, New York
- Zurek, P.M., 1987. The precedence effect. In: Yost, W.A., Gourevitch, G. (Eds.), Directional Hearing. Springer-Verlag, New York, pp. 85-105.
- (1987) Directional Hearing , pp. 85-105
- Zurek, P.M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.