SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 7, 2012, Pages 2016-2030

A binaural scene analyzer for joint localization and recognition of speakers in the presence of interfering noise sources and reverberation

(3) May, Tobias a Van De Par, Steven b Kohlrausch, Armin c

a UNIVERSITY OF OLDENBURG (Germany)

b PHILIPS RESEARCH LABORATORIES (Netherlands)

c EINDHOVEN UNIVERSITY OF TECHNOLOGY (Netherlands)

Author keywords

Automatic speaker recognition; binaural processing; computational auditory scene analysis (CASA); mask estimation; missing data

Indexed keywords

AUTOMATIC SPEAKER RECOGNITION; BINARY MASKS; BINAURAL LOCALIZATION; BINAURAL PROCESSING; BUILDING BLOCKES; COCKTAIL PARTY; COMPUTATIONAL AUDITORY SCENE ANALYSIS; MISSING DATA; NOISE SOURCE; PRIORI KNOWLEDGE; SOUND SOURCE; SOURCE DETECTION; SPEAKER IDENTIFICATION; SPEAKER RECOGNITION; SPEECH DETECTION; STATE OF THE ART; TARGET SPEAKER;

ACOUSTIC GENERATORS; REVERBERATION; SPEECH PROCESSING;

SPEECH RECOGNITION;

EID: 84861514871 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2012.2193391 Document Type: Article

Times cited : (74)

References (55)

1
- 80052339383
- Some experiments on the recognition of speech, with one and two ears
- Sep.
- E. C. Cherry, "Some experiments on the recognition of speech, with one and two ears," J. Acoust. Soc. Amer., vol. 25, no. 5, pp. 975-979, Sep. 1953.
- (1953) J. Acoust. Soc. Amer. , vol.25 , Issue.5 , pp. 975-979
- Cherry, E.C.¹

2
- 0039334758
- The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions
- A. W. Bronkhorst, "The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions," Acustica, vol. 86, pp. 117-128, 2000. (Pubitemid 34103984)
- (2000) Acta Acustica united with Acustica , vol.86 , Issue.1 , pp. 117-128
- Bronkhorst, A.W.¹

3
- 0003684441
- Cambridge, MA: MIT Press
- A. S. Bregman, Auditory Scene Analysis: The Perceptual Organization of Sound. Cambridge, MA: MIT Press, 1990.
- (1990) Auditory Scene Analysis: The Perceptual Organization of Sound
- Bregman, A.S.¹

4
- 82255178542
- Eds. Hoboken, NJ: Wiley
- Computational Auditory Scene Analysis: Principles, Algorithms and Applications, D. L. Wang and G. Brown, Eds. Hoboken, NJ: Wiley, 2006.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms and Applications
- Wang, D.L.¹ Brown, G.²

5
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, pp. 267-285, 2001. (Pubitemid 32284867)
- (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

6
- 51449083412
- Robust speaker identification using combined feature selection and missing data recognition
- Las Vegas, NV
- D. Pullella, M. Kühne, and R. Togneri, "Robust speaker identification using combined feature selection and missing data recognition," in Proc. ICASSP, Las Vegas, NV, 2008, pp. 4833-4836.
- (2008) Proc. ICASSP , pp. 4833-4836
- Pullella, D.¹ Kühne, M.² Togneri, R.³

7
- 81155132367
- Noise-robust speaker recognition combining missing data techniques and universal background modeling
- Speech, Lang. Process., Jan.
- T. May, S. van de Par, and A.Kohlrausch, "Noise-robust speaker recognition combining missing data techniques and universal background modeling," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 108-121, Jan. 2012.
- (2012) IEEE Trans. Audio , vol.20 , Issue.1 , pp. 108-121
- May, T.¹ Van De Par, S.² Kohlrausch, A.³

8
- 33845354768
- Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
- DOI 10.1121/1.2363929
- D. S. Brungart, P. S. Chang, B. D. Simpson, and D. L. Wang, "Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation," J. Acoust. Soc. Amer., vol. 120, no. 6, pp. 4007-4018, Dec. 2006. (Pubitemid 44888096)
- (2006) Journal of the Acoustical Society of America , vol.120 , Issue.6 , pp. 4007-4018
- Brungart, D.S.¹ Chang, P.S.² Simpson, B.D.³ Wang, D.⁴

9
- 34547627836
- Factors influencing glimpsing of speech in noise
- DOI 10.1121/1.2749454
- N. Li and P. C. Loizou, "Factors influencing glimpsing of speech in noise," J. Acoust. Soc. Amer., vol. 122, no. 2, pp. 1165-1172, Aug. 2007. (Pubitemid 47205513)
- (2007) Journal of the Acoustical Society of America , vol.122 , Issue.2 , pp. 1165-1172
- Li, N.¹ Loizou, P.C.²

10
- 64649103540
- Speech intelligibility in background noise with ideal binary time-frequency masking
- Apr.
- D. L. Wang, U. Kjems, M. S. Pedersen, and J. B. Boldt, "Speech intelligibility in background noise with ideal binary time-frequency masking," J. Acoust. Soc. Amer., vol. 125, no. 4, pp. 2336-2347, Apr. 2009.
- (2009) J. Acoust. Soc. Amer. , vol.125 , Issue.4 , pp. 2336-2347
- Wang, D.L.¹ Kjems, U.² Pedersen, M.S.³ Boldt, J.B.⁴

11
- 84892233308
- On ideal binary masks as the computational goal of auditory scene analysis
- P. Divenyi, Ed. Norwell, MA: Kluwer,ch. 12
- D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Norwell, MA: Kluwer, 2005, ch. 12, pp. 181-197.
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

12
- 1042299913
- The benefit of binaural hearing in a cocktail party: Effect of location and type of interferer
- DOI 10.1121/1.1639908
- M. L. Hawley and R. Y. Litovsky, "The benefit of binaural hearing in a cocktail party: Effect of location and type of interferer," J. Acoust. Soc. Amer., vol. 115, no. 2, pp. 833-843, Feb. 2004. (Pubitemid 38200670)
- (2004) Journal of the Acoustical Society of America , vol.115 , Issue.2 , pp. 833-843
- Hawley, M.L.¹ Litovsky, R.Y.² Culling, J.F.³

13
- 70349187511
- Detection and localization of speech in the presence of competing speech signals
- London, U.K., Jun.
- B. D. Simpson, D. S. Brungart, N. Iyer, R. H. Gilkey, and J. T. Hamil, "Detection and localization of speech in the presence of competing speech signals," in Proc. ICAD, London, U.K., Jun. 2006, pp. 129-133.
- (2006) Proc. ICAD , pp. 129-133
- Simpson, B.D.¹ Brungart, D.S.² Iyer, N.³ Gilkey, R.H.⁴ Hamil, J.T.⁵

14
- 4644304197
- A binaural processor for missing data speech recognition in the presence of noise and smallroom reverberation
- K. J. Palomäki, G. J. Brown, and D. L. Wang, "A binaural processor for missing data speech recognition in the presence of noise and smallroom reverberation," Speech Commun., vol. 43, no. 4, pp. 361-378, 2004.
- (2004) Speech Commun. , vol.43 , Issue.4 , pp. 361-378
- Palomäki, K.J.¹ Brown, G.J.² Wang, D.L.³

15
- 33744971131
- Mask estimation for missing data speech recognition based on statistics of binaural interaction
- DOI 10.1109/TSA.2005.860354
- S. Harding, J. Barker, and G. Brown, "Mask estimation for missing data speech recognition based on statistics of binaural interaction," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 58-67, Jan. 2006. (Pubitemid 43863453)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 58-67
- Harding, S.¹ Barker, J.² Brown, G.J.³

16
- 0142026377
- Speech segregation based on sound localization
- DOI 10.1121/1.1610463
- N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol. 114, no. 4, pp. 2236-2252, Oct. 2003. (Pubitemid 37266649)
- (2003) Journal of the Acoustical Society of America , vol.114 , Issue.4 , pp. 2236-2252
- Roman, N.¹ Wang, D.² Brown, G.J.³

17
- 70349210869
- A speech fragment approach to localising multiple speakers in reverberant environments
- Apr., Taipei, Taiwan
- H. Christensen, N. Ma, S. N. Wrigley, and J. Barker, "A speech fragment approach to localising multiple speakers in reverberant environments," in Proc. ICASSP, Taipei, Taiwan, Apr. 2009, pp. 4593-4596.
- (2009) Proc. ICASSP , pp. 4593-4596
- Christensen, H.¹ Ma, N.² Wrigley, S.N.³ Barker, J.⁴

18
- 77957729908
- A probabilistic model for robust localization based on a binaural auditory front-end
- Jan.
- T. May, S. van de Par, and A. Kohlrausch, "A probabilistic model for robust localization based on a binaural auditory front-end," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 1, pp. 1-13, Jan. 2011.
- (2011) IEEE Trans. Audio Speech, Lang. Process. , vol.19 , Issue.1 , pp. 1-13
- May, T.¹ Van De Par, S.² Kohlrausch, A.³

19
- 77955697785
- Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization
- Sep.
- J. Woodruff and D. L.Wang, "Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1856-1866, Sep. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.7 , pp. 1856-1866
- Woodruff, J.¹ Wang, D.L.²

20
- 29244442934
- The advantage of knowing where to listen
- DOI 10.1121/1.2109187
- G. Kidd, Jr., T. L. Arbogast, C. R. Mason, and F. J. Gallun, "The advantage of knowing where to listen," J. Acoust. Soc. Amer., vol. 188, no. 6, pp. 3804-3815, Dec. 2005. (Pubitemid 41820881)
- (2005) Journal of the Acoustical Society of America , vol.118 , Issue.6 , pp. 3804-3815
- Kidd Jr., G.¹ Arbogast, T.L.² Mason, C.R.³ Gallun, F.J.⁴

21
- 77950410207
- Speech localization in a multitalker mixture
- Mar.
- N. Kopčo, V. Best, and S. Carlile, "Speech localization in a multitalker mixture," J. Acoust. Soc. Amer., vol. 127, no. 3, pp. 1450-1457, Mar. 2010.
- (2010) J. Acoust. Soc. Amer. , vol.127 , Issue.3 , pp. 1450-1457
- Kopčo, N.¹ Best, V.² Carlile, S.³

22
- 2942539074
- Techniques for handling convolutional distortion with 'missing data' automatic speech recognition
- Jun.
- K. J. Palomäki, G. J. Brown, and J. P. Barker, "Techniques for handling convolutional distortion with 'missing data' automatic speech recognition," Speech Commun., vol. 43, no. 1-2, pp. 123-142, Jun. 2004.
- (2004) Speech Commun. , vol.43 , Issue.1-2 , pp. 123-142
- Palomäki, K.J.¹ Brown, G.J.² Barker, J.P.³

23
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- DOI 10.1016/0378-5955(90)90170-T
- B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol. 47, no. 1-2, pp. 103-138, Aug. 1990. (Pubitemid 20244652)
- (1990) Hearing Research , vol.47 , Issue.1-2 , pp. 103-138
- Glasberg, B.R.¹ Moore, B.C.J.²

24
- 0028531926
- Computational auditory scene analysis
- Oct.
- G. J. Brown and M. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, no. 4, pp. 297-336, Oct. 1994.
- (1994) Comput. Speech Lang. , vol.8 , Issue.4 , pp. 297-336
- Brown, G.J.¹ Cooke, M.²

25
- 0023681706
- Lateralization of complex binaural stimuli: A weighted-image model
- Jul.
- R. M. Stern, A. S. Zeiberg, and C. Trahiotis, "Lateralization of complex binaural stimuli: A weighted-image model," J. Acoust. Soc. Amer., vol. 84, no. 1, pp. 156-165, Jul. 1988.
- (1988) J. Acoust. Soc. Amer. , vol.84 , Issue.1 , pp. 156-165
- Stern, R.M.¹ Zeiberg, A.S.² Trahiotis, C.³

26
- 0026694566
- Across frequency integration in a model of lateralization
- Apr.
- T. M. Shackleton, R. Meddis, and M. J. Hewitt, "Across frequency integration in a model of lateralization," J. Acoust. Soc. Amer., vol. 91, no. 4, pp. 2276-2279, Apr. 1992.
- (1992) J. Acoust. Soc. Amer. , vol.91 , Issue.4 , pp. 2276-2279
- Shackleton, T.M.¹ Meddis, R.² Hewitt, M.J.³

27
- 83455170710
- Binaural detection of speech sources in complex acoustic scenes
- NewPaltz, NY, Oct.
- T. May, S. van de Par, and A.Kohlrausch, "Binaural detection of speech sources in complex acoustic scenes," in Proc.WASPAA, NewPaltz, NY, Oct. 2011, pp. 241-244.
- (2011) Proc.WASPAA , pp. 241-244
- May, T.¹ Van De Par, S.² Kohlrausch, A.³

28
- 85032752225
- Missing-feature approaches in speech recognition
- DOI 10.1109/MSP.2005.1511828
- B. Raj and R. M. Stern, "Missing-feature approaches in speech recognition," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 101-116, Sep. 2005. (Pubitemid 41488524)
- (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 101-116
- Raj, B.¹ Stern, R.M.²

29
- 85135252448
- Missing features detection and handling for robust speaker verification
- Budapest, Hungary, Sep.
- M. El-Maliki and A. Drygajlo, "Missing features detection and handling for robust speaker verification," in Proc. Eurospeech, Budapest, Hungary, Sep. 1999, pp. 975-978.
- (1999) Proc. Eurospeech , pp. 975-978
- El-Maliki, M.¹ Drygajlo, A.²

30
- 34547539772
- accessed on 12th Oct., [Online]. Available
- M. Cooke and T.-W. Lee, "Speech separation and recognition competition," 2006, accessed on 12th Oct. 2010, [Online]. Available: http://staffwww.dcs.shef.ac.uk/people/M.Cooke/SpeechSeparation Challenge.htm
- (2006) Speech Separation and Recognition Competition
- Cooke, M.¹ Lee, T.-W.²

31
- 0004319968
- The NOISEX-92 study on the effect of additive noise on automatic speaker recognition
- Defence Research Agency, Malvern, U.K., Tech. Rep.
- A. P. Varga, H. J. M. Steeneken, M. Tomlinson, and D. Jones, "The NOISEX-92 study on the effect of additive noise on automatic speaker recognition," Speech Research Unit, Defence Research Agency, Malvern, U.K., 1992, Tech. Rep..
- (1992) Speech Research Unit
- Varga, A.P.¹ Steeneken, H.J.M.² Tomlinson, M.³ Jones, D.⁴

32
- 0020102027
- Least squares quantization in PCM
- Mar.
- S. Lloyd, "Least squares quantization in PCM," IEEE Trans. Inf. Theory., vol. 28, no. 2, pp. 129-137, Mar. 1982.
- (1982) IEEE Trans. Inf. Theory. , vol.28 , Issue.2 , pp. 129-137
- Lloyd, S.¹

33
- 0002629270
- Maximum likelihood estimation from incomplete data via the EM algorithm
- A. Dempster,N. Laird, and D. Rubin, "Maximum likelihood estimation from incomplete data via the EM algorithm," J. R. Statist. Soc. B, vol. 39, no. 1, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. B , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

34
- 33644661135
- A glimpsing model of speech perception in noise
- DOI 10.1121/1.2166600
- M. Cooke, "A glimpsing model of speech perception in noise," J. Acoust. Soc. Amer., vol. 199, no. 3, pp. 1562-1573, Mar. 2006. (Pubitemid 43326025)
- (2006) Journal of the Acoustical Society of America , vol.119 , Issue.3 , pp. 1562-1573
- Cooke, M.¹

35
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- DOI 10.1006/dspr.1999.0361
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, pp. 19-41, 2000. (Pubitemid 30592166)
- (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

36
- 0034896976
- Spatial unmasking of nearby speech sources in a simulated anechoic environment
- DOI 10.1121/1.1386633
- B. G. Shinn-Cunningham, J. Schickler, N. Kopčo, and R. Litovsky, "Spatial unmasking of nearby speech sources in a simulated anechoic environment," J. Acoust. Soc. Amer., vol. 110, no. 2, pp. 1118-1129, Aug. 2001. (Pubitemid 32734745)
- (2001) Journal of the Acoustical Society of America , vol.110 , Issue.2 , pp. 1118-1129
- Shinn-Cunningham, B.G.¹ Schickler, J.² Kopco, N.³ Litovsky, R.⁴

37
- 0004089083
- HRTF measurements of a KEMAR dummy-head microphone
- Tech. Rep.
- W. G. Gardner and K. D. Martin, "HRTF measurements of a KEMAR dummy-head microphone," MIT Media Lab, Perceptual Computing, Tech. Rep. #280, 1994.
- (1994) MIT Media Lab, Perceptual Computing , vol.280
- Gardner, W.G.¹ Martin, K.D.²

38
- 0018455820
- Image method for efficiently simulating small-room acoustics
- Apr.
- J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol. 65, no. 4, pp. 943-950, Apr. 1979.
- (1979) J. Acoust. Soc. Amer. , vol.65 , Issue.4 , pp. 943-950
- Allen, J.B.¹ Berkley, D.A.²

39
- 70349471167
- A fast and accurate "shoebox" room acoustics simulator
- Apr.
- S. M. Schimmel, M. F. Müller, and N. Dillier, "A fast and accurate "shoebox" room acoustics simulator," in Proc. ICASSP, Taipei, Taiwan, Apr. 2009, pp. 241-244.
- (2009) Proc. ICASSP, Taipei, Taiwan , pp. 241-244
- Schimmel, S.M.¹ Müller, M.F.² Dillier, N.³

40
- 84861505891
- American National Standard Specification for Sound Level Meters, ANSI/ ASA S1.4-1983 (R2001
- American National Standard Specification for Sound Level Meters, ANSI/ASA S1.4-1983 (R2001), Amer. Nat. Stand. Inst., 1983.
- (1983) Amer. Nat. Stand. Inst.

41
- 36049044257
- accessed on 11th October 2011 [Online]. Available:
- D. P. W. Ellis, PLP and RASTA (and MFCC, and Inversion) in Matlab, 2005, accessed on 11th October 2011 [Online]. Available: http://www.ee.columbia.edu/ ~dpwe/resources/matlab/rastamat
- (2005) PLP and RASTA (and MFCC and Inversion) in Matlab
- Ellis, D.P.W.¹

42
- 0024035182
- On the use of instantaneous and transitional spectral information in speaker recognition
- Jun.
- F. K. Soong and A. E. Rosenberg, "On the use of instantaneous and transitional spectral information in speaker recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. 36, no. 6, pp. 871-879, Jun. 1988.
- (1988) IEEE Trans. Acoust., Speech, Signal Process. , vol.36 , Issue.6 , pp. 871-879
- Soong, F.K.¹ Rosenberg, A.E.²

43
- 85079234583
- On the limitations of cepstral features in noise
- Apr.
- J. P. Openshaw and J. S. Mason, "On the limitations of cepstral features in noise," in Proc. ICASSP, Adelaide, South Australia, Australia, Apr. 1994, pp. 49-52.
- (1994) Proc. ICASSP, Adelaide, South Australia, Australia , pp. 49-52
- Openshaw, J.P.¹ Mason, J.S.²

44
- 85135190755
- Multi-band and adaptation approaches to robust speech recognition
- Sep., Rhodes, Greece
- S. Tibrewala and H. Hermansky, "Multi-band and adaptation approaches to robust speech recognition," in Proc. Eurospeech, Rhodes, Greece, Sep. 1997, pp. 2619-2622.
- (1997) Proc. Eurospeech , pp. 2619-2622
- Tibrewala, S.¹ Hermansky, H.²

45
- 0028996871
- Noise estimation techniques for robust speech recognition
- Detroit, MI
- H. G. Hirsch and C. Ehrlicher, "Noise estimation techniques for robust speech recognition," in Proc. ICASSP, Detroit, MI, 1995, vol. 1, pp. 153-156.
- (1995) Proc. ICASSP , vol.1 , pp. 153-156
- Hirsch, H.G.¹ Ehrlicher, C.²

46
- 0021892216
- Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
- Apr.
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 2, pp. 443-445, Apr. 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-33 , Issue.2 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

47
- 78149476938
- Signal-to-signal ratio independent speaker identification for co-channel speech signals
- Istanbul, Turkey, Aug.
- R. Saeidi, P. Mowlaee, T. Kinnunen, Z.-H. Tan, M. G. Christensen, S. H. Jensen, and P. Fränti, "Signal-to-signal ratio independent speaker identification for co-channel speech signals," in Proc. ICPR, Istanbul, Turkey, Aug. 2010, pp. 4565-4568.
- (2010) Proc. ICPR , pp. 4565-4568
- Saeidi, R.¹ Mowlaee, P.² Kinnunen, T.³ Tan, Z.-H.⁴ Christensen, M.G.⁵ Jensen, S.H.⁶ Fränti, P.⁷

48
- 33646023117
- An introduction to ROC analysis
- Jun.
- T. Fawcett, "An introduction to ROC analysis," Pattern Recog. Lett., vol. 27, no. 8, pp. 861-874, Jun. 2006.
- (2006) Pattern Recog. Lett. , vol.27 , Issue.8 , pp. 861-874
- Fawcett, T.¹

49
- 70349093614
- An algorithm that improves speech intelligibility in noise for normal-hearing listeners
- Sep.
- G. Kim, Y. Lu, Y. Hu, and P. C. Loizou, "An algorithm that improves speech intelligibility in noise for normal-hearing listeners," J. Acoust. Soc. Amer., vol. 126, no. 3, pp. 1486-1494, Sep. 2009.
- (2009) J. Acoust. Soc. Amer. , vol.126 , Issue.3 , pp. 1486-1494
- Kim, G.¹ Lu, Y.² Hu, Y.³ Loizou, P.C.⁴

50
- 33745004174
- Effect of source location and listener location on ILD cues in a reverberant room
- A. Ihlefeld and B. G. Shinn-Cunningham, "Effect of source location and listener location on ILD cues in a reverberant room," J. Acoust. Soc. Amer., vol. 115, no. 5, p. 2598, 2004.
- (2004) J. Acoust. Soc. Amer. , vol.115 , Issue.5 , pp. 2598
- Ihlefeld, A.¹ Shinn-Cunningham, B.G.²

51
- 18744392833
- Localizing nearby sound sources in a classroom: Binaural room impulse responses
- DOI 10.1121/1.1872572
- B. G. Shinn-Cunningham, N. Kopčo, and T. J. Martin, "Localizing nearby sound sources in a classroom: Binaural room impulse responses," J. Acoust. Soc. Amer., vol. 117, no. 5, pp. 3100-3115, May 2005. (Pubitemid 40675172)
- (2005) Journal of the Acoustical Society of America , vol.117 , Issue.5 , pp. 3100-3115
- Shinn-Cunningham, B.G.¹ Kopco, N.² Martin, T.J.³

52
- 33750311718
- Binary and ratio time-frequency masks for robust speech recognition
- DOI 10.1016/j.specom.2006.09.003, PII S0167639306001129
- S. Srinivasan, N. Roman, and D. L. Wang, "Binary and ratio time-frequency masks for robust speech recognition," Speech Commun., vol. 48, no. 11, pp. 1486-1501, Nov. 2006. (Pubitemid 44634774)
- (2006) Speech Communication , vol.48 , Issue.11 , pp. 1486-1501
- Srinivasan, S.¹ Roman, N.² Wang, D.³

53
- 85009145345
- Observations on overlap: Findings and implications for automatic processing of multi-party conversation
- Sep., Aalborg, Denmark
- E. Shriberg, A. Stolcke, and D. Baron, "Observations on overlap: Findings and implications for automatic processing of multi-party conversation," in Proc. Eurospeech, Aalborg, Denmark, Sep. 2001, pp. 1359-1362.
- (2001) Proc. Eurospeech , pp. 1359-1362
- Shriberg, E.¹ Stolcke, A.² Baron, D.³

54
- 33947653296
- Recognition of reverberant speech using full cepstral features and spectral missing data
- Toulouse, France
- K. J. Palomäki, G. J. Brown, and J. P. Barker, "Recognition of reverberant speech using full cepstral features and spectral missing data," in Proc. ICASSP, Toulouse, France, 2006, pp. 289-292.
- (2006) Proc. ICASSP , pp. 289-292
- Palomäki, K.J.¹ Brown, G.J.² Barker, J.P.³

55
- 80051661646
- Binaural sound source separation motivated by auditory processing
- Prague, Czech Republic
- C. Kim, K. Kumar, and R. M. Stern, "Binaural sound source separation motivated by auditory processing," in Proc. ICASSP, Prague, Czech Republic, 2011, pp. 5072-5075.
- (2011) Proc. ICASSP , pp. 5072-5075
- Kim, C.¹ Kumar, K.² Stern, R.M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.