SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 5, 2012, Pages 1503-1512

Binaural localization of multiple sources in reverberant and noisy environments

(2) Woodruff, John a Wang, DeLiang a

a The Ohio State University (United States)

Author keywords

Binaural sound localization; Computational auditory scene analysis (CASA); Monaural grouping; Reverberation

Indexed keywords

REVERBERATION;

BINAURAL LOCALIZATION; BINAURAL SOUND LOCALIZATIONS; COMPUTATIONAL AUDITORY SCENE ANALYSIS; ENVIRONMENTAL CONDITIONS; MONAURAL GROUPING; ROBUST PERFORMANCE; SOUND SOURCE LOCALIZATION; STATIONARY SOURCES;

BINS;

EID: 84872299752 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2012.2183869 Document Type: Article

Times cited : (112)

References (40)

1
- 0036881034
- Self-localization dynamic microphone arrays
- Nov.
- P. Aarabi, "Self-localization dynamic microphone arrays," IEEE Trans. Syst., Man, Cybern. C, vol. 32, no. 4, pp. 474-484, Nov. 2002.
- (2002) IEEE Trans. Syst., Man, Cybern. C , vol.32 , Issue.4 , pp. 474-484
- Aarabi, P.¹

2
- 0018455820
- Image method for efficiently simulating small-room acoustics
- J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol. 65, pp. 943-950, 1979.
- (1979) J. Acoust. Soc. Amer. , vol.65 , pp. 943-950
- Allen, J.B.¹ Berkley, D.A.²

3
- 0033986263
- Adaptive eigenvalue decomposition algorithm for passive acoustic source localization
- J. Benesty, "Adaptive eigenvalue decomposition algorithm for passive acoustic source localization," J. Acoust. Soc. Amer., vol. 107, no. 5, pp. 384-391, 2000.
- (2000) J. Acoust. Soc. Amer. , vol.107 , Issue.5 , pp. 384-391
- Benesty, J.¹

4
- 33846655689
- Binaural interference and auditory grouping
- V. Best, F. J. Gallun, S. Carlile, and B. G. Shinn-Cunningham, "Binaural interference and auditory grouping," J. Acoust. Soc. Amer., vol. 121, no. 2, pp. 1070-1076, 2007.
- (2007) J. Acoust. Soc. Amer. , vol.121 , Issue.2 , pp. 1070-1076
- Best, V.¹ Gallun, F.J.² Carlile, S.³ Shinn-Cunningham, B.G.⁴

5
- 0003742220
- Cambridge, MA: MIT Press
- J. Blauert, Spatial Hearing -The Psychophysics of Human Sound Localization. Cambridge, MA: MIT Press, 1997.
- (1997) Spatial Hearing -The Psychophysics of Human Sound Localization
- Blauert, J.¹

6
- 0001835850
- Accurate short-Time analysis of the fundamental frequency and the harmonics-To-noise ratio of a sampled sound
- P. Boersma, "Accurate short-Time analysis of the fundamental frequency and the harmonics-To-noise ratio of a sampled sound," Inst. Phon. Sci., vol. 17, pp. 97-110, 1993.
- (1993) Inst. Phon. Sci. , vol.17 , pp. 97-110
- Boersma, P.¹

7
- 0032918933
- Time-delay estimation of reverberated speech exploiting harmonic structure
- M. Brandstein, "Time-delay estimation of reverberated speech exploiting harmonic structure," J. Acoust. Soc. Amer., vol. 105, pp. 2914-2919, 1999.
- (1999) J. Acoust. Soc. Amer. , vol.105 , pp. 2914-2919
- Brandstein, M.¹

8
- 0003684441
- Cambridge, MA: MIT Press
- A. S. Bregman, Auditory Scene Analysis. Cambridge, MA: MIT Press, 1990.
- (1990) Auditory Scene Analysis
- Bregman, A.S.¹

9
- 33947676870
- [Online]. Available
- D. R. Campbell, The ROOMSIM User Guide (v3.3) 2004 [Online]. Available: Http://media.paisley.ac.uk/campbell/Roomsim/
- (2004) The ROOMSIM User Guide (v3.3).
- Campbell, D.R.¹

10
- 70349210869
- A speech fragment approach to localizing multiple speakers in reverberant environments
- Apr.
- H. Christensen, N. Ma, S. N. Wrigley, and J. Barker, "A speech fragment approach to localizing multiple speakers in reverberant environments," in Proc. ICASSP, Apr. 2009, pp. 4593-4596.
- (2009) Proc. ICASSP , pp. 4593-4596
- Christensen, H.¹ Ma, N.² Wrigley, S.N.³ Barker, J.⁴

11
- 0003343412
- Robust localization in reverberant rooms
- M. Brstein and D. Ward, Eds. New York: Springer, ch. 8
- J. H. DiBiase, H. F. Silverman, and M. S. Brandstein, "Robust localization in reverberant rooms," in Microphone Arrays: Signal Processing Techniques and Applications, M. Brstein and D. Ward, Eds. New York: Springer, 2001, ch. 8, pp. 157-180.
- (2001) Microphone Arrays: Signal Processing Techniques and Applications , pp. 157-180
- DiBiase, J.H.¹ Silverman, H.F.² Brandstein, M.S.³

12
- 79953649387
- Auditory model based direction estimation of concurrent speakers from binaural signals
- M. Dietz, S. D. Ewert, and V. Hohmann, "Auditory model based direction estimation of concurrent speakers from binaural signals," Speech Commun., vol. 53, pp. 592-605, 2011.
- (2011) Speech Commun. , vol.53 , pp. 592-605
- Dietz, M.¹ Ewert, S.D.² Hohmann, V.³

13
- 0242334709
- Robust adaptive time delay estimation for speaker localization in noisy and reverberant acoustic environments
- S. Doclo and M. Moonen, "Robust adaptive time delay estimation for speaker localization in noisy and reverberant acoustic environments," EURASIP J. App. Signal Process., vol. 2003, pp. 1110-1124, 2003.
- (2003) EURASIP J. App. Signal Process. , vol.2003 , pp. 1110-1124
- Doclo, S.¹ Moonen, M.²

14
- 0031762046
- Range dependence of the response of a spherical head model
- R. O. Duda and W. L. Martens, "Range dependence of the response of a spherical head model," J. Acoust. Soc. Amer., vol. 104, no. 5, pp. 3048-3058, 1998.
- (1998) J. Acoust. Soc. Amer. , vol.104 , Issue.5 , pp. 3048-3058
- Duda, R.O.¹ Martens, W.L.²

15
- 0029041417
- HRTF measurements of a KEMAR
- W. G. Gardner and K. D. Martin, "HRTF measurements of a KEMAR," J. Acoust. Soc. Amer., vol. 97, pp. 3907-3908, 1995.
- (1995) J. Acoust. Soc. Amer. , vol.97 , pp. 3907-3908
- Gardner, W.G.¹ Martin, K.D.²

16
- 0003548585
- [Online]. Available
- J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, "DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus," 1993 [Online]. Available: Http://www.ldc.upenn.edu/Catalog/LDC93S1.html
- (1993) DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus
- Garofolo, J.S.¹ Lamel, L.F.² Fisher, W.M.³ Fiscus, J.G.⁴ Pallett, D.S.⁵ Dahlgren, N.L.⁶

17
- 38849102154
- Auditory segmentation based on onset and offset analysis
- Feb.
- G. Hu and D. L. Wang, "Auditory segmentation based on onset and offset analysis," IEEE Trans. Acoust., Speech, Signal Process., vol. 15, no. 2, pp. 396-405, Feb. 2007.
- (2007) IEEE Trans. Acoust., Speech, Signal Process. , vol.15 , Issue.2 , pp. 396-405
- Hu, G.¹ Wang, D.L.²

18
- 77955700868
- Dynamic precedence effect modeling for source separation in reverberant environments
- Sep.
- C. Hummersone, R. Mason, and T. Brookes, "Dynamic precedence effect modeling for source separation in reverberant environments," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1867-1871, Sep. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.7 , pp. 1867-1871
- Hummersone, C.¹ Mason, R.² Brookes, T.³

19
- 65249103478
- A supervised learning approach to monaural segregation of reverberant speech
- May
- Z. Jin and D. L. Wang, "A supervised learning approach to monaural segregation of reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 4, pp. 625-638, May 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.4 , pp. 625-638
- Jin, Z.¹ Wang, D.L.²

20
- 85008056718
- HMM-based multipitch tracking for noisy and reverberant speech
- Jul.
- Z. Jin and D. L.Wang, "HMM-based multipitch tracking for noisy and reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 5, pp. 1091-1102, Jul. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.5 , pp. 1091-1102
- Jin, Z.¹ Wang, D.L.²

21
- 46749155862
- Joint position pitch tracking for 2-channel audio
- M. Képesi, F. Pernkopf, and M. Wohlmayr, "Joint position pitch tracking for 2-channel audio," in Proc. Int. Workshop Content Based Multimedia Indexing, 2007.
- (2007) Proc. Int. Workshop Content Based Multimedia Indexing
- Képesi, M.¹ Pernkopf, F.² Wohlmayr, M.³

22
- 0016990291
- The generalized correlation method for estimation of time delay
- Aug.
- C. H. Knapp and G. C. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-24, no. 4, pp. 320-327, Aug. 1976.
- (1976) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-24 , Issue.4 , pp. 320-327
- Knapp, C.H.¹ Carter, G.C.²

23
- 0032845228
- The precedence effect
- R. Y. Litovsky, H. S. Colburn, W. A. Yost, and S. J. Guzman, "The precedence effect," J. Acoust. Soc. Amer., vol. 106, pp. 1633-1654, 1999.
- (1999) J. Acoust. Soc. Amer. , vol.106 , pp. 1633-1654
- Litovsky, R.Y.¹ Colburn, H.S.² Yost, W.A.³ Guzman, S.J.⁴

24
- 0033778326
- Localization of multiple sound sources with two microphones
- C. Liu, B. C. Wheeler,W. D. O'Brien, R. C. Bilger, C. R. Lansing, and A. S. Feng, "Localization of multiple sound sources with two microphones," J. Acoust. Soc. Amer., vol. 108, pp. 1888-1905, 2000.
- (2000) J. Acoust. Soc. Amer. , vol.108 , pp. 1888-1905
- Liu, C.¹ Wheeler, W.D.² O'Brien, B.C.³ Bilger, R.C.⁴ Lansing, C.R.⁵ Feng, A.S.⁶

25
- 33750390953
- Tracking an unknown time-varying number of speakers using TDOA measurements: A random finite set approach
- Sep.
- W.-K. Ma, B.-N. Vo, S. Singh, and A. Baddelay, "Tracking an unknown time-varying number of speakers using TDOA measurements: A random finite set approach," IEEE Trans. Signal Process., vol. 54, no. 9, pp. 3291-3304, Sep. 2006.
- (2006) IEEE Trans. Signal Process. , vol.54 , Issue.9 , pp. 3291-3304
- Ma, W.-K.¹ Vo, B.-N.² Singh, S.³ Baddelay, A.⁴

26
- 85008544097
- Model-based expectation-maximization source separation and localization
- Feb.
- M. I. Mandel, R. J. Weiss, and D. P. W. Ellis, "Model-based expectation-maximization source separation and localization," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 2, pp. 382-394, Feb. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.2 , pp. 382-394
- Mandel, M.I.¹ Weiss, R.J.² Ellis, D.P.W.³

27
- 77957729908
- A probabilistic model for robust localization based on a binaural auditory frond-end
- Jan.
- T. May, S. Van De Par, and A. Kohlrausch, "A probabilistic model for robust localization based on a binaural auditory frond-end," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 1, pp. 1-13, Jan. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.1 , pp. 1-13
- May, T.¹ Van De Par, S.² Kohlrausch, A.³

28
- 50449096822
- Joint time delay and pitch estimation for speaker localization
- L. Y. Ngan, Y.Wu, C. So, P. C. Ching, and S. W. Lee, "Joint time delay and pitch estimation for speaker localization," in Proc. ICAS, 2003.
- (2003) Proc. ICAS
- Ngan, L.Y.¹ Wu, Y.² So, C.³ Ching, P.C.⁴ Lee, S.W.⁵

29
- 52149108294
- Combined estimation of spectral envelopes and sound source direction of concurrent voices by multidimensional statistical filtering
- Mar.
- J. Nix and V. Hohmann, "Combined estimation of spectral envelopes and sound source direction of concurrent voices by multidimensional statistical filtering," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 995-1008, Mar. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 995-1008
- Nix, J.¹ Hohmann, V.²

30
- 3142694930
- Blind separation of speech mixtures via time-frequency masking
- Jul.
- O. Yilmaz and S. Rickard, "Blind separation of speech mixtures via time-frequency masking," IEEE Trans. Signal Process., vol. 52, no. 7, pp. 1830-1847, Jul. 2004.
- (2004) IEEE Trans. Signal Process. , vol.52 , Issue.7 , pp. 1830-1847
- Yilmaz, O.¹ Rickard, S.²

31
- 0142056390
- Tech. Rep. MRC App. Psych. Unit, Cambridge, MA
- R. D. Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, "An efficient auditory filterbank based on the gammatone function," Tech. Rep. MRC App. Psych. Unit, Cambridge, MA, 1988.
- (1988) An Efficient Auditory Filterbank based on the Gammatone Function
- Patterson, R.D.¹ Nimmo-Smith, I.² Holdsworth, J.³ Rice, P.⁴

32
- 70449394046
- Binaural source localization by joint estimation of ILD and ITD
- Jan.
- M. Raspaud, H. Viste, and G. Evangelista, "Binaural source localization by joint estimation of ILD and ITD," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 1, pp. 68-77, Jan. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.1 , pp. 68-77
- Raspaud, M.¹ Viste, H.² Evangelista, G.³

33
- 64849095806
- Binaural tracking of multiple moving sources
- May
- N. Roman and D. L. Wang, "Binaural tracking of multiple moving sources," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 4, pp. 728-739, May 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.4 , pp. 728-739
- Roman, N.¹ Wang, D.L.²

34
- 0142026377
- Speech segregation based on sound localization
- N. Roman, D. L. Wang, and G. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol. 114, pp. 2236-2252, 2003.
- (2003) J. Acoust. Soc. Amer. , vol.114 , pp. 2236-2252
- Roman, N.¹ Wang, D.L.² Brown, G.³

35
- 0031153687
- A new cepstral prefiltering technique for estimating time delay under reverberant conditions
- A. Stéphenne and B. Champagne, "A new cepstral prefiltering technique for estimating time delay under reverberant conditions," Signal Process., vol. 59, no. 3, pp. 253-266, 1997.
- (1997) Signal Process. , vol.59 , Issue.3 , pp. 253-266
- Stéphenne, A.¹ Champagne, B.²

36
- 84868663836
- Binaural sound localization
- D. L. Wang and G. J. Brown, Eds. New York: Wiley
- R. M. Stern, G. J. Brown, and D. L. Wang, "Binaural sound localization," in Computational Auditory Scene Analysis: Principles, Algorithms and Applications, D. L. Wang and G. J. Brown, Eds. New York: Wiley, 2006, pp. 147-185.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms and Applications , pp. 147-185
- Stern, R.M.¹ Brown, G.J.² Wang, D.L.³

37
- 82255178542
- Hoboken, NJ, Wiley/IEEE Press
- D. L. Wang and G. J. Brown, Eds., Computational Auditory Scene Analysis: Principles, Algorithms, and Applications Hoboken, NJ, Wiley/IEEE Press, 2006.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
- Wang, D.L.¹ Brown, G.J.²

38
- 77955678360
- Integrating monaural and binaural analysis for localizing multiple reverberant sound sources
- Mar.
- J. Woodruff and D. L. Wang, "Integrating monaural and binaural analysis for localizing multiple reverberant sound sources," in Proc. ICASSP, Mar. 2010, pp. 2706-2709.
- (2010) Proc. ICASSP , pp. 2706-2709
- Woodruff, J.¹ Wang, D.L.²

39
- 77955697785
- Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization
- Sep.
- J.Woodruff and D. L. Wang, "Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization," IEEE Trans. Acoust., Speech, Signal Process., vol. 18, no. 7, pp. 1856-1866, Sep. 2010.
- (2010) IEEE Trans. Acoust., Speech, Signal Process. , vol.18 , Issue.7 , pp. 1856-1866
- Woodruff, J.¹ Wang, D.L.²

40
- 77956285777
- A two microphone-based approach for source localization of multiple speech sources
- Dec.
- W. Zhang and B. D. Rao, "A two microphone-based approach for source localization of multiple speech sources," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 1913-1928, Dec. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.8 , pp. 1913-1928
- Zhang, W.¹ Rao, B.D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.