메뉴 건너뛰기




Volumn 20, Issue 5, 2012, Pages 1503-1512

Binaural localization of multiple sources in reverberant and noisy environments

Author keywords

Binaural sound localization; Computational auditory scene analysis (CASA); Monaural grouping; Reverberation

Indexed keywords

REVERBERATION;

EID: 84872299752     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2183869     Document Type: Article
Times cited : (112)

References (40)
  • 1
    • 0036881034 scopus 로고    scopus 로고
    • Self-localization dynamic microphone arrays
    • Nov.
    • P. Aarabi, "Self-localization dynamic microphone arrays," IEEE Trans. Syst., Man, Cybern. C, vol. 32, no. 4, pp. 474-484, Nov. 2002.
    • (2002) IEEE Trans. Syst., Man, Cybern. C , vol.32 , Issue.4 , pp. 474-484
    • Aarabi, P.1
  • 2
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol. 65, pp. 943-950, 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.65 , pp. 943-950
    • Allen, J.B.1    Berkley, D.A.2
  • 3
    • 0033986263 scopus 로고    scopus 로고
    • Adaptive eigenvalue decomposition algorithm for passive acoustic source localization
    • J. Benesty, "Adaptive eigenvalue decomposition algorithm for passive acoustic source localization," J. Acoust. Soc. Amer., vol. 107, no. 5, pp. 384-391, 2000.
    • (2000) J. Acoust. Soc. Amer. , vol.107 , Issue.5 , pp. 384-391
    • Benesty, J.1
  • 6
    • 0001835850 scopus 로고
    • Accurate short-Time analysis of the fundamental frequency and the harmonics-To-noise ratio of a sampled sound
    • P. Boersma, "Accurate short-Time analysis of the fundamental frequency and the harmonics-To-noise ratio of a sampled sound," Inst. Phon. Sci., vol. 17, pp. 97-110, 1993.
    • (1993) Inst. Phon. Sci. , vol.17 , pp. 97-110
    • Boersma, P.1
  • 7
    • 0032918933 scopus 로고    scopus 로고
    • Time-delay estimation of reverberated speech exploiting harmonic structure
    • M. Brandstein, "Time-delay estimation of reverberated speech exploiting harmonic structure," J. Acoust. Soc. Amer., vol. 105, pp. 2914-2919, 1999.
    • (1999) J. Acoust. Soc. Amer. , vol.105 , pp. 2914-2919
    • Brandstein, M.1
  • 10
    • 70349210869 scopus 로고    scopus 로고
    • A speech fragment approach to localizing multiple speakers in reverberant environments
    • Apr.
    • H. Christensen, N. Ma, S. N. Wrigley, and J. Barker, "A speech fragment approach to localizing multiple speakers in reverberant environments," in Proc. ICASSP, Apr. 2009, pp. 4593-4596.
    • (2009) Proc. ICASSP , pp. 4593-4596
    • Christensen, H.1    Ma, N.2    Wrigley, S.N.3    Barker, J.4
  • 12
    • 79953649387 scopus 로고    scopus 로고
    • Auditory model based direction estimation of concurrent speakers from binaural signals
    • M. Dietz, S. D. Ewert, and V. Hohmann, "Auditory model based direction estimation of concurrent speakers from binaural signals," Speech Commun., vol. 53, pp. 592-605, 2011.
    • (2011) Speech Commun. , vol.53 , pp. 592-605
    • Dietz, M.1    Ewert, S.D.2    Hohmann, V.3
  • 13
    • 0242334709 scopus 로고    scopus 로고
    • Robust adaptive time delay estimation for speaker localization in noisy and reverberant acoustic environments
    • S. Doclo and M. Moonen, "Robust adaptive time delay estimation for speaker localization in noisy and reverberant acoustic environments," EURASIP J. App. Signal Process., vol. 2003, pp. 1110-1124, 2003.
    • (2003) EURASIP J. App. Signal Process. , vol.2003 , pp. 1110-1124
    • Doclo, S.1    Moonen, M.2
  • 14
    • 0031762046 scopus 로고    scopus 로고
    • Range dependence of the response of a spherical head model
    • R. O. Duda and W. L. Martens, "Range dependence of the response of a spherical head model," J. Acoust. Soc. Amer., vol. 104, no. 5, pp. 3048-3058, 1998.
    • (1998) J. Acoust. Soc. Amer. , vol.104 , Issue.5 , pp. 3048-3058
    • Duda, R.O.1    Martens, W.L.2
  • 15
    • 0029041417 scopus 로고
    • HRTF measurements of a KEMAR
    • W. G. Gardner and K. D. Martin, "HRTF measurements of a KEMAR," J. Acoust. Soc. Amer., vol. 97, pp. 3907-3908, 1995.
    • (1995) J. Acoust. Soc. Amer. , vol.97 , pp. 3907-3908
    • Gardner, W.G.1    Martin, K.D.2
  • 17
    • 38849102154 scopus 로고    scopus 로고
    • Auditory segmentation based on onset and offset analysis
    • Feb.
    • G. Hu and D. L. Wang, "Auditory segmentation based on onset and offset analysis," IEEE Trans. Acoust., Speech, Signal Process., vol. 15, no. 2, pp. 396-405, Feb. 2007.
    • (2007) IEEE Trans. Acoust., Speech, Signal Process. , vol.15 , Issue.2 , pp. 396-405
    • Hu, G.1    Wang, D.L.2
  • 18
    • 77955700868 scopus 로고    scopus 로고
    • Dynamic precedence effect modeling for source separation in reverberant environments
    • Sep.
    • C. Hummersone, R. Mason, and T. Brookes, "Dynamic precedence effect modeling for source separation in reverberant environments," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1867-1871, Sep. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.7 , pp. 1867-1871
    • Hummersone, C.1    Mason, R.2    Brookes, T.3
  • 19
    • 65249103478 scopus 로고    scopus 로고
    • A supervised learning approach to monaural segregation of reverberant speech
    • May
    • Z. Jin and D. L. Wang, "A supervised learning approach to monaural segregation of reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 4, pp. 625-638, May 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.4 , pp. 625-638
    • Jin, Z.1    Wang, D.L.2
  • 20
    • 85008056718 scopus 로고    scopus 로고
    • HMM-based multipitch tracking for noisy and reverberant speech
    • Jul.
    • Z. Jin and D. L.Wang, "HMM-based multipitch tracking for noisy and reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 5, pp. 1091-1102, Jul. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.5 , pp. 1091-1102
    • Jin, Z.1    Wang, D.L.2
  • 22
    • 0016990291 scopus 로고
    • The generalized correlation method for estimation of time delay
    • Aug.
    • C. H. Knapp and G. C. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-24, no. 4, pp. 320-327, Aug. 1976.
    • (1976) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-24 , Issue.4 , pp. 320-327
    • Knapp, C.H.1    Carter, G.C.2
  • 25
    • 33750390953 scopus 로고    scopus 로고
    • Tracking an unknown time-varying number of speakers using TDOA measurements: A random finite set approach
    • Sep.
    • W.-K. Ma, B.-N. Vo, S. Singh, and A. Baddelay, "Tracking an unknown time-varying number of speakers using TDOA measurements: A random finite set approach," IEEE Trans. Signal Process., vol. 54, no. 9, pp. 3291-3304, Sep. 2006.
    • (2006) IEEE Trans. Signal Process. , vol.54 , Issue.9 , pp. 3291-3304
    • Ma, W.-K.1    Vo, B.-N.2    Singh, S.3    Baddelay, A.4
  • 26
    • 85008544097 scopus 로고    scopus 로고
    • Model-based expectation-maximization source separation and localization
    • Feb.
    • M. I. Mandel, R. J. Weiss, and D. P. W. Ellis, "Model-based expectation-maximization source separation and localization," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 2, pp. 382-394, Feb. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.2 , pp. 382-394
    • Mandel, M.I.1    Weiss, R.J.2    Ellis, D.P.W.3
  • 27
    • 77957729908 scopus 로고    scopus 로고
    • A probabilistic model for robust localization based on a binaural auditory frond-end
    • Jan.
    • T. May, S. Van De Par, and A. Kohlrausch, "A probabilistic model for robust localization based on a binaural auditory frond-end," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 1, pp. 1-13, Jan. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.1 , pp. 1-13
    • May, T.1    Van De Par, S.2    Kohlrausch, A.3
  • 28
    • 50449096822 scopus 로고    scopus 로고
    • Joint time delay and pitch estimation for speaker localization
    • L. Y. Ngan, Y.Wu, C. So, P. C. Ching, and S. W. Lee, "Joint time delay and pitch estimation for speaker localization," in Proc. ICAS, 2003.
    • (2003) Proc. ICAS
    • Ngan, L.Y.1    Wu, Y.2    So, C.3    Ching, P.C.4    Lee, S.W.5
  • 29
    • 52149108294 scopus 로고    scopus 로고
    • Combined estimation of spectral envelopes and sound source direction of concurrent voices by multidimensional statistical filtering
    • Mar.
    • J. Nix and V. Hohmann, "Combined estimation of spectral envelopes and sound source direction of concurrent voices by multidimensional statistical filtering," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 995-1008, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 995-1008
    • Nix, J.1    Hohmann, V.2
  • 30
    • 3142694930 scopus 로고    scopus 로고
    • Blind separation of speech mixtures via time-frequency masking
    • Jul.
    • O. Yilmaz and S. Rickard, "Blind separation of speech mixtures via time-frequency masking," IEEE Trans. Signal Process., vol. 52, no. 7, pp. 1830-1847, Jul. 2004.
    • (2004) IEEE Trans. Signal Process. , vol.52 , Issue.7 , pp. 1830-1847
    • Yilmaz, O.1    Rickard, S.2
  • 32
    • 70449394046 scopus 로고    scopus 로고
    • Binaural source localization by joint estimation of ILD and ITD
    • Jan.
    • M. Raspaud, H. Viste, and G. Evangelista, "Binaural source localization by joint estimation of ILD and ITD," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 1, pp. 68-77, Jan. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.1 , pp. 68-77
    • Raspaud, M.1    Viste, H.2    Evangelista, G.3
  • 33
    • 64849095806 scopus 로고    scopus 로고
    • Binaural tracking of multiple moving sources
    • May
    • N. Roman and D. L. Wang, "Binaural tracking of multiple moving sources," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 4, pp. 728-739, May 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.4 , pp. 728-739
    • Roman, N.1    Wang, D.L.2
  • 34
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • N. Roman, D. L. Wang, and G. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol. 114, pp. 2236-2252, 2003.
    • (2003) J. Acoust. Soc. Amer. , vol.114 , pp. 2236-2252
    • Roman, N.1    Wang, D.L.2    Brown, G.3
  • 35
    • 0031153687 scopus 로고    scopus 로고
    • A new cepstral prefiltering technique for estimating time delay under reverberant conditions
    • A. Stéphenne and B. Champagne, "A new cepstral prefiltering technique for estimating time delay under reverberant conditions," Signal Process., vol. 59, no. 3, pp. 253-266, 1997.
    • (1997) Signal Process. , vol.59 , Issue.3 , pp. 253-266
    • Stéphenne, A.1    Champagne, B.2
  • 38
    • 77955678360 scopus 로고    scopus 로고
    • Integrating monaural and binaural analysis for localizing multiple reverberant sound sources
    • Mar.
    • J. Woodruff and D. L. Wang, "Integrating monaural and binaural analysis for localizing multiple reverberant sound sources," in Proc. ICASSP, Mar. 2010, pp. 2706-2709.
    • (2010) Proc. ICASSP , pp. 2706-2709
    • Woodruff, J.1    Wang, D.L.2
  • 39
    • 77955697785 scopus 로고    scopus 로고
    • Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization
    • Sep.
    • J.Woodruff and D. L. Wang, "Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization," IEEE Trans. Acoust., Speech, Signal Process., vol. 18, no. 7, pp. 1856-1866, Sep. 2010.
    • (2010) IEEE Trans. Acoust., Speech, Signal Process. , vol.18 , Issue.7 , pp. 1856-1866
    • Woodruff, J.1    Wang, D.L.2
  • 40
    • 77956285777 scopus 로고    scopus 로고
    • A two microphone-based approach for source localization of multiple speech sources
    • Dec.
    • W. Zhang and B. D. Rao, "A two microphone-based approach for source localization of multiple speech sources," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 1913-1928, Dec. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.8 , pp. 1913-1928
    • Zhang, W.1    Rao, B.D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.