SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 7, 2010, Pages 1856-1866

Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization

(2) Woodruff, John a Wang, Deliang a

a The Ohio State University (United States)

Author keywords

Binaural speech segregation; computational auditory scene analysis; monaural grouping; sequential organization; sound localization

Indexed keywords

COMPUTATIONAL AUDITORY SCENE ANALYSIS; MONAURAL GROUPING; SEQUENTIAL ORGANIZATION; SOUND LOCALIZATION; SPEECH SEGREGATION;

PATIENT REHABILITATION; REVERBERATION; SPEECH ANALYSIS; SPEECH PROCESSING;

SEGREGATION (METALLOGRAPHY);

EID: 77955697785 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2010.2050087 Document Type: Article

Times cited : (25)

References (40)

1
- 0003980102
- M. Brandstein and D. Ward, Eds.. New York: Springer
- Microphone Arrays: Signal Processing Techniques and Applications, M. Brandstein and D. Ward, Eds.. New York: Springer, 2001.
- (2001) Microphone Arrays:Signal Processing Techniques and Applications

2
- 82255178542
- D. L. Wang and G. J. Brown, Eds.. Hoboken, NJ: Wiley/IEEE Press
- Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, D. L. Wang and G. J. Brown, Eds.. Hoboken, NJ: Wiley/IEEE Press, 2006.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications

3
- 85008004589
- Reverberation
- D. L. Wang and G. J. Brown, Eds. New York: Wiley/IEEE Press
- G. J. Brown and K. J. Palomaki, "Reverberation," in Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, D. L. Wang and G. J. Brown, Eds. New York: Wiley/IEEE Press, 2006, pp. 209-250.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications , pp. 209-250
- Brown, G.J.¹ Palomaki, K.J.²

4
- 33744971131
- Mask estimation for missing data speech recognition based on statistics of binaural interaction
- Jan.
- S. Harding, J. Barker, and G. J. Brown, "Mask estimation for missing data speech recognition based on statistics of binaural interaction," IEEE Trans. Audio, Speech, and Lang. Process., vol.14, no.1, pp. 58-67, Jan. 2006.
- (2006) IEEE Trans. Audio, Speech, and Lang. Process , vol.14 , Issue.1 , pp. 58-67
- Harding, S.¹ Barker, J.² Brown, G.J.³

5
- 33845361885
- Binaural segregation in multisource reverberant environments
- N. Roman, S. Srinivasan, and D. L. Wang, "Binaural segregation in multisource reverberant environments," J. Acoust. Soc. Amer., vol.120, no.6, pp. 4040-4051, 2006.
- (2006) J. Acoust. Soc. Amer , vol.120 , Issue.6 , pp. 4040-4051
- Roman, N.¹ Srinivasan, S.² Wang, D.L.³

6
- 50249101640
- Sparseness-based 2CH BSS using the em algorithm in reverberant environment
- Oct
- Y. Izumi, N. Ono, and S. Sagayama, "Sparseness-based 2CH BSS using the EM algorithm in reverberant environment," in Proc. WASPAA, Oct. 2007, pp. 147-150.
- (2007) Proc. WASPAA , pp. 147-150
- Izumi, Y.¹ Ono, N.² Sagayama, S.³

7
- 50249183469
- EM localization and separation using interaural level and phase cues
- Oct
- M. I. Mandel and D. P. W. Ellis, "EM localization and separation using interaural level and phase cues," in Proc. WASPAA, Oct. 2007, pp. 275-278.
- (2007) Proc. WASPAA , pp. 275-278
- Mandel, M.I.¹ Ellis, D.P.W.²

8
- 50249118229
- A two-state frequency-domain blind source separation method for underdetermined convolutive mixtures
- Oct
- H. Sawada, S. Araki, and S. Makino, "A two-state frequency-domain blind source separation method for underdetermined convolutive mixtures," in Proc. WASPAA, Oct. 2007, pp. 139-142.
- (2007) Proc. WASPAA , pp. 139-142
- Sawada, H.¹ Araki, S.² Makino, S.³

9
- 0003684441
- Cambridge MA: MIT Press
- A. S. Bregman, Auditory Scene Analysis. Cambridge, MA: MIT Press, 1990.
- (1990) Auditory Scene Analysis
- Bregman, A.S.¹

10
- 0029127703
- Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay
- J. F. Culling and Q. S. Summerfield, "Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay," J. Acoust. Soc. Amer., vol.98, pp. 785-797, 1995.
- (1995) J. Acoust. Soc. Amer , vol.98 , pp. 785-797
- Culling, J.F.¹ Summerfield, Q.S.²

11
- 0033144658
- Auditory objects of attention: The role of interaural time differences
- C. J. Darwin and R. W. Hukin, "Auditory objects of attention: The role of interaural time differences," J. Exp. Psychol. Hum. Percept. Perform., vol.25, pp. 617-629, 1999.
- (1999) J. Exp. Psychol. Hum. Percept. Perform , vol.25 , pp. 617-629
- Darwin, C.J.¹ Hukin, R.W.²

12
- 0003127954
- How we localize sounds
- Nov.
- W. M. Hartmann, "How we localize sounds," Phys. Today, pp. 24-29, Nov. 1999.
- (1999) Phys. Today , pp. 24-29
- Hartmann, W.M.¹

13
- 56249137775
- Spatial hearing and perceiving sources
- W. A. Yost, A. N. Popper, and R. R. Fay, Eds. New York: Springer
- C. J. Darwin, "Spatial hearing and perceiving sources," in Auditory Perception of Sound Sources, W. A. Yost, A. N. Popper, and R. R. Fay, Eds. New York: Springer, 2007, pp. 215-232.
- (2007) Auditory Perception of Sound Sources , pp. 215-232
- Darwin, C.J.¹

14
- 0035254668
- A sound segregation algorithm for reverberant conditions
- A. Shamsoddini and P. N. Denbigh, "A sound segregation algorithm for reverberant conditions," Speech Commun., vol.33, pp. 179-196, 2001.
- (2001) Speech Commun , vol.33 , pp. 179-196
- Shamsoddini, A.¹ Denbigh, P.N.²

15
- 70349210869
- A speech fragment approach to localising multiple speakers in reverberant environments
- Apr.
- H. Christensen, N. Ma, S. N. Wrigley, and J. Barker, "A speech fragment approach to localising multiple speakers in reverberant environments," in Proc. ICASSP, Apr. 2009, pp. 4593-4596.
- (2009) Proc. ICASSP , pp. 4593-4596
- Christensen, H.¹ Ma, N.² Wrigley, S.N.³ Barker, J.⁴

16
- 70349216477
- On the role of localization cues in binaural segregation of reverberant speech
- Apr.
- J. Woodruff and D. L. Wang, "On the role of localization cues in binaural segregation of reverberant speech," in Proc. ICASSP, Apr. 2009, pp. 2205-2208.
- (2009) Proc. ICASSP , pp. 2205-2208
- Woodruff, J.¹ Wang, D.L.²

17
- 77955678360
- Integrating monaural and binaural analysis for localizing multiple reverberant sound sources
- Mar.
- J. Woodruff and D. L. Wang, "Integrating monaural and binaural analysis for localizing multiple reverberant sound sources," in Proc. ICASSP, Mar. 2010, pp. 2706-2709.
- (2010) Proc. ICASSP , pp. 2706-2709
- Woodruff, J.¹ Wang, D.L.²

18
- 70349448618
- An algorithm for speech segregation of co-channel speech
- Apr.
- S. Vishnubhotla and C. Y. Epsy-Wilson, "An algorithm for speech segregation of co-channel speech," in Proc. ICASSP, Apr. 2009, pp. 109-112.
- (2009) Proc. ICASSP , pp. 109-112
- Vishnubhotla, S.¹ Epsy-Wilson, C.Y.²

19
- 65249103478
- A supervised learning approach to monaural segregation of reverberant speech
- Z. Jin and D. L. Wang, "A supervised learning approach to monaural segregation of reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol.17, pp. 625-638, 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , pp. 625-638
- Jin, Z.¹ Wang, D.L.²

20
- 49249107353
- Segregation of unvoiced speech from nonspeech interference
- G. Hu and D. L. Wang, "Segregation of unvoiced speech from nonspeech interference," J. Acoust. Soc. Amer., vol.124, pp. 1306-1319, 2008.
- (2008) J. Acoust. Soc. Amer , vol.124 , pp. 1306-1319
- Hu, G.¹ Wang, D.L.²

21
- 67349134831
- Sequential organization of speech in computational auditory scene analysis
- Y. Shao and D. L. Wang, "Sequential organization of speech in computational auditory scene analysis," Speech Commun., vol.51, pp. 657-667, 2009.
- (2009) Speech Commun , vol.51 , pp. 657-667
- Shao, Y.¹ Wang, D.L.²

22
- 33947676870
- D. R. Campbell, The ROOMSIM User Guide (v3.3) 2004.
- (2004) The ROOMSIM User Guide (v3.3)
- Campbell, D.R.¹

23
- 0029041417
- HRTF measurements of a KEMAR
- W. G. Gardner and K. D. Martin, "HRTF measurements of a KEMAR," J. Acoust. Soc. Amer., vol.97, pp. 3907-3908, 1995.
- (1995) J. Acoust. Soc. Amer , vol.97 , pp. 3907-3908
- Gardner, W.G.¹ Martin, K.D.²

24
- 0018455820
- Image method for efficiently simulating small-room acoustics
- J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol.65, pp. 943-950, 1979.
- (1979) J. Acoust. Soc. Amer , vol.65 , pp. 943-950
- Allen, J.B.¹ Berkley, D.A.²

25
- 0003548585
- J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, "DARPA TIMIT acoustic phonetic continuous speech corpus," 1993.
- (1993) DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus
- Garofolo, J.S.¹ Lamel, L.F.² Fisher, W.M.³ Fiscus, J.G.⁴ Pallett, D.S.⁵ Dahlgren, N.L.⁶

26
- 0142056390
- Cambridge, U.K., Tech. Rep., MRC Applied Psychology Unit
- R. D. Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, "An efficient auditory filterbank based on the gammatone function," Cambridge, U.K., Tech. Rep., MRC Applied Psychology Unit, 1988.
- (1988) An Efficient Auditory Filterbank Based on the Gammatone Function
- Patterson, R.D.¹ Nimmo-Smith, I.² Holdsworth, J.³ Rice, P.⁴

27
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol.47, pp. 103-138, 1990.
- (1990) Hear. Res , vol.47 , pp. 103-138
- Glasberg, B.R.¹ Moore, B.C.J.²

28
- 77955695149
- A tandem algorithm for pitch estimation and voiced speech segregation
- to be published
- G. Hu and D. L. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process., 2010, to be published.
- (2010) IEEE Trans. Audio, Speech, Lang. Process
- Hu, G.¹ Wang, D.L.²

29
- 85045165251
- Ph.D. dissertation, The Ohio State Univ., Columbus, OH
- G. Hu, "Monaural speech organization and segregation," Ph.D. dissertation, The Ohio State Univ., Columbus, OH, 2006.
- (2006) Monaural Speech Organization and Segregation
- Hu, G.¹

30
- 0003742220
- Cambridge, MA: MIT Press
- J. Blauert, Spatial Hearing-The Psychophysics of Human Sound Localization. Cambridge, MA: MIT Press, 1997.
- (1997) Spatial Hearing-The Psychophysics of Human Sound Localization
- Blauert, J.¹

31
- 0003443397
- London, U.K.: Chapman & Hall
- B. Silverman, Density Estimation for Statistics and Data Analysis. London, U.K.: Chapman & Hall, 1986.
- (1986) Density Estimation for Statistics and Data Analysis
- Silverman, B.¹

32
- 0142026377
- Speech segregation based on sound localization
- N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol.114, no.4, pp. 2236-2252, 2003.
- (2003) J. Acoust. Soc. Amer , vol.114 , Issue.4 , pp. 2236-2252
- Roman, N.¹ Wang, D.L.² Brown, G.J.³

33
- 0032845228
- The precedence effect
- R. Y. Litovsky, H. S. Colburn, W. A. Yost, and S. J. Guzman, "The precedence effect," J. Acoust. Soc. Amer., vol.106, pp. 1633-1654, 1999.
- (1999) J. Acoust. Soc. Amer , vol.106 , pp. 1633-1654
- Litovsky, R.Y.¹ Colburn, H.S.² Yost, W.A.³ Guzman, S.J.⁴

34
- 9644281074
- Source localization in complex listening situations: Selection of binaural cues based on interaural coherence
- C. Faller and J. Merimaa, "Source localization in complex listening situations: Selection of binaural cues based on interaural coherence," J. Acoust. Soc. Amer., vol.116, no.5, pp. 3075-3089, 2004.
- (2004) J. Acoust. Soc. Amer , vol.116 , Issue.5 , pp. 3075-3089
- Faller, C.¹ Merimaa, J.²

35
- 33947155770
- Learning a precedence effect-like weighting function for the generalized cross-correlation framework
- Nov.
- K. W. Wilson and T. Darrell, "Learning a precedence effect-like weighting function for the generalized cross-correlation framework," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.6, pp. 2156-2164, Nov. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.6 , pp. 2156-2164
- Wilson, K.W.¹ Darrell, T.²

36
- 33744996003
- Model-based sequential organization in cochannel speech
- Jan.
- Y. Shao and D. L. Wang, "Model-based sequential organization in cochannel speech," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.1, pp. 289-298, Jan. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.1 , pp. 289-298
- Shao, Y.¹ Wang, D.L.²

37
- 0003343412
- Robust localization in reverberant rooms
- M. Brandstein and D.Ward, Eds. New York: Springer, ch. 8
- J. H. DiBiase, H. F. Silverman, and M. S. Brandstein, "Robust localization in reverberant rooms," in Microphone Arrays: Signal Processing Techniques and Applications, M. Brandstein and D.Ward, Eds. New York: Springer, 2001, ch. 8, pp. 157-180.
- (2001) Microphone Arrays: Signal Processing Techniques and Applications , pp. 157-180
- Dibiase, J.H.¹ Silverman, H.F.² Brandstein, M.S.³

38
- 0033778326
- Localization of multiple sound sources with two microphones
- C. Liu, B. C. Wheeler,W. D. O'Brien, R. C. Bilger, C. R. Lansing, and A. S. Feng, "Localization of multiple sound sources with two microphones," J. Acoust. Soc. Amer., vol.108, no.4, pp. 1888-1905, 2000.
- (2000) J. Acoust. Soc. Amer , vol.108 , Issue.4 , pp. 1888-1905
- Liu, C.¹ Wheeler, B.C.² O'Brien, W.D.³ Bilger, R.C.⁴ Lansing, C.R.⁵ Feng, A.S.⁶

39
- 84892233308
- On ideal binary masks as the computational goal of auditory scene analysis
- P. Divenyi, Ed. Boston, MA: Kluwer
- D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Boston, MA: Kluwer, 2005, pp. 181-197.
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

40
- 33845354768
- Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask
- D. Brungart, P. S. Chang, B. D. Simpson, and D. L. Wang, "Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask," J. Acoust. Soc. Amer., vol.120, pp. 4007-4018, 2006.
- (2006) J. Acoust. Soc. Amer , vol.120 , pp. 4007-4018
- Brungart, D.¹ Chang, P.S.² Simpson, B.D.³ Wang, D.L.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.