메뉴 건너뛰기




Volumn 18, Issue 7, 2010, Pages 1856-1866

Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization

Author keywords

Binaural speech segregation; computational auditory scene analysis; monaural grouping; sequential organization; sound localization

Indexed keywords

COMPUTATIONAL AUDITORY SCENE ANALYSIS; MONAURAL GROUPING; SEQUENTIAL ORGANIZATION; SOUND LOCALIZATION; SPEECH SEGREGATION;

EID: 77955697785     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2050087     Document Type: Article
Times cited : (25)

References (40)
  • 4
    • 33744971131 scopus 로고    scopus 로고
    • Mask estimation for missing data speech recognition based on statistics of binaural interaction
    • Jan.
    • S. Harding, J. Barker, and G. J. Brown, "Mask estimation for missing data speech recognition based on statistics of binaural interaction," IEEE Trans. Audio, Speech, and Lang. Process., vol.14, no.1, pp. 58-67, Jan. 2006.
    • (2006) IEEE Trans. Audio, Speech, and Lang. Process , vol.14 , Issue.1 , pp. 58-67
    • Harding, S.1    Barker, J.2    Brown, G.J.3
  • 5
    • 33845361885 scopus 로고    scopus 로고
    • Binaural segregation in multisource reverberant environments
    • N. Roman, S. Srinivasan, and D. L. Wang, "Binaural segregation in multisource reverberant environments," J. Acoust. Soc. Amer., vol.120, no.6, pp. 4040-4051, 2006.
    • (2006) J. Acoust. Soc. Amer , vol.120 , Issue.6 , pp. 4040-4051
    • Roman, N.1    Srinivasan, S.2    Wang, D.L.3
  • 6
    • 50249101640 scopus 로고    scopus 로고
    • Sparseness-based 2CH BSS using the em algorithm in reverberant environment
    • Oct
    • Y. Izumi, N. Ono, and S. Sagayama, "Sparseness-based 2CH BSS using the EM algorithm in reverberant environment," in Proc. WASPAA, Oct. 2007, pp. 147-150.
    • (2007) Proc. WASPAA , pp. 147-150
    • Izumi, Y.1    Ono, N.2    Sagayama, S.3
  • 7
    • 50249183469 scopus 로고    scopus 로고
    • EM localization and separation using interaural level and phase cues
    • Oct
    • M. I. Mandel and D. P. W. Ellis, "EM localization and separation using interaural level and phase cues," in Proc. WASPAA, Oct. 2007, pp. 275-278.
    • (2007) Proc. WASPAA , pp. 275-278
    • Mandel, M.I.1    Ellis, D.P.W.2
  • 8
    • 50249118229 scopus 로고    scopus 로고
    • A two-state frequency-domain blind source separation method for underdetermined convolutive mixtures
    • Oct
    • H. Sawada, S. Araki, and S. Makino, "A two-state frequency-domain blind source separation method for underdetermined convolutive mixtures," in Proc. WASPAA, Oct. 2007, pp. 139-142.
    • (2007) Proc. WASPAA , pp. 139-142
    • Sawada, H.1    Araki, S.2    Makino, S.3
  • 10
    • 0029127703 scopus 로고
    • Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay
    • J. F. Culling and Q. S. Summerfield, "Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay," J. Acoust. Soc. Amer., vol.98, pp. 785-797, 1995.
    • (1995) J. Acoust. Soc. Amer , vol.98 , pp. 785-797
    • Culling, J.F.1    Summerfield, Q.S.2
  • 11
    • 0033144658 scopus 로고    scopus 로고
    • Auditory objects of attention: The role of interaural time differences
    • C. J. Darwin and R. W. Hukin, "Auditory objects of attention: The role of interaural time differences," J. Exp. Psychol. Hum. Percept. Perform., vol.25, pp. 617-629, 1999.
    • (1999) J. Exp. Psychol. Hum. Percept. Perform , vol.25 , pp. 617-629
    • Darwin, C.J.1    Hukin, R.W.2
  • 12
    • 0003127954 scopus 로고    scopus 로고
    • How we localize sounds
    • Nov.
    • W. M. Hartmann, "How we localize sounds," Phys. Today, pp. 24-29, Nov. 1999.
    • (1999) Phys. Today , pp. 24-29
    • Hartmann, W.M.1
  • 13
    • 56249137775 scopus 로고    scopus 로고
    • Spatial hearing and perceiving sources
    • W. A. Yost, A. N. Popper, and R. R. Fay, Eds. New York: Springer
    • C. J. Darwin, "Spatial hearing and perceiving sources," in Auditory Perception of Sound Sources, W. A. Yost, A. N. Popper, and R. R. Fay, Eds. New York: Springer, 2007, pp. 215-232.
    • (2007) Auditory Perception of Sound Sources , pp. 215-232
    • Darwin, C.J.1
  • 14
    • 0035254668 scopus 로고    scopus 로고
    • A sound segregation algorithm for reverberant conditions
    • A. Shamsoddini and P. N. Denbigh, "A sound segregation algorithm for reverberant conditions," Speech Commun., vol.33, pp. 179-196, 2001.
    • (2001) Speech Commun , vol.33 , pp. 179-196
    • Shamsoddini, A.1    Denbigh, P.N.2
  • 15
    • 70349210869 scopus 로고    scopus 로고
    • A speech fragment approach to localising multiple speakers in reverberant environments
    • Apr.
    • H. Christensen, N. Ma, S. N. Wrigley, and J. Barker, "A speech fragment approach to localising multiple speakers in reverberant environments," in Proc. ICASSP, Apr. 2009, pp. 4593-4596.
    • (2009) Proc. ICASSP , pp. 4593-4596
    • Christensen, H.1    Ma, N.2    Wrigley, S.N.3    Barker, J.4
  • 16
    • 70349216477 scopus 로고    scopus 로고
    • On the role of localization cues in binaural segregation of reverberant speech
    • Apr.
    • J. Woodruff and D. L. Wang, "On the role of localization cues in binaural segregation of reverberant speech," in Proc. ICASSP, Apr. 2009, pp. 2205-2208.
    • (2009) Proc. ICASSP , pp. 2205-2208
    • Woodruff, J.1    Wang, D.L.2
  • 17
    • 77955678360 scopus 로고    scopus 로고
    • Integrating monaural and binaural analysis for localizing multiple reverberant sound sources
    • Mar.
    • J. Woodruff and D. L. Wang, "Integrating monaural and binaural analysis for localizing multiple reverberant sound sources," in Proc. ICASSP, Mar. 2010, pp. 2706-2709.
    • (2010) Proc. ICASSP , pp. 2706-2709
    • Woodruff, J.1    Wang, D.L.2
  • 18
    • 70349448618 scopus 로고    scopus 로고
    • An algorithm for speech segregation of co-channel speech
    • Apr.
    • S. Vishnubhotla and C. Y. Epsy-Wilson, "An algorithm for speech segregation of co-channel speech," in Proc. ICASSP, Apr. 2009, pp. 109-112.
    • (2009) Proc. ICASSP , pp. 109-112
    • Vishnubhotla, S.1    Epsy-Wilson, C.Y.2
  • 19
    • 65249103478 scopus 로고    scopus 로고
    • A supervised learning approach to monaural segregation of reverberant speech
    • Z. Jin and D. L. Wang, "A supervised learning approach to monaural segregation of reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol.17, pp. 625-638, 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , pp. 625-638
    • Jin, Z.1    Wang, D.L.2
  • 20
    • 49249107353 scopus 로고    scopus 로고
    • Segregation of unvoiced speech from nonspeech interference
    • G. Hu and D. L. Wang, "Segregation of unvoiced speech from nonspeech interference," J. Acoust. Soc. Amer., vol.124, pp. 1306-1319, 2008.
    • (2008) J. Acoust. Soc. Amer , vol.124 , pp. 1306-1319
    • Hu, G.1    Wang, D.L.2
  • 21
    • 67349134831 scopus 로고    scopus 로고
    • Sequential organization of speech in computational auditory scene analysis
    • Y. Shao and D. L. Wang, "Sequential organization of speech in computational auditory scene analysis," Speech Commun., vol.51, pp. 657-667, 2009.
    • (2009) Speech Commun , vol.51 , pp. 657-667
    • Shao, Y.1    Wang, D.L.2
  • 23
    • 0029041417 scopus 로고
    • HRTF measurements of a KEMAR
    • W. G. Gardner and K. D. Martin, "HRTF measurements of a KEMAR," J. Acoust. Soc. Amer., vol.97, pp. 3907-3908, 1995.
    • (1995) J. Acoust. Soc. Amer , vol.97 , pp. 3907-3908
    • Gardner, W.G.1    Martin, K.D.2
  • 24
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol.65, pp. 943-950, 1979.
    • (1979) J. Acoust. Soc. Amer , vol.65 , pp. 943-950
    • Allen, J.B.1    Berkley, D.A.2
  • 27
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol.47, pp. 103-138, 1990.
    • (1990) Hear. Res , vol.47 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 28
    • 77955695149 scopus 로고    scopus 로고
    • A tandem algorithm for pitch estimation and voiced speech segregation
    • to be published
    • G. Hu and D. L. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process., 2010, to be published.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process
    • Hu, G.1    Wang, D.L.2
  • 32
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol.114, no.4, pp. 2236-2252, 2003.
    • (2003) J. Acoust. Soc. Amer , vol.114 , Issue.4 , pp. 2236-2252
    • Roman, N.1    Wang, D.L.2    Brown, G.J.3
  • 34
    • 9644281074 scopus 로고    scopus 로고
    • Source localization in complex listening situations: Selection of binaural cues based on interaural coherence
    • C. Faller and J. Merimaa, "Source localization in complex listening situations: Selection of binaural cues based on interaural coherence," J. Acoust. Soc. Amer., vol.116, no.5, pp. 3075-3089, 2004.
    • (2004) J. Acoust. Soc. Amer , vol.116 , Issue.5 , pp. 3075-3089
    • Faller, C.1    Merimaa, J.2
  • 35
    • 33947155770 scopus 로고    scopus 로고
    • Learning a precedence effect-like weighting function for the generalized cross-correlation framework
    • Nov.
    • K. W. Wilson and T. Darrell, "Learning a precedence effect-like weighting function for the generalized cross-correlation framework," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.6, pp. 2156-2164, Nov. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.6 , pp. 2156-2164
    • Wilson, K.W.1    Darrell, T.2
  • 36
    • 33744996003 scopus 로고    scopus 로고
    • Model-based sequential organization in cochannel speech
    • Jan.
    • Y. Shao and D. L. Wang, "Model-based sequential organization in cochannel speech," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.1, pp. 289-298, Jan. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.1 , pp. 289-298
    • Shao, Y.1    Wang, D.L.2
  • 39
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary masks as the computational goal of auditory scene analysis
    • P. Divenyi, Ed. Boston, MA: Kluwer
    • D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Boston, MA: Kluwer, 2005, pp. 181-197.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 40
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask
    • D. Brungart, P. S. Chang, B. D. Simpson, and D. L. Wang, "Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask," J. Acoust. Soc. Amer., vol.120, pp. 4007-4018, 2006.
    • (2006) J. Acoust. Soc. Amer , vol.120 , pp. 4007-4018
    • Brungart, D.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.L.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.