메뉴 건너뛰기




Volumn 20, Issue 7, 2012, Pages 2016-2030

A binaural scene analyzer for joint localization and recognition of speakers in the presence of interfering noise sources and reverberation

Author keywords

Automatic speaker recognition; binaural processing; computational auditory scene analysis (CASA); mask estimation; missing data

Indexed keywords

AUTOMATIC SPEAKER RECOGNITION; BINARY MASKS; BINAURAL LOCALIZATION; BINAURAL PROCESSING; BUILDING BLOCKES; COCKTAIL PARTY; COMPUTATIONAL AUDITORY SCENE ANALYSIS; MISSING DATA; NOISE SOURCE; PRIORI KNOWLEDGE; SOUND SOURCE; SOURCE DETECTION; SPEAKER IDENTIFICATION; SPEAKER RECOGNITION; SPEECH DETECTION; STATE OF THE ART; TARGET SPEAKER;

EID: 84861514871     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2193391     Document Type: Article
Times cited : (74)

References (55)
  • 1
    • 80052339383 scopus 로고
    • Some experiments on the recognition of speech, with one and two ears
    • Sep.
    • E. C. Cherry, "Some experiments on the recognition of speech, with one and two ears," J. Acoust. Soc. Amer., vol. 25, no. 5, pp. 975-979, Sep. 1953.
    • (1953) J. Acoust. Soc. Amer. , vol.25 , Issue.5 , pp. 975-979
    • Cherry, E.C.1
  • 2
    • 0039334758 scopus 로고    scopus 로고
    • The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions
    • A. W. Bronkhorst, "The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions," Acustica, vol. 86, pp. 117-128, 2000. (Pubitemid 34103984)
    • (2000) Acta Acustica united with Acustica , vol.86 , Issue.1 , pp. 117-128
    • Bronkhorst, A.W.1
  • 5
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, pp. 267-285, 2001. (Pubitemid 32284867)
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 6
    • 51449083412 scopus 로고    scopus 로고
    • Robust speaker identification using combined feature selection and missing data recognition
    • Las Vegas, NV
    • D. Pullella, M. Kühne, and R. Togneri, "Robust speaker identification using combined feature selection and missing data recognition," in Proc. ICASSP, Las Vegas, NV, 2008, pp. 4833-4836.
    • (2008) Proc. ICASSP , pp. 4833-4836
    • Pullella, D.1    Kühne, M.2    Togneri, R.3
  • 7
    • 81155132367 scopus 로고    scopus 로고
    • Noise-robust speaker recognition combining missing data techniques and universal background modeling
    • Speech, Lang. Process., Jan.
    • T. May, S. van de Par, and A.Kohlrausch, "Noise-robust speaker recognition combining missing data techniques and universal background modeling," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 108-121, Jan. 2012.
    • (2012) IEEE Trans. Audio , vol.20 , Issue.1 , pp. 108-121
    • May, T.1    Van De Par, S.2    Kohlrausch, A.3
  • 8
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    • DOI 10.1121/1.2363929
    • D. S. Brungart, P. S. Chang, B. D. Simpson, and D. L. Wang, "Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation," J. Acoust. Soc. Amer., vol. 120, no. 6, pp. 4007-4018, Dec. 2006. (Pubitemid 44888096)
    • (2006) Journal of the Acoustical Society of America , vol.120 , Issue.6 , pp. 4007-4018
    • Brungart, D.S.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.4
  • 9
    • 34547627836 scopus 로고    scopus 로고
    • Factors influencing glimpsing of speech in noise
    • DOI 10.1121/1.2749454
    • N. Li and P. C. Loizou, "Factors influencing glimpsing of speech in noise," J. Acoust. Soc. Amer., vol. 122, no. 2, pp. 1165-1172, Aug. 2007. (Pubitemid 47205513)
    • (2007) Journal of the Acoustical Society of America , vol.122 , Issue.2 , pp. 1165-1172
    • Li, N.1    Loizou, P.C.2
  • 10
    • 64649103540 scopus 로고    scopus 로고
    • Speech intelligibility in background noise with ideal binary time-frequency masking
    • Apr.
    • D. L. Wang, U. Kjems, M. S. Pedersen, and J. B. Boldt, "Speech intelligibility in background noise with ideal binary time-frequency masking," J. Acoust. Soc. Amer., vol. 125, no. 4, pp. 2336-2347, Apr. 2009.
    • (2009) J. Acoust. Soc. Amer. , vol.125 , Issue.4 , pp. 2336-2347
    • Wang, D.L.1    Kjems, U.2    Pedersen, M.S.3    Boldt, J.B.4
  • 11
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary masks as the computational goal of auditory scene analysis
    • P. Divenyi, Ed. Norwell, MA: Kluwer,ch. 12
    • D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Norwell, MA: Kluwer, 2005, ch. 12, pp. 181-197.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 12
    • 1042299913 scopus 로고    scopus 로고
    • The benefit of binaural hearing in a cocktail party: Effect of location and type of interferer
    • DOI 10.1121/1.1639908
    • M. L. Hawley and R. Y. Litovsky, "The benefit of binaural hearing in a cocktail party: Effect of location and type of interferer," J. Acoust. Soc. Amer., vol. 115, no. 2, pp. 833-843, Feb. 2004. (Pubitemid 38200670)
    • (2004) Journal of the Acoustical Society of America , vol.115 , Issue.2 , pp. 833-843
    • Hawley, M.L.1    Litovsky, R.Y.2    Culling, J.F.3
  • 13
    • 70349187511 scopus 로고    scopus 로고
    • Detection and localization of speech in the presence of competing speech signals
    • London, U.K., Jun.
    • B. D. Simpson, D. S. Brungart, N. Iyer, R. H. Gilkey, and J. T. Hamil, "Detection and localization of speech in the presence of competing speech signals," in Proc. ICAD, London, U.K., Jun. 2006, pp. 129-133.
    • (2006) Proc. ICAD , pp. 129-133
    • Simpson, B.D.1    Brungart, D.S.2    Iyer, N.3    Gilkey, R.H.4    Hamil, J.T.5
  • 14
    • 4644304197 scopus 로고    scopus 로고
    • A binaural processor for missing data speech recognition in the presence of noise and smallroom reverberation
    • K. J. Palomäki, G. J. Brown, and D. L. Wang, "A binaural processor for missing data speech recognition in the presence of noise and smallroom reverberation," Speech Commun., vol. 43, no. 4, pp. 361-378, 2004.
    • (2004) Speech Commun. , vol.43 , Issue.4 , pp. 361-378
    • Palomäki, K.J.1    Brown, G.J.2    Wang, D.L.3
  • 15
    • 33744971131 scopus 로고    scopus 로고
    • Mask estimation for missing data speech recognition based on statistics of binaural interaction
    • DOI 10.1109/TSA.2005.860354
    • S. Harding, J. Barker, and G. Brown, "Mask estimation for missing data speech recognition based on statistics of binaural interaction," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 58-67, Jan. 2006. (Pubitemid 43863453)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 58-67
    • Harding, S.1    Barker, J.2    Brown, G.J.3
  • 16
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • DOI 10.1121/1.1610463
    • N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol. 114, no. 4, pp. 2236-2252, Oct. 2003. (Pubitemid 37266649)
    • (2003) Journal of the Acoustical Society of America , vol.114 , Issue.4 , pp. 2236-2252
    • Roman, N.1    Wang, D.2    Brown, G.J.3
  • 17
    • 70349210869 scopus 로고    scopus 로고
    • A speech fragment approach to localising multiple speakers in reverberant environments
    • Apr., Taipei, Taiwan
    • H. Christensen, N. Ma, S. N. Wrigley, and J. Barker, "A speech fragment approach to localising multiple speakers in reverberant environments," in Proc. ICASSP, Taipei, Taiwan, Apr. 2009, pp. 4593-4596.
    • (2009) Proc. ICASSP , pp. 4593-4596
    • Christensen, H.1    Ma, N.2    Wrigley, S.N.3    Barker, J.4
  • 18
    • 77957729908 scopus 로고    scopus 로고
    • A probabilistic model for robust localization based on a binaural auditory front-end
    • Jan.
    • T. May, S. van de Par, and A. Kohlrausch, "A probabilistic model for robust localization based on a binaural auditory front-end," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 1, pp. 1-13, Jan. 2011.
    • (2011) IEEE Trans. Audio Speech, Lang. Process. , vol.19 , Issue.1 , pp. 1-13
    • May, T.1    Van De Par, S.2    Kohlrausch, A.3
  • 19
    • 77955697785 scopus 로고    scopus 로고
    • Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization
    • Sep.
    • J. Woodruff and D. L.Wang, "Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1856-1866, Sep. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.7 , pp. 1856-1866
    • Woodruff, J.1    Wang, D.L.2
  • 21
    • 77950410207 scopus 로고    scopus 로고
    • Speech localization in a multitalker mixture
    • Mar.
    • N. Kopčo, V. Best, and S. Carlile, "Speech localization in a multitalker mixture," J. Acoust. Soc. Amer., vol. 127, no. 3, pp. 1450-1457, Mar. 2010.
    • (2010) J. Acoust. Soc. Amer. , vol.127 , Issue.3 , pp. 1450-1457
    • Kopčo, N.1    Best, V.2    Carlile, S.3
  • 22
    • 2942539074 scopus 로고    scopus 로고
    • Techniques for handling convolutional distortion with 'missing data' automatic speech recognition
    • Jun.
    • K. J. Palomäki, G. J. Brown, and J. P. Barker, "Techniques for handling convolutional distortion with 'missing data' automatic speech recognition," Speech Commun., vol. 43, no. 1-2, pp. 123-142, Jun. 2004.
    • (2004) Speech Commun. , vol.43 , Issue.1-2 , pp. 123-142
    • Palomäki, K.J.1    Brown, G.J.2    Barker, J.P.3
  • 23
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • DOI 10.1016/0378-5955(90)90170-T
    • B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol. 47, no. 1-2, pp. 103-138, Aug. 1990. (Pubitemid 20244652)
    • (1990) Hearing Research , vol.47 , Issue.1-2 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 24
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • Oct.
    • G. J. Brown and M. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, no. 4, pp. 297-336, Oct. 1994.
    • (1994) Comput. Speech Lang. , vol.8 , Issue.4 , pp. 297-336
    • Brown, G.J.1    Cooke, M.2
  • 25
    • 0023681706 scopus 로고
    • Lateralization of complex binaural stimuli: A weighted-image model
    • Jul.
    • R. M. Stern, A. S. Zeiberg, and C. Trahiotis, "Lateralization of complex binaural stimuli: A weighted-image model," J. Acoust. Soc. Amer., vol. 84, no. 1, pp. 156-165, Jul. 1988.
    • (1988) J. Acoust. Soc. Amer. , vol.84 , Issue.1 , pp. 156-165
    • Stern, R.M.1    Zeiberg, A.S.2    Trahiotis, C.3
  • 26
    • 0026694566 scopus 로고
    • Across frequency integration in a model of lateralization
    • Apr.
    • T. M. Shackleton, R. Meddis, and M. J. Hewitt, "Across frequency integration in a model of lateralization," J. Acoust. Soc. Amer., vol. 91, no. 4, pp. 2276-2279, Apr. 1992.
    • (1992) J. Acoust. Soc. Amer. , vol.91 , Issue.4 , pp. 2276-2279
    • Shackleton, T.M.1    Meddis, R.2    Hewitt, M.J.3
  • 27
    • 83455170710 scopus 로고    scopus 로고
    • Binaural detection of speech sources in complex acoustic scenes
    • NewPaltz, NY, Oct.
    • T. May, S. van de Par, and A.Kohlrausch, "Binaural detection of speech sources in complex acoustic scenes," in Proc.WASPAA, NewPaltz, NY, Oct. 2011, pp. 241-244.
    • (2011) Proc.WASPAA , pp. 241-244
    • May, T.1    Van De Par, S.2    Kohlrausch, A.3
  • 28
    • 85032752225 scopus 로고    scopus 로고
    • Missing-feature approaches in speech recognition
    • DOI 10.1109/MSP.2005.1511828
    • B. Raj and R. M. Stern, "Missing-feature approaches in speech recognition," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 101-116, Sep. 2005. (Pubitemid 41488524)
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 101-116
    • Raj, B.1    Stern, R.M.2
  • 29
    • 85135252448 scopus 로고    scopus 로고
    • Missing features detection and handling for robust speaker verification
    • Budapest, Hungary, Sep.
    • M. El-Maliki and A. Drygajlo, "Missing features detection and handling for robust speaker verification," in Proc. Eurospeech, Budapest, Hungary, Sep. 1999, pp. 975-978.
    • (1999) Proc. Eurospeech , pp. 975-978
    • El-Maliki, M.1    Drygajlo, A.2
  • 30
    • 34547539772 scopus 로고    scopus 로고
    • accessed on 12th Oct., [Online]. Available
    • M. Cooke and T.-W. Lee, "Speech separation and recognition competition," 2006, accessed on 12th Oct. 2010, [Online]. Available: http://staffwww.dcs.shef.ac.uk/people/M.Cooke/SpeechSeparation Challenge.htm
    • (2006) Speech Separation and Recognition Competition
    • Cooke, M.1    Lee, T.-W.2
  • 31
    • 0004319968 scopus 로고
    • The NOISEX-92 study on the effect of additive noise on automatic speaker recognition
    • Defence Research Agency, Malvern, U.K., Tech. Rep.
    • A. P. Varga, H. J. M. Steeneken, M. Tomlinson, and D. Jones, "The NOISEX-92 study on the effect of additive noise on automatic speaker recognition," Speech Research Unit, Defence Research Agency, Malvern, U.K., 1992, Tech. Rep..
    • (1992) Speech Research Unit
    • Varga, A.P.1    Steeneken, H.J.M.2    Tomlinson, M.3    Jones, D.4
  • 32
    • 0020102027 scopus 로고
    • Least squares quantization in PCM
    • Mar.
    • S. Lloyd, "Least squares quantization in PCM," IEEE Trans. Inf. Theory., vol. 28, no. 2, pp. 129-137, Mar. 1982.
    • (1982) IEEE Trans. Inf. Theory. , vol.28 , Issue.2 , pp. 129-137
    • Lloyd, S.1
  • 33
    • 0002629270 scopus 로고
    • Maximum likelihood estimation from incomplete data via the EM algorithm
    • A. Dempster,N. Laird, and D. Rubin, "Maximum likelihood estimation from incomplete data via the EM algorithm," J. R. Statist. Soc. B, vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 34
    • 33644661135 scopus 로고    scopus 로고
    • A glimpsing model of speech perception in noise
    • DOI 10.1121/1.2166600
    • M. Cooke, "A glimpsing model of speech perception in noise," J. Acoust. Soc. Amer., vol. 199, no. 3, pp. 1562-1573, Mar. 2006. (Pubitemid 43326025)
    • (2006) Journal of the Acoustical Society of America , vol.119 , Issue.3 , pp. 1562-1573
    • Cooke, M.1
  • 38
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • Apr.
    • J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol. 65, no. 4, pp. 943-950, Apr. 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.65 , Issue.4 , pp. 943-950
    • Allen, J.B.1    Berkley, D.A.2
  • 40
    • 84861505891 scopus 로고
    • American National Standard Specification for Sound Level Meters, ANSI/ ASA S1.4-1983 (R2001
    • American National Standard Specification for Sound Level Meters, ANSI/ASA S1.4-1983 (R2001), Amer. Nat. Stand. Inst., 1983.
    • (1983) Amer. Nat. Stand. Inst.
  • 41
    • 36049044257 scopus 로고    scopus 로고
    • accessed on 11th October 2011 [Online]. Available:
    • D. P. W. Ellis, PLP and RASTA (and MFCC, and Inversion) in Matlab, 2005, accessed on 11th October 2011 [Online]. Available: http://www.ee.columbia.edu/ ~dpwe/resources/matlab/rastamat
    • (2005) PLP and RASTA (and MFCC and Inversion) in Matlab
    • Ellis, D.P.W.1
  • 42
    • 0024035182 scopus 로고
    • On the use of instantaneous and transitional spectral information in speaker recognition
    • Jun.
    • F. K. Soong and A. E. Rosenberg, "On the use of instantaneous and transitional spectral information in speaker recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. 36, no. 6, pp. 871-879, Jun. 1988.
    • (1988) IEEE Trans. Acoust., Speech, Signal Process. , vol.36 , Issue.6 , pp. 871-879
    • Soong, F.K.1    Rosenberg, A.E.2
  • 44
    • 85135190755 scopus 로고    scopus 로고
    • Multi-band and adaptation approaches to robust speech recognition
    • Sep., Rhodes, Greece
    • S. Tibrewala and H. Hermansky, "Multi-band and adaptation approaches to robust speech recognition," in Proc. Eurospeech, Rhodes, Greece, Sep. 1997, pp. 2619-2622.
    • (1997) Proc. Eurospeech , pp. 2619-2622
    • Tibrewala, S.1    Hermansky, H.2
  • 45
    • 0028996871 scopus 로고
    • Noise estimation techniques for robust speech recognition
    • Detroit, MI
    • H. G. Hirsch and C. Ehrlicher, "Noise estimation techniques for robust speech recognition," in Proc. ICASSP, Detroit, MI, 1995, vol. 1, pp. 153-156.
    • (1995) Proc. ICASSP , vol.1 , pp. 153-156
    • Hirsch, H.G.1    Ehrlicher, C.2
  • 46
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Apr.
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 2, pp. 443-445, Apr. 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 47
    • 78149476938 scopus 로고    scopus 로고
    • Signal-to-signal ratio independent speaker identification for co-channel speech signals
    • Istanbul, Turkey, Aug.
    • R. Saeidi, P. Mowlaee, T. Kinnunen, Z.-H. Tan, M. G. Christensen, S. H. Jensen, and P. Fränti, "Signal-to-signal ratio independent speaker identification for co-channel speech signals," in Proc. ICPR, Istanbul, Turkey, Aug. 2010, pp. 4565-4568.
    • (2010) Proc. ICPR , pp. 4565-4568
    • Saeidi, R.1    Mowlaee, P.2    Kinnunen, T.3    Tan, Z.-H.4    Christensen, M.G.5    Jensen, S.H.6    Fränti, P.7
  • 48
    • 33646023117 scopus 로고    scopus 로고
    • An introduction to ROC analysis
    • Jun.
    • T. Fawcett, "An introduction to ROC analysis," Pattern Recog. Lett., vol. 27, no. 8, pp. 861-874, Jun. 2006.
    • (2006) Pattern Recog. Lett. , vol.27 , Issue.8 , pp. 861-874
    • Fawcett, T.1
  • 49
    • 70349093614 scopus 로고    scopus 로고
    • An algorithm that improves speech intelligibility in noise for normal-hearing listeners
    • Sep.
    • G. Kim, Y. Lu, Y. Hu, and P. C. Loizou, "An algorithm that improves speech intelligibility in noise for normal-hearing listeners," J. Acoust. Soc. Amer., vol. 126, no. 3, pp. 1486-1494, Sep. 2009.
    • (2009) J. Acoust. Soc. Amer. , vol.126 , Issue.3 , pp. 1486-1494
    • Kim, G.1    Lu, Y.2    Hu, Y.3    Loizou, P.C.4
  • 50
    • 33745004174 scopus 로고    scopus 로고
    • Effect of source location and listener location on ILD cues in a reverberant room
    • A. Ihlefeld and B. G. Shinn-Cunningham, "Effect of source location and listener location on ILD cues in a reverberant room," J. Acoust. Soc. Amer., vol. 115, no. 5, p. 2598, 2004.
    • (2004) J. Acoust. Soc. Amer. , vol.115 , Issue.5 , pp. 2598
    • Ihlefeld, A.1    Shinn-Cunningham, B.G.2
  • 51
    • 18744392833 scopus 로고    scopus 로고
    • Localizing nearby sound sources in a classroom: Binaural room impulse responses
    • DOI 10.1121/1.1872572
    • B. G. Shinn-Cunningham, N. Kopčo, and T. J. Martin, "Localizing nearby sound sources in a classroom: Binaural room impulse responses," J. Acoust. Soc. Amer., vol. 117, no. 5, pp. 3100-3115, May 2005. (Pubitemid 40675172)
    • (2005) Journal of the Acoustical Society of America , vol.117 , Issue.5 , pp. 3100-3115
    • Shinn-Cunningham, B.G.1    Kopco, N.2    Martin, T.J.3
  • 52
    • 33750311718 scopus 로고    scopus 로고
    • Binary and ratio time-frequency masks for robust speech recognition
    • DOI 10.1016/j.specom.2006.09.003, PII S0167639306001129
    • S. Srinivasan, N. Roman, and D. L. Wang, "Binary and ratio time-frequency masks for robust speech recognition," Speech Commun., vol. 48, no. 11, pp. 1486-1501, Nov. 2006. (Pubitemid 44634774)
    • (2006) Speech Communication , vol.48 , Issue.11 , pp. 1486-1501
    • Srinivasan, S.1    Roman, N.2    Wang, D.3
  • 53
    • 85009145345 scopus 로고    scopus 로고
    • Observations on overlap: Findings and implications for automatic processing of multi-party conversation
    • Sep., Aalborg, Denmark
    • E. Shriberg, A. Stolcke, and D. Baron, "Observations on overlap: Findings and implications for automatic processing of multi-party conversation," in Proc. Eurospeech, Aalborg, Denmark, Sep. 2001, pp. 1359-1362.
    • (2001) Proc. Eurospeech , pp. 1359-1362
    • Shriberg, E.1    Stolcke, A.2    Baron, D.3
  • 54
    • 33947653296 scopus 로고    scopus 로고
    • Recognition of reverberant speech using full cepstral features and spectral missing data
    • Toulouse, France
    • K. J. Palomäki, G. J. Brown, and J. P. Barker, "Recognition of reverberant speech using full cepstral features and spectral missing data," in Proc. ICASSP, Toulouse, France, 2006, pp. 289-292.
    • (2006) Proc. ICASSP , pp. 289-292
    • Palomäki, K.J.1    Brown, G.J.2    Barker, J.P.3
  • 55
    • 80051661646 scopus 로고    scopus 로고
    • Binaural sound source separation motivated by auditory processing
    • Prague, Czech Republic
    • C. Kim, K. Kumar, and R. M. Stern, "Binaural sound source separation motivated by auditory processing," in Proc. ICASSP, Prague, Czech Republic, 2011, pp. 5072-5075.
    • (2011) Proc. ICASSP , pp. 5072-5075
    • Kim, C.1    Kumar, K.2    Stern, R.M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.