메뉴 건너뛰기




Volumn 127, Issue 3, 2010, Pages 1661-1672

Two-microphone separation of speech mixtures based on interclass variance maximization

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL COSTS; MULTILEVEL THRESHOLDING; REALISTIC SCENARIO; SOUND SOURCE; SPARSE METHODS; SPEECH SEPARATION; TIME FREQUENCY; TIME-FREQUENCY POINTS; VARIANCE MAXIMIZATION;

EID: 77950410617     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.3294713     Document Type: Article
Times cited : (15)

References (38)
  • 1
    • 80052339383 scopus 로고
    • Some experiments on the recognition of speech, with one and with two ears
    • JASMAN 0001-4966, 10.1121/1.1907229
    • E. C. Cherry, " Some experiments on the recognition of speech, with one and with two ears.," J. Acoust. Soc. Am. JASMAN 0001-4966 25, 975-979 (1953). 10.1121/1.1907229
    • (1953) J. Acoust. Soc. Am. , vol.25 , pp. 975-979
    • Cherry, E.C.1
  • 2
    • 0039334758 scopus 로고    scopus 로고
    • The cocktail party phenomenom: A review of research on speech intellibility in multiple-talker conditions
    • ACUSAY 0001-7884
    • A. W. Bronkhorst, " The cocktail party phenomenom: A review of research on speech intellibility in multiple-talker conditions.," Acustica ACUSAY 0001-7884 86, 117-128 (2000).
    • (2000) Acustica , vol.86 , pp. 117-128
    • Bronkhorst, A.W.1
  • 3
    • 33845361885 scopus 로고    scopus 로고
    • Binaural segregation in multisource reverberant environments
    • JASMAN 0001-4966, 10.1121/1.2355480
    • N. Roman, S. Srinivasan, and D. Wang, " Binaural segregation in multisource reverberant environments.," J. Acoust. Soc. Am. JASMAN 0001-4966 120, 4040-4051 (2006). 10.1121/1.2355480
    • (2006) J. Acoust. Soc. Am. , vol.120 , pp. 4040-4051
    • Roman, N.1    Srinivasan, S.2    Wang, D.3
  • 4
    • 0032187518 scopus 로고    scopus 로고
    • Blind signal separation: Statistical principles
    • IEEPAD 0018-9219, 10.1109/5.720250
    • J. F. Cardoso, " Blind signal separation: Statistical principles.," Proc. IEEE IEEPAD 0018-9219 86, 2009-2025 (1998). 10.1109/5.720250
    • (1998) Proc. IEEE , vol.86 , pp. 2009-2025
    • Cardoso, J.F.1
  • 6
    • 4344579404 scopus 로고    scopus 로고
    • A robust and precise method for solving the permutation problem of frequency-domain blind source separation
    • IESPEJ 1063-6676, 10.1109/TSA.2004.832994
    • H. Sawada, R. Mukai, S. Araki, and S. Makino, " A robust and precise method for solving the permutation problem of frequency-domain blind source separation.," IEEE Trans. Speech Audio Process. IESPEJ 1063-6676 12, 530-538 (2004). 10.1109/TSA.2004.832994
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , pp. 530-538
    • Sawada, H.1    Mukai, R.2    Araki, S.3    Makino, S.4
  • 7
    • 0035501128 scopus 로고    scopus 로고
    • Underdetermined blind source separation using sparse representations
    • SPRODR 0165-1684, 10.1016/S0165-1684(01)00120-7
    • P. Bofill and M. Zibulevski, " Underdetermined blind source separation using sparse representations.," Signal Process. SPRODR 0165-1684 81, 2353-2362 (2001). 10.1016/S0165-1684(01)00120-7
    • (2001) Signal Process. , vol.81 , pp. 2353-2362
    • Bofill, P.1    Zibulevski, M.2
  • 8
    • 0032131292 scopus 로고    scopus 로고
    • Atomic decomposition by basis pursuit
    • SJOCE3 1064-8275, 10.1137/S1064827596304010
    • S. Chen, D. L. Donoho, and M. A. Saunders, " Atomic decomposition by basis pursuit.," SIAM J. Sci. Comput. (USA) SJOCE3 1064-8275 20, 33-61 (1998). 10.1137/S1064827596304010
    • (1998) SIAM J. Sci. Comput. (USA) , vol.20 , pp. 33-61
    • Chen, S.1    Donoho, D.L.2    Saunders, M.A.3
  • 9
    • 0034133184 scopus 로고    scopus 로고
    • Learning overcomplete representations
    • NEUCEB 0899-7667, 10.1162/089976600300015826
    • M. S. Lewicki and T. J. Sejnowski, " Learning overcomplete representations.," Neural Comput. NEUCEB 0899-7667 12, 337-365 (2000). 10.1162/089976600300015826
    • (2000) Neural Comput. , vol.12 , pp. 337-365
    • Lewicki, M.S.1    Sejnowski, T.J.2
  • 12
    • 3142694930 scopus 로고    scopus 로고
    • Blind separation of speech mixtures via time-frequency masking
    • ITPRED 1053-587X, 10.1109/TSP.2004.828896
    • O. Yilmaz and S. Rickard, " Blind separation of speech mixtures via time-frequency masking.," IEEE Trans. Signal Process. ITPRED 1053-587X 52, 1830-1847 (2004). 10.1109/TSP.2004.828896
    • (2004) IEEE Trans. Signal Process. , vol.52 , pp. 1830-1847
    • Yilmaz, O.1    Rickard, S.2
  • 14
    • 0018306059 scopus 로고
    • A threshold selection method from gray-level histogram
    • ISYMAW 0018-9472, 10.1109/TSMC.1979.4310076
    • N. Otsu, " A threshold selection method from gray-level histogram.," IEEE Trans. Syst. Man Cybern. ISYMAW 0018-9472 9, 62-66 (1979). 10.1109/TSMC.1979.4310076
    • (1979) IEEE Trans. Syst. Man Cybern. , vol.9 , pp. 62-66
    • Otsu, N.1
  • 15
    • 50549096869 scopus 로고    scopus 로고
    • Stereo audio source separation based on time-frequency masking and multilevel thresholding
    • DSPREJ 1051-2004, 10.1016/j.ds2008.06.004
    • M. Cobos and J. J. Lopez, " Stereo audio source separation based on time-frequency masking and multilevel thresholding.," Digit. Signal Process. DSPREJ 1051-2004 18, 960-976 (2008). 10.1016/j.dsp.2008.06.004
    • (2008) Digit. Signal Process. , vol.18 , pp. 960-976
    • Cobos, M.1    Lopez, J.J.2
  • 16
    • 50949092983 scopus 로고    scopus 로고
    • edited by S. Makino, T. W. Lee, and H. Sawada (Springer, New York)
    • Blind Speech Separation, edited by, S. Makino, T. W. Lee, and, H. Sawada, (Springer, New York, 2007).
    • (2007) Blind Speech Separation
  • 17
    • 0003733873 scopus 로고
    • (Prentice-Hall, Englewood Cliffs, NJ)
    • L. Cohen, Time-Frequency Analysis (Prentice-Hall, Englewood Cliffs, NJ, 1995).
    • (1995) Time-Frequency Analysis
    • Cohen, L.1
  • 20
    • 67650166628 scopus 로고    scopus 로고
    • Improving isolation of blindly separated sources using time-frequency masking
    • ISPLEM 1070-9908, 10.1109/LSP.2008.2002927
    • M. Cobos and J. J. Lopez, " Improving isolation of blindly separated sources using time-frequency masking.," IEEE Signal Process. Lett. ISPLEM 1070-9908 15, 617-620 (2008). 10.1109/LSP.2008.2002927
    • (2008) IEEE Signal Process. Lett. , vol.15 , pp. 617-620
    • Cobos, M.1    Lopez, J.J.2
  • 22
    • 56249144201 scopus 로고    scopus 로고
    • Time frequency masking for speech separation and its potential for hearing aid design
    • TAMPFF 1084-7138, 10.1177/1084713808326455
    • D. Wang, " Time frequency masking for speech separation and its potential for hearing aid design.," Trends Amplif. TAMPFF 1084-7138 12, 332-353 (2008). 10.1177/1084713808326455
    • (2008) Trends Amplif. , vol.12 , pp. 332-353
    • Wang, D.1
  • 23
    • 33749065640 scopus 로고    scopus 로고
    • Ph.D. thesis, École Polytechynique F́d́rale de Lausanne (EPFL), Lausanne, Switzerland
    • C. Faller, " Parametric coding of spatial audio.," Ph.D. thesis, École Polytechynique F́d́rale de Lausanne (EPFL), Lausanne, Switzerland (2004).
    • (2004) Parametric Coding of Spatial Audio
    • Faller, C.1
  • 24
    • 9644281074 scopus 로고    scopus 로고
    • Source localization in complex listening situations: Selection of binaural cues based on interaural coherence
    • DOI 10.1121/1.1791872
    • C. Faller and J. Merimaa, " Source localization in complex listening situations: Selection of binaural cues based on interaural coherence.," J. Acoust. Soc. Am. JASMAN 0001-4966 116, 3075-3089 (2004). 10.1121/1.1791872 (Pubitemid 39575553)
    • (2004) Journal of the Acoustical Society of America , vol.116 , Issue.5 , pp. 3075-3089
    • Faller, C.1    Merimaa, J.2
  • 25
    • 41849132137 scopus 로고    scopus 로고
    • Localization of multiple acoustic sources with small arrays using a coherence test
    • JASMAN 0001-4966, 10.1121/1.2871597
    • S. Mohan, M. E. Lockwood, M. L. Kramer, and D. L. Jones, " Localization of multiple acoustic sources with small arrays using a coherence test.," J. Acoust. Soc. Am. JASMAN 0001-4966 123, 2136-2147 (2008). 10.1121/1.2871597
    • (2008) J. Acoust. Soc. Am. , vol.123 , pp. 2136-2147
    • Mohan, S.1    Lockwood, M.E.2    Kramer, M.L.3    Jones, D.L.4
  • 27
    • 0023295449 scopus 로고
    • Coherence and time delay estimation
    • IEEPAD 0018-9219, 10.1109/PROC.1987.13723
    • G. C. Carter, " Coherence and time delay estimation.," Proc. IEEE IEEPAD 0018-9219 75, 236-255 (1987). 10.1109/PROC.1987.13723
    • (1987) Proc. IEEE , vol.75 , pp. 236-255
    • Carter, G.C.1
  • 28
    • 17744367667 scopus 로고    scopus 로고
    • The time-delay graph and the delayogram-New visualizations for time delay
    • ISPLEM 1070-9908, 10.1109/LSP.2004.842266
    • H. F. Silverman and J. M. Sachar, " The time-delay graph and the delayogram-New visualizations for time delay.," IEEE Signal Process. Lett. ISPLEM 1070-9908 12, 301-304 (2005). 10.1109/LSP.2004.842266
    • (2005) IEEE Signal Process. Lett. , vol.12 , pp. 301-304
    • Silverman, H.F.1    Sachar, J.M.2
  • 29
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • JASMAN 0001-4966, 10.1121/1.382599
    • J. B. Allen and D. A. Berkley, " Image method for efficiently simulating small-room acoustics.," J. Acoust. Soc. Am. JASMAN 0001-4966 65, 943-950 (1979). 10.1121/1.382599
    • (1979) J. Acoust. Soc. Am. , vol.65 , pp. 943-950
    • Allen, J.B.1    Berkley, D.A.2
  • 30
    • 0023823049 scopus 로고
    • A survey of thresholding techniques
    • CVGPDB 0734-189X, 10.1016/0734-189X(88)90022-9
    • P. K. Sahoo, S. Soltani, and A. K. C. Wong, " A survey of thresholding techniques.," Comput. Vis. Graph. Image Process. CVGPDB 0734-189X 41, 233-260 (1988). 10.1016/0734-189X(88)90022-9
    • (1988) Comput. Vis. Graph. Image Process. , vol.41 , pp. 233-260
    • Sahoo, P.K.1    Soltani, S.2    Wong, A.K.C.3
  • 31
    • 1842422015 scopus 로고    scopus 로고
    • Survey over image thresholding techniques and quantitative performance evaluation
    • JEIME5 1017-9909, 10.1117/1.1631315
    • M. Sezgin and B. Sankur, " Survey over image thresholding techniques and quantitative performance evaluation.," J. Electron. Imaging JEIME5 1017-9909 13, 146-165 (2004). 10.1117/1.1631315
    • (2004) J. Electron. Imaging , vol.13 , pp. 146-165
    • Sezgin, M.1    Sankur, B.2
  • 32
    • 0035440214 scopus 로고    scopus 로고
    • A fast algorithm for multilevel thresholding
    • P. Liao, T. Chen, and P. Chung, " A fast algorithm for multilevel thresholding.," J. Inf. Sci. Eng. 17, 713-717 (2001).
    • (2001) J. Inf. Sci. Eng. , vol.17 , pp. 713-717
    • Liao, P.1    Chen, T.2    Chung, P.3
  • 33
    • 0031206664 scopus 로고    scopus 로고
    • A fast iterative scheme for multilevel thresholding methods
    • SPRODR 0165-1684, 10.1016/S0165-1684(97)00080-7
    • P. Y. Yin and L. H. Chen, " A fast iterative scheme for multilevel thresholding methods.," Signal Process. SPRODR 0165-1684 60, 305-313 (1997). 10.1016/S0165-1684(97)00080-7
    • (1997) Signal Process. , vol.60 , pp. 305-313
    • Yin, P.Y.1    Chen, L.H.2
  • 35
    • 67149088353 scopus 로고    scopus 로고
    • The 2008 signal separation evaluation campaign: A community-based approach to large-scale evaluation
    • LNCSD9 0302-9743, 10.1007/978-3-642-00599-2-92
    • E. Vincent, S. Araki, and P. Bofill, " The 2008 signal separation evaluation campaign: A community-based approach to large-scale evaluation.," Lect. Notes Comput. Sci. LNCSD9 0302-9743 5441, 734-741 (2009). 10.1007/978-3-642-00599-2-92
    • (2009) Lect. Notes Comput. Sci. , vol.5441 , pp. 734-741
    • Vincent, E.1    Araki, S.2    Bofill, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.