메뉴 건너뛰기




Volumn 21, Issue 1, 2013, Pages 122-131

An unsupervised approach to cochannel speech separation

Author keywords

cochannel speech separation; Computational auditory scene analysis (CASA); sequential grouping; unsupervised clustering; unvoiced speech segregation

Indexed keywords

PATIENT REHABILITATION; SIGNAL TO NOISE RATIO; SOURCE SEPARATION; SPEECH ANALYSIS;

EID: 84867946385     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2215591     Document Type: Article
Times cited : (102)

References (36)
  • 2
    • 44949219122 scopus 로고    scopus 로고
    • Recent advances in speech fragment decoding techniques
    • J. Barker, A. Coy, N. Ma, and M. Cooke, "Recent advances in speech fragment decoding techniques," in Proc. Interspeech '06, 2006, pp. 85-88.
    • (2006) Proc. Interspeech '06 , pp. 85-88
    • Barker, J.1    Coy, A.2    Ma, N.3    Cooke, M.4
  • 3
    • 84867941359 scopus 로고    scopus 로고
    • [Online] Praat: doing phonetics by computer (version 5.0.02)
    • P. Boersma and D.Weenink [Online]. Available: http://www.fon.hum. uva.nl/praat, 2007, Praat: doing phonetics by computer (version 5.0.02)
    • (2007)
    • Boersma, P.1    Weenink, D.2
  • 4
    • 0014753348 scopus 로고
    • Interaction of competing speech signals with hearing losses
    • R. C. Carhart and T. W. Tillman, "Interaction of competing speech signals with hearing losses," Arch. Otolaryngol., vol. 91, pp. 273-279, 1970.
    • (1970) Arch. Otolaryngol , vol.91 , pp. 273-279
    • Carhart, R.C.1    Tillman, T.W.2
  • 6
    • 47749094114 scopus 로고    scopus 로고
    • [Online]
    • M. Cooke and T. Lee, Speech Separation Challenge, 2006. [Online]. Available: http://staffwww.dcs.shef.ac.uk/people/M.Cooke/Speech- SeparationChallenge.htm
    • (2006) Speech Separation Challenge
    • Cooke, M.1    Lee, T.2
  • 8
    • 0025259936 scopus 로고
    • Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing
    • J. M. Festen andR. Plomp, "Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing," J. Acoust. Soc. Amer., vol. 88, pp. 1725-1736, 1990.
    • (1990) J. Acoust. Soc. Amer , vol.88 , pp. 1725-1736
    • Festen, J.M.1    Plomp, R.2
  • 9
    • 84867933901 scopus 로고    scopus 로고
    • [Online]
    • G. Grindlay, 2010 [Online]. Available: http://code.google.com/p/nmflib/, NMFlib.
    • (2010)
    • Grindlay, G.1
  • 10
    • 69249222720 scopus 로고    scopus 로고
    • Superhuman multi-talker speech recognition: A graphical model approach
    • J. R. Hershey, S. J. Rennie, P. A. Olsen, and T. T. Kristjansson, "Superhuman multi-talker speech recognition: A graphical model approach," Comput. Speech Lang., vol. 24, pp. 45-66, 2010.
    • (2010) Comput. Speech Lang , vol.24 , pp. 45-66
    • Hershey, J.R.1    Rennie, S.J.2    Olsen, P.A.3    Kristjansson, T.T.4
  • 11
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Sep
    • G. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
    • (2004) IEEE Trans. Neural Netw , vol.15 , Issue.5 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 12
    • 38849102154 scopus 로고    scopus 로고
    • Auditory segmentation based on onset and offset analysis
    • Feb
    • G. Hu and D. L. Wang, "Auditory segmentation based on onset and offset analysis," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 396-405, Feb. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.2 , pp. 396-405
    • Hu, G.1    Wang, D.L.2
  • 13
    • 49249107353 scopus 로고    scopus 로고
    • Segregation of unvoiced speech from nonspeech interference
    • G. Hu and D. L. Wang, "Segregation of unvoiced speech from nonspeech interference," J. Acoust. Soc. Amer., vol. 124, pp. 1306-1319, 2008.
    • (2008) J. Acoust. Soc. Amer , vol.124 , pp. 1306-1319
    • Hu, G.1    Wang, D.L.2
  • 14
    • 77955695149 scopus 로고    scopus 로고
    • A tandem algorithm for pitch estimation and voiced speech segregation
    • Nov
    • G. Hu and D. L. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process, vol. 18, no. 8, pp. 2067-2079, Nov. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process , vol.18 , Issue.8 , pp. 2067-2079
    • Hu, G.1    Wang, D.L.2
  • 15
    • 80051610956 scopus 로고    scopus 로고
    • An approach to sequential grouping in cochannel speech
    • K. Hu and D. L. Wang, "An approach to sequential grouping in cochannel speech," in Proc. ICASSP'11, 2011, pp. 4636-4639.
    • (2011) Proc. ICASSP'11 , pp. 4636-4639
    • Hu, K.1    Wang, D.L.2
  • 16
    • 85008054377 scopus 로고    scopus 로고
    • Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction
    • Aug
    • K. Hu and D. L.Wang, "Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction," IEEE Trans. Audio, Speech, Lang. Process, vol. 19, no. 6, pp. 1600-1609, Aug. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.6 , pp. 1600-1609
    • Hu, K.1    Wang, D.L.2
  • 18
    • 0033592606 scopus 로고    scopus 로고
    • Learning the parts of objects by nonnegative matrix factorization
    • D. D. Lee and H. S. Seung, "Learning the parts of objects by nonnegative matrix factorization," Nature, vol. 401, pp. 788-791, 1999.
    • (1999) Nature , vol.401 , pp. 788-791
    • Lee, D.D.1    Seung, H.S.2
  • 19
    • 34250115918 scopus 로고
    • An examination of procedures for determining the number of clusters in a data set
    • G. W. Milligan and M. C. Cooper, "An examination of procedures for determining the number of clusters in a data set," Psychometrika, vol. 50, no. 2, pp. 159-179, 1985.
    • (1985) Psychometrika , vol.50 , Issue.2 , pp. 159-179
    • Milligan, G.W.1    Cooper, M.C.2
  • 23
    • 48849091396 scopus 로고    scopus 로고
    • Single-channel speech separation using soft mask filtering
    • Nov
    • M. H. Radfar and R.M. Dansereau, "Single-channel speech separation using soft mask filtering," IEEE Trans. Audio, Speech, Lang. Process, vol. 15, no. 8, pp. 2299-2310, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2299-2310
    • Radfar, M.H.1    Dansereau, R.M.2
  • 24
    • 56249144712 scopus 로고    scopus 로고
    • Soft mask methods for single-channel speaker separation
    • Aug
    • A. Reddy and B. Raj, "Soft mask methods for single-channel speaker separation," IEEE Trans. Audio, Speech, Lang. Process, vol. 15, no. 6, pp. 1766-1776, Aug. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.6 , pp. 1766-1776
    • Reddy, A.1    Raj, B.2
  • 26
    • 44949110218 scopus 로고    scopus 로고
    • Single-channel speech separation using sparse non-negative matrix factorization
    • M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrix factorization," in Proc. Interspeech' 06, 2006, pp. 2614-2617.
    • (2006) Proc. Interspeech'06 , pp. 2614-2617
    • Schmidt, M.N.1    Olsson, R.K.2
  • 28
    • 69249159165 scopus 로고    scopus 로고
    • A computational auditory scene analysis system for speech segregation and robust speech recognition
    • Y. Shao, S. Srinivasan, Z. Jin, and D. L. Wang, "A computational auditory scene analysis system for speech segregation and robust speech recognition," Comput. Speech Lang., vol. 24, pp. 77-93, 2010.
    • (2010) Comput. Speech Lang , vol.24 , pp. 77-93
    • Shao, Y.1    Srinivasan, S.2    Jin, Z.3    Wang, D.L.4
  • 29
    • 33744996003 scopus 로고    scopus 로고
    • Model-based sequential organization in cochannel speech
    • DOI 10.1109/TSA.2005.854106
    • Y. Shao and D. L. Wang, "Model-based sequential organization in cochannel speech," IEEE Trans. Audio, Speech, Lang. Process, vol. 14, no. 1, pp. 289-298, Jan. 2006. (Pubitemid 43863474)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 289-298
    • Shao, Y.1    Wang, D.2
  • 30
    • 67349134831 scopus 로고    scopus 로고
    • Sequential organization of speech in computational auditory scene analysis
    • Y. Shao and D. L. Wang, "Sequential organization of speech in computational auditory scene analysis," Speech Commun., vol. 51, pp. 657-667, 2009.
    • (2009) Speech Commun , vol.51 , pp. 657-667
    • Shao, Y.1    Wang, D.L.2
  • 31
    • 38049021850 scopus 로고    scopus 로고
    • Convolutive speech bases and their application to supervised speech separation
    • Jan
    • P. Smaragdis, "Convolutive speech bases and their application to supervised speech separation," IEEE Trans. Audio, Speech, Lang. Process, vol. 15, no. 1, pp. 1-12, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.1 , pp. 1-12
    • Smaragdis, P.1
  • 32
    • 78049306672 scopus 로고    scopus 로고
    • Source-filter-based single-channel speech separation using pitch information
    • Feb
    • M. Stark, M. Wohlmayr, and F. Pernkopf, "Source-filter-based single-channel speech separation using pitch information," IEEE Trans. Audio, Speech, Lang. Process, vol. 19, no. 2, pp. 242-255, Feb. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.2 , pp. 242-255
    • Stark, M.1    Wohlmayr, M.2    Pernkopf, F.3
  • 35
    • 69249151355 scopus 로고    scopus 로고
    • Speech separation using speaker-adapted eigenvoice speech models
    • R.Weiss and D. Ellis, "Speech separation using speaker-adapted eigenvoice speech models," Comput. Speech Lang., vol. 24, no. 1, pp. 16-29, 2010.
    • (2010) Comput. Speech Lang , vol.24 , Issue.1 , pp. 16-29
    • Weiss, R.1    Ellis, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.