SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 21, Issue 1, 2013, Pages 122-131

An unsupervised approach to cochannel speech separation

Author keywords

cochannel speech separation; Computational auditory scene analysis (CASA); sequential grouping; unsupervised clustering; unvoiced speech segregation

Indexed keywords

PATIENT REHABILITATION; SIGNAL TO NOISE RATIO; SOURCE SEPARATION; SPEECH ANALYSIS;

COCHANNEL SPEECH SEPARATIONS; COMPUTATIONAL AUDITORY SCENE ANALYSIS; SEQUENTIAL GROUPING; UNSUPERVISED CLUSTERING; UNVOICED SPEECH SEGREGATIONS;

SPEECH;

EID: 84867946385 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2012.2215591 Document Type: Article

Times cited : (102)

References (36)

1
- 38849083727
- San Rafael CA: Morgan & Claypool
- J. B. Allen, Articulation and Intelligibility. San Rafael, CA: Morgan & Claypool, 2005.
- (2005) Articulation and Intelligibility
- Allen, J.B.¹

2
- 44949219122
- Recent advances in speech fragment decoding techniques
- J. Barker, A. Coy, N. Ma, and M. Cooke, "Recent advances in speech fragment decoding techniques," in Proc. Interspeech '06, 2006, pp. 85-88.
- (2006) Proc. Interspeech '06 , pp. 85-88
- Barker, J.¹ Coy, A.² Ma, N.³ Cooke, M.⁴

3
- 84867941359
- [Online] Praat: doing phonetics by computer (version 5.0.02)
- P. Boersma and D.Weenink [Online]. Available: http://www.fon.hum. uva.nl/praat, 2007, Praat: doing phonetics by computer (version 5.0.02)
- (2007)
- Boersma, P.¹ Weenink, D.²

4
- 0014753348
- Interaction of competing speech signals with hearing losses
- R. C. Carhart and T. W. Tillman, "Interaction of competing speech signals with hearing losses," Arch. Otolaryngol., vol. 91, pp. 273-279, 1970.
- (1970) Arch. Otolaryngol , vol.91 , pp. 273-279
- Carhart, R.C.¹ Tillman, T.W.²

5
- 33746239350
- Extended SMART algorithms for non-negative matrix factorization
- A. Cichocki, S.-I. Amari, R. Zdunek, R. Kompass, G. Hori, and Z. He, "Extended SMART algorithms for non-negative matrix factorization," in Proc. ICAISC '06, 2006, no. 548-562.
- (2006) Proc. ICAISC '06 , Issue.548-562
- Cichocki, A.¹ Amari, S.-I.² Zdunek, R.³ Kompass, R.⁴ Hori, G.⁵ He, Z.⁶

6
- 47749094114
- [Online]
- M. Cooke and T. Lee, Speech Separation Challenge, 2006. [Online]. Available: http://staffwww.dcs.shef.ac.uk/people/M.Cooke/Speech- SeparationChallenge.htm
- (2006) Speech Separation Challenge
- Cooke, M.¹ Lee, T.²

7
- 0004191790
- New York: Thieme
- H. Dillon, Hearing Aids. New York: Thieme, 2001.
- (2001) Hearing Aids
- Dillon, H.¹

8
- 0025259936
- Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing
- J. M. Festen andR. Plomp, "Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing," J. Acoust. Soc. Amer., vol. 88, pp. 1725-1736, 1990.
- (1990) J. Acoust. Soc. Amer , vol.88 , pp. 1725-1736
- Festen, J.M.¹ Plomp, R.²

9
- 84867933901
- [Online]
- G. Grindlay, 2010 [Online]. Available: http://code.google.com/p/nmflib/, NMFlib.
- (2010)
- Grindlay, G.¹

10
- 69249222720
- Superhuman multi-talker speech recognition: A graphical model approach
- J. R. Hershey, S. J. Rennie, P. A. Olsen, and T. T. Kristjansson, "Superhuman multi-talker speech recognition: A graphical model approach," Comput. Speech Lang., vol. 24, pp. 45-66, 2010.
- (2010) Comput. Speech Lang , vol.24 , pp. 45-66
- Hershey, J.R.¹ Rennie, S.J.² Olsen, P.A.³ Kristjansson, T.T.⁴

11
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- Sep
- G. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
- (2004) IEEE Trans. Neural Netw , vol.15 , Issue.5 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

12
- 38849102154
- Auditory segmentation based on onset and offset analysis
- Feb
- G. Hu and D. L. Wang, "Auditory segmentation based on onset and offset analysis," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 396-405, Feb. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.2 , pp. 396-405
- Hu, G.¹ Wang, D.L.²

13
- 49249107353
- Segregation of unvoiced speech from nonspeech interference
- G. Hu and D. L. Wang, "Segregation of unvoiced speech from nonspeech interference," J. Acoust. Soc. Amer., vol. 124, pp. 1306-1319, 2008.
- (2008) J. Acoust. Soc. Amer , vol.124 , pp. 1306-1319
- Hu, G.¹ Wang, D.L.²

14
- 77955695149
- A tandem algorithm for pitch estimation and voiced speech segregation
- Nov
- G. Hu and D. L. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process, vol. 18, no. 8, pp. 2067-2079, Nov. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process , vol.18 , Issue.8 , pp. 2067-2079
- Hu, G.¹ Wang, D.L.²

15
- 80051610956
- An approach to sequential grouping in cochannel speech
- K. Hu and D. L. Wang, "An approach to sequential grouping in cochannel speech," in Proc. ICASSP'11, 2011, pp. 4636-4639.
- (2011) Proc. ICASSP'11 , pp. 4636-4639
- Hu, K.¹ Wang, D.L.²

16
- 85008054377
- Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction
- Aug
- K. Hu and D. L.Wang, "Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction," IEEE Trans. Audio, Speech, Lang. Process, vol. 19, no. 6, pp. 1600-1609, Aug. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.6 , pp. 1600-1609
- Hu, K.¹ Wang, D.L.²

17
- 67349235289
- Speaker distinguishing distances: A comparative study
- A. N. Iyer, U. O. Ofoegbu, R. E. Yantorno, and B. Y. Smolenski, "Speaker distinguishing distances: A comparative study," Int J. Speech Technol., vol. 10, pp. 95-107, 2007.
- (2007) Int J. Speech Technol , vol.10 , pp. 95-107
- Iyer, A.N.¹ Ofoegbu, U.O.² Yantorno, R.E.³ Smolenski, B.Y.⁴

18
- 0033592606
- Learning the parts of objects by nonnegative matrix factorization
- D. D. Lee and H. S. Seung, "Learning the parts of objects by nonnegative matrix factorization," Nature, vol. 401, pp. 788-791, 1999.
- (1999) Nature , vol.401 , pp. 788-791
- Lee, D.D.¹ Seung, H.S.²

19
- 34250115918
- An examination of procedures for determining the number of clusters in a data set
- G. W. Milligan and M. C. Cooper, "An examination of procedures for determining the number of clusters in a data set," Psychometrika, vol. 50, no. 2, pp. 159-179, 1985.
- (1985) Psychometrika , vol.50 , Issue.2 , pp. 159-179
- Milligan, G.W.¹ Cooper, M.C.²

20
- 0031237388
- Cochannel speaker separation by harmonic enhancement and suppression
- PII S1063667697063852
- D. P. Morgan, E. B. George, L. T. Lee, and S. M. Kay, "Cochannel speaker separation by harmonic enhancement and suppression," IEEE Trans. Speech Audio Process., vol. 5, no. 5, pp. 407-424, Sep. 1997. (Pubitemid 127746014)
- (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.5 , pp. 407-424
- Morgan, D.P.¹ Bryan George, E.² Lee, L.T.³ Kay, S.M.⁴

21
- 84873425853
- Non-negative hiddenMarkov modeling of audio with application to source separation
- G. J. Mysore, P. Smaragdis, and B. Raj, "Non-negative hiddenMarkov modeling of audio with application to source separation," in Proc. Int. Conf. Latent Variable Anal. Signal Separat. (LVA/ICA), 2010.
- (2010) Proc. Int. Conf. Latent Variable Anal. Signal Separat. (LVA/ICA)
- Mysore, G.J.¹ Smaragdis, P.² Raj, B.³

22
- 79959859356
- Unsupervised indexing of conversations with short speaker utterances
- U. O. Ofoegbu, A. N. Iyer, R. E. Yantorno, and S. Wenndt, "Unsupervised indexing of conversations with short speaker utterances," in Proc. IEEE Aerospace Conf., 2006, pp. 1-11.
- (2006) Proc. IEEE Aerospace Conf , pp. 1-11
- Ofoegbu, U.O.¹ Iyer, A.N.² Yantorno, R.E.³ Wenndt, S.⁴

23
- 48849091396
- Single-channel speech separation using soft mask filtering
- Nov
- M. H. Radfar and R.M. Dansereau, "Single-channel speech separation using soft mask filtering," IEEE Trans. Audio, Speech, Lang. Process, vol. 15, no. 8, pp. 2299-2310, Nov. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2299-2310
- Radfar, M.H.¹ Dansereau, R.M.²

24
- 56249144712
- Soft mask methods for single-channel speaker separation
- Aug
- A. Reddy and B. Raj, "Soft mask methods for single-channel speaker separation," IEEE Trans. Audio, Speech, Lang. Process, vol. 15, no. 6, pp. 1766-1776, Aug. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.6 , pp. 1766-1776
- Reddy, A.¹ Raj, B.²

25
- 0003584577
- 2nd ed. Englewood Cliffs NJ: Prentice-Hall
- S. Russell and P. Norvig, Artificial Intelligence-A Modern Approach, 2nd ed. Englewood Cliffs, NJ: Prentice-Hall, 2002.
- (2002) Artificial Intelligence-A Modern Approach
- Russell, S.¹ Norvig, P.²

26
- 44949110218
- Single-channel speech separation using sparse non-negative matrix factorization
- M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrix factorization," in Proc. Interspeech' 06, 2006, pp. 2614-2617.
- (2006) Proc. Interspeech'06 , pp. 2614-2617
- Schmidt, M.N.¹ Olsson, R.K.²

27
- 46049084086
- Ph.D. dissertation Dept. of Comput. Sci. & Eng., The Ohio State Univ., Columbus
- Y. Shao, "Sequential organization in computational auditory scene analysis," Ph.D. dissertation, Dept. of Comput. Sci. & Eng., The Ohio State Univ., Columbus, 2007.
- (2007) Sequential Organization in Computational Auditory Scene Analysis
- Shao, Y.¹

28
- 69249159165
- A computational auditory scene analysis system for speech segregation and robust speech recognition
- Y. Shao, S. Srinivasan, Z. Jin, and D. L. Wang, "A computational auditory scene analysis system for speech segregation and robust speech recognition," Comput. Speech Lang., vol. 24, pp. 77-93, 2010.
- (2010) Comput. Speech Lang , vol.24 , pp. 77-93
- Shao, Y.¹ Srinivasan, S.² Jin, Z.³ Wang, D.L.⁴

29
- 33744996003
- Model-based sequential organization in cochannel speech
- DOI 10.1109/TSA.2005.854106
- Y. Shao and D. L. Wang, "Model-based sequential organization in cochannel speech," IEEE Trans. Audio, Speech, Lang. Process, vol. 14, no. 1, pp. 289-298, Jan. 2006. (Pubitemid 43863474)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 289-298
- Shao, Y.¹ Wang, D.²

30
- 67349134831
- Sequential organization of speech in computational auditory scene analysis
- Y. Shao and D. L. Wang, "Sequential organization of speech in computational auditory scene analysis," Speech Commun., vol. 51, pp. 657-667, 2009.
- (2009) Speech Commun , vol.51 , pp. 657-667
- Shao, Y.¹ Wang, D.L.²

31
- 38049021850
- Convolutive speech bases and their application to supervised speech separation
- Jan
- P. Smaragdis, "Convolutive speech bases and their application to supervised speech separation," IEEE Trans. Audio, Speech, Lang. Process, vol. 15, no. 1, pp. 1-12, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.1 , pp. 1-12
- Smaragdis, P.¹

32
- 78049306672
- Source-filter-based single-channel speech separation using pitch information
- Feb
- M. Stark, M. Wohlmayr, and F. Pernkopf, "Source-filter-based single-channel speech separation using pitch information," IEEE Trans. Audio, Speech, Lang. Process, vol. 19, no. 2, pp. 242-255, Feb. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.2 , pp. 242-255
- Stark, M.¹ Wohlmayr, M.² Pernkopf, F.³

33
- 34047261805
- An overview of automatic speaker diarization systems
- DOI 10.1109/TASL.2006.878256
- S. E. Tranter and D. A. Reynold, "An overview of automatic speaker diarization systems," IEEE Trans. Audio, Speech, Lang. Process, vol. 14, no. 5, pp. 1557-1565, Sep. 2006. (Pubitemid 46547580)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1557-1565
- Tranter, S.E.¹ Reynolds, D.A.²

34
- 82255178542
- Hoboken,NJ: Wiley-IEEE
- Computational Auditory Scene Analysis: Principles, Algorithms and Applications,D. L.Wang and G. J. Brown, Eds. Hoboken,NJ:Wiley-IEEE, 2006.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms and Applications
- Wang, D.L.¹ Brown, G.J.²

35
- 69249151355
- Speech separation using speaker-adapted eigenvoice speech models
- R.Weiss and D. Ellis, "Speech separation using speaker-adapted eigenvoice speech models," Comput. Speech Lang., vol. 24, no. 1, pp. 16-29, 2010.
- (2010) Comput. Speech Lang , vol.24 , Issue.1 , pp. 16-29
- Weiss, R.¹ Ellis, D.²

36
- 70349335150
- Hoboken, NJ: Wiley-IEEE
- R. Xu and D. C. Wunsch, Clustering. Hoboken, NJ: Wiley-IEEE, 2009.
- (2009) Clustering
- Xu, R.¹ Wunsch, D.C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.