메뉴 건너뛰기




Volumn 19, Issue 2, 2011, Pages 242-255

Source-filter-based single-channel speech separation using pitch information

Author keywords

multi pitch estimation; Single channel speech separation (SCSS); sourcefilter representation

Indexed keywords

FAST APPROXIMATION; FILTER MODEL; FILTER-BASED; GAIN ESTIMATION; LIKELIHOOD COMPUTATION; LINEAR RELATIONSHIPS; MODEL-DRIVEN METHOD; NONNEGATIVE MATRIX FACTORIZATION; PITCH ESTIMATION; PITCH-TRACKING; SINGLE-CHANNEL; SOURCE SEPARATION; SOURCEFILTER REPRESENTATION; SPEECH SEPARATION; VOCAL-TRACTS;

EID: 78049306672     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2047419     Document Type: Article
Times cited : (64)

References (34)
  • 2
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Sep
    • G. Hu and D. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation", IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
    • (2004) IEEE Trans. Neural. Netw. , vol.15 , Issue.5 , pp. 1135-1150
    • Hu, G.1    Wang, D.2
  • 3
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • May
    • D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation", IEEE Trans. Neural Netw., vol. 10, no. 3, pp. 684-697, May 1999.
    • (1999) IEEE Trans. Neural. Netw. , vol.10 , Issue.3 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 5
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • 1st ed. New York: Springer, Nov
    • D. Wang, "On ideal binary mask as the computational goal of auditory scene analysis", in Speech Separation by Humans and Machines, 1st ed. New York: Springer, Nov. 2005, p. 319.
    • (2005) Speech Separation by Humans and Machines , pp. 319
    • Wang, D.1
  • 7
    • 85009230793 scopus 로고    scopus 로고
    • Factorial models and refiltering for speech separation and denoising
    • sep
    • S. T. Roweis, "Factorial models and refiltering for speech separation and denoising", in Proc. Eurospeech, Sep. 2003, pp. 1009-1012.
    • (2003) Proc. Eurospeech , pp. 1009-1012
    • Roweis, S.T.1
  • 8
    • 0038705102 scopus 로고    scopus 로고
    • One microphone source separation
    • S. T. Roweis, "One microphone source separation", Neural Inf. Process. Syst., vol. 13, pp. 793-799, 2000.
    • (2000) Neural. Inf. Process. Syst. , vol.13 , pp. 793-799
    • Roweis, S.T.1
  • 9
    • 0033592606 scopus 로고    scopus 로고
    • Learning the parts of objects by nonnegative matrix factorization
    • D. D. Lee and H. S. Seung, "Learning the parts of objects by nonnegative matrix factorization", Nature, vol. 401, p. 788, 1999.
    • (1999) Nature , vol.401 , pp. 788
    • Lee, D.D.1    Seung, H.S.2
  • 11
    • 44949258898 scopus 로고    scopus 로고
    • Super-human multi-talker speech recognition: The IBM 2006 speech separation challenge system
    • T. Kristjansson, J. Hershey, P. Olsen, S. Rennie, and R. Gopinath, "Super-human multi-talker speech recognition: The IBM 2006 speech separation challenge system", in Proc. Interspeech, 2006, no. 1775.
    • (2006) Proc. Interspeech , Issue.1775
    • Kristjansson, T.1    Hershey, J.2    Olsen, P.3    Rennie, S.4    Gopinath, R.5
  • 12
    • 33750368310 scopus 로고    scopus 로고
    • An audiovisual corpus for speech perception and automatic speech recognition
    • M. P. Cooke, J. Barker, S. P. Cunningham, and X. Shao, "An audiovisual corpus for speech perception and automatic speech recognition", J. Acoust. Soc. Amer., vol. 120, no. 5, pp. 2421-2424, 2006.
    • (2006) J. Acoust. Soc. Amer. , vol.120 , Issue.5 , pp. 2421-2424
    • Cooke, M.P.1    Barker, J.2    Cunningham, S.P.3    Shao, X.4
  • 13
    • 33845940172 scopus 로고    scopus 로고
    • A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation
    • M. H. Radfar, R. M. Dansereau, and A. Sayadiyan, "A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation", J. Audio, Speech, Music Process., vol. 1, p. 15, 2007.
    • (2007) J. Audio, Speech, Music Process. , vol.1 , pp. 15
    • Radfar, M.H.1    Dansereau, R.M.2    Sayadiyan, A.3
  • 14
    • 0037767686 scopus 로고    scopus 로고
    • A multipitch tracking algorithm for noisy speech
    • Mar
    • M. Wu, D. Wang, and G. Brown, "A multipitch tracking algorithm for noisy speech", IEEE Trans. Speech Audio Process, vol. 11, no. 3, pp. 229-241, Mar. 2003.
    • (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.3 , pp. 229-241
    • Wu, M.1    Wang, D.2    Brown, G.3
  • 15
    • 0030846123 scopus 로고    scopus 로고
    • A unitary model of pitch perception
    • R. Meddis and L. O'Mard, "A unitary model of pitch perception", J. Acoust Soc. Amer., vol. 102, no. 3, pp. 1811-1820, 1997.
    • (1997) J. Acoust Soc. Amer. , vol.102 , Issue.3 , pp. 1811-1820
    • Meddis, R.1    O'Mard, L.2
  • 16
    • 0031268341 scopus 로고    scopus 로고
    • Factorial hidden Markov models
    • Z. Ghahramani and M. Jordan, "Factorial hidden Markov models", Mach. Learn., vol. 29, no. 2-3, pp. 245-273, 1997.
    • (1997) Mach. Learn. , vol.29 , Issue.2-3 , pp. 245-273
    • Ghahramani, Z.1    Jordan, M.2
  • 17
  • 18
    • 84867209792 scopus 로고    scopus 로고
    • Multipitch tracking using a factorial hidden Markov model
    • M. Wohlmayr and F. Pernkopf, "Multipitch tracking using a factorial hidden Markov model", in Proc. Interspeech, 2008.
    • (2008) Proc. Interspeech
    • Wohlmayr, M.1    Pernkopf, F.2
  • 19
    • 0001455934 scopus 로고
    • A robust algorithm for pitch tracking
    • Amsterdam, The Netherlands: Elsevier
    • D. Talkin, "A robust algorithm for pitch tracking", in Speech Coding and Synthesis. Amsterdam, The Netherlands: Elsevier, 1995, pp. 495-518.
    • (1995) Speech Coding and Synthesis , pp. 495-518
    • Talkin, D.1
  • 20
    • 24344483148 scopus 로고    scopus 로고
    • Genetic-based EM algorithm for learning Gaussian mixture models
    • Aug
    • F. Pernkopf and D. Bouchaffra, "Genetic-based EM algorithm for learning Gaussian mixture models", IEEE Trans. Pattern Anal Mach. Intell., vol. 27, no. 8, pp. 1344-1348, Aug. 2005.
    • (2005) IEEE Trans. Pattern Anal. Mach. Intell. , vol.27 , Issue.8 , pp. 1344-1348
    • Pernkopf, F.1    Bouchaffra, D.2
  • 21
    • 70450177302 scopus 로고    scopus 로고
    • Finite mixture spectrogram modeling for multipitch tracking using a factorial hidden Markov model
    • M. Wohlmayr and F. Pernkopf, "Finite mixture spectrogram modeling for multipitch tracking using a factorial hidden Markov model", in Proc. Interspeech, 2009.
    • (2009) Proc. Interspeech
    • Wohlmayr, M.1    Pernkopf, F.2
  • 23
    • 0035246564 scopus 로고    scopus 로고
    • Factor graphs and the sum-product algorithm
    • Feb
    • F. Kschischang, B. Frey, and H.-A. Loeliger, "Factor graphs and the sum-product algorithm", IEEE Trans. Inf. Theory, vol. 47, no. 2, pp. 498-519, Feb. 2001.
    • (2001) IEEE Trans. Inf. Theory , vol.47 , Issue.2 , pp. 498-519
    • Kschischang, F.1    Frey, B.2    Loeliger, H.-A.3
  • 24
    • 0002629270 scopus 로고
    • Maximum likelihood estimation from incomplete data via the EM algorithm
    • A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood estimation from incomplete data via the EM algorithm", J. R. Statist. Soc, vol. B39, no. B, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.B39 , Issue.B , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 29
    • 0001935942 scopus 로고
    • Berlin, Germany: Elsevier, ch. 4, Sinusoidal Coding
    • R. McAulay and T. Quatieri, Speech Coding and Synthesis. Berlin, Germany: Elsevier, 1995, ch. 4, pp. 121-173, Sinusoidal Coding.
    • (1995) Speech Coding and Synthesis , pp. 121-173
    • McAulay, R.1    Quatieri, T.2
  • 31
    • 38049021850 scopus 로고    scopus 로고
    • Convolutive speech bases and their application to supervised speech separation
    • Jan
    • P. Smaragdis, "Convolutive speech bases and their application to supervised speech separation", IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 1, pp. 1-12, Jan. 2007.
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.1 , pp. 1-12
    • Smaragdis, P.1
  • 32
    • 67349134831 scopus 로고    scopus 로고
    • Sequential organization of speech in computational auditory scene analysis
    • Aug
    • Y. Shao and D. Wang, "Sequential organization of speech in computational auditory scene analysis", Speech Commun., vol. 51, no. 8, pp. 657-667, Aug. 2009.
    • (2009) Speech Commun. , vol.51 , Issue.8 , pp. 657-667
    • Shao, Y.1    Wang, D.2
  • 33
    • 0027297381 scopus 로고
    • Vector quantization for the efficient computation of continuous density likelihoods
    • E. Bocchieri, "Vector quantization for the efficient computation of continuous density likelihoods", in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1993, vol. 2, pp. 692-695.
    • (1993) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.2 , pp. 692-695
    • Bocchieri, E.1
  • 34
    • 78049277624 scopus 로고    scopus 로고
    • On optimizing the computational complexity for VQ-based single channel source separation
    • Dallas, TX
    • M. Stark and F. Pernkopf, "On optimizing the computational complexity for VQ-based single channel source separation", in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Dallas, TX, 2010, pp. 237-240.
    • (2010) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 237-240
    • Stark, M.1    Pernkopf, F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.