메뉴 건너뛰기




Volumn 15, Issue 5, 2007, Pages 1564-1578

Adaptation of bayesian models for single-channel source separation and its application to voice/music separation in popular songs

Author keywords

Adaptive Wiener filtering; Bayesian model; Expectation maximization (EM); Gaussian mixture model (GMM); Maximum a posteriori (MAP); Model adaptation; Single channel source separation; Time frequency masking

Indexed keywords

ADAPTIVE WIENER FILTERING; BAYESIAN MODEL; EXPECTATION MAXIMIZATION (EM); GAUSSIAN MIXTURE MODEL (GMM); MAXIMUM A POSTERIORI (MAP); MODEL ADAPTATION; SINGLE-CHANNEL SOURCE SEPARATION; TIME-FREQUENCY MASKING;

EID: 51449094735     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.899291     Document Type: Article
Times cited : (163)

References (42)
  • 1
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • May
    • D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw., vol. 10, no. 3, pp. 684-697, May 1999.
    • (1999) IEEE Trans. Neural Netw , vol.10 , Issue.3 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 2
    • 8344232372 scopus 로고    scopus 로고
    • A maximum likelihood approach to single-channel source separation
    • G.-J. Jang and T.-W. Lee, "A maximum likelihood approach to single-channel source separation," J. Mach. Learning Res., no. 4, pp. 1365-1392, 2003.
    • (2003) J. Mach. Learning Res , Issue.4 , pp. 1365-1392
    • Jang, G.-J.1    Lee, T.-W.2
  • 5
    • 84898946024 scopus 로고    scopus 로고
    • One microphone source separation
    • Cambridge, MA: MIT Press
    • S. T. Roweis, "One microphone source separation," in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 2001, vol. 13, pp. 793-799.
    • (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 793-799
    • Roweis, S.T.1
  • 8
    • 85159664446 scopus 로고    scopus 로고
    • SINOLA: A new analysis/synthesis method using spectrum peak shape distortion, phase and reassigned spectrum
    • Oct
    • G. Peeters and X. Rodet, "SINOLA: A new analysis/synthesis method using spectrum peak shape distortion, phase and reassigned spectrum," in Proc. Int. Comput. Music Conf. (ICMC'99), Oct. 1999, pp. 153-156.
    • (1999) Proc. Int. Comput. Music Conf. (ICMC'99) , pp. 153-156
    • Peeters, G.1    Rodet, X.2
  • 13
    • 33947659500 scopus 로고    scopus 로고
    • Model-based monaural source separation using a vector-quantized phase-vocoder representation
    • Toulouse, France, May
    • D. Ellis and R.Weiss, "Model-based monaural source separation using a vector-quantized phase-vocoder representation," in IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'06), Toulouse, France, May 2006, vol. 5, pp. 957-960.
    • (2006) IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'06) , vol.5 , pp. 957-960
    • Ellis, D.1    Weiss, R.2
  • 15
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc , vol.39 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 17
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains
    • Apr
    • J. Gauvain and C. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.1    Lee, C.2
  • 18
    • 0000159105 scopus 로고    scopus 로고
    • On adaptive decision rules and decision parameter adaptation for automatic speech recognition
    • Aug
    • C.-H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol. 88, no. 8, pp. 1241-1269, Aug. 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1241-1269
    • Lee, C.-H.1    Huo, Q.2
  • 19
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • A. Reynolds, T. Quatieri, and R. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., no. 10, pp. 19-41, 2000.
    • (2000) Digital Signal Process , Issue.10 , pp. 19-41
    • Reynolds, A.1    Quatieri, T.2    Dunn, R.3
  • 20
    • 0013288412 scopus 로고    scopus 로고
    • Dynamic Bayesian networks: Representation, inference and learning,
    • Ph.D. dissertation, Univ. California Berkeley, Berkeley, CA, Jul
    • K. P. Murphy, "Dynamic Bayesian networks: Representation, inference and learning," Ph.D. dissertation, Univ. California Berkeley, Berkeley, CA, Jul. 2002.
    • (2002)
    • Murphy, K.P.1
  • 22
    • 0009623939 scopus 로고
    • Flexible speaker adaptation using maximum likelihood linear regression
    • C. Leggetter and P.Woodland, "Flexible speaker adaptation using maximum likelihood linear regression," in ARPA Spoken Lang. Technol. Workshop, 1995, pp. 104-109.
    • (1995) ARPA Spoken Lang. Technol. Workshop , pp. 104-109
    • Leggetter, C.1    Woodland, P.2
  • 23
    • 0030359637 scopus 로고    scopus 로고
    • Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation
    • Philadelphia, PA
    • M. Gales, D. Pye, and P. Woodland, "Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP'96), Philadelphia, PA, 1996, vol. 3, pp. 1832-1835.
    • (1996) Proc. Int. Conf. Spoken Lang. Process. (ICSLP'96) , vol.3 , pp. 1832-1835
    • Gales, M.1    Pye, D.2    Woodland, P.3
  • 24
    • 0030640789 scopus 로고    scopus 로고
    • Structural MAP speaker adaptation using hierarchical priors
    • Santa Barbara, CA, Dec
    • K. Shinoda and C.-H. Lee, "Structural MAP speaker adaptation using hierarchical priors," in Proc. IEEE Workshop Speech Recognition Understanding, Santa Barbara, CA, Dec. 1997, pp. 381-388.
    • (1997) Proc. IEEE Workshop Speech Recognition Understanding , pp. 381-388
    • Shinoda, K.1    Lee, C.-H.2
  • 25
    • 85009097035 scopus 로고    scopus 로고
    • Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
    • Beijing, China, Oct
    • K.-T. Chen, W.-W. Liau, H.-M. Wang, and L.-S. Lee, "Fast speaker adaptation using eigenspace-based maximum likelihood linear regression," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP'00), Beijing, China, Oct. 2000, pp. 742-745.
    • (2000) Proc. Int. Conf. Spoken Lang. Process. (ICSLP'00) , pp. 742-745
    • Chen, K.-T.1    Liau, W.-W.2    Wang, H.-M.3    Lee, L.-S.4
  • 27
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • Apr
    • R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 245-257, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.2 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 28
    • 4444245782 scopus 로고    scopus 로고
    • Blind clustering of popular music recordings based on singer voice characteristics
    • W.-H. Tsai, D. Rogers, and H.-M. Wang, "Blind clustering of popular music recordings based on singer voice characteristics," Comput. Music J., vol. 28, no. 3, pp. 68-78, 2004.
    • (2004) Comput. Music J , vol.28 , Issue.3 , pp. 68-78
    • Tsai, W.-H.1    Rogers, D.2    Wang, H.-M.3
  • 30
    • 4444229791 scopus 로고    scopus 로고
    • Singer identification in popular music recordings using voice coding features
    • Oct
    • Y. E. Kim and B. Whitman, "Singer identification in popular music recordings using voice coding features," in Proc. Int. Symp. Music Inf. Retrieval (ISMIR'02), Oct. 2002, pp. 164-169.
    • (2002) Proc. Int. Symp. Music Inf. Retrieval (ISMIR'02) , pp. 164-169
    • Kim, Y.E.1    Whitman, B.2
  • 31
    • 13444291977 scopus 로고    scopus 로고
    • Singing voice detection in popular music
    • New York, Oct
    • T. L. Nwe, A. Shenoy, and Y. Wang, "Singing voice detection in popular music," in Proc. ACM Multimedia Conf., New York, Oct. 2004, pp. 324-327.
    • (2004) Proc. ACM Multimedia Conf , pp. 324-327
    • Nwe, T.L.1    Shenoy, A.2    Wang, Y.3
  • 32
    • 4544255234 scopus 로고    scopus 로고
    • Automatic detection and tracking of target singer in multi-singer music recordings
    • Montreal, QC, Canada
    • W. H. Tsai and H. M. Wang, "Automatic detection and tracking of target singer in multi-singer music recordings," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04),Montreal, QC, Canada, 2004, vol. 4, pp. 221-224.
    • (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04) , vol.4 , pp. 221-224
    • Tsai, W.H.1    Wang, H.M.2
  • 33
    • 0032595188 scopus 로고    scopus 로고
    • Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition
    • Sep
    • R. Vergin, D. O'Shaughnessy, and A. Farhat, "Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition," IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 525-532, Sep. 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.5 , pp. 525-532
    • Vergin, R.1    O'Shaughnessy, D.2    Farhat, A.3
  • 35
    • 0028517016 scopus 로고
    • Space-alternating generalized expectation- maximization algorithm
    • Oct
    • J. A. Fessler and A. O. Hero, "Space-alternating generalized expectation- maximization algorithm," IEEE Trans. Signal Process., vol. 42, no. 10, pp. 2664-2677, Oct. 1994.
    • (1994) IEEE Trans. Signal Process , vol.42 , Issue.10 , pp. 2664-2677
    • Fessler, J.A.1    Hero, A.O.2
  • 39
    • 0020102027 scopus 로고
    • Least squares quantization in PCM
    • Mar
    • S. P. Lloyd, "Least squares quantization in PCM," IEEE Trans. Inf. Theory, vol. IT-28, no. 2, pp. 129-137, Mar. 1982.
    • (1982) IEEE Trans. Inf. Theory , vol.IT-28 , Issue.2 , pp. 129-137
    • Lloyd, S.P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.