메뉴 건너뛰기




Volumn 15, Issue 1, 2007, Pages 135-149

Variational probabilistic speech separation using microphone arrays

Author keywords

Approximate inference; Microphone arrays; Phase based speech processing; Probabilistic graphical models; Robust speech recognition; Speech separation; Variational methods

Indexed keywords

APPROXIMATE INFERENCE; MICROPHONE ARRAYS; PHASE-BASED SPEECH PROCESSING; PROBABILISTIC GRAPHICAL MODELS; ROBUST SPEECH RECOGNITION; SPEECH SEPARATION; VARIATIONAL METHODS;

EID: 53349111173     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.876865     Document Type: Article
Times cited : (13)

References (48)
  • 2
    • 85009074657 scopus 로고    scopus 로고
    • Algonquin: Iterating Laplace's method to remove multiple types of acoustic distortion for robust speech recognition
    • Sep
    • B. J. Frey, L. Deng, A. Acero, and T. Kristjansson, "Algonquin: Iterating Laplace's method to remove multiple types of acoustic distortion for robust speech recognition," in Eurospeech, Sep. 2001, pp. 901-904.
    • (2001) Eurospeech , pp. 901-904
    • Frey, B.J.1    Deng, L.2    Acero, A.3    Kristjansson, T.4
  • 3
    • 84898688036 scopus 로고    scopus 로고
    • Speech denoising and dereverberation using probabilistic models
    • Dec
    • H. Attias, J. C. Platt, A. Acero, and L. Deng, "Speech denoising and dereverberation using probabilistic models," in Proc. NIPS, Dec. 2001, pp. 758-764.
    • (2001) Proc. NIPS , pp. 758-764
    • Attias, H.1    Platt, J.C.2    Acero, A.3    Deng, L.4
  • 4
    • 0141479057 scopus 로고    scopus 로고
    • Robust digit recognition using phase-dependent time-frequency masking
    • Apr
    • G. Shi and P. Aarabi, "Robust digit recognition using phase-dependent time-frequency masking," in Proc. ICASSP, Apr. 2003, pp. 684-687.
    • (2003) Proc. ICASSP , pp. 684-687
    • Shi, G.1    Aarabi, P.2
  • 5
    • 0042826822 scopus 로고    scopus 로고
    • Independent component analysis: Algorithms and applications
    • A. Hyvarinen and E. Oja, "Independent component analysis: Algorithms and applications," Neural Netw., vol. 113, no. 4, pp. 411-430, 2000.
    • (2000) Neural Netw , vol.113 , Issue.4 , pp. 411-430
    • Hyvarinen, A.1    Oja, E.2
  • 6
    • 85153956759 scopus 로고
    • A non-linear information maximization algorithm that performs blind separation
    • Dec
    • A. J. Bell and T. J. Sejnowski, "A non-linear information maximization algorithm that performs blind separation," in Proc. NIPS, Dec. 1995, pp. 467-474.
    • (1995) Proc. NIPS , pp. 467-474
    • Bell, A.J.1    Sejnowski, T.J.2
  • 8
    • 0011006550 scopus 로고    scopus 로고
    • ICA mixture models for unsupervised classification and automatic context switching
    • Jan
    • T. Lee, M. S. Lewicki, and T. S. Sejnowski, "ICA mixture models for unsupervised classification and automatic context switching," in Proc. ICA and BSS, Jan. 1999, pp. 209-214.
    • (1999) Proc. ICA and BSS , pp. 209-214
    • Lee, T.1    Lewicki, M.S.2    Sejnowski, T.S.3
  • 9
    • 0029725825 scopus 로고    scopus 로고
    • Blind separation of delayed sources based on information maximization
    • Apr
    • K. Torkkola, "Blind separation of delayed sources based on information maximization," in Proc. ICASSP, Apr. 1996, pp. 3509-3512.
    • (1996) Proc. ICASSP , pp. 3509-3512
    • Torkkola, K.1
  • 10
    • 0029723365 scopus 로고    scopus 로고
    • Blind separation of convolved sources based on information maximization
    • Sep
    • -, "Blind separation of convolved sources based on information maximization," in Proc. INNSP, Sep. 1996, pp. 423-432.
    • (1996) Proc. INNSP , pp. 423-432
    • Torkkola, K.1
  • 11
    • 0034207888 scopus 로고    scopus 로고
    • Aunifying information- theoretic framework for independent component analysis
    • Mar. 1-21
    • T.-W. Lee, M. Girolami, A. J. Bell, and T. J. Sejnowski, "Aunifying information- theoretic framework for independent component analysis," Comput. Math. With Applicat., vol. 31, no. 11, Mar. 1-21, 2000.
    • (2000) Comput. Math. With Applicat , vol.31 , Issue.11
    • Lee, T.-W.1    Girolami, M.2    Bell, A.J.3    Sejnowski, T.J.4
  • 12
    • 64149123497 scopus 로고    scopus 로고
    • Nonlinear independent component analysis using power series and application to blind source separation
    • Dec
    • Z. Xiong and T. S. Huang, "Nonlinear independent component analysis using power series and application to blind source separation," in Proc. ICA and BSS, Dec. 2001, pp. 680-685.
    • (2001) Proc. ICA and BSS , pp. 680-685
    • Xiong, Z.1    Huang, T.S.2
  • 13
    • 64149110500 scopus 로고    scopus 로고
    • Blind source separation of mixtures of speech signals with unknown propagation delays
    • P. De Leon and Y. Ma, "Blind source separation of mixtures of speech signals with unknown propagation delays," J. Acoust. Soc. Amer., vol. 108, no. 5, p. 8629, 2000.
    • (2000) J. Acoust. Soc. Amer , vol.108 , Issue.5 , pp. 8629
    • De Leon, P.1    Ma, Y.2
  • 14
    • 85009113586 scopus 로고    scopus 로고
    • Blind source separation for speech based on a fast-convergence algorithm with ICA and beamforming
    • Sep
    • K. Shikano, H. Saruwatari, and T. Kawamura, "Blind source separation for speech based on a fast-convergence algorithm with ICA and beamforming," in Eurospeech, Sep. 2001, pp. 2603-2606.
    • (2001) Eurospeech , pp. 2603-2606
    • Shikano, K.1    Saruwatari, H.2    Kawamura, T.3
  • 15
    • 34547525365 scopus 로고    scopus 로고
    • Learning dynamic noise models from noisy speech for robust speech recognition
    • Dec
    • B. J. Frey, T. Kristjansson, L. Deng, and A. Acero, "Learning dynamic noise models from noisy speech for robust speech recognition," in Proc. NIPS, Dec. 2001, pp. 1165-1171.
    • (2001) Proc. NIPS , pp. 1165-1171
    • Frey, B.J.1    Kristjansson, T.2    Deng, L.3    Acero, A.4
  • 16
    • 0033561886 scopus 로고    scopus 로고
    • Independent factor analysis
    • H. Attias, "Independent factor analysis," Neural Comput., vol. 11, no. 4, pp. 803-851, 1999.
    • (1999) Neural Comput , vol.11 , Issue.4 , pp. 803-851
    • Attias, H.1
  • 17
    • 64149107043 scopus 로고    scopus 로고
    • Source separation with a sensor array using graphical models and subband filtering
    • Dec
    • -, "Source separation with a sensor array using graphical models and subband filtering," in Proc. NIPS, Dec. 2002, pp. 1205-1212.
    • (2002) Proc. NIPS , pp. 1205-1212
  • 18
    • 85009070292 scopus 로고    scopus 로고
    • Large-vocabulary speech recognition under adverse acoustic environments
    • M. Plumpe, L. Deng, A. Acero, and X. Huang, "Large-vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP, 2000, vol. 3, pp. 806-809.
    • (2000) Proc. ICSLP , vol.3 , pp. 806-809
    • Plumpe, M.1    Deng, L.2    Acero, A.3    Huang, X.4
  • 19
    • 0027192618 scopus 로고
    • Speaker adaptation based on map estimation of hmm parameters
    • C. H. Lee and J. L. Gauvain, "Speaker adaptation based on map estimation of hmm parameters," in Proc. ICASSP, 1993, pp. 558-561.
    • (1993) Proc. ICASSP , pp. 558-561
    • Lee, C.H.1    Gauvain, J.L.2
  • 20
    • 0032528695 scopus 로고    scopus 로고
    • Blind source separation and deconvolution: The dynamic component analysis algorithm
    • H. Attias and C. E. Schreiner, "Blind source separation and deconvolution: The dynamic component analysis algorithm," Neural Comput., vol. 10, no. 6, pp. 1373-1424, 1998.
    • (1998) Neural Comput , vol.10 , Issue.6 , pp. 1373-1424
    • Attias, H.1    Schreiner, C.E.2
  • 21
    • 0031258231 scopus 로고    scopus 로고
    • Blind separation of convolutive mixtures and an application in automatic speech recognition in noisy environment
    • Oct
    • F. Ehlers and H. Schuster, "Blind separation of convolutive mixtures and an application in automatic speech recognition in noisy environment," IEEE Trans. Signal Process., vol. 45, no. 10, pp. 2608-2609, Oct. 1997.
    • (1997) IEEE Trans. Signal Process , vol.45 , Issue.10 , pp. 2608-2609
    • Ehlers, F.1    Schuster, H.2
  • 22
    • 0036753896 scopus 로고    scopus 로고
    • Geometric source separation: Merging convolutive source separation with geometric beamforming
    • Sep
    • C. V. Alvino and L. C. Parra, "Geometric source separation: Merging convolutive source separation with geometric beamforming," IEEE Trans. Speech Audio Process., vol. 10, no. 6, pp. 352-362, Sep. 2002.
    • (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.6 , pp. 352-362
    • Alvino, C.V.1    Parra, L.C.2
  • 23
    • 0002343530 scopus 로고    scopus 로고
    • Blind separation for audio signals-Are we there yet?
    • K. Torkkola, "Blind separation for audio signals-Are we there yet?," in Proc. ICA and BSS, 1999, pp. 239-244.
    • (1999) Proc. ICA and BSS , pp. 239-244
    • Torkkola, K.1
  • 24
    • 0031640099 scopus 로고    scopus 로고
    • On the use of explicit speech modeling in microphone array applications
    • May
    • M. S. Brandstein, "On the use of explicit speech modeling in microphone array applications," in Proc. ICASSP,May 1998, pp. 3613-3616.
    • (1998) Proc. ICASSP , pp. 3613-3616
    • Brandstein, M.S.1
  • 25
    • 85009083964 scopus 로고    scopus 로고
    • Speech/noise separation using two microphones and a VQ model of speech signals
    • Oct
    • A. Acero, S. Attschuler, and L. Wu, "Speech/noise separation using two microphones and a VQ model of speech signals," in Proc. ICSLP, Oct. 2000, pp. 3613-3616.
    • (2000) Proc. ICSLP , pp. 3613-3616
    • Acero, A.1    Attschuler, S.2    Wu, L.3
  • 28
    • 0033225865 scopus 로고    scopus 로고
    • An introduction to variational methods for graphical models
    • M. I. Jordan, Z. Ghahramani, T. Jaakkola, and L. K. Saul, "An introduction to variational methods for graphical models," Mach. Learning, vol. 37, no. 2, pp. 183-233, 1999.
    • (1999) Mach. Learning , vol.37 , Issue.2 , pp. 183-233
    • Jordan, M.I.1    Ghahramani, Z.2    Jaakkola, T.3    Saul, L.K.4
  • 29
    • 64149124004 scopus 로고    scopus 로고
    • D. J. C. MacKay, Introduction to Monte Carlo methods, in Learning in Graphical Models, ser.NATOSci., M. I. Jordan, Ed. Norwell, MA: Kluwer, 1998, pp. 175-204.
    • D. J. C. MacKay, "Introduction to Monte Carlo methods," in Learning in Graphical Models, ser.NATOSci., M. I. Jordan, Ed. Norwell, MA: Kluwer, 1998, pp. 175-204.
  • 33
    • 0037445320 scopus 로고    scopus 로고
    • P. Aarabi, The fusion of distributed microphone arrays tor sound localization, EURASIP J. Appl. Signal Process. (Special Issue.), 2003 No. 4:338:347, Mar. 2003.
    • P. Aarabi, "The fusion of distributed microphone arrays tor sound localization," EURASIP J. Appl. Signal Process. (Special Issue.), 2003 No. 4:338:347, Mar. 2003.
  • 34
    • 0003649791 scopus 로고
    • A framework for speech source localization using sensor arrays,
    • Ph.D. dissertation, Brown University, Providence, RI, May
    • M. S. Brandstein, "A framework for speech source localization using sensor arrays," Ph.D. dissertation, Brown University, Providence, RI, May 1995.
    • (1995)
    • Brandstein, M.S.1
  • 35
    • 64149109675 scopus 로고    scopus 로고
    • Adaptive time-frequency data fusion for speech enhancement
    • Jul
    • G. Shi, P. Aarabi, and N. Lazic, "Adaptive time-frequency data fusion for speech enhancement," in Proc. Inf. Fusion, Jul. 2003, pp. 394-399.
    • (2003) Proc. Inf. Fusion , pp. 394-399
    • Shi, G.1    Aarabi, P.2    Lazic, N.3
  • 37
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb
    • L. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 38
    • 0024735448 scopus 로고
    • A minimum discrimination information approach for hidden Markov modeling
    • Sep
    • Y. Ephraim and L. R. Rabiner, "A minimum discrimination information approach for hidden Markov modeling," IEEE Trans. Inf. Theory, vol. 35, no. 5, pp. 1001-1013, Sep. 1989.
    • (1989) IEEE Trans. Inf. Theory , vol.35 , Issue.5 , pp. 1001-1013
    • Ephraim, Y.1    Rabiner, L.R.2
  • 39
    • 4644369641 scopus 로고    scopus 로고
    • New EM algorithms for source separation and deconvolution
    • Apr
    • H. Attias, "New EM algorithms for source separation and deconvolution," in Proc. ICASSP, Apr. 2003, pp. 297-300.
    • (2003) Proc. ICASSP , pp. 297-300
    • Attias, H.1
  • 40
    • 0013208713 scopus 로고    scopus 로고
    • The application of spatial likelihood functions to multicamera object localization
    • Apr
    • P. Aarabi, "The application of spatial likelihood functions to multicamera object localization," in Proc. Sensor Fusion, Apr. 2001, pp. 255-265.
    • (2001) Proc. Sensor Fusion , pp. 255-265
    • Aarabi, P.1
  • 41
    • 0016990291 scopus 로고
    • The generalized correlation method for estimation of time delay
    • Aug
    • C. H. Knapp and G. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust., Speech Signal Process., vol. ASSP-24, no. 4, pp. 320-327, Aug. 1976.
    • (1976) IEEE Trans. Acoust., Speech Signal Process , vol.ASSP-24 , Issue.4 , pp. 320-327
    • Knapp, C.H.1    Carter, G.2
  • 42
    • 0141488670 scopus 로고    scopus 로고
    • Geometric overcomplete ICA
    • Apr
    • F. Theis and E. Lang, "Geometric overcomplete ICA," in Proc. ESANN, Apr. 2002, pp. 217-223.
    • (2002) Proc. ESANN , pp. 217-223
    • Theis, F.1    Lang, E.2
  • 43
    • 0003205588 scopus 로고
    • Fundamentals of Statistical Signal Processing
    • Englewood Cliffs, NJ: Prentice-Hall
    • S. Kay, Fundamentals of Statistical Signal Processing: Volume I: Estimation Theory. Englewood Cliffs, NJ: Prentice-Hall, 1993.
    • (1993) Estimation Theory , vol.1
    • Kay, S.1
  • 46
    • 0004267646 scopus 로고    scopus 로고
    • Princeton, NJ: Princeton Univ. Press
    • T. Roekafellar, Convex Analysis. Princeton, NJ: Princeton Univ. Press, 1996.
    • (1996) Convex Analysis
    • Roekafellar, T.1
  • 47
    • 27844587869 scopus 로고    scopus 로고
    • Underdetermined blind source separation using a probabilistic source sparsity model
    • Dec
    • L. Vielva, D. Erdogmus, and J. C. Procipe, "Underdetermined blind source separation using a probabilistic source sparsity model," in Proc. ICA and BSS, Dec. 2001, pp. 189-194.
    • (2001) Proc. ICA and BSS , pp. 189-194
    • Vielva, L.1    Erdogmus, D.2    Procipe, J.C.3
  • 48
    • 0032624821 scopus 로고    scopus 로고
    • Blind source separation of more sources than mixtures using overcomplete representations
    • Apr
    • T.-W. Lee, M. S. Lewicki, M. Girolami, and T. J. Sejnowski, "Blind source separation of more sources than mixtures using overcomplete representations," IEEE Signal Process. Lett., vol. 6, no. 4, pp. 87-90, Apr. 1999.
    • (1999) IEEE Signal Process. Lett , vol.6 , Issue.4 , pp. 87-90
    • Lee, T.-W.1    Lewicki, M.S.2    Girolami, M.3    Sejnowski, T.J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.