메뉴 건너뛰기




Volumn 27, Issue 6, 2010, Pages 66-80

Single-channel multitalker speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

AUDITION; DEEP NEURAL NETWORKS;

EID: 85032751986     PISSN: 10535888     EISSN: None     Source Type: Journal    
DOI: 10.1109/MSP.2010.938081     Document Type: Article
Times cited : (86)

References (49)
  • 3
    • 69249231059 scopus 로고    scopus 로고
    • Speech fragment decoding techniques for simultaneous speaker identification and speech recognition
    • Jan.
    • J. Barker, N. Ma, A. Coy, and M. Cooke, "Speech fragment decoding techniques for simultaneous speaker identification and speech recognition," Comput. Speech Lang., vol. 24, no. 1, pp. 94-111, Jan. 2010.
    • (2010) Comput. Speech Lang. , vol.24 , Issue.1 , pp. 94-111
    • Barker, J.1    Ma, N.2    Coy, A.3    Cooke, M.4
  • 4
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, no. 3, pp. 267-285, 2001.
    • (2001) Speech Commun. , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 5
    • 69249115826 scopus 로고    scopus 로고
    • Combining missing-feature theory, speech enhancement, and speaker-dependent/independent modeling for speech separation
    • Jan.
    • J. Ming, T. Hazen, and J. Glass, "Combining missing-feature theory, speech enhancement, and speaker-dependent/independent modeling for speech separation," Comput. Speech Lang., vol. 24, no. 1, pp. 67-76, Jan. 2010.
    • (2010) Comput. Speech Lang. , vol.24 , Issue.1 , pp. 67-76
    • Ming, J.1    Hazen, T.2    Glass, J.3
  • 6
    • 33749317042 scopus 로고    scopus 로고
    • Learning spectral clustering, with application to speech separation
    • Oct.
    • F. R. Bach and M. I. Jordan, "Learning spectral clustering, with application to speech separation," J. Mach. Learn. Res., vol. 7, pp. 1963-2001, Oct. 2006.
    • (2006) J. Mach. Learn. Res. , vol.7 , pp. 1963-2001
    • Bach, F.R.1    Jordan, M.I.2
  • 7
    • 44949110218 scopus 로고    scopus 로고
    • Single-channel speech separation using sparse non-negative matrixfactorization
    • Spoken Language Processing (ICSLP'06), Pittsburgh, PA, Sept.
    • M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrixfactorization," in Proc. ISCA 9th Int. Conf. Spoken Language Processing (ICSLP'06), Pittsburgh, PA, Sept. 2006, pp. 2614-2617.
    • (2006) Proc. ISCA 9th Int. Conf. , pp. 2614-2617
    • Schmidt, M.N.1    Olsson, R.K.2
  • 8
    • 8344232372 scopus 로고    scopus 로고
    • A maximum likelihood approach to single-channel source separation
    • Dec.
    • G.-J. Jang and T.-W. Lee, "A maximum likelihood approach to single-channel source separation," J. Mach. Learn. Res., vol. 4, pp. 1365-1392, Dec. 2003.
    • (2003) J. Mach. Learn. Res. , vol.4 , pp. 1365-1392
    • Jang, G.-J.1    Lee, T.-W.2
  • 9
    • 51449100115 scopus 로고    scopus 로고
    • Efficient model-based speech separation and denoising using non-negative subspace analysis
    • Speechand Signal Processing (ICASSP ' 0 8 ) , Las Vegas, N V Apr.
    • S. J. Rennie, J. R. Hershey, and P. A. Olsen, "Efficient model-based speech separation and denoising using non-negative subspace analysis," in Proc. IEEE Int. Conf. Acoustics, Speechand Signal Processing (ICASSP ' 0 8 ) , Las Vegas, N V, Apr. 2008, pp. 1833-1836.
    • (2008) Proc. IEEE Int. Conf. Acoustics , pp. 1833-1836
    • Rennie, S.J.1    Hershey, J.R.2    Olsen, P.A.3
  • 11
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by nonnegative matrix fac torization with temporal continuity and sparseness criteria
    • Mar.
    • [II] T. Virtanen, "Monaural sound source separation by nonnegative matrix fac torization with temporal continuity and sparseness criteria," IEEE Trans. Audio Speech Lang. Processing, vol. 15, no. 3, pp. 1066-1074, Mar. 2007.
    • (2007) IEEE Trans. Audio Speech Lang. Processing , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 15
    • 0026843273 scopus 로고
    • A Bayesian estimation approach for speech enhancement using hidden Markov models
    • Apr.
    • Y. Ephraim, "A Bayesian estimation approach for speech enhancement using hidden Markov models," IEEE Trans. Signal Processing, vol. 40, no. 4, pp. 725-735, Apr. 1992.
    • (1992) IEEE Trans. Signal Processing , vol.40 , Issue.4 , pp. 725-735
    • Ephraim, Y.1
  • 16
    • 84899023315 scopus 로고    scopus 로고
    • ALGONQUIN\-learning dynamic noise models from noisy speech for robust speech recognition
    • Vancouver, British Columbia, Canada, Dec.
    • B. J. Frey, T. T. Kristjansson, L. Deng, and A. Acero, "ALGONQUIN\-learning dynamic noise models from noisy speech for robust speech recognition," in Proc. Neural Information Processing Systems (NIPS'01), Vancouver, British Columbia, Canada, Dec. 2001, pp. 1165-1171.
    • (2001) Proc. Neural Information Processing Systems (NIPS'01) , pp. 1165-1171
    • Frey, B.J.1    Kristjansson, T.T.2    Deng, L.3    Acero, A.4
  • 18
    • 44849140301 scopus 로고    scopus 로고
    • Speech recognition using factorial hidden Markov models for separation in the feature space
    • Pittsburgh, PA Sept.
    • T. Virtanen, "Speech recognition using factorial hidden Markov models for separation in the feature space," in Proc. ISCA 9th Int. Conf. Spoken Language Processing (ICSLP'06), Pittsburgh, PA, Sept. 2006, pp. 89-92.
    • (2006) Proc. ISCA 9th Int. Conf. Spoken Language Processing (ICSLP'06) , pp. 89-92
    • Virtanen, T.1
  • 20
    • 69249222720 scopus 로고    scopus 로고
    • Super-human multi-talker speech recognition: A graphical modeling approach
    • Jan.
    • J. R. Hershey, S. J. Rennie, P. A. Olsen, and T. T. Kristjansson, "Super-human multi-talker speech recognition: A graphical modeling approach," Comput. Speech Lang., vol. 24, no. 1, pp. 45-66, Jan. 2010.
    • (2010) Comput. Speech Lang. , vol.24 , Issue.1 , pp. 45-66
    • Hershey, J.R.1    Rennie, S.J.2    Olsen, P.A.3    Kristjansson, T.T.4
  • 21
    • 69249202377 scopus 로고    scopus 로고
    • Monaural speech separation and recognition challenge
    • Jan.
    • M. Cooke, J. R. Hershey, and S. J. Rennie, "Monaural speech separation and recognition challenge," Comput. Speech Lang., vol. 24, no. 1, pp. 1-15, Jan. 2010.
    • (2010) Comput. Speech Lang. , vol.24 , Issue.1 , pp. 1-15
    • Cooke, M.1    Hershey, J.R.2    Rennie, S.J.3
  • 22
    • 85156254941 scopus 로고
    • Factorial hidden Markov models
    • D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, Eds. Cambridge, MA: MIT Press
    • Z. Ghahramani and M. I. Jordan, "Factorial hidden Markov models," in Proc. Advances in Neural Information Processing Systems (NIPS'95), vol. 8., D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, Eds. Cambridge, MA: MIT Press, 1995, pp. 472-478.
    • (1995) Proc. Advances in Neural Information Processing Systems (NIPS'95) , vol.8 , pp. 472-478
    • Ghahramani, Z.1    Jordan, M.I.2
  • 24
    • 0035246323 scopus 로고    scopus 로고
    • On the optimality of solutions of the max-product belief-propagation algorithm in arbitrary graphs
    • Freeman Feb.
    • Y. Weiss and W. T. Freeman, "On the optimality of solutions of the max-product belief-propagation algorithm in arbitrary graphs," IEEE Trans. Inform. Theory, vol. 47, no. 2, pp. 736-744, Feb. 2001.
    • (2001) IEEE Trans. Inform. Theory , vol.47 , Issue.2 , pp. 736-744
    • Weiss, Y.1    Freeman, W.T.2
  • 25
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. A. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.A.1
  • 26
    • 0000342467 scopus 로고
    • Statistical inference for probabilistic functions of finite state Markov chains
    • L. E. Baum and T. Petrie, "Statistical inference for probabilistic functions of finite state Markov chains," Ann. Math. Statist., vol. 37, no. 6, pp. 1554-1563, 1966.
    • (1966) Ann. Math. Statist. , vol.37 , Issue.6 , pp. 1554-1563
    • Baum, L.E.1    Petrie, T.2
  • 27
    • 0016939124 scopus 로고
    • Continuous speech recognition by statistical methods
    • Apr.
    • F. Jelinek, "Continuous speech recognition by statistical methods," Proc. IEEE, vol. 64, no. 4, pp. 532-556, Apr. 1976.
    • (1976) Proc. IEEE , vol.64 , Issue.4 , pp. 532-556
    • Jelinek, F.1
  • 28
    • 65749118363 scopus 로고    scopus 로고
    • Graphical models, exponential families, and variational inference
    • M. J. Wainwright and M. I. Jordan, "Graphical models, exponential families, and variational inference," Found. Trends Mach. Learn., vol. 1, no. 1-2, pp. 1-305, 2008.
    • (2008) Found. Trends Mach. Learn. , vol.1 , Issue.1-2 , pp. 1-305
    • Wainwright, M.J.1    Jordan, M.I.2
  • 31
    • 0036602306 scopus 로고    scopus 로고
    • New analytical models and probability density functions for fading in wireless communications
    • G. D. Durgin, T. S. Rappaport, and D. A. De Wolf, "New analytical models and probability density functions for fading in wireless communications," IEEE Trans. Commun., vol. 50, no. 6, pp. 1005-1015, 2002.
    • (2002) IEEE Trans. Commun. , vol.50 , Issue.6 , pp. 1005-1015
    • Durgin, G.D.1    Rappaport, T.S.2    De Wolf, D.A.3
  • 32
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr.
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust. Speech Signal Processing, vol. 27, no. 3, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust. Speech Signal Processing , vol.27 , Issue.3 , pp. 113-120
    • Boll, S.F.1
  • 33
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment-independent speech recognition
    • Atlanta, GA, May
    • P. J. Moreno, B. Raj, and R. M. Stern, "A vector Taylor series approach for environment-independent speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, Atlanta, GA, May 1996, vol. 2, pp. 733-736.
    • (1996) Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing , vol.2 , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 34
    • 2142756950 scopus 로고    scopus 로고
    • Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise
    • Mar.
    • L. Deng, J. Droppo, and A. Acero, "Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise," IEEE Trans. Speech Audio Processing, vol. 12, no. 2, pp. 133-143, Mar. 2004.
    • (2004) IEEE Trans. Speech Audio Processing , vol.12 , Issue.2 , pp. 133-143
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 35
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • Sept.
    • M. J. F. Gales and S. J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Processing, vol. 4, no. 5, pp. 352-359, Sept. 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , Issue.5 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 39
    • 33745112980 scopus 로고    scopus 로고
    • Nonlinear minimum mean square error estimator for mixture-maximisation approximation
    • June
    • M. H. Radfar, A. H. Banihashemi, R. M. Dansereau, and A. Sayadiyan, "Nonlinear minimum mean square error estimator for mixture-maximisation approximation," Electron. Lett., vol. 42, no. 12, pp. 724-725, June 2006.
    • (2006) Electron. Lett. , vol.42 , Issue.12 , pp. 724-725
    • Radfar, M.H.1    Banihashemi, A.H.2    Dansereau, R.M.3    Sayadiyan, A.4
  • 41
    • 0033225865 scopus 로고    scopus 로고
    • An introduction to variational methods for graphical models
    • Nov.
    • M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul, "An introduction to variational methods for graphical models," Mach. Learn., vol. 37, no. 2, pp. 183-233, Nov. 1999.
    • (1999) Mach. Learn. , vol.37 , Issue.2 , pp. 183-233
    • Jordan, M.I.1    Ghahramani, Z.2    Jaakkola, T.S.3    Saul, L.K.4
  • 43
    • 85032766541 scopus 로고    scopus 로고
    • Incorporating expressive graphical models in variational approximations: Chain-graphs and hidden variables
    • T. El-Hay and N. Friedman, "Incorporating expressive graphical models in variational approximations: Chain-graphs and hidden variables," in Proc. 18th Conf. Uncertainty and Artificial Intelligence, 2002, pp. 136-143.
    • (2002) Proc. 18th Conf. Uncertainty and Artificial Intelligence , pp. 136-143
    • El-Hay, T.1    Friedman, N.2
  • 45
  • 49
    • 85032751937 scopus 로고    scopus 로고
    • Dynamic graphical models
    • Nov.
    • J. Bilmes, "Dynamic graphical models," IEEE Signal Processing Mag., vol. 27, no. 6, pp. 29-42, Nov. 2010.
    • (2010) IEEE Signal Processing Mag. , vol.27 , Issue.6 , pp. 29-42
    • Bilmes, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.