메뉴 건너뛰기




Volumn , Issue , 2008, Pages

Distant speech recognition: No black boxes allowed

Author keywords

[No Author keywords available]

Indexed keywords

SPEECH COMMUNICATION;

EID: 85091829120     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (1)

References (37)
  • 1
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acou stics
    • April
    • J. B. Allen and D. A. Berkley. Image method for efficiently simulating small-room acou stics. J. Acoust. Soc. Am., 65(4):943-950, April 1979.
    • (1979) J. Acoust. Soc. Am , vol.65 , Issue.4 , pp. 943-950
    • Allen, J. B.1    Berkley, D. A.2
  • 3
    • 0015403739 scopus 로고
    • Atmospheric absorption of sound: analytical expression
    • H. E. Bass, H.-J. Bauer, and L. B. Evans. Atmospheric absorption of sound: analytical expression. Jour. of ASA, pages 821-825, 1972.
    • (1972) Jour. of ASA , pp. 821-825
    • Bass, H. E.1    Bauer, H.-J.2    Evans, L. B.3
  • 4
    • 0003980102 scopus 로고    scopus 로고
    • editors. Springer Verlag, Heidelberg, Germany
    • M. Brandstein and D. Ward, editors. Microphone Arrays. Springer Verlag, Heidelberg, Germany, 2001.
    • (2001) Microphone Arrays
    • Brandstein, M.1    Ward, D.2
  • 9
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales. Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language, 12, 1998.
    • (1998) Computer Speech and Language , vol.12
    • Gales, M. J. F.1
  • 11
    • 40249109687 scopus 로고    scopus 로고
    • Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters
    • T. Gehrig, U. Klee, J. McDonough, S. Ikbal, M. Wölfel, and C. Fügen. Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters. In Proc. Interspeech, pages 2594-2597, 2006.
    • (2006) Proc. Interspeech , pp. 2594-2597
    • Gehrig, T.1    Klee, U.2    McDonough, J.3    Ikbal, S.4    Wölfel, M.5    Fügen, C.6
  • 12
    • 0042826822 scopus 로고    scopus 로고
    • Independent component analysis: Algorithms and applications
    • A. Hyvärinen and E. Oja. Independent component analysis: Algorithms and applications. Neural Networks, 13:411-430, 2000.
    • (2000) Neural Networks , vol.13 , pp. 411-430
    • Hyvärinen, A.1    Oja, E.2
  • 17
    • 51449092343 scopus 로고    scopus 로고
    • Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming
    • K. Kumatani, J. McDonough, S. Schacht, D. Klakow, P. N. Garner, and W. Li. Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming. In Proc. ICASSP, 2008.
    • (2008) Proc. ICASSP
    • Kumatani, K.1    McDonough, J.2    Schacht, S.3    Klakow, D.4    Garner, P. N.5    Li, W.6
  • 19
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
    • April
    • C. J. Leggetter and P. C. Woodland. Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech and Language, 9:171-185, April 1995.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C. J.1    Woodland, P. C.2
  • 20
    • 33846217002 scopus 로고    scopus 로고
    • The multi-channel Wall Street Journal audio visual corpus (mc-wsj-av): Specification and initial experiments
    • M. Lincoln, I. McCowan, I. Vepa, and H. K. Maganti. The multi-channel Wall Street Journal audio visual corpus (mc-wsj-av): Specification and initial experiments. In Proc. ASRU, pages 357-362, 2005.
    • (2005) Proc. ASRU , pp. 357-362
    • Lincoln, M.1    McCowan, I.2    Vepa, I.3    Maganti, H. K.4
  • 21
    • 0032072917 scopus 로고    scopus 로고
    • Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering
    • C. Marro, Y. Mahieux, and K. U. Simmer. Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering. IEEE Transactions on Speech and Audio Processing, 6:240-259, 1998.
    • (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , pp. 240-259
    • Marro, C.1    Mahieux, Y.2    Simmer, K. U.3
  • 22
    • 27644556974 scopus 로고    scopus 로고
    • Speech enhancement based on minimum mean-square error estimation and supergaussian priors
    • Sept
    • R. Martin. Speech enhancement based on minimum mean-square error estimation and supergaussian priors. IEEE Trans. Speech Audio Proc., 13(5):845-856, Sept. 2005.
    • (2005) IEEE Trans. Speech Audio Proc , vol.13 , Issue.5 , pp. 845-856
    • Martin, R.1
  • 24
    • 44849112578 scopus 로고    scopus 로고
    • An algorithm for fast composition of weighted finite-state transducers
    • December
    • J. McDonough, E. Stoimenov, and D. Klakow. An algorithm for fast composition of weighted finite-state transducers. In Proc. ASRU, December 2007.
    • (2007) Proc. ASRU
    • McDonough, J.1    Stoimenov, E.2    Klakow, D.3
  • 26
    • 0027634633 scopus 로고
    • Proper complex random processes with applications to information theory
    • July
    • F. D. Neeser and J. L. Massey. Proper complex random processes with applications to information theory. IEEE Trans. Info. Theory, 39(4):1293-1302, July 1993.
    • (1993) IEEE Trans. Info. Theory , vol.39 , Issue.4 , pp. 1293-1302
    • Neeser, F. D.1    Massey, J. L.2
  • 27
    • 56149112846 scopus 로고    scopus 로고
    • Investigations into early and late reflections on distant-talking speech recognition toward suitable reverberation criteria
    • T. Nishiura, Y. Hirano, Y. Denda, and M. Nakayama. Investigations into early and late reflections on distant-talking speech recognition toward suitable reverberation criteria. In Proc. of Interspeech, 2007.
    • (2007) Proc. of Interspeech
    • Nishiura, T.1    Hirano, Y.2    Denda, Y.3    Nakayama, M.4
  • 28
    • 78649242855 scopus 로고    scopus 로고
    • A cepstral domain maximum likelihood beamformer for speech recognition
    • D. Raub, J. McDonough, and M. Wölfel. A cepstral domain maximum likelihood beamformer for speech recognition. In Proc. Interspeech, 2004.
    • (2004) Proc. Interspeech
    • Raub, D.1    McDonough, J.2    Wölfel, M.3
  • 30
    • 4344607755 scopus 로고    scopus 로고
    • Likelihood-maximizing beamforming for robust hands-free speech recognition
    • September
    • M. L. Seltzer, B. Raj, and R. M. Stern. Likelihood-maximizing beamforming for robust hands-free speech recognition. IEEE Trans. Speech Audio Proc., 12(5):489-498, September 2004.
    • (2004) IEEE Trans. Speech Audio Proc , vol.12 , Issue.5 , pp. 489-498
    • Seltzer, M. L.1    Raj, B.2    Stern, R. M.3
  • 31
    • 0034855183 scopus 로고    scopus 로고
    • Improvements in linear transform based speaker adaptation
    • L. Uebel and P. Woodland. Improvements in linear transform based speaker adaptation. In Proc. ICASSP, 2001.
    • (2001) Proc. ICASSP
    • Uebel, L.1    Woodland, P.2
  • 34
    • 0036753897 scopus 로고    scopus 로고
    • Speaker adaptive modeling by vocal tract normalizatio
    • L. Welling, H. Ney, and S. Kanthak. Speaker adaptive modeling by vocal tract normalizatio. IEEE Trans. Speech Audio Proc., 10(6):415-426, 2002.
    • (2002) IEEE Trans. Speech Audio Proc , vol.10 , Issue.6 , pp. 415-426
    • Welling, L.1    Ney, H.2    Kanthak, S.3
  • 36
    • 50449110561 scopus 로고    scopus 로고
    • A joint particle filter and multi-step linear prediction framework to provide enhanced speech features prior to automatic recognition
    • Trento, Italy, May
    • M. Wölfel. A joint particle filter and multi-step linear prediction framework to provide enhanced speech features prior to automatic recognition. In Proc. Hands-Free Speech Communication and Microphone Arrays, Trento, Italy, May 2008.
    • (2008) Proc. Hands-Free Speech Communication and Microphone Arrays
    • Wölfel, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.