SCOPUS 정보 검색 플랫폼

Sprachkommunikation 2008 - 8. ITG-Fachtagung

Volumn , Issue , 2008, Pages

Distant speech recognition: No black boxes allowed

(6) McDonough, John a Wölfel, Matthias b Kumatani, Kenichi a Rauch, Barbara a Faubel, Friedrich a Klakow, Dietrich a

a SAARLAND UNIVERSITY (Germany)

b UNIVERSITY OF KARLSRUHE (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

SPEECH COMMUNICATION;

BLACK BOXES; COMPLETE SYSTEM; DISTANT SPEECH RECOGNITION; INDIVIDUAL COMPONENTS; OPTIMAL PERFORMANCE;

SPEECH RECOGNITION;

EID: 85091829120 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (1)

References (37)

1
- 0018455820
- Image method for efficiently simulating small-room acou stics
- April
- J. B. Allen and D. A. Berkley. Image method for efficiently simulating small-room acou stics. J. Acoust. Soc. Am., 65(4):943-950, April 1979.
- (1979) J. Acoust. Soc. Am , vol.65 , Issue.4 , pp. 943-950
- Allen, J. B.¹ Berkley, D. A.²

2
- 0030362995
- A compact model for speaker-adaptive training
- T. Anastasakos, J. McDonough, R. Schwarz, and J. Makhoul. A compact model for speaker-adaptive training. In Proc. ICSLP, pages 1137-1140, 1996.
- (1996) Proc. ICSLP , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwarz, R.³ Makhoul, J.⁴

3
- 0015403739
- Atmospheric absorption of sound: analytical expression
- H. E. Bass, H.-J. Bauer, and L. B. Evans. Atmospheric absorption of sound: analytical expression. Jour. of ASA, pages 821-825, 1972.
- (1972) Jour. of ASA , pp. 821-825
- Bass, H. E.¹ Bauer, H.-J.² Evans, L. B.³

4
- 0003980102
- editors. Springer Verlag, Heidelberg, Germany
- M. Brandstein and D. Ward, editors. Microphone Arrays. Springer Verlag, Heidelberg, Germany, 2001.
- (2001) Microphone Arrays
- Brandstein, M.¹ Ward, D.²

5
- 4544262407
- Blind source seperation for convolutive mixtures: A unified treatment
- Kluwer Academic, Boston
- H. Buchner, R. Aichner, and W. Kellermann. Blind source seperation for convolutive mixtures: A unified treatment. In Audio Signal Processing for Next-Generation Multimedia Communication Systems, pages 255-289. Kluwer Academic, Boston, 2004.
- (2004) Audio Signal Processing for Next-Generation Multimedia Communication Systems , pp. 255-289
- Buchner, H.¹ Aichner, R.² Kellermann, W.³

6
- 0037229734
- Filter bank design for subband adaptive microphone arrays
- Jan
- J. M. de Haan, N. Grbic, I. Claesson, and S. E. Nordholm. Filter bank design for subband adaptive microphone arrays. IEEE Trans. Speech Audio Proc., 11(1):14-23, Jan. 2003.
- (2003) IEEE Trans. Speech Audio Proc , vol.11 , Issue.1 , pp. 14-23
- de Haan, J. M.¹ Grbic, N.² Claesson, I.³ Nordholm, S. E.⁴

7
- 0003424145
- Macmillan Publishing, New York
- J. Deller, J. Hansen, and J. Proakis. Discrete-Time Processing of Speech Signals. Macmillan Publishing, New York, 1993.
- (1993) Discrete-Time Processing of Speech Signals
- Deller, J.¹ Hansen, J.² Proakis, J.³

8
- 51449104842
- Minimum mean-square error estimation of discrete fourier coefficients with generalized gamma priors
- J. S. Erkelens, R. C. Hendriks, R. Heusdens, and J. Jensen. Minimum mean-square error estimation of discrete fourier coefficients with generalized gamma priors. IEEE Transactions on Audio, Speech and Language Processing, 15:1741-1752, 2007.
- (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , pp. 1741-1752
- Erkelens, J. S.¹ Hendriks, R. C.² Heusdens, R.³ Jensen, J.⁴

9
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. J. F. Gales. Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language, 12, 1998.
- (1998) Computer Speech and Language , vol.12
- Gales, M. J. F.¹

10
- 0003887760
- John Wiley & Sons, New York
- R. G. Gallager. Information Theory and Reliable Communication. John Wiley & Sons, New York, 1968.
- (1968) Information Theory and Reliable Communication
- Gallager, R. G.¹

11
- 40249109687
- Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters
- T. Gehrig, U. Klee, J. McDonough, S. Ikbal, M. Wölfel, and C. Fügen. Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters. In Proc. Interspeech, pages 2594-2597, 2006.
- (2006) Proc. Interspeech , pp. 2594-2597
- Gehrig, T.¹ Klee, U.² McDonough, J.³ Ikbal, S.⁴ Wölfel, M.⁵ Fügen, C.⁶

12
- 0042826822
- Independent component analysis: Algorithms and applications
- A. Hyvärinen and E. Oja. Independent component analysis: Algorithms and applications. Neural Networks, 13:411-430, 2000.
- (2000) Neural Networks , vol.13 , pp. 411-430
- Hyvärinen, A.¹ Oja, E.²

13
- 38049134625
- Kalman filters for time delay of arrival-based source localization
- August
- Ulrich Klee, Tobias Gehrig, and John McDonough. Kalman filters for time delay of arrival-based source localization. Journal of Advanced Signal Processing, Special Issue on Multi-Channel Speech Processing, August 2005.
- (2005) Journal of Advanced Signal Processing, Special Issue on Multi-Channel Speech Processing
- Klee, Ulrich¹ Gehrig, Tobias² McDonough, John³

14
- 50449099480
- Adaptive beamforming with a minimum mutual information criterion
- K. Kumatani, T. Gehrig, U. Mayer, E. Stoimenov, J. McDonough, and M. Wölfel. Adaptive beamforming with a minimum mutual information criterion. IEEE Transactions on Audio, Speech and Language Processing, 15:2527-2541, 2007.
- (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , pp. 2527-2541
- Kumatani, K.¹ Gehrig, T.² Mayer, U.³ Stoimenov, E.⁴ McDonough, J.⁵ Wölfel, M.⁶

15
- 50449089368
- Adaptive beamforming with a maximum negentropy criterion
- Trento, Italy, May
- K. Kumatani, J. McDonough, D. Klakow, P. N. Garner, and W. Li. Adaptive beamforming with a maximum negentropy criterion. In Proc. Hands-Free Speech Communication and Microphone Arrays, Trento, Italy, May 2008.
- (2008) Proc. Hands-Free Speech Communication and Microphone Arrays
- Kumatani, K.¹ McDonough, J.² Klakow, D.³ Garner, P. N.⁴ Li, W.⁵

16
- 84867218783
- Maximum kurtosis beamforming with the generalized sidelobe canceller
- September
- K. Kumatani, J. McDonough, B. Rauch, P. N. Garner, W. Li, and J. Dines. Maximum kurtosis beamforming with the generalized sidelobe canceller. In Proc. Interspeech, September 2008.
- (2008) Proc. Interspeech
- Kumatani, K.¹ McDonough, J.² Rauch, B.³ Garner, P. N.⁴ Li, W.⁵ Dines, J.⁶

17
- 51449092343
- Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming
- K. Kumatani, J. McDonough, S. Schacht, D. Klakow, P. N. Garner, and W. Li. Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming. In Proc. ICASSP, 2008.
- (2008) Proc. ICASSP
- Kumatani, K.¹ McDonough, J.² Schacht, S.³ Klakow, D.⁴ Garner, P. N.⁵ Li, W.⁶

18
- 0003870155
- Elsevier Applied Science
- H. Kuttruff. Room Acoustics. Elsevier Applied Science, 2000.
- (2000) Room Acoustics
- Kuttruff, H.¹

19
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
- April
- C. J. Leggetter and P. C. Woodland. Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech and Language, 9:171-185, April 1995.
- (1995) Computer Speech and Language , vol.9 , pp. 171-185
- Leggetter, C. J.¹ Woodland, P. C.²

20
- 33846217002
- The multi-channel Wall Street Journal audio visual corpus (mc-wsj-av): Specification and initial experiments
- M. Lincoln, I. McCowan, I. Vepa, and H. K. Maganti. The multi-channel Wall Street Journal audio visual corpus (mc-wsj-av): Specification and initial experiments. In Proc. ASRU, pages 357-362, 2005.
- (2005) Proc. ASRU , pp. 357-362
- Lincoln, M.¹ McCowan, I.² Vepa, I.³ Maganti, H. K.⁴

21
- 0032072917
- Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering
- C. Marro, Y. Mahieux, and K. U. Simmer. Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering. IEEE Transactions on Speech and Audio Processing, 6:240-259, 1998.
- (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , pp. 240-259
- Marro, C.¹ Mahieux, Y.² Simmer, K. U.³

22
- 27644556974
- Speech enhancement based on minimum mean-square error estimation and supergaussian priors
- Sept
- R. Martin. Speech enhancement based on minimum mean-square error estimation and supergaussian priors. IEEE Trans. Speech Audio Proc., 13(5):845-856, Sept. 2005.
- (2005) IEEE Trans. Speech Audio Proc , vol.13 , Issue.5 , pp. 845-856
- Martin, R.¹

23
- 50449099053
- To separate speech! a system for recognizing simultaneous speech
- J. McDonough, K. Kumatani, T. Gehrig, E. Stoimenov, U. Mayer, S. Schacht, M. Wölfel, and D. Klakow. To separate speech! a system for recognizing simultaneous speech. In Proc. Machine Learning and Multi-modal Interfaces, 2007.
- (2007) Proc. Machine Learning and Multi-modal Interfaces
- McDonough, J.¹ Kumatani, K.² Gehrig, T.³ Stoimenov, E.⁴ Mayer, U.⁵ Schacht, S.⁶ Wölfel, M.⁷ Klakow, D.⁸

24
- 44849112578
- An algorithm for fast composition of weighted finite-state transducers
- December
- J. McDonough, E. Stoimenov, and D. Klakow. An algorithm for fast composition of weighted finite-state transducers. In Proc. ASRU, December 2007.
- (2007) Proc. ASRU
- McDonough, J.¹ Stoimenov, E.² Klakow, D.³

25
- 50449097590
- Distant speech recognition: Bridging the gaps
- J. McDonough and M. Wölfel. Distant speech recognition: Bridging the gaps. In Proc. Hands-Free Speech Communication and Microphone Arrays, 2008.
- (2008) Proc. Hands-Free Speech Communication and Microphone Arrays
- McDonough, J.¹ Wölfel, M.²

26
- 0027634633
- Proper complex random processes with applications to information theory
- July
- F. D. Neeser and J. L. Massey. Proper complex random processes with applications to information theory. IEEE Trans. Info. Theory, 39(4):1293-1302, July 1993.
- (1993) IEEE Trans. Info. Theory , vol.39 , Issue.4 , pp. 1293-1302
- Neeser, F. D.¹ Massey, J. L.²

27
- 56149112846
- Investigations into early and late reflections on distant-talking speech recognition toward suitable reverberation criteria
- T. Nishiura, Y. Hirano, Y. Denda, and M. Nakayama. Investigations into early and late reflections on distant-talking speech recognition toward suitable reverberation criteria. In Proc. of Interspeech, 2007.
- (2007) Proc. of Interspeech
- Nishiura, T.¹ Hirano, Y.² Denda, Y.³ Nakayama, M.⁴

28
- 78649242855
- A cepstral domain maximum likelihood beamformer for speech recognition
- D. Raub, J. McDonough, and M. Wölfel. A cepstral domain maximum likelihood beamformer for speech recognition. In Proc. Interspeech, 2004.
- (2004) Proc. Interspeech
- Raub, D.¹ McDonough, J.² Wölfel, M.³

29
- 84890499695
- Hidden markov model beamforming with a maximum negentropy optimization criterion
- September
- B. Rauch, K. Kumatani, J. McDonough, and D. Klakow. Hidden markov model beamforming with a maximum negentropy optimization criterion. In Proc. International Workshop on Acoustic Echo and Noise Control, September 2008.
- (2008) Proc. International Workshop on Acoustic Echo and Noise Control
- Rauch, B.¹ Kumatani, K.² McDonough, J.³ Klakow, D.⁴

30
- 4344607755
- Likelihood-maximizing beamforming for robust hands-free speech recognition
- September
- M. L. Seltzer, B. Raj, and R. M. Stern. Likelihood-maximizing beamforming for robust hands-free speech recognition. IEEE Trans. Speech Audio Proc., 12(5):489-498, September 2004.
- (2004) IEEE Trans. Speech Audio Proc , vol.12 , Issue.5 , pp. 489-498
- Seltzer, M. L.¹ Raj, B.² Stern, R. M.³

31
- 0034855183
- Improvements in linear transform based speaker adaptation
- L. Uebel and P. Woodland. Improvements in linear transform based speaker adaptation. In Proc. ICASSP, 2001.
- (2001) Proc. ICASSP
- Uebel, L.¹ Woodland, P.²

32
- 0003433734
- Prentice Hall, Englewood Cliffs
- P. P. Vaidyanathan. Multirate Systems and Filter Banks. Prentice Hall, Englewood Cliffs, 1993.
- (1993) Multirate Systems and Filter Banks
- Vaidyanathan, P. P.¹

33
- 51449110329
- Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller
- Las Vegas, NV, U.S.A
- E. Warsitz, A. Krueger, and R. Haeb-Umbach. Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, NV, U.S.A, 2008.
- (2008) Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Warsitz, E.¹ Krueger, A.² Haeb-Umbach, R.³

34
- 0036753897
- Speaker adaptive modeling by vocal tract normalizatio
- L. Welling, H. Ney, and S. Kanthak. Speaker adaptive modeling by vocal tract normalizatio. IEEE Trans. Speech Audio Proc., 10(6):415-426, 2002.
- (2002) IEEE Trans. Speech Audio Proc , vol.10 , Issue.6 , pp. 415-426
- Welling, L.¹ Ney, H.² Kanthak, S.³

35
- 0020126973
- Signal cancellation phenomena in adaptive antennas: Causes and cures
- K. M. D., and, AP-30
- B. Widrow, K. M. D., R. P. Gooch, and W. C. Newman. Signal cancellation phenomena in adaptive antennas: Causes and cures. IEEE Transactions on Antennas and Propagation, AP-30:469-478, 1982.
- (1982) IEEE Transactions on Antennas and Propagation , pp. 469-478
- Widrow, B.¹ Gooch, R. P.² Newman, W. C.³

36
- 50449110561
- A joint particle filter and multi-step linear prediction framework to provide enhanced speech features prior to automatic recognition
- Trento, Italy, May
- M. Wölfel. A joint particle filter and multi-step linear prediction framework to provide enhanced speech features prior to automatic recognition. In Proc. Hands-Free Speech Communication and Microphone Arrays, Trento, Italy, May 2008.
- (2008) Proc. Hands-Free Speech Communication and Microphone Arrays
- Wölfel, M.¹

37
- 50449083999
- Wiley & Sons, Chichester, West Sussex, England
- M. Wölfel and J. McDonough. Distant Speech Recognition. Wiley & Sons, Chichester, West Sussex, England, 2009.
- (2009) Distant Speech Recognition
- Wölfel, M.¹ McDonough, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.