SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 4892 LNCS, Issue , 2008, Pages 283-294

To separate speech: A system for recognizing simultaneous speech

(8) McDonough, John a,c Kumatani, Kenichi b,c Gehrig, Tobias c Stoimenov, Emilian c Mayer, Uwe c Schacht, Stefan a Wölfel, Matthias c Klakow, Dietrich a

a SAARLAND UNIVERSITY (Germany)

b IDIAP RESEARCH INSTITUTE (Switzerland)

c UNIVERSITY OF KARLSRUHE (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

BEAMFORMING; MICROPHONES; TRANSDUCERS;

FILTER BANK DESIGN; GENERALIZED SIDELOBE CANCELLER (GSC); MINIMUM MUTUAL INFORMATION (MMI); WORD ERROR RATE;

SPEECH RECOGNITION;

EID: 40249114843 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-540-78155-4_25 Document Type: Conference Paper

Times cited : (9)

References (31)

1
- 40249109687
- Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters
- Gehrig, T., Klee, U., McDonough, J., Ikbal, S., Wölfel, M., Fügen, C.: Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters. In: Proc. Interspeech, pp. 2594-2597 (2006)
- (2006) Proc. Interspeech , pp. 2594-2597
- Gehrig, T.¹ Klee, U.² McDonough, J.³ Ikbal, S.⁴ Wölfel, M.⁵ Fügen, C.⁶

2
- 0004235293
- Academic Press, San Diego
- Bar-Shalom, Y., Fortmann, T.E.: Tracking and Data Association. Academic Press, San Diego (1988)
- (1988) Tracking and Data Association
- Bar-Shalom, Y.¹ Fortmann, T.E.²

3
- 0003964055
- Wiley-Interscience, Chichester
- Van Trees, H.L.: Optimum Array Processing. Wiley-Interscience, Chichester (2002)
- (2002) Optimum Array Processing
- Van Trees, H.L.¹

4
- 0042826822
- Independent component analysis: Algorithms and applications
- Hyvärinen, A., Oja, E.: Independent component analysis: Algorithms and applications. Neural Networks 13, 411-430 (2000)
- (2000) Neural Networks , vol.13 , pp. 411-430
- Hyvärinen, A.¹ Oja, E.²

5
- 40249114589
- McDonough, J., Kumatani, K.: Minimum mutual information beamforming. Technical Report 107, Interactive Systems Lab, Universität Karlsruhe (August 2006)
- McDonough, J., Kumatani, K.: Minimum mutual information beamforming. Technical Report 107, Interactive Systems Lab, Universität Karlsruhe (August 2006)

6
- 50449099480
- Adaptive beamforming with a minimum mutual information criterion. IEEE Trans
- to appear
- Kumatani, K., Gehrig, T., Mayer, U., Stoimenov, E., McDonough, J., Wölfel, M.: Adaptive beamforming with a minimum mutual information criterion. IEEE Trans. Audio Speech and Lang. Proc. (to appear)
- Audio Speech and Lang. Proc
- Kumatani, K.¹ Gehrig, T.² Mayer, U.³ Stoimenov, E.⁴ McDonough, J.⁵ Wölfel, M.⁶

7
- 0003433734
- Prentice-Hall, Englewood Cliffs
- Vaidyanathan, P.P.: Multirate Systems and Filter Banks. Prentice-Hall, Englewood Cliffs (1993)
- (1993) Multirate Systems and Filter Banks
- Vaidyanathan, P.P.¹

8
- 0037229734
- Filter bank design for subband adaptive microphone arrays
- de Haan, J.M., Grbic, N., Claesson, I., Nordholm, S.E.: Filter bank design for subband adaptive microphone arrays. IEEE Trans. Speech and Audio Proc. 11(1), 14-23 (2003)
- (2003) IEEE Trans. Speech and Audio Proc , vol.11 , Issue.1 , pp. 14-23
- de Haan, J.M.¹ Grbic, N.² Claesson, I.³ Nordholm, S.E.⁴

9
- 0023313586
- Description and generation of spherically invariant speech-model signals
- Brehm, H., Stammler, W.: Description and generation of spherically invariant speech-model signals. Signal Processing 12, 119-141 (1987)
- (1987) Signal Processing , vol.12 , pp. 119-141
- Brehm, H.¹ Stammler, W.²

10
- 84892168937
- Full expansion of contextdependent networks in large vocabulary speech recognition
- Seattle
- Mohri, M., Riley, M., Hindle, D., Ljolje, A., Perlera, F.: Full expansion of contextdependent networks in large vocabulary speech recognition. In: Proc. ICASSP, Seattle, vol. II, pp. 665-668 (1998)
- (1998) Proc. ICASSP , vol.2 , pp. 665-668
- Mohri, M.¹ Riley, M.² Hindle, D.³ Ljolje, A.⁴ Perlera, F.⁵

11
- 0036460907
- Weighted finite-state transducers in speech recognition
- Mohri, M., Pereira, F., Riley, M.: Weighted finite-state transducers in speech recognition. Computer Speech and Language 16, 69-88 (2002)
- (2002) Computer Speech and Language , vol.16 , pp. 69-88
- Mohri, M.¹ Pereira, F.² Riley, M.³

12
- 0012263430
- Network optimizations for large vocabulary speech recognition
- Mohri, M., Riley, M.: Network optimizations for large vocabulary speech recognition. Speech Communication 25(3) (1998)
- (1998) Speech Communication , vol.25 , Issue.3
- Mohri, M.¹ Riley, M.²

13
- 33947638544
- Modeling polyphone context with weighted finitestate transducers
- Stoimenov, E., McDonough, J.: Modeling polyphone context with weighted finitestate transducers. In: Proc. ICASSP (2006)
- (2006) Proc. ICASSP
- Stoimenov, E.¹ McDonough, J.²

14
- 85164641250
- Memory efficient modeling of polyphone context with weighted finite-state transducers
- Stoimenov, E., McDonough, J.: Memory efficient modeling of polyphone context with weighted finite-state transducers. In: Proc. Interspeech (2007)
- (2007) Proc. Interspeech
- Stoimenov, E.¹ McDonough, J.²

15
- 0348198473
- Finite-state transducers in language and speech processing
- Mohri, M.: Finite-state transducers in language and speech processing. Computational Linguistics 23(2) (1997)
- (1997) Computational Linguistics , vol.23 , Issue.2
- Mohri, M.¹

16
- 85009070232
- A weight pushing algorithm for large vocabulary speech recognition
- Aarlborg, Denmark, September
- Mohri, M., Riley, M.: A weight pushing algorithm for large vocabulary speech recognition. In: Proc. ASRU, Aarlborg, Denmark, September 2001, pp. 1603-1606 (2001)
- (2001) Proc. ASRU , pp. 1603-1606
- Mohri, M.¹ Riley, M.²

17
- 0000116303
- Minimization algorithms for sequential transducers
- Mohri, M.: Minimization algorithms for sequential transducers. Theoretical Computer Science 234(1-2), 177-201 (2000)
- (2000) Theoretical Computer Science , vol.234 , Issue.1-2 , pp. 177-201
- Mohri, M.¹

18
- 33846217002
- The multi-channel wall street journal audio visual corpus (mc-wsj-av): Specification and initial experiments
- November
- Lincoln, M., McCowan, I., Vepa, J., Maganti, H.: The multi-channel wall street journal audio visual corpus (mc-wsj-av): specification and initial experiments. In: Proc. ASRU, pp. 357-362 (November 2005)
- (2005) Proc. ASRU , pp. 357-362
- Lincoln, M.¹ McCowan, I.² Vepa, J.³ Maganti, H.⁴

19
- 85032772258
- Minimum variance distortionless response spectral estimation, review and refinements
- Wölfel, M., McDonough, J.: Minimum variance distortionless response spectral estimation, review and refinements. IEEE Signal Processing Magazine 22(5), 117-126 (2005)
- (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 117-126
- Wölfel, M.¹ McDonough, J.²

20
- 0032097263
- Academic Press, New York
- Fukunaga, K.: Introduction to Statistical Pattern Recognition. Academic Press, New York (1990)
- (1990) Introduction to Statistical Pattern Recognition
- Fukunaga, K.¹

21
- 4243460174
- Semi-tied covariance matrices
- Gales, M.J.F.: Semi-tied covariance matrices. In: Proc. ICASSP (1998)
- (1998) Proc. ICASSP
- Gales, M.J.F.¹

22
- 40249085519
- Fransen, J., Pye, D., Robinson, T., Woodland, P., Young, S.: Wsjcam0 corpus and recording description. Technical Report CUED/F-INFENG/TR.192, Cambridge University Engineering Department (CUED) Speech Group (September 1994)
- Fransen, J., Pye, D., Robinson, T., Woodland, P., Young, S.: Wsjcam0 corpus and recording description. Technical Report CUED/F-INFENG/TR.192, Cambridge University Engineering Department (CUED) Speech Group (September 1994)

23
- 0003424145
- Macmillan Publishing, New York
- Deller, J., Hansen, J., Proakis, J.: Discrete-Time Processing of Speech Signals. Macmillan Publishing, New York (1993)
- (1993) Discrete-Time Processing of Speech Signals
- Deller, J.¹ Hansen, J.² Proakis, J.³

24
- 0030362995
- A compact model for speaker-adaptive training
- Anastasakos, T., McDonough, J., Schwarz, R., Makhoul, J.: A compact model for speaker-adaptive training. In: Proc. ICSLP, pp. 1137-1140 (1996)
- (1996) Proc. ICSLP , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwarz, R.³ Makhoul, J.⁴

25
- 0034855183
- Improvements in linear transform based speaker adaptation
- Uebel, L., Woodland, P.: Improvements in linear transform based speaker adaptation. In: Proc. ICASSP (2001)
- (2001) Proc. ICASSP
- Uebel, L.¹ Woodland, P.²

26
- 40249105704
- Mel-Frequenzanpassung der Minimum Varianz Distortionless Response Einhüllenden
- Wölfel, M.: Mel-Frequenzanpassung der Minimum Varianz Distortionless Response Einhüllenden. In: Proc. of ESSV (2003)
- (2003) Proc. of ESSV
- Wölfel, M.¹

27
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- Gales, M. J.F.: Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language 12 (1998)
- (1998) Computer Speech and Language , vol.12
- Gales, M.J.F.¹

28
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
- Leggetter, C.J., Woodland, P.C.: Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech and Language 9, 171-185 (1995)
- (1995) Computer Speech and Language , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

29
- 44849112578
- An algorithm for fast composition of weighted finite-state transducers
- submitted
- McDonough, J., Stoimenov, E., Klakow, D.: An algorithm for fast composition of weighted finite-state transducers. In: Proc. ASRU (submitted, 2007)
- (2007) Proc. ASRU
- McDonough, J.¹ Stoimenov, E.² Klakow, D.³

30
- 0009653561
- Post-filtering techniques
- Branstein, M, Ward, D, eds, Springer, Heidelberg
- Simmer, K.U., Bitzer, J., Marro, C.: Post-filtering techniques. In: Branstein, M., Ward, D. (eds.) Microphone Arrays, pp. 39-60. Springer, Heidelberg (2001)
- (2001) Microphone Arrays , pp. 39-60
- Simmer, K.U.¹ Bitzer, J.² Marro, C.³

31
- 33750570839
- McCowan, I., Hari-Krishna, M., Gatica-Perez, D., Moore, D., Ba, S.: Speech acquisition in meetings with an audio-visual sensor array. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME) (July 2005)
- McCowan, I., Hari-Krishna, M., Gatica-Perez, D., Moore, D., Ba, S.: Speech acquisition in meetings with an audio-visual sensor array. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME) (July 2005)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.