-
1
-
-
40249109687
-
Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters
-
Gehrig, T., Klee, U., McDonough, J., Ikbal, S., Wölfel, M., Fügen, C.: Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters. In: Proc. Interspeech, pp. 2594-2597 (2006)
-
(2006)
Proc. Interspeech
, pp. 2594-2597
-
-
Gehrig, T.1
Klee, U.2
McDonough, J.3
Ikbal, S.4
Wölfel, M.5
Fügen, C.6
-
4
-
-
0042826822
-
Independent component analysis: Algorithms and applications
-
Hyvärinen, A., Oja, E.: Independent component analysis: Algorithms and applications. Neural Networks 13, 411-430 (2000)
-
(2000)
Neural Networks
, vol.13
, pp. 411-430
-
-
Hyvärinen, A.1
Oja, E.2
-
5
-
-
40249114589
-
-
McDonough, J., Kumatani, K.: Minimum mutual information beamforming. Technical Report 107, Interactive Systems Lab, Universität Karlsruhe (August 2006)
-
McDonough, J., Kumatani, K.: Minimum mutual information beamforming. Technical Report 107, Interactive Systems Lab, Universität Karlsruhe (August 2006)
-
-
-
-
6
-
-
50449099480
-
Adaptive beamforming with a minimum mutual information criterion. IEEE Trans
-
to appear
-
Kumatani, K., Gehrig, T., Mayer, U., Stoimenov, E., McDonough, J., Wölfel, M.: Adaptive beamforming with a minimum mutual information criterion. IEEE Trans. Audio Speech and Lang. Proc. (to appear)
-
Audio Speech and Lang. Proc
-
-
Kumatani, K.1
Gehrig, T.2
Mayer, U.3
Stoimenov, E.4
McDonough, J.5
Wölfel, M.6
-
8
-
-
0037229734
-
Filter bank design for subband adaptive microphone arrays
-
de Haan, J.M., Grbic, N., Claesson, I., Nordholm, S.E.: Filter bank design for subband adaptive microphone arrays. IEEE Trans. Speech and Audio Proc. 11(1), 14-23 (2003)
-
(2003)
IEEE Trans. Speech and Audio Proc
, vol.11
, Issue.1
, pp. 14-23
-
-
de Haan, J.M.1
Grbic, N.2
Claesson, I.3
Nordholm, S.E.4
-
9
-
-
0023313586
-
Description and generation of spherically invariant speech-model signals
-
Brehm, H., Stammler, W.: Description and generation of spherically invariant speech-model signals. Signal Processing 12, 119-141 (1987)
-
(1987)
Signal Processing
, vol.12
, pp. 119-141
-
-
Brehm, H.1
Stammler, W.2
-
10
-
-
84892168937
-
Full expansion of contextdependent networks in large vocabulary speech recognition
-
Seattle
-
Mohri, M., Riley, M., Hindle, D., Ljolje, A., Perlera, F.: Full expansion of contextdependent networks in large vocabulary speech recognition. In: Proc. ICASSP, Seattle, vol. II, pp. 665-668 (1998)
-
(1998)
Proc. ICASSP
, vol.2
, pp. 665-668
-
-
Mohri, M.1
Riley, M.2
Hindle, D.3
Ljolje, A.4
Perlera, F.5
-
11
-
-
0036460907
-
Weighted finite-state transducers in speech recognition
-
Mohri, M., Pereira, F., Riley, M.: Weighted finite-state transducers in speech recognition. Computer Speech and Language 16, 69-88 (2002)
-
(2002)
Computer Speech and Language
, vol.16
, pp. 69-88
-
-
Mohri, M.1
Pereira, F.2
Riley, M.3
-
12
-
-
0012263430
-
Network optimizations for large vocabulary speech recognition
-
Mohri, M., Riley, M.: Network optimizations for large vocabulary speech recognition. Speech Communication 25(3) (1998)
-
(1998)
Speech Communication
, vol.25
, Issue.3
-
-
Mohri, M.1
Riley, M.2
-
13
-
-
33947638544
-
Modeling polyphone context with weighted finitestate transducers
-
Stoimenov, E., McDonough, J.: Modeling polyphone context with weighted finitestate transducers. In: Proc. ICASSP (2006)
-
(2006)
Proc. ICASSP
-
-
Stoimenov, E.1
McDonough, J.2
-
14
-
-
85164641250
-
Memory efficient modeling of polyphone context with weighted finite-state transducers
-
Stoimenov, E., McDonough, J.: Memory efficient modeling of polyphone context with weighted finite-state transducers. In: Proc. Interspeech (2007)
-
(2007)
Proc. Interspeech
-
-
Stoimenov, E.1
McDonough, J.2
-
15
-
-
0348198473
-
Finite-state transducers in language and speech processing
-
Mohri, M.: Finite-state transducers in language and speech processing. Computational Linguistics 23(2) (1997)
-
(1997)
Computational Linguistics
, vol.23
, Issue.2
-
-
Mohri, M.1
-
16
-
-
85009070232
-
A weight pushing algorithm for large vocabulary speech recognition
-
Aarlborg, Denmark, September
-
Mohri, M., Riley, M.: A weight pushing algorithm for large vocabulary speech recognition. In: Proc. ASRU, Aarlborg, Denmark, September 2001, pp. 1603-1606 (2001)
-
(2001)
Proc. ASRU
, pp. 1603-1606
-
-
Mohri, M.1
Riley, M.2
-
17
-
-
0000116303
-
Minimization algorithms for sequential transducers
-
Mohri, M.: Minimization algorithms for sequential transducers. Theoretical Computer Science 234(1-2), 177-201 (2000)
-
(2000)
Theoretical Computer Science
, vol.234
, Issue.1-2
, pp. 177-201
-
-
Mohri, M.1
-
18
-
-
33846217002
-
The multi-channel wall street journal audio visual corpus (mc-wsj-av): Specification and initial experiments
-
November
-
Lincoln, M., McCowan, I., Vepa, J., Maganti, H.: The multi-channel wall street journal audio visual corpus (mc-wsj-av): specification and initial experiments. In: Proc. ASRU, pp. 357-362 (November 2005)
-
(2005)
Proc. ASRU
, pp. 357-362
-
-
Lincoln, M.1
McCowan, I.2
Vepa, J.3
Maganti, H.4
-
19
-
-
85032772258
-
Minimum variance distortionless response spectral estimation, review and refinements
-
Wölfel, M., McDonough, J.: Minimum variance distortionless response spectral estimation, review and refinements. IEEE Signal Processing Magazine 22(5), 117-126 (2005)
-
(2005)
IEEE Signal Processing Magazine
, vol.22
, Issue.5
, pp. 117-126
-
-
Wölfel, M.1
McDonough, J.2
-
21
-
-
4243460174
-
Semi-tied covariance matrices
-
Gales, M.J.F.: Semi-tied covariance matrices. In: Proc. ICASSP (1998)
-
(1998)
Proc. ICASSP
-
-
Gales, M.J.F.1
-
22
-
-
40249085519
-
-
Fransen, J., Pye, D., Robinson, T., Woodland, P., Young, S.: Wsjcam0 corpus and recording description. Technical Report CUED/F-INFENG/TR.192, Cambridge University Engineering Department (CUED) Speech Group (September 1994)
-
Fransen, J., Pye, D., Robinson, T., Woodland, P., Young, S.: Wsjcam0 corpus and recording description. Technical Report CUED/F-INFENG/TR.192, Cambridge University Engineering Department (CUED) Speech Group (September 1994)
-
-
-
-
23
-
-
0003424145
-
-
Macmillan Publishing, New York
-
Deller, J., Hansen, J., Proakis, J.: Discrete-Time Processing of Speech Signals. Macmillan Publishing, New York (1993)
-
(1993)
Discrete-Time Processing of Speech Signals
-
-
Deller, J.1
Hansen, J.2
Proakis, J.3
-
24
-
-
0030362995
-
A compact model for speaker-adaptive training
-
Anastasakos, T., McDonough, J., Schwarz, R., Makhoul, J.: A compact model for speaker-adaptive training. In: Proc. ICSLP, pp. 1137-1140 (1996)
-
(1996)
Proc. ICSLP
, pp. 1137-1140
-
-
Anastasakos, T.1
McDonough, J.2
Schwarz, R.3
Makhoul, J.4
-
25
-
-
0034855183
-
Improvements in linear transform based speaker adaptation
-
Uebel, L., Woodland, P.: Improvements in linear transform based speaker adaptation. In: Proc. ICASSP (2001)
-
(2001)
Proc. ICASSP
-
-
Uebel, L.1
Woodland, P.2
-
26
-
-
40249105704
-
Mel-Frequenzanpassung der Minimum Varianz Distortionless Response Einhüllenden
-
Wölfel, M.: Mel-Frequenzanpassung der Minimum Varianz Distortionless Response Einhüllenden. In: Proc. of ESSV (2003)
-
(2003)
Proc. of ESSV
-
-
Wölfel, M.1
-
27
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
Gales, M. J.F.: Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language 12 (1998)
-
(1998)
Computer Speech and Language
, vol.12
-
-
Gales, M.J.F.1
-
28
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
-
Leggetter, C.J., Woodland, P.C.: Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech and Language 9, 171-185 (1995)
-
(1995)
Computer Speech and Language
, vol.9
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
29
-
-
44849112578
-
An algorithm for fast composition of weighted finite-state transducers
-
submitted
-
McDonough, J., Stoimenov, E., Klakow, D.: An algorithm for fast composition of weighted finite-state transducers. In: Proc. ASRU (submitted, 2007)
-
(2007)
Proc. ASRU
-
-
McDonough, J.1
Stoimenov, E.2
Klakow, D.3
-
30
-
-
0009653561
-
Post-filtering techniques
-
Branstein, M, Ward, D, eds, Springer, Heidelberg
-
Simmer, K.U., Bitzer, J., Marro, C.: Post-filtering techniques. In: Branstein, M., Ward, D. (eds.) Microphone Arrays, pp. 39-60. Springer, Heidelberg (2001)
-
(2001)
Microphone Arrays
, pp. 39-60
-
-
Simmer, K.U.1
Bitzer, J.2
Marro, C.3
-
31
-
-
33750570839
-
-
McCowan, I., Hari-Krishna, M., Gatica-Perez, D., Moore, D., Ba, S.: Speech acquisition in meetings with an audio-visual sensor array. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME) (July 2005)
-
McCowan, I., Hari-Krishna, M., Gatica-Perez, D., Moore, D., Ba, S.: Speech acquisition in meetings with an audio-visual sensor array. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME) (July 2005)
-
-
-
|