-
1
-
-
84977901887
-
A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise
-
H. Attias, L. Deng, A. Acero, and J.C. Platt, "A New Method for Speech Denoising and Robust Speech Recognition Using Probabilistic Models for Clean Speech and for Noise," Proc. Eurospeech, 2001.
-
Proc. Eurospeech, 2001
-
-
Attias, H.1
Deng, L.2
Acero, A.3
Platt, J.C.4
-
2
-
-
0032528695
-
Blind source separation and deconvolution: The dynamic component analysis algorithm
-
H. Attias and C.E. Schreiner, "Blind Source Separation and Deconvolution: The Dynamic Component Analysis Algorithm," Neural Computation, vol. 10, 1998.
-
(1998)
Neural Computation
, vol.10
-
-
Attias, H.1
Schreiner, C.E.2
-
3
-
-
0032675797
-
Audio-visual person verification
-
S. Ben-Yacoub, J. Luttin, K. Jonsson, J. Matas, and J. Kittler, "Audio-Visual Person Verification," Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 2000.
-
Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 2000
-
-
Ben-Yacoub, S.1
Luttin, J.2
Jonsson, K.3
Matas, J.4
Kittler, J.5
-
5
-
-
0009590598
-
-
M. Brandstein and D. Ward, eds. Springer
-
Microphone Arrays, M. Brandstein and D. Ward, eds. Springer, 2001.
-
(2001)
Microphone Arrays
-
-
-
6
-
-
0032918933
-
Time-delay estimation of reverberant speech exploiting harmonic structure
-
M.S. Brandstein, "Time-Delay Estimation of Reverberant Speech Exploiting Harmonic Structure," J. Accoustic Soc. Am., vol. 105, no. 5, pp. 2914-2919, 1999.
-
(1999)
J. Accoustic Soc. Am.
, vol.105
, Issue.5
, pp. 2914-2919
-
-
Brandstein, M.S.1
-
7
-
-
85013597845
-
Eigenlips for robust speech recognition
-
C. Bregler and Y. Konig, "Eigenlips for Robust Speech Recognition," Proc. IEEE Conf. Acoustics, Speech, and Signal Processing, 1994.
-
Proc. IEEE Conf. Acoustics, Speech, and Signal Processing, 1994
-
-
Bregler, C.1
Konig, Y.2
-
10
-
-
0038715064
-
Distributed meetings: A meeting capture and broadcasting system
-
R. Cutler, Y. Rui, A. Gupta, J.J. Cadiz, I. Tashev, L.-W. He, A. Colburn, Z. Zhang, Z. Liu, and S. Silverberg, "Distributed Meetings: A Meeting Capture and Broadcasting System," Proc. ACM Multimedia, 2002.
-
Proc. ACM Multimedia, 2002
-
-
Cutler, R.1
Rui, Y.2
Gupta, A.3
Cadiz, J.J.4
Tashev, I.5
He, L.-W.6
Colburn, A.7
Zhang, Z.8
Liu, Z.9
Silverberg, S.10
-
11
-
-
0034842488
-
Active speech source localization by a dual coarse-to-fine search
-
R. Duraiswami, D. Zotkin, and L. David, "Active Speech Source Localization by a Dual Coarse-to-Fine Search," Proc. IEEE Conf. Acoustics, Speech, and Signal Processing, 2001.
-
Proc. IEEE Conf. Acoustics, Speech, and Signal Processing, 2001
-
-
Duraiswami, R.1
Zotkin, D.2
David, L.3
-
13
-
-
0344077914
-
Advances in algorithms for inference and learning in complex probability models
-
pending publication
-
B.J. Frey and N. Jojic, "Advances in Algorithms for Inference and Learning in Complex Probability Models," IEEE Trans. Pattern Analysis and Machine Intelligence, pending publication.
-
IEEE Trans. Pattern Analysis and Machine Intelligence
-
-
Frey, B.J.1
Jojic, N.2
-
16
-
-
84910034222
-
Stereo vision lip-tracking for audio-video speech processing
-
R. Goecke, J.B. Millar, A. Zelinsky, and J. Robert-Ribes, "Stereo Vision Lip-Tracking for Audio-Video Speech Processing," Proc. IEEE Conf. Acoustics, Speech, and Signal Processing, 2001.
-
Proc. IEEE Conf. Acoustics, Speech, and Signal Processing, 2001
-
-
Goecke, R.1
Millar, J.B.2
Zelinsky, A.3
Robert-Ribes, J.4
-
18
-
-
84899028297
-
Using audio-visual synchrony to locate sounds
-
S.A. Solla, T.K. Leen, and K.-R. Muller, eds.
-
J. Hershey and J.R. Movellan, "Using Audio-Visual Synchrony to Locate Sounds," Proc. Advances in Neural Information Processing Systems 1999, S.A. Solla, T.K. Leen, and K.-R. Muller, eds., vol. 12, 2000.
-
(2000)
Proc. Advances in Neural Information Processing Systems 1999
, vol.12
-
-
Hershey, J.1
Movellan, J.R.2
-
19
-
-
84898954418
-
Learning joint statistical models for audio-visual fusion and segregation
-
J.W. Fisher III, T. Darrell, W.T. Freeman, and P.A. Viola, "Learning Joint Statistical Models for Audio-Visual Fusion and Segregation," Proc. Advances in Neural Information Processing Systems 2000, vol. 14, 2001.
-
(2001)
Proc. Advances in Neural Information Processing Systems 2000
, vol.14
-
-
Fisher J.W. III1
Darrell, T.2
Freeman, W.T.3
Viola, P.A.4
-
20
-
-
0035680076
-
Robust, on-line appearance models for vision tracking
-
A.D. Jepson, D.J. Fleet, and T. El-Maraghi, "Robust, On-Line Appearance Models for Vision Tracking," Proc. IEEE Conf. Computer Vision and Pattern Recognition, Dec. 2001.
-
Proc. IEEE Conf. Computer Vision and Pattern Recognition, Dec. 2001
-
-
Jepson, A.D.1
Fleet, D.J.2
El-Maraghi, T.3
-
22
-
-
0033698724
-
Transformed hidden Markov models: Estimating mixture models of images and inferring spatial transformations in video sequences
-
N. Jojic, N. Petrovic, B.J. Frey, and T.S. Huang, "Transformed Hidden Markov Models: Estimating Mixture Models of Images and Inferring Spatial Transformations in Video Sequences," Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 2000.
-
Proc. IEEE Conf. Computer Vision and Pattern Recognition, June 2000
-
-
Jojic, N.1
Petrovic, N.2
Frey, B.J.3
Huang, T.S.4
-
23
-
-
0000935895
-
An introduction to variational methods for graphical models
-
M.I. Jordan, ed. Norwell Mass.: Kluwer Academic Publishers
-
M.I. Jordan, Z. Ghahramani, T.S. Jaakkola, and L.K. Saul, "An Introduction to Variational Methods for Graphical Models," Learning in Graphical Models, M.I. Jordan, ed. Norwell Mass.: Kluwer Academic Publishers, 1998.
-
(1998)
Learning in Graphical Models
-
-
Jordan, M.I.1
Ghahramani, Z.2
Jaakkola, T.S.3
Saul, L.K.4
-
24
-
-
84880877816
-
Real-time auditory and visual multiple-object tracking for robots
-
K. Nakadai, K. Hidai, H. Mizoguchi, H.G. Okuno, and H. Kitano, "Real-Time Auditory and Visual Multiple-Object Tracking for Robots," Proc. Int'l Joint Conf. Artificial Intelligence, 2001.
-
Proc. Int'l Joint Conf. Artificial Intelligence, 2001
-
-
Nakadai, K.1
Hidai, K.2
Mizoguchi, H.3
Okuno, H.G.4
Kitano, H.5
-
25
-
-
0002788893
-
A view of the EM algorithm that justifies incremental, sparse, and other variants
-
M.I. Jordan, ed.; Norwell Mass.: Kluwer Academic Publishers
-
R.M. Neal and G.E. Hinton, "A View of the EM Algorithm that Justifies Incremental, Sparse, and Other Variants," Learning in Graphical Models, M.I. Jordan, ed. pp. 355-368, Norwell Mass.: Kluwer Academic Publishers, 1998.
-
(1998)
Learning in Graphical Models
, pp. 355-368
-
-
Neal, R.M.1
Hinton, G.E.2
-
29
-
-
84898931254
-
Facesync: A linear operator for measuring synchronization of video facial images and audio tracks
-
M. Slaney and M. Covell, "Facesync: A Linear Operator for Measuring Synchronization of Video Facial Images and Audio Tracks," Proc. Advances in Neural Information Processing Systems 2000, vol. 14, 2001.
-
(2001)
Proc. Advances in Neural Information Processing Systems 2000
, vol.14
-
-
Slaney, M.1
Covell, M.2
-
30
-
-
0030681710
-
Tracking multiple talkers using microphone-array measurements
-
D.E. Sturim, M.S. Brandstein, and H.F. Solverman, "Tracking Multiple Talkers Using Microphone-Array Measurements," Proc. IEEE Conf. Acoustics, Speech, and Signal Processing, 1997.
-
Proc. IEEE Conf. Acoustics, Speech, and Signal Processing, 1997
-
-
Sturim, D.E.1
Brandstein, M.S.2
Solverman, H.F.3
-
31
-
-
0034844366
-
Sequential Monte Carlo fusion of sound and vision for speaker tracking
-
J. Vermaak, M. Gangnet, A. Blake, and P. Perez, "Sequential Monte Carlo Fusion of Sound and Vision for Speaker Tracking," Proc. IEEE Int'l Conf. Computer Vision, 2001.
-
Proc. IEEE Int'l Conf. Computer Vision, 2001
-
-
Vermaak, J.1
Gangnet, M.2
Blake, A.3
Perez, P.4
-
32
-
-
0031385284
-
Voice source localization for automatic camera pointing system in cideoconferencing
-
H. Wang and P. Chu, "Voice Source Localization for Automatic Camera Pointing System in Cideoconferencing," Proc. IEEE Conf. Acoustics, Speech, and Signal Processing, 1997.
-
Proc. IEEE Conf. Acoustics, Speech, and Signal Processing, 1997
-
-
Wang, H.1
Chu, P.2
-
33
-
-
33846628402
-
Audio-video array source localization for perceptual user interfaces
-
K. Wilson, N. Checka, D. Demirdjian, and T. Darrell, "Audio-Video Array Source Localization for Perceptual User Interfaces," Proc. Workshop Perceptive User Interfaces, 2001.
-
Proc. Workshop Perceptive User Interfaces, 2001
-
-
Wilson, K.1
Checka, N.2
Demirdjian, D.3
Darrell, T.4
-
34
-
-
0036874485
-
Joint audio-visual tracking using particle filters
-
D.N. Zotkin, R. Duraiswami, and L.S. Davis, "Joint Audio-Visual Tracking Using Particle Filters," EURASIP J. Applied Signal Processing, vol. 11, pp. 1154-1164, 2002.
-
(2002)
EURASIP J. Applied Signal Processing
, vol.11
, pp. 1154-1164
-
-
Zotkin, D.N.1
Duraiswami, R.2
Davis, L.S.3
|