-
1
-
-
70449558459
-
Audiovisual event detection towards scene understanding
-
C. Canton-Ferrer, T. Butko, C. Segura, X. Giro, C. Nadeu, J. Hernando, and J. Casas, "Audiovisual event detection towards scene understanding," in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2009, pp. 81-88.
-
IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2009
, pp. 81-88
-
-
Canton-Ferrer, C.1
Butko, T.2
Segura, C.3
Giro, X.4
Nadeu, C.5
Hernando, J.6
Casas, J.7
-
3
-
-
0141479982
-
Using speech/non-speech detection to bias recognition search on noisy data
-
F. Beaufays, D. Boies, M. Weintraub, and Q. Zhu, "Using speech/non-speech detection to bias recognition search on noisy data," in IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003.
-
IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003
-
-
Beaufays, F.1
Boies, D.2
Weintraub, M.3
Zhu, Q.4
-
5
-
-
77955558847
-
Real-world acoustic event detection
-
X. Zhuang, X. Zhou, M. A. Hasegawa-Johnson, and T. S. Huang, "Real-world acoustic event detection," Pattern Recognition Letters, vol. 31, no. 12, pp. 1543-1551, 2010.
-
(2010)
Pattern Recognition Letters
, vol.31
, Issue.12
, pp. 1543-1551
-
-
Zhuang, X.1
Zhou, X.2
Hasegawa-Johnson, M.A.3
Huang, T.S.4
-
7
-
-
55349112006
-
Audio-visual event recognition with application in sports video
-
Z. Xiong, R. Radhakrishnan, A. Divakaran, and T. Huang, "Audio-visual event recognition with application in sports video," in Intelligent Multimedia Processing with Soft Computing, ser. Studies in Fuzziness and Soft Computing, 2005, vol. 168, pp. 129-149.
-
(2005)
Intelligent Multimedia Processing with Soft Computing, Ser. Studies in Fuzziness and Soft Computing
, vol.168
, pp. 129-149
-
-
Xiong, Z.1
Radhakrishnan, R.2
Divakaran, A.3
Huang, T.4
-
8
-
-
0036295989
-
Audio-visual speech modeling using coupled hidden Markov models
-
S. M. Chu and T. S. Huang, "Audio-visual speech modeling using coupled hidden Markov models," in IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002.
-
IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002
-
-
Chu, S.M.1
Huang, T.S.2
-
9
-
-
4544228318
-
Identity verification using speech and face information
-
C. Sanderson and K. K. Paliwal, "Identity verification using speech and face information," Digital Signal Processing, vol. 14, no. 5, pp. 449-480, 2004.
-
(2004)
Digital Signal Processing
, vol.14
, Issue.5
, pp. 449-480
-
-
Sanderson, C.1
Paliwal, K.K.2
-
10
-
-
38349007037
-
A duality based approach for realtime tv-l1 optical flow
-
C. Zach, T. Pock, and H. Bischof, "A duality based approach for realtime tv-l1 optical flow," in Pattern Recognition (Proc. DAGM), Heidelberg, Germany, 2007, pp. 214-223.
-
Pattern Recognition (Proc. DAGM), Heidelberg, Germany, 2007
, pp. 214-223
-
-
Zach, C.1
Pock, T.2
Bischof, H.3
-
12
-
-
33845572523
-
Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories
-
S. Lazebnik, C. Schmid, and J. Ponce, "Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories," in CVPR, 2006.
-
(2006)
CVPR
-
-
Lazebnik, S.1
Schmid, C.2
Ponce, J.3
-
13
-
-
0001432664
-
On the integration of auditory and visual parameters in an HMM-based ASR
-
D. G. Stork and M. E. Hennecke (Eds.), Berlin: Springer-Verlag
-
A. Adjoudani and C. Benoit, "On the integration of auditory and visual parameters in an HMM-based ASR," In D. G. Stork and M. E. Hennecke (Eds.), Speechreading by Humans and Machines. Berlin: Springer-Verlag, pp. 461-471, 1996.
-
(1996)
Speechreading by Humans and Machines
, pp. 461-471
-
-
Adjoudani, A.1
Benoit, C.2
-
14
-
-
0003770986
-
Comparingmodels for audiovisual fusion in a noisy-vowel recognition task
-
P. Teissier, J. Robert-Ribes, and J. L. Schwartz, "Comparingmodels for audiovisual fusion in a noisy-vowel recognition task," IEEE Trans. Speech Audio Processing, vol. vol. 7, pp. 629-642, 1999.
-
(1999)
IEEE Trans. Speech Audio Processing
, vol.7
, pp. 629-642
-
-
Teissier, P.1
Robert-Ribes, J.2
Schwartz, J.L.3
-
16
-
-
0036650148
-
Statistical multimodal integration for audio-visual speech processing
-
S. Nakamura, "Statistical multimodal integration for audio-visual speech processing," IEEE Transactions on Neural Networks, vol. 13, no. 4, 2002.
-
(2002)
IEEE Transactions on Neural Networks
, vol.13
, Issue.4
-
-
Nakamura, S.1
-
17
-
-
51449101221
-
Feature analysis and selection for acoustic event detection
-
X. Zhuang, X. Zhou, T. S. Huang, and M. Hasegawa-Johnson, "Feature analysis and selection for acoustic event detection," in IEEE International Conference on Acoustics, Speech, and Signal Processing, 2008.
-
IEEE International Conference on Acoustics, Speech, and Signal Processing, 2008
-
-
Zhuang, X.1
Zhou, X.2
Huang, T.S.3
Hasegawa-Johnson, M.4
|