-
1
-
-
0036874999
-
Dynamic bayesian networks for audio-visual speech recognition
-
November
-
A. Nefian, L. Liang, X. Pi, X. Liu, and K. Murphy, "Dynamic bayesian networks for audio-visual speech recognition," EURASIP, Journal of Applied Signal Processing, vol. 2002, p. 1274, November 2002.
-
(2002)
EURASIP, Journal of Applied Signal Processing
, vol.2002
, pp. 1274
-
-
Nefian, A.1
Liang, L.2
Pi, X.3
Liu, X.4
Murphy, K.5
-
2
-
-
11444253586
-
Layered representations for learning and inferring office activity from multiple sensory channels
-
(Pittsburgh, PA), October
-
N. Oliver, E. Horvitz, and A. Garg, "Layered representations for learning and inferring office activity from multiple sensory channels," in Proc. Int. Conf. on Multimodal Interfaces (ICMI'02), (Pittsburgh, PA), October 2002.
-
(2002)
Proc. Int. Conf. on Multimodal Interfaces (ICMI'02)
-
-
Oliver, N.1
Horvitz, E.2
Garg, A.3
-
3
-
-
1542347786
-
Automatic image annotation and retrieval using cross-media relevance models
-
(Toronto, Canada), July 28-Aug.-1
-
J. Jeon, V. Lavrenko, and R. Manmatha, "Automatic image annotation and retrieval using cross-media relevance models," in Proc. SIGIR-03, (Toronto, Canada), pp. 119-126, July 28-Aug.-1 2003.
-
(2003)
Proc. SIGIR-03
, pp. 119-126
-
-
Jeon, J.1
Lavrenko, V.2
Manmatha, R.3
-
4
-
-
85026972772
-
Probabilistic latent semantic indexing
-
ACM Press
-
T. Hofmann, "Probabilistic latent semantic indexing," in Proc. ACM SIGIR, pp. 50-57, ACM Press, 1999.
-
(1999)
Proc. ACM SIGIR
, pp. 50-57
-
-
Hofmann, T.1
-
7
-
-
14644416738
-
The IBM semantic concept detection framework
-
A. Amir, G. lyengar, C.-Y. Lin, C. Dorai, M. Naphade, A. Natsev, C. Neti, H. Nock, I. Sachdev, J. Smith, Y. Wu, B. Tseng, and D. Zhang, "The IBM semantic concept detection framework," in TRECVID Workshop, 2003.
-
(2003)
TRECVID Workshop
-
-
Amir, A.1
Lyengar, G.2
Lin, C.-Y.3
Dorai, C.4
Naphade, M.5
Natsev, A.6
Neti, C.7
Nock, H.8
Sachdev, I.9
Smith, J.10
Wu, Y.11
Tseng, B.12
Zhang, D.13
-
8
-
-
8844259704
-
Discovery and fusion of salient multi-modal features towards news story segmentation
-
January
-
W. H.-M. Hsu, S.-F. Chang, C.-W. Huang, L. Kennedy, C.-Y. Lin, and G. lyengar, "Discovery and fusion of salient multi-modal features towards news story segmentation," in SPIE Electronic Imaging, January 2004.
-
(2004)
SPIE Electronic Imaging
-
-
Hsu, W.H.-M.1
Chang, S.-F.2
Huang, C.-W.3
Kennedy, L.4
Lin, C.-Y.5
Lyengar, G.6
-
9
-
-
79959829262
-
Generation of sports highlights using a combination of supervised and unsupervised learning in audio domain
-
R. Radhakrishan, Z. Xiong, A. Divakaran, and Y. Ishikawa, "Generation of sports highlights using a combination of supervised and unsupervised learning in audio domain," in Proc. Pacific Rim Conference on Multimedia, 2003.
-
(2003)
Proc. Pacific Rim Conference on Multimedia
-
-
Radhakrishan, R.1
Xiong, Z.2
Divakaran, A.3
Ishikawa, Y.4
-
10
-
-
8844236394
-
-
ch. 10. Kluwer Academic Publishers
-
L. Xie, S.-F. Chang, A. Divakaran, and H. Sun, Unsupervised Mining of Statistical Temporal Structures in Video,ch. 10. Kluwer Academic Publishers, 2003.
-
(2003)
Unsupervised Mining of Statistical Temporal Structures in Video
-
-
Xie, L.1
Chang, S.-F.2
Divakaran, A.3
Sun, H.4
-
11
-
-
20444449726
-
Discover meaningful multimedia patterns with audio-visual concepts and associated text
-
October
-
L. Xie, L. Kennedy, S.-F. Chang, A. Divakaran, H. Sun, and C.-Y. Lin, "Discover meaningful multimedia patterns with audio-visual concepts and associated text," in Int. Conf. Image Processing (ICIP), October 2004.
-
(2004)
Int. Conf. Image Processing (ICIP)
-
-
Xie, L.1
Kennedy, L.2
Chang, S.-F.3
Divakaran, A.4
Sun, H.5
Lin, C.-Y.6
|