-
1
-
-
0030389403
-
Visualseek: A Fully Automated Contentbased Image Query System
-
Boston, MA, Nov
-
J. R. Smith and S. F. Chang, Visualseek: A Fully Automated Contentbased Image Query System, in Proceedings of ACM Multimedia, Boston, MA, Nov. 1996.
-
(1996)
Proceedings of ACM Multimedia
-
-
Smith, J.R.1
Chang, S.F.2
-
2
-
-
0031341848
-
Spatio-Temporal Video Search Using the Object-based Video Representation
-
Santa Barbara, CA, Oct
-
D. Zhong and S. F. Chang, Spatio-Temporal Video Search Using the Object-based Video Representation, in Proceedings of IEEE International Conference on Image Processing, Santa Barbara, CA, Oct. 1997, Vol. 1, pp. 21-24.
-
(1997)
Proceedings of IEEE International Conference on Image Processing
, vol.1
, pp. 21-24
-
-
Zhong, D.1
Chang, S.F.2
-
3
-
-
0031383542
-
Content Based Search of Video Using Color, Texture and Motion
-
Santa Barbara, CA, Oct
-
Y. Deng and B. S. Manjunath, Content Based Search of Video Using Color, Texture and Motion, in Proceedings of IEEE International Conference on Image Processing, Santa Barbara, CA, Oct. 1997, Vol. 2, pp. 534-537.
-
(1997)
Proceedings of IEEE International Conference on Image Processing
, vol.2
, pp. 534-537
-
-
Deng, Y.1
Manjunath, B.S.2
-
4
-
-
0031381333
-
Content-based Video Retrieval and Compression: A Unified Solution
-
Santa Barbara, CA, Oct
-
H. Zhang, A. Wang, and Y. Altunbasak, Content-based Video Retrieval and Compression: A Unified Solution, in Proceedings of IEEE International Conference on Image Processing, Santa Barbara, CA, Oct. 1997, Vol. 1, pp. 13-16.
-
(1997)
Proceedings of IEEE International Conference on Image Processing
, vol.1
, pp. 13-16
-
-
Zhang, H.1
Wang, A.2
Altunbasak, Y.3
-
5
-
-
0029780234
-
Efficient Matching and Clustering of Video Shots
-
Washington, D.C., Oct
-
M. M. Yeung and B. Liu, Efficient Matching and Clustering of Video Shots,” in Proceedings of IEEE International Conference on Image Processing, Washington, D.C., Oct. 1995, Vol. 1, pp. 338-341.
-
(1995)
Proceedings of IEEE International Conference on Image Processing
, vol.1
, pp. 338-341
-
-
Yeung, M.M.1
Liu, B.2
-
6
-
-
0033883721
-
A Novel Scheme for Fast and Efficient Video Sequence Matching Using Compact Signatures
-
Jan
-
M. R. Naphade, M. M. Yeung, and B. L. Yeo, A Novel Scheme for Fast and Efficient Video Sequence Matching Using Compact Signatures, in Proceedings of SPIE Storage and Retrieval for Multimedia Databases, Jan. 2000, Vol. 3972, pp. 564-572.
-
(2000)
Proceedings of SPIE Storage and Retrieval for Multimedia Databases
, vol.3972
, pp. 564-572
-
-
Naphade, M.R.1
Yeung, M.M.2
Yeo, B.L.3
-
8
-
-
0003794341
-
-
Ph.D. Thesis, MIT, Cambridge, MA
-
D. Ellis, Prediction-driven Computational Auditory Scene Analysis, Ph.D. Thesis, MIT, Cambridge, MA, 1996.
-
(1996)
Prediction-Driven Computational Auditory Scene Analysis
-
-
Ellis, D.1
-
9
-
-
0032115209
-
Video Handling with Music and Speech Detection
-
M. Akutsu, A. Hamada, and Y. Tonomura, Video Handling with Music and Speech Detection, IEEE Multimedia, Vol. 5, No. 3, pp. 17-25, 1998.
-
(1998)
IEEE Multimedia
, vol.5
, Issue.3
, pp. 17-25
-
-
Akutsu, M.1
Hamada, A.2
Tonomura, Y.3
-
10
-
-
0033327198
-
Learning to Recognize Speech by Watching Television
-
P. Jang and A. Hauptmann, Learning to Recognize Speech by Watching Television, IEEE Intelligent Systems Magazine, Vol. 14, No. 5, pp. 51-58, 1999.
-
(1999)
IEEE Intelligent Systems Magazine
, vol.14
, Issue.5
, pp. 51-58
-
-
Jang, P.1
Hauptmann, A.2
-
11
-
-
0030242072
-
Content-based Classification Search and Retrieval of Audio
-
E. Wold, T. Blum, D. Keislar, and J. Wheaton, Content-based Classification Search and Retrieval of Audio, IEEE Multimedia, Vol. 3, No. 3, pp. 27-36, 1996.
-
(1996)
IEEE Multimedia
, vol.3
, Issue.3
, pp. 27-36
-
-
Wold, E.1
Blum, T.2
Keislar, D.3
Wheaton, J.4
-
12
-
-
0033897763
-
An Integrated Approach to Multimodal Media Content Analysis
-
Jan
-
T. Zhang and C. Kuo, An Integrated Approach to Multimodal Media Content Analysis, in Proceedings of SPIE, IS&T Storage and Retrieval for Media Databases, Jan. 2000, Vol. 3972, pp. 506-517.
-
(2000)
Proceedings of SPIE, Is&T Storage and Retrieval for Media Databases
, vol.3972
, pp. 506-517
-
-
Zhang, T.1
Kuo, C.2
-
13
-
-
0032318109
-
Probabilistic Multimedia Objects (Multijects): A Novel Approach to Indexing and Retrieval in Multimedia Systems
-
Chicago, IL, Oct
-
M. Naphade, T. Kristjansson, B. Frey, and T. S. Huang, Probabilistic Multimedia Objects (Multijects): A Novel Approach to Indexing and Retrieval in Multimedia Systems, in Proceedings of IEEE International Conference on Image Processing, Chicago, IL, Oct. 1998, Vol. 3, pp. 536-540.
-
(1998)
Proceedings of IEEE International Conference on Image Processing
, vol.3
, pp. 536-540
-
-
Naphade, M.1
Kristjansson, T.2
Frey, B.3
Huang, T.S.4
-
14
-
-
0035055957
-
Multimodal Pattern Matching for Audio-Visual Query and Retrieval
-
Jan
-
M. R. Naphade, R. Wang, and T. S. Huang, Multimodal Pattern Matching for Audio-Visual Query and Retrieval, in Proceedings of SPIE, Storage and Retrieval for Media Databases, Jan. 2001, Vol. 4315, pp. 188-195.
-
(2001)
Proceedings of SPIE, Storage and Retrieval for Media Databases
, vol.4315
, pp. 188-195
-
-
Naphade, M.R.1
Wang, R.2
Huang, T.S.3
-
15
-
-
0029513797
-
Rapid Scene Change Detection on Compressed Video
-
Dec
-
B. L. Yeo and B. Liu, Rapid Scene Change Detection on Compressed Video, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 5, No. 6, pp. 533-544, Dec. 1995.
-
(1995)
IEEE Transactions on Circuits and Systems for Video Technology
, vol.5
, Issue.6
, pp. 533-544
-
-
Yeo, B.L.1
Liu, B.2
-
16
-
-
84947120161
-
Scene Change Detection in a MPEG Compressed Video Sequence
-
San Jose, CA, Feb
-
J. Meng, Y. Juan, and S. F. Chang, Scene Change Detection in a MPEG Compressed Video Sequence, in Proceedings of the SPIE Symposium, San Jose, CA, Feb. 1995, Vol. 2419, pp. 1-11.
-
(1995)
Proceedings of the SPIE Symposium
, vol.2419
, pp. 1-11
-
-
Meng, J.1
Juan, Y.2
Chang, S.F.3
-
17
-
-
85076589825
-
Video Parsing Using Compressed Data
-
San Jose, CA
-
H. J. Zhang, C. Y. Low, and S. Smoliar, Video Parsing Using Compressed Data, in Proceedings of SPIE Conference on Image and Video Processing 2, San Jose, CA, 1994, pp. 142-149.
-
(1994)
Proceedings of SPIE Conference on Image and Video Processing
, vol.2
, pp. 142-149
-
-
Zhang, H.J.1
Low, C.Y.2
Smoliar, S.3
-
18
-
-
0032306765
-
A High Performance Shot Boundary Detection Algorithm Using Multiple Cues
-
Chicago, IL, Oct
-
M. Naphade, R. Mehrotra, A. M. Ferman, J. Warnick, T. S. Huang, and A. M. Tekalp, A High Performance Shot Boundary Detection Algorithm Using Multiple Cues, in Proceedings of IEEE International Conference on Image Processing, Chicago, IL, Oct. 1998, Vol. 2, pp. 884-887.
-
(1998)
Proceedings of IEEE International Conference on Image Processing
, vol.2
, pp. 884-887
-
-
Naphade, M.1
Mehrotra, R.2
Ferman, A.M.3
Warnick, J.4
Huang, T.S.5
Tekalp, A.M.6
-
19
-
-
0032314489
-
Semantic Visual Templates-Linking Features to Semantics
-
Chicago, IL, Oct
-
S. F. Chang, W. Chen, and H. Sundaram, Semantic Visual Templates-Linking Features to Semantics, in Proceedings of IEEE International Conference on Image Processing, Chicago, IL, Oct. 1998, Vol. 3, pp. 531-535.
-
(1998)
Proceedings of IEEE International Conference on Image Processing
, vol.3
, pp. 531-535
-
-
Chang, S.F.1
Chen, W.2
Sundaram, H.3
-
20
-
-
0032666227
-
A Computational Approach to Semantic Event Detection
-
Fort Collins, CO, June
-
R. Qian, N. Hearing, and I. Sezan, A Computational Approach to Semantic Event Detection, in Proceedings of Computer Vision and Pattern Recognition, Fort Collins, CO, June 1999, Vol. 1, pp. 200-206.
-
(1999)
Proceedings of Computer Vision and Pattern Recognition
, vol.1
, pp. 200-206
-
-
Qian, R.1
Hearing, N.2
Sezan, I.3
-
23
-
-
0001688787
-
Bayesian Modeling of Video Editing and Structure: Semantic Features for Video Summarization and Browsing
-
Chicago, IL, Oct
-
N. Vasconcelos and A. Lippman, Bayesian Modeling of Video Editing and Structure: Semantic Features for Video Summarization and Browsing, in Proceedings of IEEE International Conference on Image Processing, Chicago, IL, Oct. 1998, Vol. 2, pp. 550-555.
-
(1998)
Proceedings of IEEE International Conference on Image Processing
, vol.2
, pp. 550-555
-
-
Vasconcelos, N.1
Lippman, A.2
-
24
-
-
0035281949
-
A Probabilistic Framework for Semantic Video Indexing, Filtering and Retrieval
-
March
-
M. R. Naphade and T. S. Huang, A Probabilistic Framework for Semantic Video Indexing, Filtering and Retrieval, IEEE Transactions on Multimedia, Special issue on Multimedia over IP, Vol. 3, No. 1, pp. 141-151, March 2001.
-
(2001)
IEEE Transactions on Multimedia, Special Issue on Multimedia over IP
, vol.3
, Issue.1
, pp. 141-151
-
-
Naphade, M.R.1
Huang, T.S.2
-
26
-
-
0024610919
-
A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition
-
Feb
-
L. R. Rabiner, A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition, Proceedings IEEE, Vol. 77, No. 2, pp. 257-286, Feb. 1989.
-
(1989)
Proceedings IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
27
-
-
0030685285
-
Coupled Hidden Markov Models for Complex Action Recognition
-
M. Brand, N. Oliver, and A. Pentland, Coupled Hidden Markov Models for Complex Action Recognition, in Proceedings of Computer Vision and Pattern Recognition, 1997, pp. 994-999.
-
(1997)
Proceedings of Computer Vision and Pattern Recognition
, pp. 994-999
-
-
Brand, M.1
Oliver, N.2
Pentland, A.3
-
28
-
-
0031268341
-
Factorial Hidden Markov Models
-
Z. Ghahramani and M. Jordan, Factorial Hidden Markov Models, Machine Learning, Vol. 29, pp. 245-273, 1997.
-
(1997)
Machine Learning
, vol.29
, pp. 245-273
-
-
Ghahramani, Z.1
Jordan, M.2
-
29
-
-
0033898334
-
Identifying Sports Video Using Replay, Text and Camera Motion Features
-
Jan
-
V. Kobla, D. DeMenthon, and D. Doermann, Identifying Sports Video Using Replay, Text and Camera Motion Features, in Proceedings of SPIE Storage and Retrieval for Media Databases, Jan. 2000, Vol. 3972, pp. 332-343.
-
(2000)
Proceedings of SPIE Storage and Retrieval for Media Databases
, vol.3972
, pp. 332-343
-
-
Kobla, V.1
Dementhon, D.2
Doermann, D.3
-
31
-
-
0032181880
-
Audio Feature Extraction and Analysis for Scene Segmentation and Classification
-
Oct
-
Z. Liu, Y. Wang, and T. Chen, Audio Feature Extraction and Analysis for Scene Segmentation and Classification, VLSI Signal Processing Systems for Signal, Image and Video Technology, Vol. 20, pp. 61-79, Oct. 1998.
-
(1998)
VLSI Signal Processing Systems for Signal, Image and Video Technology
, vol.20
, pp. 61-79
-
-
Liu, Z.1
Wang, Y.2
Chen, T.3
-
32
-
-
0030648077
-
Construction and Evaluation of a Robust Multifeatures Speech/Music Discriminator
-
Munich, Germany
-
E. Scheirer and M. Slaney, Construction and Evaluation of a Robust Multifeatures Speech/Music Discriminator, in Proceedings of IEEE Intl. Conf. on Acoustic, Speech and Signal Processing, Munich, Germany, 1997, Vol. 2, pp. 1331-1334.
-
(1997)
Proceedings of IEEE Intl. Conf. on Acoustic, Speech and Signal Processing
, vol.2
, pp. 1331-1334
-
-
Scheirer, E.1
Slaney, M.2
-
33
-
-
0031374433
-
Speaker Identification and Video Analysis for Hierarchical Video Shot Classification
-
Santa Barbara, CA, Oct
-
J. Nam, A.E. Cetin, and A.H. Tewfik, Speaker Identification and Video Analysis for Hierarchical Video Shot Classification, in Proceedings of IEEE International Conference on Image Processing, Santa Barbara, CA, Oct. 1997, Vol. 2, pp. 550-555.
-
(1997)
Proceedings of IEEE International Conference on Image Processing
, vol.2
, pp. 550-555
-
-
Nam, J.1
Cetin, A.E.2
Tewfik, A.H.3
-
34
-
-
85032751556
-
Multimedia Content Analysis Using Audio and Visual Information
-
Nov
-
Y. Wang, Z. Liu, and J. Huang, Multimedia Content Analysis Using Audio and Visual Information, IEEE Signal Processing Magazine, Vol. 17, No. 6, pp. 12-36, Nov. 2000.
-
(2000)
IEEE Signal Processing Magazine
, vol.17
, Issue.6
, pp. 12-36
-
-
Wang, Y.1
Liu, Z.2
Huang, J.3
-
35
-
-
0030151506
-
Intelligent Access to Digital Video: The Informedia Project
-
May
-
H. Wactlar, T. Kanade, M. Smith, and S. Stevens, Intelligent Access to Digital Video: The Informedia Project, IEEE Computer Digital Library Initiative Special Issue, No. 5, May 1996.
-
(1996)
IEEE Computer Digital Library Initiative Special Issue
, Issue.5
-
-
Wactlar, H.1
Kanade, T.2
Smith, M.3
Stevens, S.4
-
37
-
-
85076098160
-
Automated Analysis and Annotation of Basketball Video
-
D. D. Saur, Y. P. Tan, S. R. Kulkarni, and P. J. Ramadge, Automated Analysis and Annotation of Basketball Video, in Proceedings of SPIE Symposium, 1997, Vol. 3022, pp. 176-187.
-
(1997)
Proceedings of SPIE Symposium
, vol.3022
, pp. 176-187
-
-
Saur, D.D.1
Tan, Y.P.2
Kulkarni, S.R.3
Ramadge, P.J.4
-
39
-
-
0032639979
-
Vision-based Speaker Detection Using Bayesian Networks
-
Fort Collins, CO, June
-
J. Rehg, K. Murphy, and P. Fieguth, Vision-based Speaker Detection Using Bayesian Networks, in Proceedings of Computer Vision and Pattern Recognition, Fort Collins, CO, June 1999, Vol. 2, pp. 110-116.
-
(1999)
Proceedings of Computer Vision and Pattern Recognition
, vol.2
, pp. 110-116
-
-
Rehg, J.1
Murphy, K.2
Fieguth, P.3
-
41
-
-
84908294933
-
Duration Dependent Input Output Markov Models for Audio-Visual Event Detection
-
Tokyo, Japan
-
M. R. Naphade, A. Garg, and T. S. Huang, Duration Dependent Input Output Markov Models for Audio-Visual Event Detection, submitted to IEEE International Conference on Multimedia and Expo, Tokyo, Japan, 2001.
-
(2001)
IEEE International Conference on Multimedia and Expo
-
-
Naphade, M.R.1
Garg, A.2
Huang, T.S.3
-
42
-
-
84905395655
-
Integrated Audio/Visual Speaker Detection Using Dynamic Bayesian Networks
-
March
-
A. Garg, V. Pavlovic, M. Rehg, and T. S. Huang, Integrated Audio/Visual Speaker Detection Using Dynamic Bayesian Networks, in Proceedings of IEEE Conference on Automatic Face and Gesture Recognition, March 2000.
-
(2000)
Proceedings of IEEE Conference on Automatic Face and Gesture Recognition
-
-
Garg, A.1
Pavlovic, V.2
Rehg, M.3
Huang, T.S.4
-
43
-
-
0032074310
-
Audio-Visual Integration in Multimodal Communication
-
T. Chen and R. Rao, Audio-Visual Integration in Multimodal Communication, IEEE Proceedings, Vol. 86, No. 5, pp. 837-852, 1998.
-
(1998)
IEEE Proceedings
, vol.86
, Issue.5
, pp. 837-852
-
-
Chen, T.1
Rao, R.2
-
46
-
-
0000342467
-
Statistical Inference for Probabilistic Functions of Finite State Markov Chains
-
L. E. Baum and T. Petrie, Statistical Inference for Probabilistic Functions of Finite State Markov Chains, Annals of Mathematical Statistics, Vol. 37, pp. 1559-1563, 1966.
-
(1966)
Annals of Mathematical Statistics
, vol.37
, pp. 1559-1563
-
-
Baum, L.E.1
Petrie, T.2
-
47
-
-
0030242097
-
Input/Output HMMs for Sequence Processing
-
Y. Bengio and P. Frasconi, Input/Output HMMs for Sequence Processing, IEEE Transactions on Neural Networks, Vol. 7, No. 5, pp. 1231-1249, 1996.
-
(1996)
IEEE Transactions on Neural Networks
, vol.7
, Issue.5
, pp. 1231-1249
-
-
Bengio, Y.1
Frasconi, P.2
-
48
-
-
84978308497
-
Modeling State Durations in Hidden Markov Models for Automatic Speech Recognition
-
Mar
-
P. Ramesh and J. Wilpon, Modeling State Durations in Hidden Markov Models for Automatic Speech Recognition, in Proceedings of International Conference on Acoustics, Speech and Signal processing, Mar. 1992, Vol. 1, pp. 381-384.
-
(1992)
Proceedings of International Conference on Acoustics, Speech and Signal Processing
, vol.1
, pp. 381-384
-
-
Ramesh, P.1
Wilpon, J.2
-
49
-
-
33750919959
-
Semantic Video Indexing Using a Probabilistic Framework
-
Barcelona, Spain, Sep
-
M. R. Naphade and T. S. Huang, Semantic Video Indexing Using a Probabilistic Framework, in Proceedings of IAPR International Conference on Pattern Recognition, Barcelona, Spain, Sep. 2000, Vol. 3, pp. 83-88.
-
(2000)
Proceedings of IAPR International Conference on Pattern Recognition
, vol.3
, pp. 83-88
-
-
Naphade, M.R.1
Huang, T.S.2
|