-
1
-
-
0030389403
-
VisualSEEk: A fully automated content-based image query system
-
ACM, Boston, Mass, USA, November
-
J. R. Smith and S.-F. Chang, "VisualSEEk: a fully automated content-based image query system," in Proc. 4th ACM International Conference on Multimedia, pp. 87-98, ACM, Boston, Mass, USA, November 1996.
-
(1996)
Proc. 4th ACM International Conference on Multimedia
, pp. 87-98
-
-
Smith, J.R.1
Chang, S.-F.2
-
2
-
-
0032318109
-
Probabilistic multimedia objects (multijects): A novel approach to video indexing and retrieval in multimedia systems
-
IEEE, Chicago, 111, USA, October
-
M. Naphade, T. Kristjansson, B. Frey, and T. S. Huang, "Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems," in Proc. IEEE International Conference on Image Processing, vol. 3, pp. 536-540, IEEE, Chicago, 111, USA, October 1998.
-
(1998)
Proc. IEEE International Conference on Image Processing
, vol.3
, pp. 536-540
-
-
Naphade, M.1
Kristjansson, T.2
Frey, B.3
Huang, T.S.4
-
3
-
-
0032314489
-
Semantic visual templates - Linking features to semantics
-
IEEE, Chicago, 111, USA, October
-
S. F. Chang, W. Chen, and H. Sundaram, "Semantic visual templates - linking features to semantics," in Proc, IEEE International Conference on Image Processing, vol. 3, pp. 531-535, IEEE, Chicago, 111, USA, October 1998.
-
(1998)
Proc, IEEE International Conference on Image Processing
, vol.3
, pp. 531-535
-
-
Chang, S.F.1
Chen, W.2
Sundaram, H.3
-
4
-
-
0032666227
-
A computational approach to semantic event detection
-
IEEE, Fort Collins, Colo, USA, June
-
R. Qian, N. Hearing, and I. Sezan, "A computational approach to semantic event detection," in Proc. Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 200-206, IEEE, Fort Collins, Colo, USA, June 1999.
-
(1999)
Proc. Conference on Computer Vision and Pattern Recognition
, vol.1
, pp. 200-206
-
-
Qian, R.1
Hearing, N.2
Sezan, I.3
-
5
-
-
0033897763
-
An integrated approach to multimodal media content analysis
-
Storage and Retrieval for Media Databases 2000, SPIE, San Jose, Calif, USA, January
-
T. Zhang and C. Kuo, "An integrated approach to multimodal media content analysis," in Storage and Retrieval for Media Databases 2000, vol. 3972 of SPIE Proceedings, pp. 506-517, SPIE, San Jose, Calif, USA, January 2000.
-
(2000)
SPIE Proceedings
, vol.3972
, pp. 506-517
-
-
Zhang, T.1
Kuo, C.2
-
6
-
-
0003794341
-
-
Ph.D. thesis, MIT Department of Electrical Engineering and Computer Science, Cambridge, Mass, USA
-
D. Ellis, Prediction-driven computational auditory scene analysis, Ph.D. thesis, MIT Department of Electrical Engineering and Computer Science, Cambridge, Mass, USA, 1996.
-
(1996)
Prediction-Driven Computational Auditory Scene Analysis
-
-
Ellis, D.1
-
7
-
-
0034857154
-
Learning the semantics of words and pictures
-
IEEE, Vancouver, Canada, July
-
K. Barnard and D. Forsyth, "Learning the semantics of words and pictures," in Proc. International Conf. on Computer Vision, vol. 2, pp. 408-415, IEEE, Vancouver, Canada, July 2001.
-
(2001)
Proc. International Conf. on Computer Vision
, vol.2
, pp. 408-415
-
-
Barnard, K.1
Forsyth, D.2
-
8
-
-
0012532765
-
Reduced-rank spectra and minimum-entropy priors as consistent and reliable cues for generalized sound recognition
-
Aalborg, Denmark, September
-
M. A. Casey, "Reduced-rank spectra and minimum-entropy priors as consistent and reliable cues for generalized sound recognition," in Proc. Eurospeech, Aalborg, Denmark, September 2001.
-
(2001)
Proc. Eurospeech
-
-
Casey, M.A.1
-
9
-
-
0030705367
-
Hidden Markov model parsing of video programs
-
IEEE, Munich, Germany, April
-
W. Wolf, "Hidden Markov model parsing of video programs," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 4, pp. 2609-2611, IEEE, Munich, Germany, April 1997.
-
(1997)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.4
, pp. 2609-2611
-
-
Wolf, W.1
-
10
-
-
0032223813
-
Models for automatic classification of video sequences
-
Storage and Retrieval for Image and Video Databases VI, SPIE, San Jose, Calif, USA, January
-
G. Iyengar and A. B. Lippman, "Models for automatic classification of video sequences," in Storage and Retrieval for Image and Video Databases VI, vol. 3312 of SPIE Proceedings, pp. 216-227, SPIE, San Jose, Calif, USA, January 1998.
-
(1998)
SPIE Proceedings
, vol.3312
, pp. 216-227
-
-
Iyengar, G.1
Lippman, A.B.2
-
11
-
-
0033316687
-
Probabilistic analysis and extraction of video content
-
IEEE, Kobe, Japan, October
-
A. M. Ferman and A. M. Tekalp, "Probabilistic analysis and extraction of video content," in Proc. IEEE International Conference on Image Processing, vol. 2, pp. 91-95, IEEE, Kobe, Japan, October 1999.
-
(1999)
Proc. IEEE International Conference on Image Processing
, vol.2
, pp. 91-95
-
-
Ferman, A.M.1
Tekalp, A.M.2
-
12
-
-
0032311673
-
Bayesian modeling of video editing and structure: Semantic features for video summarization and browsing
-
IEEE, Chicago, Ill, USA
-
N. Vasconcelos and A. Lippman, "Bayesian modeling of video editing and structure: semantic features for video summarization and browsing," in Proc. IEEE International Conference on Image Processing, vol. 3, pp. 153-157, IEEE, Chicago, Ill, USA, 1998.
-
(1998)
Proc. IEEE International Conference on Image Processing
, vol.3
, pp. 153-157
-
-
Vasconcelos, N.1
Lippman, A.2
-
13
-
-
0030648077
-
Construction and evaluation of a robust multifeature speech/music discriminator
-
IEEE, Munich, Germany, April
-
E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, pp. 1331-1334, IEEE, Munich, Germany, April 1997.
-
(1997)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.2
, pp. 1331-1334
-
-
Scheirer, E.1
Slaney, M.2
-
14
-
-
0034509230
-
Towards automatic extraction of expressive elements from motion pictures: Tempo
-
IEEE, New York, NY, USA, July
-
B. Adams, C. Dorai, and S. Venkatesh, "Towards automatic extraction of expressive elements from motion pictures: Tempo," in Proc. IEEE International Conference on Multimedia and Expo, vol. II, pp. 641-645, IEEE, New York, NY, USA, July 2000.
-
(2000)
Proc. IEEE International Conference on Multimedia and Expo
, vol.2
, pp. 641-645
-
-
Adams, B.1
Dorai, C.2
Venkatesh, S.3
-
15
-
-
0012583743
-
The 10th text retrieval conference (TREC 2001)
-
NIST, Gaithersburg, Md, USA
-
E. M. Voorhees and D. K. Harman, Eds., The 10th Text REtrieval Conference (TREC 2001), vol. 500-250 of NIST Special Publication, NIST, Gaithersburg, Md, USA, 2001.
-
(2001)
NIST Special Publication
, vol.500
, Issue.250
-
-
Voorhees, E.M.1
Harman, D.K.2
-
16
-
-
0001839555
-
Media Streams: An iconic visual language for video annotation
-
M. Davis, "Media Streams: an iconic visual language for video annotation," Telektronikk, vol. 89, no. 4, pp. 59-71, 1993.
-
(1993)
Telektronikk
, vol.89
, Issue.4
, pp. 59-71
-
-
Davis, M.1
-
17
-
-
0003322357
-
Audio-visual speech recognition
-
The Johns Hopkins University, Baltimore, Md, USA, October
-
C. Neti, G. Potamianos, J. Luettin, et al., "Audio-visual speech recognition," Final workshop 2000 report, Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, Md, USA, October 2000.
-
(2000)
Final Workshop 2000 Report, Center for Language and Speech Processing
-
-
Neti, C.1
Potamianos, G.2
Luettin, J.3
-
18
-
-
84931854786
-
Speaker change detection using joint audio-visual statistics
-
Paris, France, April
-
G. Iyengar and C. Neti, "Speaker change detection using joint audio-visual statistics," in Proc. RIAO, Paris, France, April 2000.
-
(2000)
Proc. RIAO
-
-
Iyengar, G.1
Neti, C.2
-
20
-
-
0004244302
-
-
Prentice-Hall, Englewood Cliffs, NJ, USA, 1st edition
-
L. R. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition, Prentice-Hall, Englewood Cliffs, NJ, USA, 1st edition, 1993.
-
(1993)
Fundamentals of Speech Recognition
-
-
Rabiner, L.R.1
Juang, B.-H.2
-
21
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm
-
A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," Journal of Royal Statistical Society, Series B, vol. 39, no. 1, pp. 1-38, 1977.
-
(1977)
Journal of Royal Statistical Society, Series B
, vol.39
, Issue.1
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
22
-
-
0003450542
-
-
Springer-Verlag, New York, NY, USA
-
V. Vapnik, The Nature of Statistical Learning Theory, Springer-Verlag, New York, NY, USA, 1995.
-
(1995)
The Nature of Statistical Learning Theory
-
-
Vapnik, V.1
-
23
-
-
17344389852
-
Robust speech recognition in noisy environments: The IBM SPINE-2 evaluation system
-
Orlando, Fla, USA, May
-
B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan, and R. Sarikaya, "Robust speech recognition in noisy environments: The IBM SPINE-2 evaluation system," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, Orlando, Fla, USA, May 2002.
-
(2002)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
-
-
Kingsbury, B.1
Saon, G.2
Mangu, L.3
Padmanabhan, M.4
Sarikaya, R.5
-
24
-
-
0012611082
-
TREC-6 Ad-hoc retrieval
-
Proc. 6th Text REtrieval Conference (TREC-6), NIST, Gaithersburg, Md, USA
-
M. Franz and S. Roukos, "TREC-6 Ad-hoc retrieval," in Proc. 6th Text REtrieval Conference (TREC-6), vol. 500-240 of NIST Special Publication, pp. 511-516, NIST, Gaithersburg, Md, USA, 1998.
-
(1998)
NIST Special Publication
, vol.500
, Issue.240
, pp. 511-516
-
-
Franz, M.1
Roukos, S.2
-
25
-
-
0004289791
-
-
MIT Press, Cambridge, Mass, USA
-
C. Fellbaum, Ed., WordNet: An Electronic Lexical Database, MIT Press, Cambridge, Mass, USA, 1998.
-
(1998)
WordNet: An Electronic Lexical Database
-
-
Fellbaum, C.1
-
26
-
-
0001319911
-
Okapi at TREC-3
-
The 3rd Text REtrieval Conference (TREC-3), NIST, Gaithersburg, Md, USA
-
S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, and M. Gatford, "Okapi at TREC-3," in The 3rd Text REtrieval Conference (TREC-3), vol. 500-225 of NIST Special Publication, pp. 109-126, NIST, Gaithersburg, Md, USA, 1995.
-
(1995)
NIST Special Publication
, vol.500
, Issue.225
, pp. 109-126
-
-
Robertson, S.E.1
Walker, S.2
Jones, S.3
Hancock-Beaulieu, M.M.4
Gatford, M.5
-
27
-
-
0012577594
-
-
IBM Almaden Research Center, "The IBM cuevideo project," 1997, www.almaden.ibm.com/projects/cuevideo.shtml.
-
(1997)
The IBM Cuevideo Project
-
-
-
28
-
-
0012529167
-
Integrating features, models, and semantics for TREC video retrieval
-
Proc. 10th Text REtrieval Conference (TREC 2001), NIST, Gaithersburg, Md, USA
-
J. R. Smith, S. Srinivasan, A. Amir, et al., "Integrating features, models, and semantics for TREC video retrieval," in Proc. 10th Text REtrieval Conference (TREC 2001), vol. 500-250 of NIST Special Publication, pp. 240-249, NIST, Gaithersburg, Md, USA, 2001.
-
(2001)
NIST Special Publication
, vol.500
, Issue.250
, pp. 240-249
-
-
Smith, J.R.1
Srinivasan, S.2
Amir, A.3
-
29
-
-
0032164964
-
Shape-based retrieval: A case study with trademark image databases
-
A. K. Jain and A. Vailaya, "Shape-based retrieval: A case study with trademark image databases," Pattern Recognition, vol. 31, no. 9, pp. 1369-1390, 1998.
-
(1998)
Pattern Recognition
, vol.31
, Issue.9
, pp. 1369-1390
-
-
Jain, A.K.1
Vailaya, A.2
-
30
-
-
0032598856
-
Query by video clip
-
A. K. Jain, A. Vailaya, and W. Xiong, "Query by video clip," Multimedia Systems: Special Issue on Video Libraries, vol. 7, no, 5, pp. 369-384, 1999.
-
(1999)
Multimedia Systems: Special Issue on Video Libraries
, vol.7
, Issue.5
, pp. 369-384
-
-
Jain, A.K.1
Vailaya, A.2
Xiong, W.3
-
31
-
-
0017442627
-
Aircraft identification by moment invariants
-
S. Dudani, K. Breeding, and R. McGhee, "Aircraft identification by moment invariants," IEEE Trans. on Computers, vol. 26, no. 1, pp. 39-45, 1977.
-
(1977)
IEEE Trans. on Computers
, vol.26
, Issue.1
, pp. 39-45
-
-
Dudani, S.1
Breeding, K.2
McGhee, R.3
-
32
-
-
0030706661
-
Transcription of broadcast news - System robustness issues and adaptation techniques
-
IEEE, Munich, Germany, April
-
R. Bakis, S. Sehen, P. Gopalakrishnan, R. Gopinath, S. Maes, and L. Polymenakos, "Transcription of broadcast news - system robustness issues and adaptation techniques," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, pp. 711-714, IEEE, Munich, Germany, April 1997.
-
(1997)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.2
, pp. 711-714
-
-
Bakis, R.1
Sehen, S.2
Gopalakrishnan, P.3
Gopinath, R.4
Maes, S.5
Polymenakos, L.6
|