-
1
-
-
0031619139
-
A hidden Markov model framework for video segmentation using audio and image features
-
Seattle, WA, May 12-15
-
J. S. Boreczky and L. D. Wilcox, "A hidden Markov model framework for video segmentation using audio and image features," in Proc. ICASSP'1998, Seattle, WA, May 12-15, 1998, vol. 6, pp. 3741-3744.
-
(1998)
Proc. ICASSP'1998
, vol.6
, pp. 3741-3744
-
-
Boreczky, J.S.1
Wilcox, L.D.2
-
2
-
-
0029304865
-
Human and machine recognition of faces: A survey
-
May
-
R. Chellappa, C. L. Wilson, and S. Sirohey, "Human and machine recognition of faces: A survey," Proc. IEEE, vol. 83, no. 5, pp. 705-741, May 1995.
-
(1995)
Proc. IEEE
, vol.83
, Issue.5
, pp. 705-741
-
-
Chellappa, R.1
Wilson, C.L.2
Sirohey, S.3
-
4
-
-
0037860595
-
Look who's talking: Speaker detection using video and audio corrlation
-
New York, Nov
-
R. Cutler and L. Davis, "Look who's talking: Speaker detection using video and audio corrlation," in Proc. ICME, New York, Nov. 2000.
-
(2000)
Proc. ICME
-
-
Cutler, R.1
Davis, L.2
-
5
-
-
0029375609
-
Query by image and video content: The QBIC system
-
Sep
-
M. Flickner et al., "Query by image and video content: The QBIC system," IEEE Comput., vol. 28, no. 9, pp. 23-32, Sep. 1995.
-
(1995)
IEEE Comput
, vol.28
, Issue.9
, pp. 23-32
-
-
Flickner, M.1
-
6
-
-
0032296995
-
Efficient filtering and clustering methods for temporal video segmentation and visual summarization
-
Dec
-
A. Ferman and A. Tekalp, "Efficient filtering and clustering methods for temporal video segmentation and visual summarization," J. Vis. Comun. Image Repres., vol. 9, no. 4, pp. 336-351, Dec. 1998.
-
(1998)
J. Vis. Comun. Image Repres
, vol.9
, Issue.4
, pp. 336-351
-
-
Ferman, A.1
Tekalp, A.2
-
7
-
-
67649123507
-
Semantic indexing of multimedia using audio, text and visual cues
-
Lausanne, Switzerland
-
C. N. G. Iyengar, H. Nock, and M. Franz, "Semantic indexing of multimedia using audio, text and visual cues," in Proc. ICME, Lausanne, Switzerland, 2002, pp. 369-372.
-
(2002)
Proc. ICME
, pp. 369-372
-
-
Iyengar, C.N.G.1
Nock, H.2
Franz, M.3
-
8
-
-
0032320287
-
Integration of audio and visual information for content-based video segmentation
-
Chicago, IL, Oct. 4-7
-
J. Huang, Z. Liu, and Y. Wang, "Integration of audio and visual information for content-based video segmentation," in Proc. ICIP'1998, Chicago, IL, Oct. 4-7, 1998, vol. 3, pp. 526-530.
-
(1998)
Proc. ICIP'1998
, vol.3
, pp. 526-530
-
-
Huang, J.1
Liu, Z.2
Wang, Y.3
-
9
-
-
0009622481
-
Learning joint statistical models for audio-visual fusion and segregation
-
Denver, CO
-
W. F. J. Fisher, T. Darrell, and P. Viola, "Learning joint statistical models for audio-visual fusion and segregation," in Advances in Neural Information Processing Systems., Denver, CO, 2000.
-
(2000)
Advances in Neural Information Processing Systems
-
-
Fisher, W.F.J.1
Darrell, T.2
Viola, P.3
-
11
-
-
0032692096
-
Scene determination based on video and audio features
-
Jun. 7-11
-
R. Lienhart, S. Pfeiffer, and W. Effelsberg, "Scene determination based on video and audio features," in Proc. IEEE Int. Conf. Multimedia Computing and Systems, Jun. 7-11, 1999, vol. 1, pp. 685-690.
-
(1999)
Proc. IEEE Int. Conf. Multimedia Computing and Systems
, vol.1
, pp. 685-690
-
-
Lienhart, R.1
Pfeiffer, S.2
Effelsberg, W.3
-
12
-
-
0033690894
-
A new distance measure for probability distribution function of mixture type
-
Istanbul, Turkey, Jun. 5-9
-
Z. Liu and Q. Huang, "A new distance measure for probability distribution function of mixture type," in Proc. ICASSP '2000, Istanbul, Turkey, Jun. 5-9, 2000.
-
(2000)
Proc. ICASSP '2000
-
-
Liu, Z.1
Huang, Q.2
-
13
-
-
0034445531
-
Face detection and tracking in video using dynamic programming
-
Vancouver, BC, Canada, Sep. 10-13
-
Z. Liu and Y. Wang, "Face detection and tracking in video using dynamic programming," in Proc. ICIP'2000, Vancouver, BC, Canada, Sep. 10-13, 2000.
-
(2000)
Proc. ICIP'2000
-
-
Liu, Z.1
Wang, Y.2
-
14
-
-
0034841928
-
Major cast detection in video using both audio and visual information
-
Salt Lake City, UT, May 7-11
-
_, "Major cast detection in video using both audio and visual information," in Proc. ICASSP'2001, Salt Lake City, UT, May 7-11, 2001.
-
(2001)
Proc. ICASSP'2001
-
-
Liu, Z.1
Wang, Y.2
-
15
-
-
0032181880
-
Audio feature extraction and analysis for scene segmentation and classification
-
Oct
-
Z. Liu, Y. Wang, and T. Chen, "Audio feature extraction and analysis for scene segmentation and classification," J. VLSI Signal Process. Syst. for Signal, Image, and Video Technol., vol. 20, no. 1/2, pp. 61-79, Oct. 1998.
-
(1998)
J. VLSI Signal Process. Syst. for Signal, Image, and Video Technol
, vol.20
, Issue.1-2
, pp. 61-79
-
-
Liu, Z.1
Wang, Y.2
Chen, T.3
-
16
-
-
0031650491
-
Scene break detection: A comparison
-
Feb
-
G. Lupatini, C. Saraceno, and R. Leonardi, "Scene break detection: A comparison," in Proc. 8th Int. Workshop on Research Issues In Data Engineering, Feb. 1998, pp. 34-41.
-
(1998)
Proc. 8th Int. Workshop on Research Issues In Data Engineering
, pp. 34-41
-
-
Lupatini, G.1
Saraceno, C.2
Leonardi, R.3
-
17
-
-
0004171986
-
Available
-
Online
-
A. Martinez and R. Benavente, The AR Face Database 1998 [Online]. Available: http://rvl1.ecn.purdue.edu/~aleix/aleix_face_DB.html
-
(1998)
-
-
Martinez, A.1
Benavente, R.2
-
18
-
-
0036223025
-
Detecting faces in images: A survey
-
Jan
-
M. Y. N. Ahuja and D. Kriegman, "Detecting faces in images: A survey," IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 1, pp. 34-58, Jan. 2002.
-
(2002)
IEEE Trans. Pattern Anal. Mach. Intell
, vol.24
, Issue.1
, pp. 34-58
-
-
Ahuja, M.Y.N.1
Kriegman, D.2
-
19
-
-
0029209272
-
Robust text-independent speaker identification using Gaussian mixture speaker models
-
Jan
-
D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
-
(1995)
IEEE Trans. Speech Audio Process
, vol.3
, Issue.1
, pp. 72-83
-
-
Reynolds, D.A.1
Rose, R.C.2
-
20
-
-
0031672526
-
Neural network-baesd face detection
-
Jan
-
H. A. Rowley, S. Baluja, and T. Kanade, "Neural network-baesd face detection," IEEE Trans. Pattern Anal. Mach. Intell., vol. 20, no. 1, pp. 22-38, Jan. 1998.
-
(1998)
IEEE Trans. Pattern Anal. Mach. Intell
, vol.20
, Issue.1
, pp. 22-38
-
-
Rowley, H.A.1
Baluja, S.2
Kanade, T.3
-
21
-
-
0032663356
-
Image retrieval: Current technologies, promising directions, and open issues
-
Mar
-
Y. Rui, T. S. Huang, and S.-F. Chang, "Image retrieval: Current technologies, promising directions, and open issues," J. Vis. Commun. Image Represen., vol. 10, no. 1, pp. 39-62, Mar. 1999.
-
(1999)
J. Vis. Commun. Image Represen
, vol.10
, Issue.1
, pp. 39-62
-
-
Rui, Y.1
Huang, T.S.2
Chang, S.-F.3
-
22
-
-
0026755044
-
Automatic recognition and analysis of human faces and facial expressions: A survey
-
A. Samal and P. A. Iyengar, "Automatic recognition and analysis of human faces and facial expressions: A survey," Pattern Recognit., vol. 25, no. 1, pp. 65-77, 1992.
-
(1992)
Pattern Recognit
, vol.25
, Issue.1
, pp. 65-77
-
-
Samal, A.1
Iyengar, P.A.2
-
23
-
-
0032306091
-
Identification of story units in audio-visual sequencies by joint audio and video processing
-
Chicago, IL, Oct. 4-7
-
C. Saraceno and R. Leonardi, "Identification of story units in audio-visual sequencies by joint audio and video processing," in Proc. ICIP'1998, Chicago, IL, Oct. 4-7, 1998, vol. 1, pp. 363-367.
-
(1998)
Proc. ICIP'1998
, vol.1
, pp. 363-367
-
-
Saraceno, C.1
Leonardi, R.2
-
24
-
-
0032660827
-
Name-it: Naming and detecting faces in news videos
-
Jan.-Mar
-
S. Satoh, Y. Nakamura, and T. Kanade, "Name-it: Naming and detecting faces in news videos," IEEE Multimedia Mag., vol. 6, no. 1, pp. 22-35, Jan.-Mar. 1999.
-
(1999)
IEEE Multimedia Mag
, vol.6
, Issue.1
, pp. 22-35
-
-
Satoh, S.1
Nakamura, Y.2
Kanade, T.3
-
25
-
-
0029765670
-
Real-time discrimination of broadcast speech/music
-
Atlanta, GA, May 7-10
-
J. Saunders, "Real-time discrimination of broadcast speech/music," in Proc. ICASSP'1996, Atlanta, GA, May 7-10, 1996, vol. 2, pp. 993-996.
-
(1996)
Proc. ICASSP'1996
, vol.2
, pp. 993-996
-
-
Saunders, J.1
-
27
-
-
0030242072
-
Content-based classification, search, and retrieval of audio
-
Sep
-
E. Wold, T. Blum, D. Keislar, and J. Wheaton, "Content-based classification, search, and retrieval of audio," IEEE Multimedia Mag., vol. 3, no. 3, pp. 27-36, Sep. 1996.
-
(1996)
IEEE Multimedia Mag
, vol.3
, Issue.3
, pp. 27-36
-
-
Wold, E.1
Blum, T.2
Keislar, D.3
Wheaton, J.4
-
28
-
-
34250082473
-
Automatic partioning of video
-
H. J. Zhang, A. Kankanhalli, and S. W. Smoliar, "Automatic partioning of video," Multimedia Syst., vol. 1, no. 1, pp. 10-28, 1993.
-
(1993)
Multimedia Syst
, vol.1
, Issue.1
, pp. 10-28
-
-
Zhang, H.J.1
Kankanhalli, A.2
Smoliar, S.W.3
-
29
-
-
0032629748
-
Hierarchical classification of audio data for archiving and retrieving
-
Phoenix, AZ, Mar. 15-19
-
T. Zhang and C.-C. J. Kuo, "Hierarchical classification of audio data for archiving and retrieving," in Proc. ICASSP'1999, Phoenix, AZ, Mar. 15-19, 1999, vol. 6, pp. 3001-3004.
-
(1999)
Proc. ICASSP'1999
, vol.6
, pp. 3001-3004
-
-
Zhang, T.1
Kuo, C.-C.J.2
|