SCOPUS 정보 검색 플랫폼

Proceedings of SPIE - The International Society for Optical Engineering

Volumn 3022, Issue , 1997, Pages 218-225

Video classification using speaker identification

(2) Patel, Nilesh V a Sethi, Ishwar K a

a Wayne State University (United States)

Author keywords

Audio classification; Cepstral coefficients; Nearest neighbor classification; Speaker identification; Video indexing

Indexed keywords

AUDIO ACOUSTICS; DATABASE SYSTEMS; INDEXING (OF INFORMATION); LOUDSPEAKERS; VIDEO CAMERAS; VIDEO RECORDING;

AUDIO CLASSIFICATION; CEPSTRAL COEFFICIENTS; NEAREST NEIGHBOR CLASSIFICATION; SPEAKER IDENTIFICATION; VIDEO INDEXING;

SPEECH RECOGNITION;

EID: 77954021581 PISSN: 0277786X EISSN: 1996756X Source Type: Conference Proceeding
DOI: 10.1117/12.263411 Document Type: Conference Paper

Times cited : (14)

References (30)

1
- 85076113879
- Indexes for use access to large video databases
- L. A. Rowe, J. S. Boreczky, and C. A. Eads, "Indexes for use access to large video databases," in SPIE Proccedings, 1993.
- (1993) SPIE Proccedings
- Rowe, L.A.¹ Boreczky, J.S.² Eads, C.A.³

2
- 85040089385
- Media stream: An iconic language for video annotation
- Norway
- M. Davis, "Media stream: An iconic language for video annotation," in IEEE symposium on visual languages, (Norway), pp. 196-202, 1993.
- (1993) IEEE Symposium on Visual Languages , pp. 196-202
- Davis, M.¹

3
- 0029463062
- Indexing in video databases
- (San Jose), February
- A. ilampapur, It. Jam, and T.E.Weymouth, "Indexing in video databases," in SPIE/IS&T Proceedings on Storage and Refrieval in Image and Video Databases, vol. 2420, (San Jose), pp. 292-306, February 1995.
- (1995) SPIE/IS&T Proceedings on Storage and Refrieval in Image and Video Databases , vol.2420 , pp. 292-306
- Ilampapur, A.¹ Jam, I.² Weymouth, T.E.³

4
- 0024866229
- Scene retrieval method for video database applications using temporal condition changes
- (Tokyo, Japan), April
- S. Abe, Y. Tonomura, and H. Kasahara, Scene retrieval method for video database applications using temporal condition changes," in Proc. Conf. Machine Intelligence and Vision, (Tokyo, Japan), pp. 355-359, April 1989.
- (1989) Proc. Conf. Machine Intelligence and Vision , pp. 355-359
- Abe, S.¹ Tonomura, Y.² Kasahara, H.³

5
- 0002063538
- Structured video computing
- August
- Y. Tonomura, A. Akutsu, Y. Toniguchi, and G. Suzuki, "Structured video computing," IEEE MuUimedza, vol. 1, pp. 34-44, August 1994.
- (1994) IEEE MuUimedza , vol.1 , pp. 34-44
- Tonomura, Y.¹ Akutsu, A.² Toniguchi, Y.³ Suzuki, G.⁴

6
- 0000112911
- Content oriented visual interface using video icons for visual database system
- Y. Tonormura and S. Abe, "Content oriented visual interface using video icons for visual database system," Journal of Visual Languages and Computing, no. 1, pp. 183-198, 1990.
- (1990) Journal of Visual Languages and Computing , Issue.1 , pp. 183-198
- Tonormura, Y.¹ Abe, S.²

7
- 0027561007
- Stored video handling techniques
- March
- Y. Tonomura, K. Otsuji, A. Akutsu, and Y. Ohba, "Stored video handling techniques," NTT Review, vol. 5, pp. 82-90, March 1993.
- (1993) NTT Review , vol.5 , pp. 82-90
- Tonomura, Y.¹ Otsuji, K.² Akutsu, A.³ Ohba, Y.⁴

8
- 34250082473
- Automatic partitioning of full-motion video
- H. Zhang, A. Kankanhalli, and S. Smoliar, "Automatic partitioning of full-motion video," Multimedia Systems, vol. 1, pp. 10-28, 1993.
- (1993) Multimedia Systems , vol.1 , pp. 10-28
- Zhang, H.¹ Kankanhalli, A.² Smoliar, S.³

9
- 0029461607
- Video query formulation
- (San Jose), February
- G. Ahanger, D. Benson, and T. Little, "Video query formulation," in SPIE/IS&T Proceed2ngs on Storage and Retrieval in Image and Video Databases, vol. 2420, (San Jose), pp. 280-291, February 1995.
- (1995) SPIE/IS&T Proceed2ngs on Storage and Retrieval in Image and Video Databases , vol.2420 , pp. 280-291
- Ahanger, G.¹ Benson, D.² Little, T.³

10
- 0003895738
- Tech. Rep. 278, M.I.T Media Lab
- S. Mann and R. W. Picard, '"video orbits': Charactorizing the cordinate transformation between two images using the projective group," Tech. Rep. 278, M.I.T Media Lab., 1995.
- (1995) 'Video Orbits': Charactorizing the Cordinate Transformation between Two Images Using the Projective Group
- Mann, S.¹ Picard, R.W.²

11
- 85076114863
- Compressed video processing for video segmentation
- To appear
- N. V. Patel and I. K. Sethi, "Compressed video processing for video segmentation," To appear IEE: Vision, Image and Signal Processing.
- IEE: Vision, Image and Signal Processing
- Patel, N.V.¹ Sethi, I.K.²

12
- 0029778685
- Audio characterization for video indexing
- (San Jose), February
- N. T. Patel and I. K. Sethi, "Audio Characterization for Video Indexing," in IS&T SPIE Proceedings:Sorage and Retrieval for Image and Video Databases IV, (San Jose), February 1996.
- (1996) IS&T SPIE Proceedings:Sorage and Retrieval for Image and Video Databases IV
- Patel, N.T.¹ Sethi, I.K.²

13
- 0029456574
- Query by humming: Musical information retrieval in audio database
- (San Fransisco), ACM Press, Nov.
- A. Ghias, J. Logan, and D. Chamberlin, "Query by humming: Musical information retrieval in audio database," in Proceedings of Mulimedia-95, (San Fransisco), pp. 231-237, ACM Press, Nov. 1995.
- (1995) Proceedings of Mulimedia-95 , pp. 231-237
- Ghias, A.¹ Logan, J.² Chamberlin, D.³

14
- 84944453945
- Automatic recognition of film geners
- (San Fransisco), ACM Press, Nov.
- S. Fisher, R. Lienhart, and W. Effelsburg, "Automatic recognition of film geners," in Proceedings of Multimedia-95, (San Fransisco), pp. 295-305, ACM Press, Nov. 1995.
- (1995) Proceedings of Multimedia-95 , pp. 295-305
- Fisher, S.¹ Lienhart, R.² Effelsburg, W.³

15
- 85076112176
- PhD thesis, MIT, Cambrdge
- S. MR., Speaker recognition and verifiaion using Linear Prediction Analysis. PhD thesis, MIT, Cambrdge, 1972.
- (1972) Speaker Recognition and Verifiaion Using Linear Prediction Analysis
- Mr, S.¹

16
- 0016067897
- Effectivness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- B. Atal, "Effectivness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," JASA, vol. 55, no. 6, pp. 1304-1312, 1974.
- (1974) JASA , vol.55 , Issue.6 , pp. 1304-1312
- Atal, B.¹

17
- 85135124348
- Speaker recognition using concatenated phoneme models
- M. T. and F. S., "Speaker recognition using concatenated phoneme models," in ICSLF, p. 603, 1992.
- (1992) ICSLF , pp. 603

18
- 0028460895
- Comparison of text-independent speaker recognition methods using vq-distortion and descrete/continuous hmm's
- July
- T. Matsui and S. Furui, "Comparison of text-independent speaker recognition methods using vq-distortion and descrete/continuous hmm's," IEEE Tran. on Speech and Audio Processing, vol. 2, July 1994.
- (1994) IEEE Tran. On Speech and Audio Processing , vol.2
- Matsui, T.¹ Furui, S.²

19
- 0020594710
- An approach to text-independent speaker recognition with short utterances
- K. P. Li, W. Jr., and E. H, "An approach to text-independent speaker recognition with short utterances," in ICASSP-83, pp. 55-558, 1983.
- (1983) ICASSP-83 , pp. 555-558
- Li, K.P.¹

20
- 0022229052
- A vector quantization approach to speaker recognition
- F. K. S. Soong, A. E. Rosenberg, L. R. Rabiner, and B. H. Juang, "A vector quantization approach to speaker recognition," in ICASSP 85, pp. 387-390, 1985.
- (1985) ICASSP 85 , pp. 387-390
- Soong, F.K.S.¹ Rosenberg, A.E.² Rabiner, L.R.³ Juang, B.H.⁴

21
- 0028425051
- Text-independent speaker identification using neural nets and ar-vector models
- S. Hadjitodorov, B. Boyanov, T. Ivanov, and N. Dalakchieva, "Text-independent speaker identification using neural nets and ar-vector models," Elecfronzcs Letters, vol. 30, no. 11, pp. 838-839, 1994.
- (1994) Elecfronzcs Letters , vol.30 , Issue.11 , pp. 838-839
- Hadjitodorov, S.¹ Boyanov, B.² Ivanov, T.³ Dalakchieva, N.⁴

22
- 85135374882
- Discriminantar-vector models for free-text speaker verification
- M. C. and L. F. J.L., "Discriminantar-vector models for free-text speaker verification," in EUROSPEECH, p. 161, 1993.
- (1993) EUROSPEECH , pp. 161

23
- 0026117640
- On the application of mixture ar hiden markov model to text independent speaker recognition
- 370
- N. Z. Tishby, "On the application of mixture ar hiden markov model to text independent speaker recognition," IEEE Trans. on ASSP, vol. ASSP-30, no. 3, pp. 563-370, 1991.
- (1991) IEEE Trans. On ASSP , vol.ASSP-30 , Issue.3 , pp. 563
- Tishby, N.Z.¹

24
- 0027252185
- Voice identification using nearest-neighbor distance measure
- A. L. Higgins, L. G. Bahler, and J. E. Porter, "Voice identification using nearest-neighbor distance measure," in ICASSP-93, pp. 375-378, 1993.
- (1993) ICASSP-93 , pp. 375-378
- Higgins, A.L.¹ Bahler, L.G.² Porter, J.E.³

25
- 0000107098
- Improved voice identification using nearest-neighbor distance measure
- L. G. Bahier, J. E. Porter, and A. L. Higgins, "Improved voice identification using nearest-neighbor distance measure," in ICASSP-94, pp. 321-323, 1994.
- (1994) ICASSP-94 , pp. 321-323
- Bahier, L.G.¹ Porter, J.E.² Higgins, A.L.³

26
- 0016939165
- Automatic speaker recognition
- A. E. Rosenberg, "Automatic speaker recognition," Proc. of IEEE, vol. 64, no. 4, pp. 475-487, 1976.
- (1976) Proc. Of IEEE , vol.64 , Issue.4 , pp. 475-487
- Rosenberg, A.E.¹

27
- 85073381166
- An evaluation of supervised and unsupervised classifiers for speaker recognition
- K. Farrell and M. R., "An evaluation of supervised and unsupervised classifiers for speaker recognition," in Esca Workshop on Speaker Recognition, Identification, and Verification, pp. 67-70, 1994.
- (1994) Esca Workshop on Speaker Recognition, Identification, and Verification , pp. 67-70
- Farrell, K.¹

28
- 0028204659
- K. Farell, R., R. Mammone, and K. Assaleh, "Speaker recognition using neural networks and conventional classifiers," vol. 2, no. 1, pp. 194-204, 1994.
- (1994) Speaker Recognition Using Neural Networks and Conventional Classifiers , vol.2 , Issue.1 , pp. 194-204
- Farell, K.¹ Mammone, R.R.² Assaleh, K.³

29
- 0029461608
- A statistical approach to scene change detection
- (San Jose), February
- I. K. Sethi and N. V. Patel, "A statistical approach to scene change detection," in IS&T SPIE Proceedings: Storage and Retrieval for Image and Video Databases III, vol. 2420, (San Jose), pp. 329-339, February 1995.
- (1995) IS&T SPIE Proceedings: Storage and Retrieval for Image and Video Databases III , vol.2420 , pp. 329-339
- Sethi, I.K.¹ Patel, N.V.²

30
- 0005043770
- Video shot detection and characterization for video databases
- To appear September
- N. V. Patel and I. K. Sethi, "Video shot detection and characterization for video databases," To appear in Pattern Recognition special issue on multimedia, September 1996.
- (1996) Pattern Recognition Special Issue on Multimedia
- Patel, N.V.¹ Sethi, I.K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.