메뉴 건너뛰기




Volumn 3022, Issue , 1997, Pages 218-225

Video classification using speaker identification

Author keywords

Audio classification; Cepstral coefficients; Nearest neighbor classification; Speaker identification; Video indexing

Indexed keywords

AUDIO ACOUSTICS; DATABASE SYSTEMS; INDEXING (OF INFORMATION); LOUDSPEAKERS; VIDEO CAMERAS; VIDEO RECORDING;

EID: 77954021581     PISSN: 0277786X     EISSN: 1996756X     Source Type: Conference Proceeding    
DOI: 10.1117/12.263411     Document Type: Conference Paper
Times cited : (14)

References (30)
  • 2
    • 85040089385 scopus 로고
    • Media stream: An iconic language for video annotation
    • Norway
    • M. Davis, "Media stream: An iconic language for video annotation," in IEEE symposium on visual languages, (Norway), pp. 196-202, 1993.
    • (1993) IEEE Symposium on Visual Languages , pp. 196-202
    • Davis, M.1
  • 4
    • 0024866229 scopus 로고
    • Scene retrieval method for video database applications using temporal condition changes
    • (Tokyo, Japan), April
    • S. Abe, Y. Tonomura, and H. Kasahara, Scene retrieval method for video database applications using temporal condition changes," in Proc. Conf. Machine Intelligence and Vision, (Tokyo, Japan), pp. 355-359, April 1989.
    • (1989) Proc. Conf. Machine Intelligence and Vision , pp. 355-359
    • Abe, S.1    Tonomura, Y.2    Kasahara, H.3
  • 6
    • 0000112911 scopus 로고
    • Content oriented visual interface using video icons for visual database system
    • Y. Tonormura and S. Abe, "Content oriented visual interface using video icons for visual database system," Journal of Visual Languages and Computing, no. 1, pp. 183-198, 1990.
    • (1990) Journal of Visual Languages and Computing , Issue.1 , pp. 183-198
    • Tonormura, Y.1    Abe, S.2
  • 7
    • 0027561007 scopus 로고
    • Stored video handling techniques
    • March
    • Y. Tonomura, K. Otsuji, A. Akutsu, and Y. Ohba, "Stored video handling techniques," NTT Review, vol. 5, pp. 82-90, March 1993.
    • (1993) NTT Review , vol.5 , pp. 82-90
    • Tonomura, Y.1    Otsuji, K.2    Akutsu, A.3    Ohba, Y.4
  • 8
    • 34250082473 scopus 로고
    • Automatic partitioning of full-motion video
    • H. Zhang, A. Kankanhalli, and S. Smoliar, "Automatic partitioning of full-motion video," Multimedia Systems, vol. 1, pp. 10-28, 1993.
    • (1993) Multimedia Systems , vol.1 , pp. 10-28
    • Zhang, H.1    Kankanhalli, A.2    Smoliar, S.3
  • 13
    • 0029456574 scopus 로고
    • Query by humming: Musical information retrieval in audio database
    • (San Fransisco), ACM Press, Nov.
    • A. Ghias, J. Logan, and D. Chamberlin, "Query by humming: Musical information retrieval in audio database," in Proceedings of Mulimedia-95, (San Fransisco), pp. 231-237, ACM Press, Nov. 1995.
    • (1995) Proceedings of Mulimedia-95 , pp. 231-237
    • Ghias, A.1    Logan, J.2    Chamberlin, D.3
  • 14
    • 84944453945 scopus 로고
    • Automatic recognition of film geners
    • (San Fransisco), ACM Press, Nov.
    • S. Fisher, R. Lienhart, and W. Effelsburg, "Automatic recognition of film geners," in Proceedings of Multimedia-95, (San Fransisco), pp. 295-305, ACM Press, Nov. 1995.
    • (1995) Proceedings of Multimedia-95 , pp. 295-305
    • Fisher, S.1    Lienhart, R.2    Effelsburg, W.3
  • 16
    • 0016067897 scopus 로고
    • Effectivness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. Atal, "Effectivness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," JASA, vol. 55, no. 6, pp. 1304-1312, 1974.
    • (1974) JASA , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.1
  • 17
    • 85135124348 scopus 로고
    • Speaker recognition using concatenated phoneme models
    • M. T. and F. S., "Speaker recognition using concatenated phoneme models," in ICSLF, p. 603, 1992.
    • (1992) ICSLF , pp. 603
  • 18
    • 0028460895 scopus 로고
    • Comparison of text-independent speaker recognition methods using vq-distortion and descrete/continuous hmm's
    • July
    • T. Matsui and S. Furui, "Comparison of text-independent speaker recognition methods using vq-distortion and descrete/continuous hmm's," IEEE Tran. on Speech and Audio Processing, vol. 2, July 1994.
    • (1994) IEEE Tran. On Speech and Audio Processing , vol.2
    • Matsui, T.1    Furui, S.2
  • 19
    • 0020594710 scopus 로고
    • An approach to text-independent speaker recognition with short utterances
    • K. P. Li, W. Jr., and E. H, "An approach to text-independent speaker recognition with short utterances," in ICASSP-83, pp. 55-558, 1983.
    • (1983) ICASSP-83 , pp. 555-558
    • Li, K.P.1
  • 20
  • 21
    • 0028425051 scopus 로고
    • Text-independent speaker identification using neural nets and ar-vector models
    • S. Hadjitodorov, B. Boyanov, T. Ivanov, and N. Dalakchieva, "Text-independent speaker identification using neural nets and ar-vector models," Elecfronzcs Letters, vol. 30, no. 11, pp. 838-839, 1994.
    • (1994) Elecfronzcs Letters , vol.30 , Issue.11 , pp. 838-839
    • Hadjitodorov, S.1    Boyanov, B.2    Ivanov, T.3    Dalakchieva, N.4
  • 22
    • 85135374882 scopus 로고
    • Discriminantar-vector models for free-text speaker verification
    • M. C. and L. F. J.L., "Discriminantar-vector models for free-text speaker verification," in EUROSPEECH, p. 161, 1993.
    • (1993) EUROSPEECH , pp. 161
  • 23
    • 0026117640 scopus 로고
    • On the application of mixture ar hiden markov model to text independent speaker recognition
    • 370
    • N. Z. Tishby, "On the application of mixture ar hiden markov model to text independent speaker recognition," IEEE Trans. on ASSP, vol. ASSP-30, no. 3, pp. 563-370, 1991.
    • (1991) IEEE Trans. On ASSP , vol.ASSP-30 , Issue.3 , pp. 563
    • Tishby, N.Z.1
  • 24
    • 0027252185 scopus 로고
    • Voice identification using nearest-neighbor distance measure
    • A. L. Higgins, L. G. Bahler, and J. E. Porter, "Voice identification using nearest-neighbor distance measure," in ICASSP-93, pp. 375-378, 1993.
    • (1993) ICASSP-93 , pp. 375-378
    • Higgins, A.L.1    Bahler, L.G.2    Porter, J.E.3
  • 25
    • 0000107098 scopus 로고
    • Improved voice identification using nearest-neighbor distance measure
    • L. G. Bahier, J. E. Porter, and A. L. Higgins, "Improved voice identification using nearest-neighbor distance measure," in ICASSP-94, pp. 321-323, 1994.
    • (1994) ICASSP-94 , pp. 321-323
    • Bahier, L.G.1    Porter, J.E.2    Higgins, A.L.3
  • 26
    • 0016939165 scopus 로고
    • Automatic speaker recognition
    • A. E. Rosenberg, "Automatic speaker recognition," Proc. of IEEE, vol. 64, no. 4, pp. 475-487, 1976.
    • (1976) Proc. Of IEEE , vol.64 , Issue.4 , pp. 475-487
    • Rosenberg, A.E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.