메뉴 건너뛰기




Volumn 2, Issue 4, 2000, Pages 147-162

Multimedia document retrieval using speech and speaker recognition

Author keywords

Audio indexing; Speaker recognition; Speaker segmentation; Speech recognition; Spoken document analysis

Indexed keywords

AUDIO INDEXING; MULTIMEDIA CONTENTS; MULTIMEDIA DOCUMENTS; SPEAKER IDENTIFICATION; SPEAKER RECOGNITION; SPEAKER RECOGNITION SYSTEM; SPEAKER SEGMENTATIONS; SPOKEN DOCUMENT;

EID: 33750180399     PISSN: 14332833     EISSN: 14332825     Source Type: Journal    
DOI: 10.1007/PL00021522     Document Type: Article
Times cited : (4)

References (29)
  • 1
    • 0016355478 scopus 로고
    • A new look at the statistical model for identification
    • [Akaike74]
    • [Akaike74] H. Akaike. A new look at the statistical model for identification. IEEE Trans on Autom Control, AC19, 1974, pp. 716-723
    • (1974) IEEE Trans on Autom Control , vol.AC19 , pp. 716-723
    • Akaike, H.1
  • 4
    • 84892185828 scopus 로고    scopus 로고
    • A distance measure between collections of distributions and its application to speaker recognition
    • Seattle, Washington, [Beigi98a]
    • [Beigi98a] H.S.M. Beigi, S. Maes, J.S. Sorenson. A distance measure between collections of distributions and its application to speaker recognition. Proc Int Conf on Acoustics, Speech, and Signal Process, Seattle, Washington, 1998, pp. 753-756
    • (1998) Proc Int Conf on Acoustics, Speech, and Signal Process , pp. 753-756
    • Beigi, H.S.M.1    Maes, S.2    Sorenson, J.S.3
  • 7
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the Bayesian information criterion
    • Lansdowne, VA, [Chen98]
    • [Chen98] S.S. Chen, P.S. Gopalakrishnan. Speaker, environment and channel change detection and clustering via the Bayesian information criterion. Proc DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, VA, 1998, pp. 127-132
    • (1998) Proc DARPA Broadcast News Transcription and Understanding Workshop , pp. 127-132
    • Chen, S.S.1    Gopalakrishnan, P.S.2
  • 8
    • 85135281671 scopus 로고    scopus 로고
    • Speech recognition with automatic punctuation
    • Budapest, Hungary, [Chen99]
    • [Chen99] C.J. Chen. Speech recognition with automatic punctuation. Proc EuroSpeech99, Budapest, Hungary, 1999, pp. 447-480
    • (1999) Proc EuroSpeech99 , pp. 447-480
    • Chen, C.J.1
  • 9
    • 85135270151 scopus 로고    scopus 로고
    • Speaker-based segmentation for audio data indexing
    • Budapest, Hungary, [Delacourt99]
    • [Delacourt99] P. Delacourt, D. Kryze, C.J. Wellekens. Speaker-based segmentation for audio data indexing. Proc EuroSpeech99, Budapest, Hungary, 1999, pp. 1195-1198
    • (1999) Proc EuroSpeech99 , pp. 1195-1198
    • Delacourt, P.1    Kryze, D.2    Wellekens, C.J.3
  • 11
    • 85135252230 scopus 로고    scopus 로고
    • Story Segmentation and Topic Detection for Recognized Speech
    • Budapest, Hungary, [Dharanipragada99a]
    • [Dharanipragada99a] S. Dharanipragada, M. Franz, J.S. McCarley, S. Roukos, and R.T. Ward. Story Segmentation and Topic Detection for Recognized Speech. Proc EuroSpeech99, Budapest, Hungary, 1999, pp. 2435-2438
    • (1999) Proc EuroSpeech99 , pp. 2435-2438
    • Dharanipragada, S.1    Franz, M.2    McCarley, J.S.3    Roukos, S.4    Ward, R.T.5
  • 12
    • 0342321399 scopus 로고    scopus 로고
    • Audio Indexing for Broadcast News
    • E.M. Voorhees, D.K. Harman (eds.) NIST Special Publication 500-242, [Dharanipragada99b]
    • [Dharanipragada99b] S. Dharanipragada, M. Franz, S. Roukos. Audio Indexing for Broadcast News. Proc Seventh Text REtrieval Conference (TREC-7), E.M. Voorhees, D.K. Harman (eds.) NIST Special Publication 500-242, 1999, pp. 115-120
    • (1999) Proc Seventh Text REtrieval Conference (TREC-7) , pp. 115-120
    • Dharanipragada, S.1    Franz, M.2    Roukos, S.3
  • 13
    • 0004072715 scopus 로고
    • Digital Speech Processing
    • Marcel Dekker, [Furui89]
    • [Furui89] S. Furui. Digital Speech Processing, Synthesis and Recognition. Marcel Dekker, 1989
    • (1989) Synthesis and Recognition
    • Furui, S.1
  • 14
    • 0026400244 scopus 로고
    • Segregation of Speakers for Speech Recognition and Speaker identification
    • Seattle, Washington, [Gish91]
    • [Gish91] H. Gish, M. Siu, R. Rohlicek. Segregation of Speakers for Speech Recognition and Speaker identification. Proc Int Conf on Acoustics, Speech, and Signal Processing, Seattle, Washington, 1991, pp. 873-876
    • (1991) Proc Int Conf on Acoustics, Speech, and Signal Processing , pp. 873-876
    • Gish, H.1    Siu, M.2    Rohlicek, R.3
  • 15
    • 0028516097 scopus 로고
    • Text-Independent Speaker identification
    • [Gish94]
    • [Gish94] H. Gish, M. Schmidt. Text-Independent Speaker identification. IEEE Signal Processing, Volume 11, Number 4, 1994, pp. 18-32
    • (1994) IEEE Signal Processing , vol.11 , Issue.4 , pp. 18-32
    • Gish, H.1    Schmidt, M.2
  • 17
    • 84892706143 scopus 로고    scopus 로고
    • Pushing Streaming Video-Indexing Video Archives
    • October-December, [Grosky97]
    • [Grosky97] W. Grosky. Pushing Streaming Video-Indexing Video Archives. IEEE Multimedia, October-December 1997, pp. 7-8
    • (1997) IEEE Multimedia , pp. 7-8
    • Grosky, W.1
  • 19
    • 85119434191 scopus 로고    scopus 로고
    • Fast Speaker Change Detection for Broadcast News Transcription and Indexing
    • Budapest, Hungary, [Liu99]
    • [Liu99] D. Liu, F. Kubala. Fast Speaker Change Detection for Broadcast News Transcription and Indexing. Proc EuroSpeech99, Budapest, Hungary, 1999, pp. 1031-1034
    • (1999) Proc EuroSpeech 99 , pp. 1031-1034
    • Liu, D.1    Kubala, F.2
  • 20
    • 0006317638 scopus 로고    scopus 로고
    • Overview of the 1997 DARPA Speech Recognition Workshop
    • Chantilly, Virginia, [Pallet97]
    • [Pallet97] D. Pallet. Overview of the 1997 DARPA Speech Recognition Workshop. Proc DARPA Speech Recognition Workshop, Chantilly, Virginia, 1997, pp. 1-2
    • (1997) Proc DARPA Speech Recognition Workshop , pp. 1-2
    • Pallet, D.1
  • 25
    • 0032660827 scopus 로고    scopus 로고
    • Name-It: Naming and Detecting Faces in News Videos
    • January-March, [Satoh99]
    • [Satoh99] S. Satoh, Y. Nakamura, T. Kanade. Name-It: Naming and Detecting Faces in News Videos. IEEE Multimedia, Volume 6, Number 1, January-March 1999, pp. 22-35
    • (1999) IEEE Multimedia , vol.6 , Issue.1 , pp. 22-35
    • Satoh, S.1    Nakamura, Y.2    Kanade, T.3
  • 27
    • 78650540904 scopus 로고    scopus 로고
    • Improved Speaker Segmentation and Segments Clustering Using the Bayesian Information Criterion
    • Budapest, Hungary, [Tritschler99]
    • [Tritschler99] A. Trischler, R.A. Gopinath. Improved Speaker Segmentation and Segments Clustering Using the Bayesian Information Criterion. Proc EuroSpeech99, Budapest, Hungary, 1999, pp. 679-682
    • (1999) Proc EuroSpeech99 , pp. 679-682
    • Trischler, A.1    Gopinath, R.A.2
  • 29
    • 0030242072 scopus 로고    scopus 로고
    • Content-based Classification, Search, and Retrieval of Audio
    • Fall, [Wold96]
    • [Wold96] E. Wold, T. Blum, D. Keislar. Content-based Classification, Search, and Retrieval of Audio. IEEE Multimedia, Volume 3, Number 3, Fall 1996, pp. 27-36
    • (1996) IEEE Multimedia , vol.3 , Issue.3 , pp. 27-36
    • Wold, E.1    Blum, T.2    Keislar, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.