메뉴 건너뛰기




Volumn 34, Issue 1, 2012, Pages 79-93

Multimodal Speaker diarization

Author keywords

audiovisual fusion; dynamic Bayesian networks; Speaker diarization

Indexed keywords

AUDIO AND VIDEO; AUDIO STREAM; AUDIO-BASED; AUDIO-VISUAL FUSION; BROADCAST VIDEO; DATA SETS; DYNAMIC BAYESIAN NETWORK; DYNAMIC BAYESIAN NETWORKS; EXPECTATION-MAXIMIZATION ALGORITHMS; LABELED TRAINING DATA; MEETING VIDEO; MODALITY ANALYSIS; MODEL PARAMETERS; MULTI-MODAL; MULTIMODAL FRAMEWORKS; PROBABILISTIC FRAMEWORK; RECORDING EQUIPMENT; SPEAKER DIARIZATION; VIDEO STREAMS;

EID: 81855191839     PISSN: 01628828     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPAMI.2011.47     Document Type: Article
Times cited : (60)

References (36)
  • 6
    • 33745196256 scopus 로고    scopus 로고
    • Spectral cross-correlation features for audio indexing of broadcast news and meetings
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • M. Yamaguchi, M. Yamashita, and S. Matsunaga, "Spectral Cross-Correlation Features for Audio Indexing of Broadcast News and Meetings," Proc. Ninth European Conf. Speech Comm. and Technology, pp. 613-616, 2005. (Pubitemid 43908137)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 613-616
    • Yamaguchi, M.1    Yamashita, M.2    Matsunaga, S.3
  • 7
    • 0034273195 scopus 로고    scopus 로고
    • DISTBIC: A speaker-based segmentation for audio data indexing
    • P. Delacourt, D. Kryze, and C.J. Wellekens, "DISTBIC: A Speaker-Based Segmentation for Audio Data Indexing," Speech Comm., vol. 32, pp. 111-126, 2000.
    • (2000) Speech Comm. , vol.32 , pp. 111-126
    • Delacourt, P.1    Kryze, D.2    Wellekens, C.J.3
  • 16
    • 4243096131 scopus 로고    scopus 로고
    • Multimodal processing by finding common cause
    • H.J. Nock, G. Iyengar, and C. Neti, "Multimodal Processing by Finding Common Cause," Comm. ACM, vol. 47, no. 1, pp. 51-56, 2004.
    • (2004) Comm. ACM , vol.47 , Issue.1 , pp. 51-56
    • Nock, H.J.1    Iyengar, G.2    Neti, C.3
  • 20
    • 21244492850 scopus 로고    scopus 로고
    • Real-time speaker tracking using particle filter sensor fusion
    • DOI 10.1109/JPROC.2003.823146, Sequential State Estimation: From Kalman Filters to Particles Filters
    • Y. Chen and Y. Rui, "Real-Time Speaker Tracking Using Particle Filter Sensor Fusion," Proc. IEEE, vol. 92, no. 3, pp. 485-494, Mar. 2004. (Pubitemid 40890755)
    • (2004) Proceedings of the IEEE , vol.92 , Issue.3 , pp. 485-494
    • Chen, Y.1    Rui, Y.2
  • 25
    • 0024610919 scopus 로고
    • A tutorial on hidden markov models and selected applications in speech recognition
    • Feb.
    • L.R. Rabiner, "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 29
    • 0001185873 scopus 로고    scopus 로고
    • An essay towards solving a problem in the doctrine of chances
    • B. Thomas, "An Essay Towards Solving a Problem in the Doctrine of Chances," Philosophical Trans. Royal Soc., vol. 53, pp. 370-418, 1763.
    • Philosophical Trans. Royal Soc. , vol.53 , Issue.1763 , pp. 370-418
    • Thomas, B.1
  • 34
    • 58049136519 scopus 로고    scopus 로고
    • Announcing the AMI meeting corpus
    • Jan.-Mar.
    • J. Carletta, "Announcing the AMI Meeting Corpus," The ELRA Newsletter, vol. 1, no. 1, pp. 3-5, Jan.-Mar. 2006.
    • (2006) The ELRA Newsletter , vol.1 , Issue.1 , pp. 3-5
    • Carletta, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.