메뉴 건너뛰기




Volumn 5, Issue 4, 2010, Pages 322-331

Multimodal speaker segmentation and identification in presence of overlapped speech segments

Author keywords

Bayesian filtering; Joint probabilistic data association; Microphone array; Multimodal fusion; Parameter estimation; Speaker segmentation

Indexed keywords

BAYESIAN FILTERING; JOINT PROBABILISTIC DATA ASSOCIATION; MICROPHONE ARRAY; MULTI-MODAL FUSION; SPEAKER SEGMENTATIONS;

EID: 78651568988     PISSN: 17962048     EISSN: None     Source Type: Journal    
DOI: 10.4304/jmm.5.4.322-331     Document Type: Article
Times cited : (8)

References (32)
  • 4
    • 0344425668 scopus 로고    scopus 로고
    • Location based speaker segmentation
    • in
    • G. Lathoud and I. McCowan, "Location based speaker segmentation," in Proc. ICASSP, 2003, pp. 621-624.
    • (2003) Proc. ICASSP , pp. 621-624
    • Lathoud, G.1    McCowan, I.2
  • 7
    • 84962200855 scopus 로고    scopus 로고
    • Activity monitoring and summarization for an intelligent meeting room
    • in
    • I. Mikié, K. Huang, and M. Trivedi, "Activity monitoring and summarization for an intelligent meeting room," in Proc. IEEE Workshop on Human Motion, 2000, pp. 107-112.
    • (2000) Proc. IEEE Workshop on Human Motion , pp. 107-112
    • Mikié, I.1    Huang, K.2    Trivedi, M.3
  • 9
    • 34547535369 scopus 로고    scopus 로고
    • Real-time monitoring of participants interaction in a meeting using audio-visual sensors
    • in
    • C. Busso, P. Georgiou, and S. Narayanan, "Real-time monitoring of participants interaction in a meeting using audio-visual sensors," in Proc. ICASSP, 2007, pp. 685-688.
    • (2007) Proc. ICASSP , pp. 685-688
    • Busso, C.1    Georgiou, P.2    Narayanan, S.3
  • 10
    • 48149099232 scopus 로고    scopus 로고
    • Speaker tracking and segmentation with microphone array using mixture particle filter: Improvement of multimodal meeting monitoring system
    • in
    • V. Rozgíc, C. Busso, P. G. Georgiou, and S. Narayanan, "Speaker tracking and segmentation with microphone array using mixture particle filter: Improvement of multimodal meeting monitoring system," in Proc. of Multi Media Signal Processing Conference, 2007, pp. 60-65.
    • (2007) Proc. of Multi Media Signal Processing Conference , pp. 60-65
    • Rozgíc, V.1    Busso, C.2    Georgiou, P.G.3    Narayanan, S.4
  • 11
    • 0030247355 scopus 로고    scopus 로고
    • Robust speaker recognition: A feature-based approach
    • September
    • R. J. Mammone, X. Zhang, and R. P. Ramachandran, "Robust speaker recognition: a feature-based approach," IEEE Signal Processing Magazine, vol. 13, no. 5, pp. 58-71, September 1996.
    • (1996) IEEE Signal Processing Magazine , vol.13 , Issue.5 , pp. 58-71
    • Mammone, R.J.1    Zhang, X.2    Ramachandran, R.P.3
  • 12
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • September
    • J. P. Campbell, "Speaker recognition: a tutorial," Proceedgins of the IEEE, vol. 85, no. 9, pp. 1437-1462, September 1997.
    • (1997) Proceedgins of the IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.P.1
  • 13
    • 0036293830 scopus 로고    scopus 로고
    • An overview of automatic speaker recognition technology
    • in
    • D. A. Reynolds, "An overview of automatic speaker recognition technology," in Proc. ICASSP, 2002, pp. 4072-4075.
    • (2002) Proc. ICASSP , pp. 4072-4075
    • Reynolds, D.A.1
  • 16
    • 4544339441 scopus 로고    scopus 로고
    • Clustering and segmenting speakers and their locations in meetings
    • in
    • J. Ajmera, G. Lathoud, and I. McCowan, "Clustering and segmenting speakers and their locations in meetings," in Proc. of the ICASSP, 2004, pp. 605-608.
    • (2004) Proc. of the ICASSP , pp. 605-608
    • Ajmera, J.1    Lathoud, G.2    McCowan, I.3
  • 19
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. on Speech and Audio Processing, vol. 3, no. 1, pp. 72-83, 1995.
    • (1995) IEEE Trans. on Speech and Audio Processing , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 21
    • 75249093579 scopus 로고    scopus 로고
    • ser. Stochastic Modelling and Applied Probability. Springer Verlag
    • A. Bain and D. Crisan, Fundamentals of Stochastic Filtering, ser. Stochastic Modelling and Applied Probability. Springer Verlag, 2008.
    • (2008) Fundamentals of Stochastic Filtering
    • Bain, A.1    Crisan, D.2
  • 22
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. of IEEE, vol. 77, no. 2, pp. 257-286, 1989.
    • (1989) Proc. of IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 25
    • 0028996655 scopus 로고
    • A comparison of the JPDAF and PMHT tracking
    • in
    • C. Rago, P. Willett, and R. Streit, "A comparison of the JPDAF and PMHT tracking," in Proc. ICASSP, 1995, pp. 3571-3574.
    • (1995) Proc. ICASSP , pp. 3571-3574
    • Rago, C.1    Willett, P.2    Streit, R.3
  • 26
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the bayesian information criterion
    • in, February
    • S. S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the bayesian information criterion," in Proc. DARPA Broadcast News Transcription and Understanding Workshop, February 1998, pp. 127-132.
    • (1998) Proc. DARPA Broadcast News Transcription and Understanding Workshop , pp. 127-132
    • Chen, S.S.1    Gopalakrishnan, P.S.2
  • 28
    • 0034273195 scopus 로고    scopus 로고
    • DISTBIC: A speaker-based segmentation for audio data indexing
    • P. Delacourt, D. Kryze, and C. J. Wellekens, "DISTBIC: A speaker-based segmentation for audio data indexing," Speech Communication, no. 32, pp. 111-126, 2000.
    • (2000) Speech Communication , Issue.32 , pp. 111-126
    • Delacourt, P.1    Kryze, D.2    Wellekens, C.J.3
  • 29
    • 85009109772 scopus 로고    scopus 로고
    • A fast, accurate and stream-based speaker segmentation and clustering algorithm
    • in, September
    • A. Vandecatseye and J.-P. Martens, "A fast, accurate and stream-based speaker segmentation and clustering algorithm," in Proc. Interspeech, September 2003, pp. 941-944.
    • (2003) Proc. Interspeech , pp. 941-944
    • Vandecatseye, A.1    Martens, J.P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.