SCOPUS 정보 검색 플랫폼

Volumn 5, Issue 4, 2010, Pages 322-331

Multimodal speaker segmentation and identification in presence of overlapped speech segments

(4) Rozgić, Viktor a Han, Kyu J a Georgiou, Panayiotis G a Narayanan, Shrikanth a

a University of Southern California (United States)

Author keywords

Bayesian filtering; Joint probabilistic data association; Microphone array; Multimodal fusion; Parameter estimation; Speaker segmentation

Indexed keywords

BAYESIAN FILTERING; JOINT PROBABILISTIC DATA ASSOCIATION; MICROPHONE ARRAY; MULTI-MODAL FUSION; SPEAKER SEGMENTATIONS;

ARRAY PROCESSING; BAYESIAN NETWORKS; DECODING; HIDDEN MARKOV MODELS; LOUDSPEAKERS; MICROPHONES; SPEECH RECOGNITION; VITERBI ALGORITHM;

PARAMETER ESTIMATION;

EID: 78651568988 PISSN: 17962048 EISSN: None Source Type: Journal
DOI: 10.4304/jmm.5.4.322-331 Document Type: Article

Times cited : (8)

References (32)

1
- 84869244850
- Augmented Multiparty Interaction Project, http://www.amiproject.org/.
- Augmented Multiparty Interaction Project

2
- 34250692938
- Computers in Human Interaction Loop Project, http://chil.server.de/.
- Computers in Human Interaction Loop Project

3
- 64149093817
- Audiovisual probabilistic tracking of multiple speakers in meetings
- February
- D. Gatica-Perez, G. Lathoud, J. M. Odobez, and I. Mc-Cowan, "Audiovisual probabilistic tracking of multiple speakers in meetings," IEEE Trans. on Audio, Speech and Language Processing, vol. 15, no. 2, pp. 601-616, February 2007.
- (2007) IEEE Trans. on Audio, Speech and Language Processing , vol.15 , Issue.2 , pp. 601-616
- Gatica-Perez, D.¹ Lathoud, G.² Odobez, J.M.³ Mc-Cowan, I.⁴

4
- 0344425668
- Location based speaker segmentation
- in
- G. Lathoud and I. McCowan, "Location based speaker segmentation," in Proc. ICASSP, 2003, pp. 621-624.
- (2003) Proc. ICASSP , pp. 621-624
- Lathoud, G.¹ McCowan, I.²

5
- 14944382167
- Memory cues for meeting video retrieval
- in
- A. Jaimes, T. Okmura, T. Nagamine, and K. Hirata, "Memory cues for meeting video retrieval," in Proc. ACM Workshop of archival and retrieval of personal experiences, 2004, pp. 74-85.
- (2004) Proc. ACM Workshop of archival and retrieval of personal experiences , pp. 74-85
- Jaimes, A.¹ Okmura, T.² Nagamine, T.³ Hirata, K.⁴

6
- 15044354466
- Automatic analysis of multimodal group actions in meetings
- March
- I. McCowan, D. Gatica-Perez, S. Bengio, G. Lathoud, M. Barnard, and D.Zhang, "Automatic analysis of multimodal group actions in meetings," IEEE Trans. on PAMI, vol. 27, no. 3, pp. 305-317, March 2005.
- (2005) IEEE Trans. on PAMI , vol.27 , Issue.3 , pp. 305-317
- McCowan, I.¹ Gatica-Perez, D.² Bengio, S.³ Lathoud, G.⁴ Barnard, M.⁵ Zhang, D.⁶

7
- 84962200855
- Activity monitoring and summarization for an intelligent meeting room
- in
- I. Mikié, K. Huang, and M. Trivedi, "Activity monitoring and summarization for an intelligent meeting room," in Proc. IEEE Workshop on Human Motion, 2000, pp. 107-112.
- (2000) Proc. IEEE Workshop on Human Motion , pp. 107-112
- Mikié, I.¹ Huang, K.² Trivedi, M.³

8
- 33646779170
- Smart room: Participant and speaker localization and identification
- in
- C. Busso, S. Hernanz, C. W. Chu, S. I. Kwon, S. Lee, P. G. Georgiou, I. Cohen, and S. Narayanan, "Smart room: Participant and speaker localization and identification," in Proc. of the ICASSP, 2005, pp. 1117-1120.
- (2005) Proc. of the ICASSP , pp. 1117-1120
- Busso, C.¹ Hernanz, S.² Chu, C.W.³ Kwon, S.I.⁴ Lee, S.⁵ Georgiou, P.G.⁶ Cohen, I.⁷ Narayanan, S.⁸

9
- 34547535369
- Real-time monitoring of participants interaction in a meeting using audio-visual sensors
- in
- C. Busso, P. Georgiou, and S. Narayanan, "Real-time monitoring of participants interaction in a meeting using audio-visual sensors," in Proc. ICASSP, 2007, pp. 685-688.
- (2007) Proc. ICASSP , pp. 685-688
- Busso, C.¹ Georgiou, P.² Narayanan, S.³

10
- 48149099232
- Speaker tracking and segmentation with microphone array using mixture particle filter: Improvement of multimodal meeting monitoring system
- in
- V. Rozgíc, C. Busso, P. G. Georgiou, and S. Narayanan, "Speaker tracking and segmentation with microphone array using mixture particle filter: Improvement of multimodal meeting monitoring system," in Proc. of Multi Media Signal Processing Conference, 2007, pp. 60-65.
- (2007) Proc. of Multi Media Signal Processing Conference , pp. 60-65
- Rozgíc, V.¹ Busso, C.² Georgiou, P.G.³ Narayanan, S.⁴

11
- 0030247355
- Robust speaker recognition: A feature-based approach
- September
- R. J. Mammone, X. Zhang, and R. P. Ramachandran, "Robust speaker recognition: a feature-based approach," IEEE Signal Processing Magazine, vol. 13, no. 5, pp. 58-71, September 1996.
- (1996) IEEE Signal Processing Magazine , vol.13 , Issue.5 , pp. 58-71
- Mammone, R.J.¹ Zhang, X.² Ramachandran, R.P.³

12
- 0031233424
- Speaker recognition: A tutorial
- September
- J. P. Campbell, "Speaker recognition: a tutorial," Proceedgins of the IEEE, vol. 85, no. 9, pp. 1437-1462, September 1997.
- (1997) Proceedgins of the IEEE , vol.85 , Issue.9 , pp. 1437-1462
- Campbell, J.P.¹

13
- 0036293830
- An overview of automatic speaker recognition technology
- in
- D. A. Reynolds, "An overview of automatic speaker recognition technology," in Proc. ICASSP, 2002, pp. 4072-4075.
- (2002) Proc. ICASSP , pp. 4072-4075
- Reynolds, D.A.¹

14
- 2942594475
- A tutorial on text-independent speaker verification
- F. Bimbot, J.-F. Bonastre, C. Fredouille, G. Gravier, I. Magrin-Chagnolleau, S. Meignier, T. Merlin, J. Ortega-Garćia, D. Petrovska-Delacŕetaz, and D. A. Reynolds, "A tutorial on text-independent speaker verification," EURASIP Journal on Applied Signal Processing, vol. 4, pp. 430-451, 2004.
- (2004) EURASIP Journal on Applied Signal Processing , vol.4 , pp. 430-451
- Bimbot, F.¹ Bonastre, J.F.² Fredouille, C.³ Gravier, G.⁴ Magrin-Chagnolleau, I.⁵ Meignier, S.⁶ Merlin, T.⁷ Ortega-Garcia, J.⁸ Petrovska-Delacrétaz, D.⁹ Reynolds, D.A.¹⁰

15
- 36249031961
- ser. Lecture Notes in Computer Science. Springer Berlin -Heidelberg
- D. E. Sturim, W. M. Campbell, and D. A. Reynolds, Classification Methods for Speaker Recognition, ser. Lecture Notes in Computer Science. Springer Berlin -Heidelberg, 2007, pp. 278-297.
- (2007) Classification Methods for Speaker Recognition , pp. 278-297
- Sturim, D.E.¹ Campbell, W.M.² Reynolds, D.A.³

16
- 4544339441
- Clustering and segmenting speakers and their locations in meetings
- in
- J. Ajmera, G. Lathoud, and I. McCowan, "Clustering and segmenting speakers and their locations in meetings," in Proc. of the ICASSP, 2004, pp. 605-608.
- (2004) Proc. of the ICASSP , pp. 605-608
- Ajmera, J.¹ Lathoud, G.² McCowan, I.³

17
- 0003980102
- Prentice Hall, 2001
- M. Brandstein and D. Ward, Microphone Arrays: Signal Processing Techniques and Applications. Prentice Hall, 2001.
- (2001) Microphone Arrays: Signal Processing Techniques and Applications
- Brandstein, M.¹ Ward, D.²

18
- 33746653380
- Time delay estimation in room acoustic environments: An overview
- January
- J. Chen, J. Benesty, and Y. Huang, "Time delay estimation in room acoustic environments: an overview," EURASIP Journal on Applied Signal Processing, pp. 170-187, January 2006.
- (2006) EURASIP Journal on Applied Signal Processing , pp. 170-187
- Chen, J.¹ Benesty, J.² Huang, Y.³

19
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. on Speech and Audio Processing, vol. 3, no. 1, pp. 72-83, 1995.
- (1995) IEEE Trans. on Speech and Audio Processing , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

20
- 0003665481
- Springer-Verlag
- A. Doucet, N. DeFreitas, and N. Gordon, Sequential Monte Carlo Methods in Practice. Springer-Verlag, 2001.
- (2001) Sequential Monte Carlo Methods in Practice
- Doucet, A.¹ DeFreitas, N.² Gordon, N.³

21
- 75249093579
- ser. Stochastic Modelling and Applied Probability. Springer Verlag
- A. Bain and D. Crisan, Fundamentals of Stochastic Filtering, ser. Stochastic Modelling and Applied Probability. Springer Verlag, 2008.
- (2008) Fundamentals of Stochastic Filtering
- Bain, A.¹ Crisan, D.²

22
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. of IEEE, vol. 77, no. 2, pp. 257-286, 1989.
- (1989) Proc. of IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

23
- 0004182828
- ser. Springer Series in Statistics. Springer Verlag
- J. S. Liu, Monte Carlo Strategies in Scientific Computing, ser. Springer Series in Statistics. Springer Verlag, 2001.
- (2001) Monte Carlo Strategies in Scientific Computing
- Liu, J.S.¹

24
- 0004160361
- ser. Texts in Statistical Science Series. Chapman & Hall
- D. Gamerman and H. F. Lopes, Markov Chain Monte Carlo: Stochastic Simulation for Bayesian Inference, ser. Texts in Statistical Science Series. Chapman & Hall, 2006.
- (2006) Markov Chain Monte Carlo: Stochastic Simulation for Bayesian Inference
- Gamerman, D.¹ Lopes, H.F.²

25
- 0028996655
- A comparison of the JPDAF and PMHT tracking
- in
- C. Rago, P. Willett, and R. Streit, "A comparison of the JPDAF and PMHT tracking," in Proc. ICASSP, 1995, pp. 3571-3574.
- (1995) Proc. ICASSP , pp. 3571-3574
- Rago, C.¹ Willett, P.² Streit, R.³

26
- 0002595416
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- in, February
- S. S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the bayesian information criterion," in Proc. DARPA Broadcast News Transcription and Understanding Workshop, February 1998, pp. 127-132.
- (1998) Proc. DARPA Broadcast News Transcription and Understanding Workshop , pp. 127-132
- Chen, S.S.¹ Gopalakrishnan, P.S.²

27
- 85119434191
- Fast speaker change detection for broadcast news transcription and indexing
- in, September
- D. Liu and F. Kubala, "Fast speaker change detection for broadcast news transcription and indexing," in Proc. 6th European Conference on Speech Communication and Technology, September 1999, pp. 1031-1034.
- (1999) Proc. 6th European Conference on Speech Communication and Technology , pp. 1031-1034
- Liu, D.¹ Kubala, F.²

28
- 0034273195
- DISTBIC: A speaker-based segmentation for audio data indexing
- P. Delacourt, D. Kryze, and C. J. Wellekens, "DISTBIC: A speaker-based segmentation for audio data indexing," Speech Communication, no. 32, pp. 111-126, 2000.
- (2000) Speech Communication , Issue.32 , pp. 111-126
- Delacourt, P.¹ Kryze, D.² Wellekens, C.J.³

29
- 85009109772
- A fast, accurate and stream-based speaker segmentation and clustering algorithm
- in, September
- A. Vandecatseye and J.-P. Martens, "A fast, accurate and stream-based speaker segmentation and clustering algorithm," in Proc. Interspeech, September 2003, pp. 941-944.
- (2003) Proc. Interspeech , pp. 941-944
- Vandecatseye, A.¹ Martens, J.P.²

30
- 34047261805
- An overview of automatic speaker diarization systems
- S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Trans. of Audio, Speech and Language Processing, vol. 5, no. 14, pp. 1557-1565, 2006.
- (2006) IEEE Trans. of Audio, Speech and Language Processing , vol.5 , Issue.14 , pp. 1557-1565
- Tranter, S.E.¹ Reynolds, D.A.²

31
- 0012614532
- Performance measures for information extraction
- in
- J. Makhoul, F. Kubala, R. Schwartz, and R. Weischedel, "Performance measures for information extraction," in Proc. of DARPA Broadcast News Workshop, 1999, pp. 249-252.
- (1999) Proc. of DARPA Broadcast News Workshop , pp. 249-252
- Makhoul, J.¹ Kubala, F.² Schwartz, R.³ Weischedel, R.⁴

32
- 0004162649
- Prentice Hall
- D. A. Forsyth and J. Ponce, Computer Vision, A Modern Approach. Prentice Hall., 2003.
- (2003) Computer Vision, A Modern Approach
- Forsyth, D.A.¹ Ponce, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.