SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 20, Issue 2-3 SPEC. ISS., 2006, Pages 303-330

Step-by-step and integrated approaches in broadcast news speaker diarization

(5) Meignier, Sylvain a,c Moraru, Daniel b Fredouille, Corinne a Bonastre, Jean François a Besacier, Laurent b

a UNIVERSITY OF AVIGNON (France)

b CNRS (France)

c UNIVERSITÉ DU MAINE (France)

Author keywords

E HMM; Integrated approach; Speaker diarization; Speaker indexing; Speaker segmentation and clustering; Step by step approach

Indexed keywords

IMAGE SEGMENTATION; PATTERN RECOGNITION SYSTEMS; TELEVISION BROADCASTING;

E-HMM; INTEGRATED APPROACH; SPEAKER DIARIZATION; SPEAKER INDEXING; SPEAKER SEGMENTATION AND CLUSTERING; STEP-BY-STEP APPROACH;

SPEECH RECOGNITION;

EID: 29044442235 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1016/j.csl.2005.08.002 Document Type: Conference Paper

Times cited : (108)

References (37)

1
- 0036288688
- A new speaker change detection method for two-speaker segmentation
- Adami, A., Kajarekar, S.S., Hermansky, H., 2002. A new speaker change detection method for two-speaker segmentation. In: Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 2002), vol. IV, pp. 3908-3911.
- (2002) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 2002) , vol.4 , pp. 3908-3911
- Adami, A.¹ Kajarekar, S.S.² Hermansky, H.³

2
- 84946742526
- A robust speaker clustering algorithm
- IEEE, ASRU 2003, St. Thomas, US Virgin Islands
- Ajmera, J., Wooters, C., 2003. A robust speaker clustering algorithm. In: Automatic Speech Recognition and Understanding, IEEE, ASRU 2003, St. Thomas, US Virgin Islands, pp. 411-416.
- (2003) Automatic Speech Recognition and Understanding , pp. 411-416
- Ajmera, J.¹ Wooters, C.²

3
- 29044446864
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- Landsdowne, VA
- Chen, S., Gopalakrishnan, P., 1998. Speaker, environment and channel change detection and clustering via the bayesian information criterion. In: DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, VA.
- (1998) DARPA Broadcast News Transcription and Understanding Workshop
- Chen, S.¹ Gopalakrishnan, P.²

4
- 29044435386
- Darpa speech recognition evaluation workshop. Available from: 〈http://www.nist.gov/speech/publications/〉.
- Darpa Speech Recognition Evaluation Workshop

5
- 0034273195
- DISTBIC: A speaker based segmentation for audio data indexing
- P. Delacourt, and C.J. Welkens DISTBIC: a speaker based segmentation for audio data indexing Speech Communication 32 2000 111 126
- (2000) Speech Communication , vol.32 , pp. 111-126
- Delacourt, P.¹ Welkens, C.J.²

6
- 0033876604
- The ELISA systems for the NIST 99 evaluation in speaker detection and tracking
- ELISA, 2000. The ELISA systems for the NIST 99 evaluation in speaker detection and tracking. Digital Signal Processing (DSP), a review journal - Special issue on NIST 1999 speaker recognition workshop 10 (1-3), pp. 143-153.
- (2000) Digital Signal Processing (DSP), a Review Journal - Special Issue on NIST 1999 Speaker Recognition Workshop , vol.10 , Issue.1-3 , pp. 143-153

7
- 29044436483
- The NIST 2004 spring rich transcription evaluation: Two-axis merging strategy in the context of multiple distance microphone based meeting speaker segmentation
- Fredouille, C., Moraru, D., Meignier, S., Besacier, L., Bonastre, J.-F., 2004. The NIST 2004 spring rich transcription evaluation: two-axis merging strategy in the context of multiple distance microphone based meeting speaker segmentation, In: RT2004 Spring Meeting Recognition Workshop, p. 5.
- (2004) RT2004 Spring Meeting Recognition Workshop , pp. 5
- Fredouille, C.¹ Moraru, D.² Meignier, S.³ Besacier, L.⁴ Bonastre, J.-F.⁵

8
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- J.-L. Gauvain, and C.H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Transactions on Speech and Audio Processing 22 1994 291 298
- (1994) IEEE Transactions on Speech and Audio Processing , vol.22 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.H.²

9
- 85128356454
- Partitioning and transcription of broadcast news data
- Gauvain, J.-L., Lamel, L., Adda, G., 1998. Partitioning and transcription of broadcast news data, In: Proceedings of International Conference on Spoken Language Processing (ICSLP 98).
- (1998) Proceedings of International Conference on Spoken Language Processing (ICSLP 98)
- Gauvain, J.-L.¹ Lamel, L.² Adda, G.³

10
- 0035367375
- Audio partitioning and transcription for broadcast data indexation
- J.-L. Gauvain, L. Lamel, and G. Adda Audio partitioning and transcription for broadcast data indexation Multimedia Tools and Applications 2001 187 200
- (2001) Multimedia Tools and Applications , pp. 187-200
- Gauvain, J.-L.¹ Lamel, L.² Adda, G.³

11
- 0036567851
- The LIMSI broadcast news transcription system
- J.-L. Gauvain, L. Lamel, and G. Adda The LIMSI broadcast news transcription system Speech Communication 37 1-2 2002 89 108
- (2002) Speech Communication , vol.37 , Issue.1-2 , pp. 89-108
- Gauvain, J.-L.¹ Lamel, L.² Adda, G.³

12
- 85071069033
- Segmentation and classification of broadcast news audio
- Sydney, Australia
- Hain, T., Woodland, P., 1998. Segmentation and classification of broadcast news audio. In: Proceedings of International Conference on Spoken Language Processing (ICSLP 98), Sydney, Australia.
- (1998) Proceedings of International Conference on Spoken Language Processing (ICSLP 98)
- Hain, T.¹ Woodland, P.²

13
- 84946740232
- Recent advances in broadcast news transcription
- IEEE, ASRU 2003, St. Thomas, US Virgin Islands
- Kim, D.Y., Evermann, G., Hain, T., Mrva, D., Tranter, S., Wang, L., Woodland, P.C., 2003. Recent advances in broadcast news transcription. In: Automatic Speech Recognition and Understanding, IEEE, ASRU 2003, St. Thomas, US Virgin Islands, pp. 105-110.
- (2003) Automatic Speech Recognition and Understanding , pp. 105-110
- Kim, D.Y.¹ Evermann, G.² Hain, T.³ Mrva, D.⁴ Tranter, S.⁵ Wang, L.⁶ Woodland, P.C.⁷

14
- 84889985371
- Overview of the ELISA consortium research activities
- for the ELISA consortium Chania, Crete
- Magrin-Chagnolleau, I., Gravier, G., Blouet, R., 2001. for the ELISA consortium, Overview of the ELISA consortium research activities. In: 2001: A Speaker Odyssey. The Speaker Recognition Workshop, Chania, Crete, pp. 67-72.
- (2001) 2001: A Speaker Odyssey. The Speaker Recognition Workshop , pp. 67-72
- Magrin-Chagnolleau, I.¹ Gravier, G.² Blouet, R.³

15
- 0033677065
- Evolutive HMM for speaker tracking system
- Istanbul, Turkey
- Meignier, S., Bonastre, J.-F., Fredouille, C., Merlin, T., 2000. Evolutive HMM for speaker tracking system. In: Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. 1177-1180.
- (2000) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 2000) , pp. 1177-1180
- Meignier, S.¹ Bonastre, J.-F.² Fredouille, C.³ Merlin, T.⁴

16
- 0141809272
- E-HMM approach for learning and adapting sound models for speaker indexing
- Chania, Crete
- Meignier, S., Bonastre, J.-F., Igounet, S., 2001. E-HMM approach for learning and adapting sound models for speaker indexing. In: 2001: a Speaker Odyssey. The Speaker Recognition Workshop, Chania, Crete, pp. 175-180.
- (2001) 2001: A Speaker Odyssey. The Speaker Recognition Workshop , pp. 175-180
- Meignier, S.¹ Bonastre, J.-F.² Igounet, S.³

17
- 0141590307
- The ELISA consortium approaches in speaker segmentation during the NIST 2002 speaker recognition evaluation
- Hong Kong
- Moraru, D., Meignier, S., Besacier, L., Bonastre, J.-F., Magrin-Chagnolleau, Y., 2003. The ELISA consortium approaches in speaker segmentation during the NIST 2002 speaker recognition evaluation. In: Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 2003), vol. II, Hong Kong, pp. 89-92.
- (2003) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 2003) , vol.2 , pp. 89-92
- Moraru, D.¹ Meignier, S.² Besacier, L.³ Bonastre, J.-F.⁴ Magrin-Chagnolleau, Y.⁵

18
- 4544361649
- The ELISA consortium approaches in broadcast news speaker segmentation during the NIST 2003 rich transcription evaluation
- Montreal, Canada
- Moraru, D., Meignier, S., Fredouille, C., Besacier, L., Bonastre, J.-F., 2004. The ELISA consortium approaches in broadcast news speaker segmentation during the NIST 2003 rich transcription evaluation. In: Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 2004), Montreal, Canada.
- (2004) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 2004)
- Moraru, D.¹ Meignier, S.² Fredouille, C.³ Besacier, L.⁴ Bonastre, J.-F.⁵

19
- 33947623018
- Using a priori information for speaker diarization
- Toledo, Spain
- Moraru, D., Besacier, L., Castelli, E., 2004. Using a priori information for speaker diarization. In: 2004: A Speaker Odyssey. The Speaker Recognition Workshop, Toledo, Spain, pp. 355-362.
- (2004) 2004: A Speaker Odyssey. The Speaker Recognition Workshop , pp. 355-362
- Moraru, D.¹ Besacier, L.² Castelli, E.³

20
- 4544273245
- Light supervision in acoustic model training
- Montreal, Canada
- Nguyen, L., Xiang, B., 2004. Light supervision in acoustic model training. In: Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 2004), Montreal, Canada.
- (2004) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 2004)
- Nguyen, L.¹ Xiang, B.²

21
- 84902054006
- v2.4
- NIST, Reference data cookbook for who spoke when diarization task. Available from: 〉http://www.nist.gov/speech/tests/rt/rt2003/spring/docs/ ref-cookbook-v2_4.pdf〉, v2.4 (2003).
- (2003) Reference Data Cookbook for Who Spoke When Diarization Task

22
- 29044440118
- NIST, Rt-03s workshop agenda and presentations. Available from: 〈http://www.nist.gov/speech/tests/rt/rt2003/spring/presentations〉.
- Rt-03s Workshop Agenda and Presentations

23
- 0004097709
- March
- NIST, The NIST 2001 speaker recognition evaluation plan. Available from: 〈http://www.nist.gov/speech/tests/spk/2001/doc/2001-spkrec-evalplan-v05.9. pdf〉 (March 2001).
- (2001) The NIST 2001 Speaker Recognition Evaluation Plan

24
- 4143152576
- February
- NIST, The NIST year 2002 speaker recognition evaluation plan. Available from: 〈http://www.nist.gov/speech/tests/spk/2002/doc/2002-spkrec-evalplan- v60.pdf〉 (February 2002).
- (2002) The NIST Year 2002 Speaker Recognition Evaluation Plan

25
- 29044441444
- (Version 4, Updated 02/25/2003) (February)
- NIST, The rich transcription spring 2003 (RT-03S) evaluation plan. Available from: 〈http://www.nist.gov/speech/tests/rt/rt2003/spring/docs/ rt03-spring-eval-plan-v4.pdf〉, (Version 4, Updated 02/25/2003) (February 2003).
- (2003) The Rich Transcription Spring 2003 (RT-03S) Evaluation Plan

26
- 29044433805
- February
- NIST, Spring 2004 (rt-04s) rich transcription meeting recognition evaluation plan. Available from: 〈http://www.nist.gov/speech/tests/rt/ rt2004/spring/documents/rt04s-meeting-eval-plan-v1.pdf〉 (February 2004).
- (2004) Spring 2004 (rt-04s) Rich Transcription Meeting Recognition Evaluation Plan

27
- 33750543737
- Clips-imag at trec-11: Experiments in video retrieval
- Gaithersburg, MD, USA
- Quénot, G., Moraru, D., Besacier, L., Mulhem, P., 2002. Clips-imag at trec-11: Experiments in video retrieval. In: TREC 2002, Gaithersburg, MD, USA.
- (2002) TREC 2002
- Quénot, G.¹ Moraru, D.² Besacier, L.³ Mulhem, P.⁴

28
- 29044439703
- Clips at trecvid: Shot boundary detection and feature detection
- Gaithersburg, MD, USA
- Quénot, G., Moraru, D., Besacier, L., 2003. Clips at trecvid: Shot boundary detection and feature detection. In: TREC 2003, Gaithersburg, MD, USA.
- (2003) TREC 2003
- Quénot, G.¹ Moraru, D.² Besacier, L.³

29
- 85009152889
- The Lincoln speaker recognition system: NIST EVAL2000
- Beijing, China
- Reynolds, D.A., Dunm, R.B., Laughlin, J.J., 2000. The Lincoln speaker recognition system: NIST EVAL2000. In: Proceedings of International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, Beijing, China, pp. 470-473.
- (2000) Proceedings of International Conference on Spoken Language Processing (ICSLP 2000) , vol.2 , pp. 470-473
- Reynolds, D.A.¹ Dunm, R.B.² Laughlin, J.J.³

30
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- Reynolds, D.A., Quatieri, T.F., Dunn, R.B., 2000. Speaker verification using adapted Gaussian mixture models, Digital Signal Processing (DSP), a review journal - Special issue on NIST 1999 speaker recognition workshop 10 (1-3), pp. 19-41.
- (2000) Digital Signal Processing (DSP), a Review Journal - Special Issue on NIST 1999 Speaker Recognition Workshop , vol.10 , Issue.1-3 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

31
- 0000120766
- Estimating the dimension of a model
- G. Schwarz Estimating the dimension of a model The Annals of Statistics 6 2 1978 461 464
- (1978) The Annals of Statistics , vol.6 , Issue.2 , pp. 461-464
- Schwarz, G.¹

32
- 0002782496
- Automatic segmentation and clustering of broadcast news audio
- Westfields, Chantilly, Virginia
- Siegler, M., Jain, U., Raj, B., Stern, R., 1997. Automatic segmentation and clustering of broadcast news audio. In: the DARPA Speech Recognition Workshop, Westfields, Chantilly, Virginia.
- (1997) The DARPA Speech Recognition Workshop
- Siegler, M.¹ Jain, U.² Raj, B.³ Stern, R.⁴

33
- 85009265801
- An unsupervised, sequential learning algorithm for segmentation of speech waveforms with multi-speakers
- San Francisco, CA
- Siu, M.-H., Rohlicek, R., Gish, H., 1992. An unsupervised, sequential learning algorithm for segmentation of speech waveforms with multi-speakers. In: Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 92), vol. 2, San Francisco, CA, pp. 189-192.
- (1992) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 92) , vol.2 , pp. 189-192
- Siu, M.-H.¹ Rohlicek, R.² Gish, H.³

34
- 24644472826
- TRECVID 2003 - An introduction
- Smeaton, A., Kraaij, W., Over, P., 2003. TRECVID 2003 - an introduction. In: 12th Text Retrieval Conference.
- (2003) 12th Text Retrieval Conference
- Smeaton, A.¹ Kraaij, W.² Over, P.³

35
- 79952385877
- Segmentation of speech using speaker identification
- Adelaide, Australia
- Wilcox, L., Chen, F., Kimber, D., Balasubramanian, V., 1994. Segmentation of speech using speaker identification, In: Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 94), Adelaide, Australia, pp. 161-164.
- (1994) Proceedings of International Conference on Acoustics Speech and Signal Processing (ICASSP 94) , pp. 161-164
- Wilcox, L.¹ Chen, F.² Kimber, D.³ Balasubramanian, V.⁴

36
- 85076109151
- Audio indexing using speaker identification
- San Diego, CA
- Wilcox, L., Kimber, D., Chen, F., 1994. Audio indexing using speaker identification. In: Proceedings SPIE Conference on Automatic Systems for the Inspection and Identification of Humans, San Diego, CA, pp. 149-157.
- (1994) Proceedings SPIE Conference on Automatic Systems for the Inspection and Identification of Humans , pp. 149-157
- Wilcox, L.¹ Kimber, D.² Chen, F.³

37
- 0036567794
- The development of the HTK broadcast news transcription system: An overview
- P. Woodland The development of the HTK broadcast news transcription system: an overview Speech Communication 37 1-2 2002 291 299
- (2002) Speech Communication , vol.37 , Issue.1-2 , pp. 291-299
- Woodland, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.