메뉴 건너뛰기




Volumn , Issue , 2013, Pages 1477-1481

An open-source state-of-the-art toolbox for broadcast news diarization

Author keywords

Broadcast news; Open source; Speaker diarization

Indexed keywords

COMPUTER APPLICATIONS; COMPUTER SIMULATION;

EID: 84906274473     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (114)

References (31)
  • 1
    • 33745196067 scopus 로고    scopus 로고
    • August, [Online]. Available
    • NIST, "Fall 2004 rich transcription (RT-04F) evaluation plan, " August 2004. [Online]. Available: http://www.nist.gov/speech/tests/rt/rt2004/ fall/docs/rt04f-eval-plan-v14.pdf.
    • (2004) Fall 2004 Rich Transcription (RT-04F) Evaluation Plan
  • 3
    • 70450180496 scopus 로고    scopus 로고
    • The ESTER 2 evaluation campaign for the rich transcription of French radio broadcasts
    • September
    • S. Galliano, G. Gravier, and L. Chaubard, "The ESTER 2 evaluation campaign for the rich transcription of French radio broadcasts, " in Proceedings of Interspeech, September 2009.
    • (2009) Proceedings of Interspeech
    • Galliano, S.1    Gravier, G.2    Chaubard, L.3
  • 6
    • 78650898482 scopus 로고    scopus 로고
    • LIUM spkdiarization: An open source toolkit for diarization
    • Dallas, Texas (USA), March
    • S. Meignier and T. Merlin, "LIUM SpkDiarization: An open source toolkit for diarization, " in CMU SPUD Workshop, Dallas, Texas (USA), March 2010.
    • (2010) CMU SPUD Workshop
    • Meignier, S.1    Merlin, T.2
  • 7
    • 0002782496 scopus 로고    scopus 로고
    • Automatic segmentation, classification and clustering of broadcast news audio
    • Chantilly, VA, USA, February
    • M. Siegler, U. U. Jain, B. Raj, and R. Stern, "Automatic segmentation, classification and clustering of broadcast news audio, " in the DARPA Speech Recognition Workshop, Chantilly, VA, USA, February 1997.
    • (1997) The DARPA Speech Recognition Workshop
    • Siegler, M.1    Jain, U.U.2    Raj, B.3    Stern, R.4
  • 12
    • 84878591156 scopus 로고    scopus 로고
    • DIARTK: An open source toolkit for research in multistream speaker diarization and its application to meetings recordings
    • Portland, Oregon (USA
    • D. Vijayasenan and F. Valente, "DIARTK: An open source toolkit for research in multistream speaker diarization and its application to meetings recordings, " in Proceedings of Interspeech, Portland, Oregon (USA), 2012.
    • (2012) Proceedings of Interspeech
    • Vijayasenan, D.1    Valente, F.2
  • 15
    • 70450167910 scopus 로고    scopus 로고
    • Speaker diarization using normalized cross-likelihood ratio
    • V.-B. Le, O. Mella, and D. Fohr, "Speaker diarization using normalized cross-likelihood ratio, " in Proceedings of Interspeech, 2007.
    • (2007) Proceedings of Interspeech
    • Le, V.-B.1    Mella, O.2    Fohr, D.3
  • 16
    • 84905255747 scopus 로고    scopus 로고
    • A global optimization framework for speaker diarization
    • Singapore
    • M. Rouvier and S. Meignier, "A global optimization framework for speaker diarization, " in Odyssey Workshop, Singapore, 2012.
    • (2012) Odyssey Workshop
    • Rouvier, M.1    Meignier, S.2
  • 19
    • 84865753339 scopus 로고    scopus 로고
    • Intersession compensation and scoring methods in the i-vectors space for speaker recognition
    • Florence, Italy
    • P.-M. Bousquet, D. Matrouf, and J.-F. Bonastre, "Intersession compensation and scoring methods in the i-vectors space for speaker recognition, " in Proceedings of Interspeech, Florence, Italy, 2011.
    • (2011) Proceedings of Interspeech
    • Bousquet, P.-M.1    Matrouf, D.2    Bonastre, J.-F.3
  • 20
    • 84865776156 scopus 로고    scopus 로고
    • Comparing multistage approaches for cross-show speaker diarization
    • Florence, Italy
    • V.-A. Tran, V. B. Le, C. Barras, and L. Lamel, "Comparing multistage approaches for cross-show speaker diarization, " in Proceedings of Interspeech, Florence, Italy, 2011.
    • (2011) Proceedings of Interspeech
    • Tran, V.-A.1    Le, V.B.2    Barras, C.3    Lamel, L.4
  • 21
    • 84865734172 scopus 로고    scopus 로고
    • Investigation of cross-show speaker diarization
    • Florence, Italy
    • Q. Yang, Q. Jin, and T. Schultz, "Investigation of cross-show speaker diarization, " in Proceedings of Interspeech, Florence, Italy, 2011.
    • (2011) Proceedings of Interspeech
    • Yang, Q.1    Jin, Q.2    Schultz, T.3
  • 22
    • 84878543097 scopus 로고    scopus 로고
    • I-vectors and ILP clustering adapted to cross-show speaker diarization
    • Portland, Oregon (USA
    • G. Dupuy, M. Rouvier, S. Meignier, and Y. Esteve, "I-vectors and ILP clustering adapted to cross-show speaker diarization, " in Proceedings of Interspeech, Portland, Oregon (USA), 2012.
    • (2012) Proceedings of Interspeech
    • Dupuy, G.1    Rouvier, M.2    Meignier, S.3    Esteve, Y.4
  • 23
    • 44849109123 scopus 로고    scopus 로고
    • A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system
    • K. J. Han and S. S. Narayanan, "A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system, " in Proceedings of Interspeech, 2007, pp. 1853-1856.
    • (2007) Proceedings of Interspeech , pp. 1853-1856
    • Han, K.J.1    Narayanan, S.S.2
  • 25
    • 84867205879 scopus 로고    scopus 로고
    • T-test distance and clustering criterion for speaker diarization
    • September
    • T. H. Nguyen, E. S. Chng, and H. Li, "T-test distance and clustering criterion for speaker diarization, " in Interspeech 2008, September 2008.
    • (2008) Interspeech 2008
    • Nguyen, T.H.1    Chng, E.S.2    Li, H.3
  • 27
    • 34548310397 scopus 로고    scopus 로고
    • Speaker diarization for multiple-distant-microphone meetings using several sources of information
    • J. M. Pardo, X. Anguera, and C. Wooters, "Speaker diarization for multiple-distant-microphone meetings using several sources of information, " IEEE Transactions on Computers, vol. 56, no. 9, pp. 1212-1224, 2007.
    • (2007) IEEE Transactions on Computers , vol.56 , Issue.9 , pp. 1212-1224
    • Pardo, J.M.1    Anguera, X.2    Wooters, C.3
  • 29
    • 84858388270 scopus 로고    scopus 로고
    • Crosspollination of normalisation techniques from speaker to face authentication using Gaussian mixture models
    • R. Wallace, M. McLaren, C. McCool, and S. Marcel, "Crosspollination of normalisation techniques from speaker to face authentication using Gaussian Mixture Models, " IEEE Transactions on Information Forensics and Security, vol. 7, no. 2, pp. 553 -562, 2012.
    • (2012) IEEE Transactions on Information Forensics and Security , vol.7 , Issue.2 , pp. 553-562
    • Wallace, R.1    McLaren, M.2    McCool, C.3    Marcel, S.4
  • 31
    • 77249161746 scopus 로고    scopus 로고
    • Video shot boundary detection: Seven years of TRECVID activity
    • A. F. Smeaton, P. Over, and A. R. Doherty, "Video shot boundary detection: Seven years of TRECVid activity, " Computer Vision and Image Understanding, vol. 114, no. 4, pp. 411-418, 2010.
    • (2010) Computer Vision and Image Understanding , vol.114 , Issue.4 , pp. 411-418
    • Smeaton, A.F.1    Over, P.2    Doherty, A.R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.