메뉴 건너뛰기




Volumn , Issue , 2016, Pages 632-638

The 2015 Sheffield system for longitudinal diarisation of broadcast media

Author keywords

adaptation; linking; neural networks; speaker diarisation

Indexed keywords

AUDIO RECORDINGS; NEURAL NETWORKS;

EID: 84964507800     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2015.7404855     Document Type: Conference Paper
Times cited : (11)

References (25)
  • 3
    • 84875953283 scopus 로고    scopus 로고
    • Clustering via the Bayesian information criterion with applications in speech recognition
    • (Seattle,WA)
    • S. S. Chen and P. S. Gopalakrishnan, "Clustering via the Bayesian information criterion with applications in speech recognition," in ICASSP, (Seattle,WA), pp. 645-648, 1998
    • (1998) ICASSP , pp. 645-648
    • Chen, S.S.1    Gopalakrishnan, P.S.2
  • 4
    • 84865770392 scopus 로고    scopus 로고
    • Speaker linking in large data sets
    • Brno, Czech Republic, June 28-July 1, 2010
    • D. A. Leeuwen, "Speaker linking in large data sets," in Odyssey 2010, Brno, Czech Republic, June 28-July 1, 2010, p. 35, 2010
    • (2010) Odyssey 2010 , pp. 35
    • Leeuwen, D.A.1
  • 5
    • 84865759467 scopus 로고    scopus 로고
    • The speaker partitioning problem
    • Brno, Czech Republic, June 28-July 1, 2010
    • N. Brummer and E. Villiers, "The speaker partitioning problem," in Odyssey 2010, Brno, Czech Republic, June 28-July 1, 2010, p. 34, 2010
    • (2010) Odyssey 2010 , pp. 34
    • Brummer, N.1    Villiers, E.2
  • 6
    • 84865729834 scopus 로고    scopus 로고
    • Partitioning of two-speaker conversation datasets
    • Florence, Italy, August 27-31, 2011
    • C. Vaquero, A. Ortega, and E. Lleida, "Partitioning of two-speaker conversation datasets," in INTERSPEECH, Florence, Italy, August 27-31, 2011, pp. 385-388, 2011
    • (2011) INTERSPEECH , pp. 385-388
    • Vaquero, C.1    Ortega, A.2    Lleida, E.3
  • 7
    • 84867606902 scopus 로고    scopus 로고
    • Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach
    • Kyoto, Japan, March 25-30, 2012
    • H. Ghaemmaghami, D. Dean, R. Vogt, and S. Sridharan, "Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach," in ICASSP 2012, Kyoto, Japan, March 25-30, 2012, pp. 4185-4188, 2012
    • (2012) ICASSP 2012 , pp. 4185-4188
    • Ghaemmaghami, H.1    Dean, D.2    Vogt, R.3    Sridharan, S.4
  • 8
    • 84874227906 scopus 로고    scopus 로고
    • Speaker diarization and linking of large corpora
    • Miami, FL, USA, December 2-5, 2012
    • M. Ferras and H. Boudard, "Speaker diarization and linking of large corpora," in IEEE SLT, Miami, FL, USA, December 2-5, 2012, pp. 280-285, 2012
    • (2012) IEEE SLT , pp. 280-285
    • Ferras, M.1    Boudard, H.2
  • 9
    • 84865776156 scopus 로고    scopus 로고
    • Comparing multi-stage approaches for cross-show speaker diarization
    • Florence, Italy, August 27-31, 2011
    • V. Tran, V. B. Le, C. Barras, and L. Lamel, "Comparing multi-stage approaches for cross-show speaker diarization," in INTERSPEECH, Florence, Italy, August 27-31, 2011, pp. 1053-1056, 2011
    • (2011) INTERSPEECH , pp. 1053-1056
    • Tran, V.1    Le, V.B.2    Barras, C.3    Lamel, L.4
  • 10
    • 84865734172 scopus 로고    scopus 로고
    • Investigation of crossshow speaker diarization
    • Italy, August 27-31, 2011
    • Q. Yang, Q. Jin, and T. Schultz, "Investigation of crossshow speaker diarization," in INTERSPEECH Florence, Italy, August 27-31, 2011, pp. 2925-2928, 2011
    • (2011) INTERSPEECH Florence , pp. 2925-2928
    • Yang, Q.1    Jin, Q.2    Schultz, T.3
  • 11
    • 84906274473 scopus 로고    scopus 로고
    • An open-source state-of-The-art toolbox for broadcast news diarization
    • Lyon, France, August 25-29, 2013
    • M. Rouvier, G. Dupuy, P. Gay, E. el Khoury, T. Merlin, and S. Meignier, "An open-source state-of-The-art toolbox for broadcast news diarization," in INTERSPEECH, Lyon, France, August 25-29, 2013, pp. 1477-1481, 2013
    • (2013) INTERSPEECH , pp. 1477-1481
    • Rouvier, M.1    Dupuy, G.2    Gay, P.3    El Khoury, E.4    Merlin, T.5    Meignier, S.6
  • 12
    • 84973386174 scopus 로고    scopus 로고
    • Corpus description of the ESTER evaluation campaign for the rich transcription of French broadcast news
    • (Genoa, Italy)
    • S. Galliano, E. Geoffrois, G. Gravier, J. F. Bonastre, D. Mostefa, and K. Choukri, "Corpus description of the ESTER evaluation campaign for the rich transcription of French broadcast news," in LREC, (Genoa, Italy), pp. 139-142, 2006
    • (2006) LREC , pp. 139-142
    • Galliano, S.1    Geoffrois, E.2    Gravier, G.3    Bonastre, J.F.4    Mostefa, D.5    Choukri, K.6
  • 13
    • 84910061411 scopus 로고    scopus 로고
    • The first official REPERE evaluation
    • O. Galibert and J. Kahn, "The first official REPERE evaluation," in SLAM, 2013
    • (2013) SLAM
    • Galibert, O.1    Kahn, J.2
  • 16
    • 84964437387 scopus 로고    scopus 로고
    • Accessed: 08-07-2015
    • "Diarisation error rate scoring code, NIST." http://www.itl.nist.gov/iad/mig/tests/rt/2006-spring/code/md-eval-v21.pl. Accessed: 08-07-2015
    • Diarisation Error Rate Scoring Code
  • 17
    • 84905238677 scopus 로고    scopus 로고
    • Brno University Accessed: 08-07-2015
    • "Neural Network Trainer TNet, Brno University." http://speech.fit.vutbr.cz/software/neural-networktrainer-tnet. Accessed: 08-07-2015
    • Neural Network Trainer TNet
  • 19
    • 84946687643 scopus 로고    scopus 로고
    • Semi-supervised DNN training in meeting recognition
    • (South Lake Tahoe, CA)
    • P. Zhang, Y. Liu, and T. Hain, "Semi-supervised DNN training in meeting recognition," in Proceedings of SLT, (South Lake Tahoe, CA), 2014
    • (2014) Proceedings of SLT
    • Zhang, P.1    Liu, Y.2    Hain, T.3
  • 20
    • 40249083942 scopus 로고    scopus 로고
    • The segmentation of multi-channel meeting recordings for automatic speech recognition
    • J. Dines, J. Vepa, and T. Hain, "The segmentation of multi-channel meeting recordings for automatic speech recognition," in Interspeech'06, 2006
    • (2006) Interspeech'06
    • Dines, J.1    Vepa, J.2    Hain, T.3
  • 21
    • 70450152040 scopus 로고    scopus 로고
    • SHoUT, the university of twente submission to the n-best 2008 speech recognition evaluation for Dutch
    • Brighton, United Kingdom, September 6-10, 2009
    • M. Huijbregts, R. Ordelman, L. Werff, and F. M. G. Jong, "SHoUT, the university of twente submission to the n-best 2008 speech recognition evaluation for dutch," in INTERSPEECH, Brighton, United Kingdom, September 6-10, 2009, pp. 2575-2578, 2009
    • (2009) INTERSPEECH , pp. 2575-2578
    • Huijbregts, M.1    Ordelman, R.2    Werff, L.3    Jong, F.M.G.4
  • 25
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
    • B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization," in INTERSPEECH, 2012
    • (2012) INTERSPEECH
    • Kingsbury, B.1    Sainath, T.N.2    Soltau, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.