SCOPUS 정보 검색 플랫폼

2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings

Volumn , Issue , 2016, Pages 632-638

The 2015 Sheffield system for longitudinal diarisation of broadcast media

(6) Milner, Rosanna a Saz, Oscar a Deena, Salil a Doulaty, Mortaza a Ng, Raymond W M a Hain, Thomas a

a UNIVERSITY OF SHEFFIELD (United Kingdom)

Author keywords

adaptation; linking; neural networks; speaker diarisation

Indexed keywords

AUDIO RECORDINGS; NEURAL NETWORKS;

ADAPTATION; BACKGROUND CONDITIONS; CONSTRAINED RESOURCES; IMPROVE PERFORMANCE; LINKING; SPEAKER DIARISATION; SPEAKER SEGMENTATION AND CLUSTERING; SPEECH ACTIVITY DETECTIONS;

SPEECH RECOGNITION;

EID: 84964507800 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2015.7404855 Document Type: Conference Paper

Times cited : (11)

References (25)

1
- 34047261805
- An overview of automatic speaker diarization systems
- S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Transactions on Audio, Speech &Language Processing, vol. 14, no. 5, pp. 1557-1565, 2006
- (2006) IEEE Transactions on Audio, Speech &Language Processing , vol.14 , Issue.5 , pp. 1557-1565
- Tranter, S.E.¹ Reynolds, D.A.²

2
- 85008530405
- Speaker diarization: A review of recent research
- X. A. Miró, S. Bozonnet, N.W. D. Evans, C. Fredouille, G. Friedland, and O. Vinyals, "Speaker diarization: A review of recent research," IEEE Transactions on Audio, Speech &Language Processing, vol. 20, no. 2, pp. 356-370, 2012
- (2012) IEEE Transactions on Audio, Speech &Language Processing , vol.20 , Issue.2 , pp. 356-370
- Miró, X.A.¹ Bozonnet, S.² Evans, N.W.D.³ Fredouille, C.⁴ Friedland, G.⁵ Vinyals, O.⁶

3
- 84875953283
- Clustering via the Bayesian information criterion with applications in speech recognition
- (Seattle,WA)
- S. S. Chen and P. S. Gopalakrishnan, "Clustering via the Bayesian information criterion with applications in speech recognition," in ICASSP, (Seattle,WA), pp. 645-648, 1998
- (1998) ICASSP , pp. 645-648
- Chen, S.S.¹ Gopalakrishnan, P.S.²

4
- 84865770392
- Speaker linking in large data sets
- Brno, Czech Republic, June 28-July 1, 2010
- D. A. Leeuwen, "Speaker linking in large data sets," in Odyssey 2010, Brno, Czech Republic, June 28-July 1, 2010, p. 35, 2010
- (2010) Odyssey 2010 , pp. 35
- Leeuwen, D.A.¹

5
- 84865759467
- The speaker partitioning problem
- Brno, Czech Republic, June 28-July 1, 2010
- N. Brummer and E. Villiers, "The speaker partitioning problem," in Odyssey 2010, Brno, Czech Republic, June 28-July 1, 2010, p. 34, 2010
- (2010) Odyssey 2010 , pp. 34
- Brummer, N.¹ Villiers, E.²

6
- 84865729834
- Partitioning of two-speaker conversation datasets
- Florence, Italy, August 27-31, 2011
- C. Vaquero, A. Ortega, and E. Lleida, "Partitioning of two-speaker conversation datasets," in INTERSPEECH, Florence, Italy, August 27-31, 2011, pp. 385-388, 2011
- (2011) INTERSPEECH , pp. 385-388
- Vaquero, C.¹ Ortega, A.² Lleida, E.³

7
- 84867606902
- Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach
- Kyoto, Japan, March 25-30, 2012
- H. Ghaemmaghami, D. Dean, R. Vogt, and S. Sridharan, "Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach," in ICASSP 2012, Kyoto, Japan, March 25-30, 2012, pp. 4185-4188, 2012
- (2012) ICASSP 2012 , pp. 4185-4188
- Ghaemmaghami, H.¹ Dean, D.² Vogt, R.³ Sridharan, S.⁴

8
- 84874227906
- Speaker diarization and linking of large corpora
- Miami, FL, USA, December 2-5, 2012
- M. Ferras and H. Boudard, "Speaker diarization and linking of large corpora," in IEEE SLT, Miami, FL, USA, December 2-5, 2012, pp. 280-285, 2012
- (2012) IEEE SLT , pp. 280-285
- Ferras, M.¹ Boudard, H.²

9
- 84865776156
- Comparing multi-stage approaches for cross-show speaker diarization
- Florence, Italy, August 27-31, 2011
- V. Tran, V. B. Le, C. Barras, and L. Lamel, "Comparing multi-stage approaches for cross-show speaker diarization," in INTERSPEECH, Florence, Italy, August 27-31, 2011, pp. 1053-1056, 2011
- (2011) INTERSPEECH , pp. 1053-1056
- Tran, V.¹ Le, V.B.² Barras, C.³ Lamel, L.⁴

10
- 84865734172
- Investigation of crossshow speaker diarization
- Italy, August 27-31, 2011
- Q. Yang, Q. Jin, and T. Schultz, "Investigation of crossshow speaker diarization," in INTERSPEECH Florence, Italy, August 27-31, 2011, pp. 2925-2928, 2011
- (2011) INTERSPEECH Florence , pp. 2925-2928
- Yang, Q.¹ Jin, Q.² Schultz, T.³

11
- 84906274473
- An open-source state-of-The-art toolbox for broadcast news diarization
- Lyon, France, August 25-29, 2013
- M. Rouvier, G. Dupuy, P. Gay, E. el Khoury, T. Merlin, and S. Meignier, "An open-source state-of-The-art toolbox for broadcast news diarization," in INTERSPEECH, Lyon, France, August 25-29, 2013, pp. 1477-1481, 2013
- (2013) INTERSPEECH , pp. 1477-1481
- Rouvier, M.¹ Dupuy, G.² Gay, P.³ El Khoury, E.⁴ Merlin, T.⁵ Meignier, S.⁶

12
- 84973386174
- Corpus description of the ESTER evaluation campaign for the rich transcription of French broadcast news
- (Genoa, Italy)
- S. Galliano, E. Geoffrois, G. Gravier, J. F. Bonastre, D. Mostefa, and K. Choukri, "Corpus description of the ESTER evaluation campaign for the rich transcription of French broadcast news," in LREC, (Genoa, Italy), pp. 139-142, 2006
- (2006) LREC , pp. 139-142
- Galliano, S.¹ Geoffrois, E.² Gravier, G.³ Bonastre, J.F.⁴ Mostefa, D.⁵ Choukri, K.⁶

13
- 84910061411
- The first official REPERE evaluation
- O. Galibert and J. Kahn, "The first official REPERE evaluation," in SLAM, 2013
- (2013) SLAM
- Galibert, O.¹ Kahn, J.²

14
- 84873834890
- Speaker diarization of broadcast news in Albayzin 2010 evaluation campaign
- M. Zelenak, H. Schulz, and J. Hernando, "Speaker diarization of broadcast news in Albayzin 2010 evaluation campaign," EURASIP Journal on Audio, Speech and Music Processing, vol. 19, pp. 1-9, 2012
- (2012) EURASIP Journal on Audio, Speech and Music Processing , vol.19 , pp. 1-9
- Zelenak, M.¹ Schulz, H.² Hernando, J.³

15
- 85010742974
- The MGB challenge: Evaluating multi-genre broadcast media transcription
- Scottsdale, AZ, 2015
- P. Bell, M. J. F. Gales, T. Hain, J. Kilgour, P. Lanchantin, X. Liu, A. McParland, S. Renals, O. Saz, M. Webster, and P. Woodland, "The MGB Challenge: Evaluating Multi-genre Broadcast Media Transcription," in ASRU 2015, Scottsdale, AZ, 2015, 2015
- (2015) ASRU 2015
- Bell, P.¹ Gales, M.J.F.² Hain, T.³ Kilgour, J.⁴ Lanchantin, P.⁵ Liu, X.⁶ McParland, A.⁷ Renals, S.⁸ Saz, O.⁹ Webster, M.¹⁰ Woodland, P.¹¹

16
- 84964437387
- Accessed: 08-07-2015
- "Diarisation error rate scoring code, NIST." http://www.itl.nist.gov/iad/mig/tests/rt/2006-spring/code/md-eval-v21.pl. Accessed: 08-07-2015
- Diarisation Error Rate Scoring Code

17
- 84905238677
- Brno University Accessed: 08-07-2015
- "Neural Network Trainer TNet, Brno University." http://speech.fit.vutbr.cz/software/neural-networktrainer-tnet. Accessed: 08-07-2015
- Neural Network Trainer TNet

18
- 85009254284
- TRAPS-classifiers of temporal patterns
- Sydney, Australia, 30th November-4th December 1998
- H. Hermansky and S. Sharma, "TRAPS-classifiers of temporal patterns," in The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November-4th December 1998, 1998
- (1998) The 5th International Conference on Spoken Language Processing, Incorporating the 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre
- Hermansky, H.¹ Sharma, S.²

19
- 84946687643
- Semi-supervised DNN training in meeting recognition
- (South Lake Tahoe, CA)
- P. Zhang, Y. Liu, and T. Hain, "Semi-supervised DNN training in meeting recognition," in Proceedings of SLT, (South Lake Tahoe, CA), 2014
- (2014) Proceedings of SLT
- Zhang, P.¹ Liu, Y.² Hain, T.³

20
- 40249083942
- The segmentation of multi-channel meeting recordings for automatic speech recognition
- J. Dines, J. Vepa, and T. Hain, "The segmentation of multi-channel meeting recordings for automatic speech recognition," in Interspeech'06, 2006
- (2006) Interspeech'06
- Dines, J.¹ Vepa, J.² Hain, T.³

21
- 70450152040
- SHoUT, the university of twente submission to the n-best 2008 speech recognition evaluation for Dutch
- Brighton, United Kingdom, September 6-10, 2009
- M. Huijbregts, R. Ordelman, L. Werff, and F. M. G. Jong, "SHoUT, the university of twente submission to the n-best 2008 speech recognition evaluation for dutch," in INTERSPEECH, Brighton, United Kingdom, September 6-10, 2009, pp. 2575-2578, 2009
- (2009) INTERSPEECH , pp. 2575-2578
- Huijbregts, M.¹ Ordelman, R.² Werff, L.³ Jong, F.M.G.⁴

22
- 72449169479
- Phd thesis, University of Twente
- M. Huijbregts, Segmentation, Diarization and Speech Transcription: Surprise Data Unraveled. Phd thesis, University of Twente, 2008
- (2008) Segmentation, Diarization and Speech Transcription: Surprise Data Unraveled
- Huijbregts, M.¹

23
- 85008520364
- Transcribing meetings with the AMIDA systems
- T. Hain, L. Burget, J. Dines, P. Garner, F. Grezl, A. Hannani, M. Huijbregts, M. Karafiat, M. Lincoln, and V. Wan, "Transcribing meetings with the AMIDA systems," IEEE Transactions on Audio, Speech &Language Processing, vol. 20, no. 2, pp. 486-498, 2012
- (2012) IEEE Transactions on Audio, Speech &Language Processing , vol.20 , Issue.2 , pp. 486-498
- Hain, T.¹ Burget, L.² Dines, J.³ Garner, P.⁴ Grezl, F.⁵ Hannani, A.⁶ Huijbregts, M.⁷ Karafiat, M.⁸ Lincoln, M.⁹ Wan, V.¹⁰

24
- 84905269643
- Using neural network front-ends on far field multiple microphones based speech recognition
- May
- Y. Liu, P. Zhang, and T. Hain, "Using neural network front-ends on far field multiple microphones based speech recognition," in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pp. 5542-5546, May 2014
- (2014) Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on , pp. 5542-5546
- Liu, Y.¹ Zhang, P.² Hain, T.³

25
- 84878379108
- Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
- B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization," in INTERSPEECH, 2012
- (2012) INTERSPEECH
- Kingsbury, B.¹ Sainath, T.N.² Soltau, H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.