SCOPUS 정보 검색 플랫폼

2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings

Volumn , Issue , 2016, Pages 647-653

The development of the Cambridge university alignment systems for the multi-genre broadcast challenge

(8) Lanchantin, P a Gales, M J F a Karanasou, P a Liu, X a Qian, Y a Wang, L a Woodland, P C a Zhang, C a

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

Alignment; Lightly Supervised Training; Multi genre Broadcast transcription

Indexed keywords

ALIGNMENT; AUDIO ACOUSTICS; TRANSCRIPTION;

ACOUSTIC MODEL; ALIGNMENT SYSTEM; AUDIO SEGMENTATION; CAMBRIDGE UNIVERSITY; CONFIDENCE SCORE; DEEP NEURAL NETWORKS; SPLIT POINTS; SUPERVISED TRAININGS;

SPEECH RECOGNITION;

EID: 84964556219 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2015.7404857 Document Type: Conference Paper

Times cited : (11)

References (33)

1
- 70450190034
- Podcastle: Collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription
- J. Ogata and M. Goto, "Podcastle: Collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription," in Proc. Interspeech, 2009
- (2009) Proc. Interspeech
- Ogata, J.¹ Goto, M.²

2
- 70349217247
- An audio indexing system for election video material
- C. Alberti, M. Bacchiani, A. Bezman, C. Chelba, A. Drofa, H. Liao, P. Moreno, T. Power, A. Sahuguet, M. Shugrina, and O. Siohan, "An audio indexing system for election video material," in Proc ICASSP, 2009, pp. 4873-7876
- (2009) Proc ICASSP , pp. 4873-7876
- Alberti, C.¹ Bacchiani, M.² Bezman, A.³ Chelba, C.⁴ Drofa, A.⁵ Liao, H.⁶ Moreno, P.⁷ Power, T.⁸ Sahuguet, A.⁹ Shugrina, M.¹⁰ Siohan, O.¹¹

3
- 84874275817
- Tech. Rep. , Cambridge University Engineering Department
- R.C. Dalen, J. Yang, and M.J.F. Gales, "Generative kernels and score-spaces for classification of speech: Progress report," in Tech. Rep. , Cambridge University Engineering Department, 2012
- (2012) Generative Kernels and Score-spaces for Classification of Speech: Progress Report
- Dalen, R.C.¹ Yang, J.² Gales, M.J.F.³

4
- 84862147437
- Overview of Mediaeval 2011 Rich speech retrieval task and genre tagging task
- M. Larson, M. Eskevitch, R. Orderlman, C. Kofler, S. Schmiedeke, and G.J.F. Jones, "Overview of Mediaeval 2011 Rich speech retrieval task and genre tagging task," in Working Notes Proceedings of the MediaEval 2011 Workshop, 2011
- (2011) Working Notes Proceedings of the MediaEval 2011 Workshop
- Larson, M.¹ Eskevitch, M.² Orderlman, R.³ Kofler, C.⁴ Schmiedeke, S.⁵ Jones, G.J.F.⁶

5
- 84874284321
- Automatic semantic tagging of speech audio
- Y. Raimond, C. Lowis, R. Hodgson, and J. Tweed, "Automatic semantic tagging of speech audio," in Proc. WWW 2012, 2012
- (2012) Proc. WWW 2012
- Raimond, Y.¹ Lowis, C.² Hodgson, R.³ Tweed, J.⁴

6
- 84905279052
- Automatic transcription of multi-genre media archives
- P. Lanchantin, P.J. Bell, M.J.F. Gales, T. Hain, X. Liu, Y. Long, J. Quinnell, S. Renals, O. Saz, M.S. Seigel, P. Swietojanski, and P.C. Woodland, "Automatic transcription of multi-genre media archives," in Proc of the first Workshop on Speech, Language and Audio in Multimedia, 2013
- (2013) Proc of the First Workshop on Speech, Language and Audio in Multimedia
- Lanchantin, P.¹ Bell, P.J.² Gales, M.J.F.³ Hain, T.⁴ Liu, X.⁵ Long, Y.⁶ Quinnell, J.⁷ Renals, S.⁸ Saz, O.⁹ Seigel, M.S.¹⁰ Swietojanski, P.¹¹ Woodland, P.C.¹²

7
- 84964470805
- The MGB Challenge: Evaluating multi-genre broadcast media transcription
- P. Bell, M.J.F. Gales, T. Hain, J. Kilgour, P. Lanchantin, X. Liu, A. McParland, S. Renals, O. Saz, M. Wester, and P.C. Woodland, "The MGB Challenge: evaluating multi-genre broadcast media transcription," in Proc. IEEE ASRU, 2015
- (2015) Proc. IEEE ASRU
- Bell, P.¹ Gales, M.J.F.² Hain, T.³ Kilgour, J.⁴ Lanchantin, P.⁵ Liu, X.⁶ McParland, A.⁷ Renals, S.⁸ Saz, O.⁹ Wester, M.¹⁰ Woodland, P.C.¹¹

8
- 84910039499
- Automatic generation of hyperlinks between audio and transcript
- J. Robert-Ribes and R. Mukhtar, "Automatic generation of hyperlinks between audio and transcript," in Proc. Eurospeech, 1997
- (1997) Proc. Eurospeech
- Robert-Ribes, J.¹ Mukhtar, R.²

9
- 84885726863
- A recursive algorithm for the forced alignment of very long audio segments
- P.J. Moreno, C. Joerg, J.M.V. Thong, and O. Glickman, "A recursive algorithm for the forced alignment of very long audio segments," in International Conference on Spoken Language Processing, 1998, vol. 8
- (1998) International Conference on Spoken Language Processing , vol.8
- Moreno, P.J.¹ Joerg, C.² Thong, J.M.V.³ Glickman, O.⁴

10
- 0036460908
- Lightly supervised and unsupervised acoustic model training
- L. Lamel, J.L. Gauvain, and G. Adda, "Lightly supervised and unsupervised acoustic model training," in Computer Speech and Language, 2002, vol. 16, pp. 115-129
- (2002) Computer Speech and Language , vol.16 , pp. 115-129
- Lamel, L.¹ Gauvain, J.L.² Adda, G.³

11
- 84907336951
- An efficient repair procedure for quick transcriptions
- A. Venkataraman, A. Stolcke, W. Wang, D. Vergyri, V.R.R. Gadde, and J. Zheng, "An efficient repair procedure for quick transcriptions," in Proc. ICSLP, 2004
- (2004) Proc. ICSLP
- Venkataraman, A.¹ Stolcke, A.² Wang, W.³ Vergyri, D.⁴ Gadde, V.R.R.⁵ Zheng, J.⁶

12
- 4544253838
- Improving broadcast news transcription by lightly supervised discriminative training
- H.Y. Chan and P.C. Woodland, "Improving broadcast news transcription by lightly supervised discriminative training," in Proc. ICASSP, 2004, vol. 1, pp. 737-740
- (2004) Proc. ICASSP , vol.1 , pp. 737-740
- Chan, H.Y.¹ Woodland, P.C.²

13
- 33646762098
- Discriminative training of acoustic models applied to domains with unreliable transcripts
- L. Mathias, G. Yegnanarayanan, and J. Fritsch, "Discriminative training of acoustic models applied to domains with unreliable transcripts," in Proc. ICASSP, 2005, vol. 1, pp. 109-112
- (2005) Proc. ICASSP , vol.1 , pp. 109-112
- Mathias, L.¹ Yegnanarayanan, G.² Fritsch, J.³

14
- 44949259199
- Imperfect transcript driven speech recognition
- B. Lecouteux, G. Linares, P. Nocera, and J.F. Bonastre, "Imperfect transcript driven speech recognition," in Proc. Inter-Speech'06, 2006, pp. 1626-1629
- (2006) Proc. Inter-Speech'06 , pp. 1626-1629
- Lecouteux, B.¹ Linares, G.² Nocera, P.³ Bonastre, J.F.⁴

15
- 46449097482
- Alignment of speech to highly imperfect text transcriptions
- A. Haubold and J. Kender, "Alignment of speech to highly imperfect text transcriptions," in IEEE International Conference on Multimedia and Expo, 2007, pp. 224-227
- (2007) IEEE International Conference on Multimedia and Expo , pp. 224-227
- Haubold, A.¹ Kender, J.²

16
- 79959817774
- Lightly supervised recognition for automatic alignment of large coherent speech recordings
- N. Braunschweiler, M.J.F Gales, and S. Buchholz, "Lightly supervised recognition for automatic alignment of large coherent speech recordings," in Proc. Interspeech, 2010, pp. 2222-2225
- (2010) Proc. Interspeech , pp. 2222-2225
- Braunschweiler, N.¹ Gales, M.J.F.² Buchholz, S.³

17
- 84906260292
- Text-to-speech alignment of long recordings using universal phone models
- S. Hoffman and B. Pfister, "Text-to-speech alignment of long recordings using universal phone models," in Proc Interspeech, 2013, pp. 1520-1524
- (2013) Proc Interspeech , pp. 1520-1524
- Hoffman, S.¹ Pfister, B.²

18
- 84878532221
- A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions
- G. Bordel, S. Nieto, M. Penagarikano, L.J. Rodriguez-Fuentes, and A. Varona, "A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions," in Proc Interspeech, 2012
- (2012) Proc Interspeech
- Bordel, G.¹ Nieto, S.² Penagarikano, M.³ Rodriguez-Fuentes, L.J.⁴ Varona, A.⁵

19
- 84905284228
- Long audio alignment for automatic subtitling using different phone-relatedness measures
- A. Alvarez, H. Arzelus, and P. Ruiz, "Long audio alignment for automatic subtitling using different phone-relatedness measures," in Proc ICASSP, 2014
- (2014) Proc ICASSP
- Alvarez, A.¹ Arzelus, H.² Ruiz, P.³

20
- 84959132764
- Towards fully automatic annotation of audiobooks for TTS
- O. Boeffard, L. Charonnat, S. L. Maguer, D. Lolive, and G. Vidal, "Towards fully automatic annotation of audiobooks for TTS," in International Conference on Language Resources and Evaluation, 2012
- (2012) International Conference on Language Resources and Evaluation
- Boeffard, O.¹ Charonnat, L.² Maguer, S.L.³ Lolive, D.⁴ Vidal, G.⁵

21
- 0030374920
- Automatic text-independent pronunciation scoring of foreign language student speech
- L. Neumeyer, H. Franco, M. Weintraub, and P. Price, "Automatic text-independent pronunciation scoring of foreign language student speech," in Proc. of ICSLP 96, 1996
- (1996) Proc. of ICSLP 96
- Neumeyer, L.¹ Franco, H.² Weintraub, M.³ Price, P.⁴

22
- 0001790722
- Automatic evaluation and training in english pronunciation
- J. Bernstein, M. Cohen, H. Murveit, D. Rtischev, and M.Weintraub, "Automatic evaluation and training in english pronunciation," in Proc. of ICSLP, 1990, pp. 1185-1188
- (1990) Proc. of ICSLP , pp. 1185-1188
- Bernstein, J.¹ Cohen, M.² Murveit, H.³ Rtischev, D.⁴ Weintraub, M.⁵

23
- 0342321399
- Audio-indexing for broadcast news
- S. Dharanipragada, M. Franz, and S. Roukos, "Audio-indexing for broadcast news," in Proc. of TREC6, 1997
- (1997) Proc. of TREC6
- Dharanipragada, S.¹ Franz, M.² Roukos, S.³

24
- 0032646977
- An overview of audio information retrieval
- J. Foote, "An overview of audio information retrieval," in ACM Multimedia Systems, 1999
- (1999) ACM Multimedia Systems
- Foote, J.¹

25
- 34047266379
- Progress in the CU-HTK broadcast news transcription system
- M.J.F. Gales, D.Y. Kim, P.C. Woodland, D. Mrva, R. Sinha, and S.E Tranter, "Progress in the CU-HTK broadcast news transcription system," in IEEE Transactions on Audio Speech and Language Processing, September 2006
- (2006) IEEE Transactions on Audio Speech and Language Processing, September
- Gales, M.J.F.¹ Kim, D.Y.² Woodland, P.C.³ Mrva, D.⁴ Sinha, R.⁵ Tranter, S.E.⁶

26
- 84946728861
- Design of fast LVCSR systems
- G. Evermann and P.C. Woodland, "Design of fast LVCSR systems," in Proc. ASRU Workshop, 2003
- (2003) Proc. ASRU Workshop
- Evermann, G.¹ Woodland, P.C.²

27
- 85053488053
- Respeaking the BBC news: A strategic analysis of respeaking on the BBC
- C. Eugeni, "Respeaking the BBC news a strategic analysis of respeaking on the BBC," The Sign Language Translator and Interpreter, vol. 3, no. 1, pp. 29-68, 2009
- (2009) The Sign Language Translator and Interpreter , vol.3 , Issue.1 , pp. 29-68
- Eugeni, C.¹

28
- 84906276653
- Improving lightly supervised training for broadcast transcriptions
- Y. Long, M.J.F. Gales, P. Lanchantin, X. Liu, M. S. Seigel, and P.C. Woodland, "Improving lightly supervised training for broadcast transcriptions," in Proc. Interspeech, 2013
- (2013) Proc. Interspeech
- Long, Y.¹ Gales, M.J.F.² Lanchantin, P.³ Liu, X.⁴ Seigel, M.S.⁵ Woodland, P.C.⁶

29
- 84964503174
- HTK 3.5, http://htk.eng.cam.ac.uk

30
- 84959142742
- A general artificial neural network extension for HTK
- C. Zhang and P.C. Woodland, "A general artificial neural network extension for HTK," in Proc. Interspeech, 2015
- (2015) Proc. Interspeech
- Zhang, C.¹ Woodland, P.C.²

31
- 84964475976
- Cambridge university transcription systems for the multi-genre broadcast challenge
- P.C. Woodland, X. Liu, Y. Qian, C. Zhang, M.J.F. Gales, P. Karanasou, P. Lanchantin, and L. Wang, "Cambridge University transcription systems for the Multi-Genre Broadcast Challenge," in Proc. ASRU, 2015
- (2015) Proc. ASRU
- Woodland, P.C.¹ Liu, X.² Qian, Y.³ Zhang, C.⁴ Gales, M.J.F.⁵ Karanasou, P.⁶ Lanchantin, P.⁷ Wang, L.⁸

32
- 84964556678
- Speaker diarisation and longitudinal linking on multi-genre broadcast data
- P. Karanasou, M.J.F. Gales, P. Lanchantin, X. Liu, Y. Qian, L. Wang, P.C. Woodland, and C. Zhang, "Speaker diarisation and longitudinal linking on multi-genre broadcast data," in Proc. ASRU, 2015
- (2015) Proc. ASRU
- Karanasou, P.¹ Gales, M.J.F.² Lanchantin, P.³ Liu, X.⁴ Qian, Y.⁵ Wang, L.⁶ Woodland, P.C.⁷ Zhang, C.⁸

33
- 84964503191
- The Cambridge University March 2005 speaker diarisation system
- R. Sinha, S. E. Tranter, M. J. F. Gales, and P. C. Woodland, "The Cambridge University March 2005 speaker diarisation system," in Interspeech, 2005
- (2005) Interspeech
- Sinha, R.¹ Tranter, S.E.² Gales, M.J.F.³ Woodland, P.C.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.