-
1
-
-
70450190034
-
Podcastle: Collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription
-
J. Ogata and M. Goto, "Podcastle: Collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription," in Proc. Interspeech, 2009
-
(2009)
Proc. Interspeech
-
-
Ogata, J.1
Goto, M.2
-
2
-
-
70349217247
-
An audio indexing system for election video material
-
C. Alberti, M. Bacchiani, A. Bezman, C. Chelba, A. Drofa, H. Liao, P. Moreno, T. Power, A. Sahuguet, M. Shugrina, and O. Siohan, "An audio indexing system for election video material," in Proc ICASSP, 2009, pp. 4873-7876
-
(2009)
Proc ICASSP
, pp. 4873-7876
-
-
Alberti, C.1
Bacchiani, M.2
Bezman, A.3
Chelba, C.4
Drofa, A.5
Liao, H.6
Moreno, P.7
Power, T.8
Sahuguet, A.9
Shugrina, M.10
Siohan, O.11
-
3
-
-
84874275817
-
-
Tech. Rep. , Cambridge University Engineering Department
-
R.C. Dalen, J. Yang, and M.J.F. Gales, "Generative kernels and score-spaces for classification of speech: Progress report," in Tech. Rep. , Cambridge University Engineering Department, 2012
-
(2012)
Generative Kernels and Score-spaces for Classification of Speech: Progress Report
-
-
Dalen, R.C.1
Yang, J.2
Gales, M.J.F.3
-
4
-
-
84862147437
-
Overview of Mediaeval 2011 Rich speech retrieval task and genre tagging task
-
M. Larson, M. Eskevitch, R. Orderlman, C. Kofler, S. Schmiedeke, and G.J.F. Jones, "Overview of Mediaeval 2011 Rich speech retrieval task and genre tagging task," in Working Notes Proceedings of the MediaEval 2011 Workshop, 2011
-
(2011)
Working Notes Proceedings of the MediaEval 2011 Workshop
-
-
Larson, M.1
Eskevitch, M.2
Orderlman, R.3
Kofler, C.4
Schmiedeke, S.5
Jones, G.J.F.6
-
5
-
-
84874284321
-
Automatic semantic tagging of speech audio
-
Y. Raimond, C. Lowis, R. Hodgson, and J. Tweed, "Automatic semantic tagging of speech audio," in Proc. WWW 2012, 2012
-
(2012)
Proc. WWW 2012
-
-
Raimond, Y.1
Lowis, C.2
Hodgson, R.3
Tweed, J.4
-
6
-
-
84905279052
-
Automatic transcription of multi-genre media archives
-
P. Lanchantin, P.J. Bell, M.J.F. Gales, T. Hain, X. Liu, Y. Long, J. Quinnell, S. Renals, O. Saz, M.S. Seigel, P. Swietojanski, and P.C. Woodland, "Automatic transcription of multi-genre media archives," in Proc of the first Workshop on Speech, Language and Audio in Multimedia, 2013
-
(2013)
Proc of the First Workshop on Speech, Language and Audio in Multimedia
-
-
Lanchantin, P.1
Bell, P.J.2
Gales, M.J.F.3
Hain, T.4
Liu, X.5
Long, Y.6
Quinnell, J.7
Renals, S.8
Saz, O.9
Seigel, M.S.10
Swietojanski, P.11
Woodland, P.C.12
-
7
-
-
84964470805
-
The MGB Challenge: Evaluating multi-genre broadcast media transcription
-
P. Bell, M.J.F. Gales, T. Hain, J. Kilgour, P. Lanchantin, X. Liu, A. McParland, S. Renals, O. Saz, M. Wester, and P.C. Woodland, "The MGB Challenge: evaluating multi-genre broadcast media transcription," in Proc. IEEE ASRU, 2015
-
(2015)
Proc. IEEE ASRU
-
-
Bell, P.1
Gales, M.J.F.2
Hain, T.3
Kilgour, J.4
Lanchantin, P.5
Liu, X.6
McParland, A.7
Renals, S.8
Saz, O.9
Wester, M.10
Woodland, P.C.11
-
8
-
-
84910039499
-
Automatic generation of hyperlinks between audio and transcript
-
J. Robert-Ribes and R. Mukhtar, "Automatic generation of hyperlinks between audio and transcript," in Proc. Eurospeech, 1997
-
(1997)
Proc. Eurospeech
-
-
Robert-Ribes, J.1
Mukhtar, R.2
-
9
-
-
84885726863
-
A recursive algorithm for the forced alignment of very long audio segments
-
P.J. Moreno, C. Joerg, J.M.V. Thong, and O. Glickman, "A recursive algorithm for the forced alignment of very long audio segments," in International Conference on Spoken Language Processing, 1998, vol. 8
-
(1998)
International Conference on Spoken Language Processing
, vol.8
-
-
Moreno, P.J.1
Joerg, C.2
Thong, J.M.V.3
Glickman, O.4
-
10
-
-
0036460908
-
Lightly supervised and unsupervised acoustic model training
-
L. Lamel, J.L. Gauvain, and G. Adda, "Lightly supervised and unsupervised acoustic model training," in Computer Speech and Language, 2002, vol. 16, pp. 115-129
-
(2002)
Computer Speech and Language
, vol.16
, pp. 115-129
-
-
Lamel, L.1
Gauvain, J.L.2
Adda, G.3
-
11
-
-
84907336951
-
An efficient repair procedure for quick transcriptions
-
A. Venkataraman, A. Stolcke, W. Wang, D. Vergyri, V.R.R. Gadde, and J. Zheng, "An efficient repair procedure for quick transcriptions," in Proc. ICSLP, 2004
-
(2004)
Proc. ICSLP
-
-
Venkataraman, A.1
Stolcke, A.2
Wang, W.3
Vergyri, D.4
Gadde, V.R.R.5
Zheng, J.6
-
12
-
-
4544253838
-
Improving broadcast news transcription by lightly supervised discriminative training
-
H.Y. Chan and P.C. Woodland, "Improving broadcast news transcription by lightly supervised discriminative training," in Proc. ICASSP, 2004, vol. 1, pp. 737-740
-
(2004)
Proc. ICASSP
, vol.1
, pp. 737-740
-
-
Chan, H.Y.1
Woodland, P.C.2
-
13
-
-
33646762098
-
Discriminative training of acoustic models applied to domains with unreliable transcripts
-
L. Mathias, G. Yegnanarayanan, and J. Fritsch, "Discriminative training of acoustic models applied to domains with unreliable transcripts," in Proc. ICASSP, 2005, vol. 1, pp. 109-112
-
(2005)
Proc. ICASSP
, vol.1
, pp. 109-112
-
-
Mathias, L.1
Yegnanarayanan, G.2
Fritsch, J.3
-
14
-
-
44949259199
-
Imperfect transcript driven speech recognition
-
B. Lecouteux, G. Linares, P. Nocera, and J.F. Bonastre, "Imperfect transcript driven speech recognition," in Proc. Inter-Speech'06, 2006, pp. 1626-1629
-
(2006)
Proc. Inter-Speech'06
, pp. 1626-1629
-
-
Lecouteux, B.1
Linares, G.2
Nocera, P.3
Bonastre, J.F.4
-
16
-
-
79959817774
-
Lightly supervised recognition for automatic alignment of large coherent speech recordings
-
N. Braunschweiler, M.J.F Gales, and S. Buchholz, "Lightly supervised recognition for automatic alignment of large coherent speech recordings," in Proc. Interspeech, 2010, pp. 2222-2225
-
(2010)
Proc. Interspeech
, pp. 2222-2225
-
-
Braunschweiler, N.1
Gales, M.J.F.2
Buchholz, S.3
-
17
-
-
84906260292
-
Text-to-speech alignment of long recordings using universal phone models
-
S. Hoffman and B. Pfister, "Text-to-speech alignment of long recordings using universal phone models," in Proc Interspeech, 2013, pp. 1520-1524
-
(2013)
Proc Interspeech
, pp. 1520-1524
-
-
Hoffman, S.1
Pfister, B.2
-
18
-
-
84878532221
-
A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions
-
G. Bordel, S. Nieto, M. Penagarikano, L.J. Rodriguez-Fuentes, and A. Varona, "A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions," in Proc Interspeech, 2012
-
(2012)
Proc Interspeech
-
-
Bordel, G.1
Nieto, S.2
Penagarikano, M.3
Rodriguez-Fuentes, L.J.4
Varona, A.5
-
19
-
-
84905284228
-
Long audio alignment for automatic subtitling using different phone-relatedness measures
-
A. Alvarez, H. Arzelus, and P. Ruiz, "Long audio alignment for automatic subtitling using different phone-relatedness measures," in Proc ICASSP, 2014
-
(2014)
Proc ICASSP
-
-
Alvarez, A.1
Arzelus, H.2
Ruiz, P.3
-
20
-
-
84959132764
-
Towards fully automatic annotation of audiobooks for TTS
-
O. Boeffard, L. Charonnat, S. L. Maguer, D. Lolive, and G. Vidal, "Towards fully automatic annotation of audiobooks for TTS," in International Conference on Language Resources and Evaluation, 2012
-
(2012)
International Conference on Language Resources and Evaluation
-
-
Boeffard, O.1
Charonnat, L.2
Maguer, S.L.3
Lolive, D.4
Vidal, G.5
-
21
-
-
0030374920
-
Automatic text-independent pronunciation scoring of foreign language student speech
-
L. Neumeyer, H. Franco, M. Weintraub, and P. Price, "Automatic text-independent pronunciation scoring of foreign language student speech," in Proc. of ICSLP 96, 1996
-
(1996)
Proc. of ICSLP 96
-
-
Neumeyer, L.1
Franco, H.2
Weintraub, M.3
Price, P.4
-
22
-
-
0001790722
-
Automatic evaluation and training in english pronunciation
-
J. Bernstein, M. Cohen, H. Murveit, D. Rtischev, and M.Weintraub, "Automatic evaluation and training in english pronunciation," in Proc. of ICSLP, 1990, pp. 1185-1188
-
(1990)
Proc. of ICSLP
, pp. 1185-1188
-
-
Bernstein, J.1
Cohen, M.2
Murveit, H.3
Rtischev, D.4
Weintraub, M.5
-
24
-
-
0032646977
-
An overview of audio information retrieval
-
J. Foote, "An overview of audio information retrieval," in ACM Multimedia Systems, 1999
-
(1999)
ACM Multimedia Systems
-
-
Foote, J.1
-
25
-
-
34047266379
-
Progress in the CU-HTK broadcast news transcription system
-
M.J.F. Gales, D.Y. Kim, P.C. Woodland, D. Mrva, R. Sinha, and S.E Tranter, "Progress in the CU-HTK broadcast news transcription system," in IEEE Transactions on Audio Speech and Language Processing, September 2006
-
(2006)
IEEE Transactions on Audio Speech and Language Processing, September
-
-
Gales, M.J.F.1
Kim, D.Y.2
Woodland, P.C.3
Mrva, D.4
Sinha, R.5
Tranter, S.E.6
-
27
-
-
85053488053
-
Respeaking the BBC news: A strategic analysis of respeaking on the BBC
-
C. Eugeni, "Respeaking the BBC news a strategic analysis of respeaking on the BBC," The Sign Language Translator and Interpreter, vol. 3, no. 1, pp. 29-68, 2009
-
(2009)
The Sign Language Translator and Interpreter
, vol.3
, Issue.1
, pp. 29-68
-
-
Eugeni, C.1
-
28
-
-
84906276653
-
Improving lightly supervised training for broadcast transcriptions
-
Y. Long, M.J.F. Gales, P. Lanchantin, X. Liu, M. S. Seigel, and P.C. Woodland, "Improving lightly supervised training for broadcast transcriptions," in Proc. Interspeech, 2013
-
(2013)
Proc. Interspeech
-
-
Long, Y.1
Gales, M.J.F.2
Lanchantin, P.3
Liu, X.4
Seigel, M.S.5
Woodland, P.C.6
-
29
-
-
84964503174
-
-
HTK 3.5, http://htk.eng.cam.ac.uk
-
-
-
-
30
-
-
84959142742
-
A general artificial neural network extension for HTK
-
C. Zhang and P.C. Woodland, "A general artificial neural network extension for HTK," in Proc. Interspeech, 2015
-
(2015)
Proc. Interspeech
-
-
Zhang, C.1
Woodland, P.C.2
-
31
-
-
84964475976
-
Cambridge university transcription systems for the multi-genre broadcast challenge
-
P.C. Woodland, X. Liu, Y. Qian, C. Zhang, M.J.F. Gales, P. Karanasou, P. Lanchantin, and L. Wang, "Cambridge University transcription systems for the Multi-Genre Broadcast Challenge," in Proc. ASRU, 2015
-
(2015)
Proc. ASRU
-
-
Woodland, P.C.1
Liu, X.2
Qian, Y.3
Zhang, C.4
Gales, M.J.F.5
Karanasou, P.6
Lanchantin, P.7
Wang, L.8
-
32
-
-
84964556678
-
Speaker diarisation and longitudinal linking on multi-genre broadcast data
-
P. Karanasou, M.J.F. Gales, P. Lanchantin, X. Liu, Y. Qian, L. Wang, P.C. Woodland, and C. Zhang, "Speaker diarisation and longitudinal linking on multi-genre broadcast data," in Proc. ASRU, 2015
-
(2015)
Proc. ASRU
-
-
Karanasou, P.1
Gales, M.J.F.2
Lanchantin, P.3
Liu, X.4
Qian, Y.5
Wang, L.6
Woodland, P.C.7
Zhang, C.8
-
33
-
-
84964503191
-
-
The Cambridge University March 2005 speaker diarisation system
-
R. Sinha, S. E. Tranter, M. J. F. Gales, and P. C. Woodland, "The Cambridge University March 2005 speaker diarisation system," in Interspeech, 2005
-
(2005)
Interspeech
-
-
Sinha, R.1
Tranter, S.E.2
Gales, M.J.F.3
Woodland, P.C.4
|