-
1
-
-
80155162494
-
Spoken book alignment using WFSTS
-
D. Caseiro, H. Meinedo, A. Serralheiro, I. Trancoso, and J. a. Neto, "Spoken Book Alignment using WFSTS, " Proc. of the second international conference on Human Language Technology Research, pp. 3-5, 2002.
-
(2002)
Proc. of the Second International Conference on Human Language Technology Research
, pp. 3-5
-
-
Caseiro, D.1
Meinedo, H.2
Serralheiro, A.3
Trancoso, I.4
Neto, J.A.5
-
2
-
-
80155146584
-
Automatic synchronization of electronic and audio books via TTS alignment and silence filtering
-
X. Anguera, N. Perez, A. Urruela, and N. Oliver, "Automatic Synchronization of Electronic and Audio Books via TTS Alignment and Silence Filtering, " in Proc. ICME, 2011.
-
(2011)
Proc. ICME
-
-
Anguera, X.1
Perez, N.2
Urruela, A.3
Oliver, N.4
-
3
-
-
79956282392
-
Segmentation of monologues in audio books for building synthetic voices
-
K. Prahallad and A. W. Black, "Segmentation of Monologues in Audio Books for Building Synthetic Voices, " Trans. Audio, Speech and Language Processing, vol. 19, no. 5, pp. 1444-1449, 2011.
-
(2011)
Trans. Audio, Speech and Language Processing
, vol.19
, Issue.5
, pp. 1444-1449
-
-
Prahallad, K.1
Black, A.W.2
-
4
-
-
34547521678
-
Automatic alignment and error correction of human generated transcripts for long speech recordings
-
T. J. Hazen, "Automatic Alignment and Error Correction of Human Generated Transcripts for Long Speech Recordings, " in Proc. Inter Speech, 2006, pp. 1606-1609.
-
(2006)
Proc. Inter Speech
, pp. 1606-1609
-
-
Hazen, T.J.1
-
5
-
-
0343950213
-
Improving acoustic models by watching television
-
Carnegie Mellon University, Tech. Rep
-
M. J.Witbrock and A. G. Hauptmann, "Improving Acoustic Models by Watching Television, " Technical Report CMU-CS-98-110, Carnegie Mellon University, Tech. Rep., 1998.
-
(1998)
Technical Report CMU-CS-98-110
-
-
Witbrock, M.J.1
Hauptmann, A.G.2
-
7
-
-
46449097482
-
Alignment of speech to highly imperfect text transcriptions
-
A. Haubold and J. R. Kender, "Alignment of Speech to Highly Imperfect Text Transcriptions, " in Proc. ICME, 2007.
-
(2007)
Proc. ICME
-
-
Haubold, A.1
Kender, J.R.2
-
8
-
-
84893599214
-
Text spotting in large speech databases for under-resourced languages
-
A. Buzo, H. Cucu, and C. Burileanu, "Text Spotting In Large Speech Databases For Under-Resourced Languages, " in Proc. Speech Technology and Human-Computer Dialogue (SpeD) Conference, no. 1, 2013.
-
(2013)
Proc. Speech Technology and Human-Computer Dialogue (SpeD) Conference
, Issue.1
-
-
Buzo, A.1
Cucu, H.2
Burileanu, C.3
-
9
-
-
84893644777
-
Processing spoken lectures in resource-scarse environments
-
C. J. van Heerden, P. de Villiers, E. Barnard, and M. H. Davel, "Processing spoken Lectures in Resource-Scarse Environments, " in Proc. Proceedings of the 22nd Annual Symposium of the Pattern Recognition Association of South Africa, 2011, pp. 138-143.
-
(2011)
Proc. Proceedings of the 22nd Annual Symposium of the Pattern Recognition Association of South Africa
, pp. 138-143
-
-
Van Heerden, C.J.1
De Villiers, P.2
Barnard, E.3
Davel, M.H.4
-
10
-
-
84865744412
-
-
August
-
M. H. Davel, C. V. Heerden, N. Kleynhans, and E. Barnard, "Efficient harvesting of Internet audio for resource-scarce ASR, " no. August, 2011, pp. 3153-3156.
-
(2011)
Efficient Harvesting of Internet Audio for Resource-scarce ASR
, pp. 3153-3156
-
-
Davel, M.H.1
Heerden, C.V.2
Kleynhans, N.3
Barnard, E.4
-
11
-
-
84910039499
-
Automatic generation of hyperlinks between audio and transcript
-
September
-
J. Robert-Ribes and R. Mukhtar, "Automatic Generation of Hyperlinks Between Audio and Transcript, " in Proc. Eurospeech, vol. 1997, no. September, 1997, pp. 903-906.
-
(1997)
Proc. Eurospeech
, vol.1997
, pp. 903-906
-
-
Robert-Ribes, J.1
Mukhtar, R.2
-
12
-
-
84885726863
-
A recursive algorithm for the forced alignment of very long audio segments
-
P. J. Moreno, C. Joerg, J.-m. Van Thong, and O. Glickman, "A Recursive Algorithm for the Forced Alignment of Very Long Audio Segments, " in Proc. ICSLP, 1998.
-
(1998)
Proc. ICSLP
-
-
Moreno, P.J.1
Joerg, C.2
Van Thong, J.-M.3
Glickman, O.4
-
13
-
-
84906260292
-
Text-to-speech alignment of long recordings using universal phone models
-
August
-
S. Hoffmann and B. Pfister, "Text-to-Speech Alignment of Long Recordings Using Universal Phone Models, " Proc. Inter Speech, no. August, pp. 1520-1524, 2013.
-
(2013)
Proc. Inter Speech
, pp. 1520-1524
-
-
Hoffmann, S.1
Pfister, B.2
-
14
-
-
84906264108
-
Technique for automatic sentence level alignment of long speech and transcripts
-
August
-
I. Ahmed, S. K. Kopparapu, T. C. S. Innovation, L. Mumbai, Y. Park, and T. West, "Technique for Automatic Sentence Level Alignment of Long Speech and Transcripts, " in Proc. Inter Speech, no. August, 2013, pp. 1516-1519.
-
(2013)
Proc. Inter Speech
, pp. 1516-1519
-
-
Ahmed, I.1
Kopparapu, S.K.2
Innovation, T.C.S.3
Mumbai, L.4
Park, Y.5
West, T.6
-
15
-
-
84865764419
-
Rapid building of an ASR system for under-resourced languages based on multilingual unsupervised training
-
August
-
N. T. Vu, F. Kraus, and T. Schultz, "Rapid building of an ASR system for Under-Resourced Languages based on Multilingual Unsupervised Training, " in Proc. Inter Speech, no. August, 2011, pp. 3145-3148.
-
(2011)
Proc. Inter Speech
, pp. 3145-3148
-
-
Vu, N.T.1
Kraus, F.2
Schultz, T.3
-
16
-
-
0036460908
-
Lightly supervised recognition for automatic alignment of large coherent speech recordings
-
N. Braunschweiler, M. J. F. Gales, and S. Buchholz, "Lightly supervised recognition for automatic alignment of large coherent speech recordings, " Trans. Computer Speech and Language, vol. 16, no. 1, pp. 115-129, 2002.
-
(2002)
Trans. Computer Speech and Language
, vol.16
, Issue.1
, pp. 115-129
-
-
Braunschweiler, N.1
Gales, M.J.F.2
Buchholz, S.3
-
17
-
-
84858953642
-
The Kaldi speech recognition toolkit
-
D. Povey, A. Ghoshal, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlícek, T. Qian, P. Schwarz, J. Silovsky, G. Stemmer, and K. Vesely, "The Kaldi Speech Recognition Toolkit, " in Proc ASRU, 2011.
-
(2011)
Proc ASRU
-
-
Povey, D.1
Ghoshal, A.2
Burget, L.3
Glembek, O.4
Goel, N.5
Hannemann, M.6
Motlícek, P.7
Qian, T.8
Schwarz, P.9
Silovsky, J.10
Stemmer, G.11
Vesely, K.12
-
19
-
-
84906274473
-
An open-source state-of-the-art toolbox for broadcast news Diarization
-
[Online]
-
M. Rouvier, G. Dupuy, P. Gay, E. Khoury, T. Merlin, and S. Meignier, "An Open-source State-of-the-art Toolbox for Broadcast News Diarization, " in Proc. Inter Speech, 2013. [Online]. Available: Http://www-lium.univ-lemans.fr/diarization.
-
(2013)
Proc. Inter Speech
-
-
Rouvier, M.1
Dupuy, G.2
Gay, P.3
Khoury, E.4
Merlin, T.5
Meignier, S.6
-
21
-
-
85009230817
-
Grapheme based speech recognition
-
M. Killer, S. Stüker, and T. Schultz, "Grapheme Based Speech Recognition, " in Eurospeech, 2003, pp. 3141-3144.
-
(2003)
Eurospeech
, pp. 3141-3144
-
-
Killer, M.1
Stüker, S.2
Schultz, T.3
-
22
-
-
78049527800
-
The Cere voice characterful speech synthesiser SDK
-
Newcastle
-
M. P. Aylett and C. J. Pidcock, "The CereVoice Characterful Speech Synthesiser SDK, " in Proc. AISB, Newcastle, 2007, pp. 174-178.
-
(2007)
Proc. AISB
, pp. 174-178
-
-
Aylett, M.P.1
Pidcock, C.J.2
-
23
-
-
84976375912
-
CEUDEX: A data base oriented to context-dependent units training in Spanish for continuous speech recognition
-
September
-
C. de la Torre, L. Gernández-Gómez, and D. Tapias, "CEUDEX: A Data Base oriented to Context-Dependent Units Training in Spanish for Continuous Speech Recognition, " in Proc. Eurospeech, no. September, 1995, pp. 845-848.
-
(1995)
N Proc. Eurospeech
, pp. 845-848
-
-
De La Torre, C.1
Gernández-Gómez, L.2
Tapias, D.3
|