-
3
-
-
84946093268
-
Creative commons attribution 4.0 international public license
-
November
-
"Creative Commons Attribution 4.0 International Public License," https:/ /creativecommons. org/ licenses/by/4. 0/, November 2013.
-
(2013)
Https:/ /Creativecommons. Org/ licenses/by/4. 0
-
-
-
4
-
-
84858953642
-
The kaldi speech recognition toolkit
-
D. Povey, A. Ghoshal, et aI., "The Kaldi Speech Recognition Toolkit," in Proc. ASRU, 2011.
-
(2011)
Proc. ASRU
-
-
Povey, D.1
Ghoshal, A.2
-
6
-
-
34547521678
-
Automatic alignment aud error correction of humau generated trauscripts for long speech recordings
-
T J. Hazen, "Automatic alignment aud error correction of humau generated trauscripts for long speech recordings," in in Proc. interspeech, 2006.
-
(2006)
Proc. Interspeech
-
-
Hazen, T.J.1
-
7
-
-
84910072484
-
Audio-to-text alignment for speech recognition with very limited resources
-
X. Anguera, J. Luque, aud C. Gracia, "Audio-to-text alignment for speech recognition with very limited resources," in interspeech, 2014.
-
(2014)
Interspeech
-
-
Anguera, X.1
Luque, J.2
Gracia, C.3
-
8
-
-
0035412925
-
Normalization of non-staudard words
-
R. Sproat et aI., "Normalization of non-staudard words," Computer Speech &Language, vol. 15, no. 3, pp. 287-333, 2001.
-
(2001)
Computer Speech &Language
, vol.15
, Issue.3
, pp. 287-333
-
-
Sproat, R.1
-
9
-
-
0141589488
-
SRILM-an extensible lauguage modeling toolkit
-
A. Stolcke, "SRILM-An Extensible Lauguage Modeling Toolkit," in iCSLP, 2002.
-
(2002)
ICSLP
-
-
Stolcke, A.1
-
10
-
-
0026187945
-
The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression
-
I.H. Witten and TC. Bell, 'The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression," IEEE Transactions on information Theory, vol. 37, no. 4,1991.
-
(1991)
IEEE Transactions on Information Theory
, vol.37
, Issue.4
-
-
Witten, I.H.1
Bell, T.C.2
-
11
-
-
41049105254
-
Joint-sequence models for graphemeto-phoneme conversion
-
M. Bisani and H. Ney, "Joint-sequence models for graphemeto-phoneme conversion.," Speech Communication, vol. 50, no. 5, pp. 434-451, 2008.
-
(2008)
Speech Communication
, vol.50
, Issue.5
, pp. 434-451
-
-
Bisani, M.1
Ney, H.2
-
13
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
S. Davis aud P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," Acoustics, Speech and Signal Processing, iEEE Transactions on, vol. 28, no. 4, pp. 357-366, 1980.
-
(1980)
Acoustics, Speech and Signal Processing, IEEE Transactions on
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.1
Mermelstein, P.2
-
15
-
-
0019887799
-
Identification of common molecular subsequences
-
T Smith and M. Waterman, "Identification of common molecular subsequences," Journal of Molecular Biology, vol. 147, no. I, pp. 195-197, 1981.
-
(1981)
Journal of Molecular Biology
, vol.147
, Issue.1
, pp. 195-197
-
-
Smith, T.1
Waterman, M.2
-
16
-
-
0001116877
-
Binary codes capable of correcting deletions, insertions aud reversals
-
v.I. Levenshtein, "Binary Codes Capable of Correcting Deletions, Insertions aud Reversals," Soviet Physics Doklady, vol. 10, pp. 707, 1966.
-
(1966)
Soviet Physics Doklady
, vol.10
, pp. 707
-
-
Levenshtein, V.I.1
-
17
-
-
79959817774
-
Lightly supervised recognition for automatic alignment of large coherent speech recordings
-
ISCA
-
N. Braunschweiler, M. J. F. Gales, and S. Buchholz, "Lightly supervised recognition for automatic alignment of large coherent speech recordings.," in lNTERSPEECH. 2010, pp. 2222-2225,ISCA.
-
(2010)
LNTERSPEECH
, pp. 2222-2225
-
-
Braunschweiler, N.1
Gales, M.J.F.2
Buchholz, S.3
-
18
-
-
0030263447
-
Mean and variance adaptation within the MLLR framework
-
M. J. F. Gales and P. C. Woodland, "Mean and Variance Adaptation Within the MLLR Framework," Computer Speech and Language, vol. 10, pp. 249-264, 1996.
-
(1996)
Computer Speech and Language
, vol.10
, pp. 249-264
-
-
Gales, M.J.F.1
Woodland, P.C.2
-
19
-
-
0030362995
-
A compact model for speaker-adaptive training
-
T Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A Compact Model for Speaker-Adaptive Training," in iCSLP, 1996.
-
(1996)
ICSLP
-
-
Anastasakos, T.1
McDonough, J.2
Schwartz, R.3
Makhoul, J.4
-
20
-
-
84946080079
-
-
in CMU SPUD Workshop, Dallas (Texas, U SA), March
-
S. Meignier and T Merlin, "UUM SpkDiarization: an open source toolkit for diarization," in CMU SPUD Workshop, Dallas (Texas, U SA), March 2010.
-
(2010)
UUM SpkDiarization: An Open Source Toolkit for Diarization
-
-
Meignier, S.1
Merlin, T.2
-
21
-
-
0028996876
-
Improved backing-off for m-gram lauguage modeling
-
R. Kneser aud H. Ney, "Improved backing-off for m-gram lauguage modeling," in iCASSP, 1995, vol. 1, pp. 181-184.
-
(1995)
ICASSP
, vol.1
, pp. 181-184
-
-
Kneser, R.1
Ney, H.2
-
23
-
-
84905239342
-
Improving deep neural network acoustic models using generalized maxout networks
-
Florence, Italy, May 4-9, 2014
-
X. Zhaug, J. Trmal, D. Povey, aud S. Khudaupur, "Improving deep neural network acoustic models using generalized maxout networks," in iEEE international Conference on Acoustics, Speech and Signal Processing, iCASSP 2014, Florence, italy, May 4-9,2014,2014, pp. 215-219.
-
(2014)
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014
, pp. 215-219
-
-
Zhaug, X.1
Trmal, J.2
Povey, D.3
Khudaupur, S.4
|