메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 5206-5210

Librispeech: An ASR corpus based on public domain audio books

Author keywords

Corpus; LibriVox; Speech Recognition

Indexed keywords

AUDIO SIGNAL PROCESSING; COMPUTATIONAL LINGUISTICS; SPEECH; SPEECH COMMUNICATION;

EID: 84946015916     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178964     Document Type: Conference Paper
Times cited : (7226)

References (23)
  • 3
    • 84946093268 scopus 로고    scopus 로고
    • Creative commons attribution 4.0 international public license
    • November
    • "Creative Commons Attribution 4.0 International Public License," https:/ /creativecommons. org/ licenses/by/4. 0/, November 2013.
    • (2013) Https:/ /Creativecommons. Org/ licenses/by/4. 0
  • 4
    • 84858953642 scopus 로고    scopus 로고
    • The kaldi speech recognition toolkit
    • D. Povey, A. Ghoshal, et aI., "The Kaldi Speech Recognition Toolkit," in Proc. ASRU, 2011.
    • (2011) Proc. ASRU
    • Povey, D.1    Ghoshal, A.2
  • 6
    • 34547521678 scopus 로고    scopus 로고
    • Automatic alignment aud error correction of humau generated trauscripts for long speech recordings
    • T J. Hazen, "Automatic alignment aud error correction of humau generated trauscripts for long speech recordings," in in Proc. interspeech, 2006.
    • (2006) Proc. Interspeech
    • Hazen, T.J.1
  • 7
    • 84910072484 scopus 로고    scopus 로고
    • Audio-to-text alignment for speech recognition with very limited resources
    • X. Anguera, J. Luque, aud C. Gracia, "Audio-to-text alignment for speech recognition with very limited resources," in interspeech, 2014.
    • (2014) Interspeech
    • Anguera, X.1    Luque, J.2    Gracia, C.3
  • 8
    • 0035412925 scopus 로고    scopus 로고
    • Normalization of non-staudard words
    • R. Sproat et aI., "Normalization of non-staudard words," Computer Speech &Language, vol. 15, no. 3, pp. 287-333, 2001.
    • (2001) Computer Speech &Language , vol.15 , Issue.3 , pp. 287-333
    • Sproat, R.1
  • 9
    • 0141589488 scopus 로고    scopus 로고
    • SRILM-an extensible lauguage modeling toolkit
    • A. Stolcke, "SRILM-An Extensible Lauguage Modeling Toolkit," in iCSLP, 2002.
    • (2002) ICSLP
    • Stolcke, A.1
  • 10
    • 0026187945 scopus 로고
    • The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression
    • I.H. Witten and TC. Bell, 'The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression," IEEE Transactions on information Theory, vol. 37, no. 4,1991.
    • (1991) IEEE Transactions on Information Theory , vol.37 , Issue.4
    • Witten, I.H.1    Bell, T.C.2
  • 11
    • 41049105254 scopus 로고    scopus 로고
    • Joint-sequence models for graphemeto-phoneme conversion
    • M. Bisani and H. Ney, "Joint-sequence models for graphemeto-phoneme conversion.," Speech Communication, vol. 50, no. 5, pp. 434-451, 2008.
    • (2008) Speech Communication , vol.50 , Issue.5 , pp. 434-451
    • Bisani, M.1    Ney, H.2
  • 13
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S. Davis aud P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," Acoustics, Speech and Signal Processing, iEEE Transactions on, vol. 28, no. 4, pp. 357-366, 1980.
    • (1980) Acoustics, Speech and Signal Processing, IEEE Transactions on , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 15
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • T Smith and M. Waterman, "Identification of common molecular subsequences," Journal of Molecular Biology, vol. 147, no. I, pp. 195-197, 1981.
    • (1981) Journal of Molecular Biology , vol.147 , Issue.1 , pp. 195-197
    • Smith, T.1    Waterman, M.2
  • 16
    • 0001116877 scopus 로고
    • Binary codes capable of correcting deletions, insertions aud reversals
    • v.I. Levenshtein, "Binary Codes Capable of Correcting Deletions, Insertions aud Reversals," Soviet Physics Doklady, vol. 10, pp. 707, 1966.
    • (1966) Soviet Physics Doklady , vol.10 , pp. 707
    • Levenshtein, V.I.1
  • 17
    • 79959817774 scopus 로고    scopus 로고
    • Lightly supervised recognition for automatic alignment of large coherent speech recordings
    • ISCA
    • N. Braunschweiler, M. J. F. Gales, and S. Buchholz, "Lightly supervised recognition for automatic alignment of large coherent speech recordings.," in lNTERSPEECH. 2010, pp. 2222-2225,ISCA.
    • (2010) LNTERSPEECH , pp. 2222-2225
    • Braunschweiler, N.1    Gales, M.J.F.2    Buchholz, S.3
  • 18
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M. J. F. Gales and P. C. Woodland, "Mean and Variance Adaptation Within the MLLR Framework," Computer Speech and Language, vol. 10, pp. 249-264, 1996.
    • (1996) Computer Speech and Language , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 21
    • 0028996876 scopus 로고
    • Improved backing-off for m-gram lauguage modeling
    • R. Kneser aud H. Ney, "Improved backing-off for m-gram lauguage modeling," in iCASSP, 1995, vol. 1, pp. 181-184.
    • (1995) ICASSP , vol.1 , pp. 181-184
    • Kneser, R.1    Ney, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.