SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 5206-5210

Librispeech: An ASR corpus based on public domain audio books

(4) Panayotov, Vassil a Chen, Guoguo a Povey, Daniel a Khudanpur, Sanjeev a

a JOHNS HOPKINS UNIVERSITY (United States)

Author keywords

Corpus; LibriVox; Speech Recognition

Indexed keywords

AUDIO SIGNAL PROCESSING; COMPUTATIONAL LINGUISTICS; SPEECH; SPEECH COMMUNICATION;

ACOUSTIC MODEL; CORPUS; CORPUS-BASED; LANGUAGE MODEL; LIBRIVOX; PUBLIC DOMAINS; SPEECH RECOGNITION SYSTEMS; WALL STREET JOURNAL;

SPEECH RECOGNITION;

EID: 84946015916 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178964 Document Type: Conference Paper

Times cited : (7226)

References (23)

1
- 84870566319
- Ph.D. Thesis, CMU, Pittsburgh
- K. Prahallad, Automatic building of synthetic voices from audio books, Ph.D. Thesis, CMU, Pittsburgh, 2010.
- (2010) Automatic Building of Synthetic Voices from Audio Books
- Prahallad, K.¹

2
- 84890516589
- The blizzard challenge 2012
- S. King and V. Karaiskos, "The Blizzard Challenge 2012," in Proceedings Blizzard Workshop, 2012.
- (2012) Proceedings Blizzard Workshop
- King, S.¹ Karaiskos, V.²

3
- 84946093268
- Creative commons attribution 4.0 international public license
- November
- "Creative Commons Attribution 4.0 International Public License," https:/ /creativecommons. org/ licenses/by/4. 0/, November 2013.
- (2013) Https:/ /Creativecommons. Org/ licenses/by/4. 0

4
- 84858953642
- The kaldi speech recognition toolkit
- D. Povey, A. Ghoshal, et aI., "The Kaldi Speech Recognition Toolkit," in Proc. ASRU, 2011.
- (2011) Proc. ASRU
- Povey, D.¹ Ghoshal, A.²

5
- 0012330750
- The design for the wall street journal-based csr corpus
- D. B. Paul aud J. M. Baker, "The design for the Wall Street Journal-based CSR corpus," in Proceedings of the workshop on Speech and Natural Language. Association for Computational Linguistics, 1992, pp. 357-362.
- (1992) Proceedings of the Workshop on Speech and Natural Language. Association for Computational Linguistics , pp. 357-362
- Paul, D.B.¹ Baker, J.M.²

6
- 34547521678
- Automatic alignment aud error correction of humau generated trauscripts for long speech recordings
- T J. Hazen, "Automatic alignment aud error correction of humau generated trauscripts for long speech recordings," in in Proc. interspeech, 2006.
- (2006) Proc. Interspeech
- Hazen, T.J.¹

7
- 84910072484
- Audio-to-text alignment for speech recognition with very limited resources
- X. Anguera, J. Luque, aud C. Gracia, "Audio-to-text alignment for speech recognition with very limited resources," in interspeech, 2014.
- (2014) Interspeech
- Anguera, X.¹ Luque, J.² Gracia, C.³

8
- 0035412925
- Normalization of non-staudard words
- R. Sproat et aI., "Normalization of non-staudard words," Computer Speech &Language, vol. 15, no. 3, pp. 287-333, 2001.
- (2001) Computer Speech &Language , vol.15 , Issue.3 , pp. 287-333
- Sproat, R.¹

9
- 0141589488
- SRILM-an extensible lauguage modeling toolkit
- A. Stolcke, "SRILM-An Extensible Lauguage Modeling Toolkit," in iCSLP, 2002.
- (2002) ICSLP
- Stolcke, A.¹

10
- 0026187945
- The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression
- I.H. Witten and TC. Bell, 'The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression," IEEE Transactions on information Theory, vol. 37, no. 4,1991.
- (1991) IEEE Transactions on Information Theory , vol.37 , Issue.4
- Witten, I.H.¹ Bell, T.C.²

11
- 41049105254
- Joint-sequence models for graphemeto-phoneme conversion
- M. Bisani and H. Ney, "Joint-sequence models for graphemeto-phoneme conversion.," Speech Communication, vol. 50, no. 5, pp. 434-451, 2008.
- (2008) Speech Communication , vol.50 , Issue.5 , pp. 434-451
- Bisani, M.¹ Ney, H.²

12
- 51449120120
- Boosted MMI for feature and model space discriminative training
- D. Povey aud D. Kauevsky aud B. Kingsbury aud B. Ramabhadrau aud G. Saon aud K. Visweswariah, "Boosted MMI for Feature and Model Space Discriminative Training," in iCASSP, 2008.
- (2008) ICASSP
- Povey, D.¹ Kauevsky, D.² Kingsbury, B.³ Ramabhadrau, B.⁴ Saon, G.⁵ Visweswariah, K.⁶

13
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- S. Davis aud P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," Acoustics, Speech and Signal Processing, iEEE Transactions on, vol. 28, no. 4, pp. 357-366, 1980.
- (1980) Acoustics, Speech and Signal Processing, IEEE Transactions on , vol.28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

14
- 0032638856
- Semi-tied covariauce matrices for hidden markov models
- M.J.F. Gales, "Semi-tied covariauce matrices for hidden markov models," IEEE Transactions on Speech and Audio Processing, vol. 7, pp. 272-281, 1999.
- (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 272-281
- Gales, M.J.F.¹

15
- 0019887799
- Identification of common molecular subsequences
- T Smith and M. Waterman, "Identification of common molecular subsequences," Journal of Molecular Biology, vol. 147, no. I, pp. 195-197, 1981.
- (1981) Journal of Molecular Biology , vol.147 , Issue.1 , pp. 195-197
- Smith, T.¹ Waterman, M.²

16
- 0001116877
- Binary codes capable of correcting deletions, insertions aud reversals
- v.I. Levenshtein, "Binary Codes Capable of Correcting Deletions, Insertions aud Reversals," Soviet Physics Doklady, vol. 10, pp. 707, 1966.
- (1966) Soviet Physics Doklady , vol.10 , pp. 707
- Levenshtein, V.I.¹

17
- 79959817774
- Lightly supervised recognition for automatic alignment of large coherent speech recordings
- ISCA
- N. Braunschweiler, M. J. F. Gales, and S. Buchholz, "Lightly supervised recognition for automatic alignment of large coherent speech recordings.," in lNTERSPEECH. 2010, pp. 2222-2225,ISCA.
- (2010) LNTERSPEECH , pp. 2222-2225
- Braunschweiler, N.¹ Gales, M.J.F.² Buchholz, S.³

18
- 0030263447
- Mean and variance adaptation within the MLLR framework
- M. J. F. Gales and P. C. Woodland, "Mean and Variance Adaptation Within the MLLR Framework," Computer Speech and Language, vol. 10, pp. 249-264, 1996.
- (1996) Computer Speech and Language , vol.10 , pp. 249-264
- Gales, M.J.F.¹ Woodland, P.C.²

19
- 0030362995
- A compact model for speaker-adaptive training
- T Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A Compact Model for Speaker-Adaptive Training," in iCSLP, 1996.
- (1996) ICSLP
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

20
- 84946080079
- in CMU SPUD Workshop, Dallas (Texas, U SA), March
- S. Meignier and T Merlin, "UUM SpkDiarization: an open source toolkit for diarization," in CMU SPUD Workshop, Dallas (Texas, U SA), March 2010.
- (2010) UUM SpkDiarization: An Open Source Toolkit for Diarization
- Meignier, S.¹ Merlin, T.²

21
- 0028996876
- Improved backing-off for m-gram lauguage modeling
- R. Kneser aud H. Ney, "Improved backing-off for m-gram lauguage modeling," in iCASSP, 1995, vol. 1, pp. 181-184.
- (1995) ICASSP , vol.1 , pp. 181-184
- Kneser, R.¹ Ney, H.²

22
- 85024115120
- An empirical study of smoothing techniques for lauguage modeling
- S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for lauguage modeling," in Proceedings of the 34th annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, 1996, pp. 310-318.
- (1996) Proceedings of the 34th Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics , pp. 310-318
- Chen, S.F.¹ Goodman, J.²

23
- 84905239342
- Improving deep neural network acoustic models using generalized maxout networks
- Florence, Italy, May 4-9, 2014
- X. Zhaug, J. Trmal, D. Povey, aud S. Khudaupur, "Improving deep neural network acoustic models using generalized maxout networks," in iEEE international Conference on Acoustics, Speech and Signal Processing, iCASSP 2014, Florence, italy, May 4-9,2014,2014, pp. 215-219.
- (2014) IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014 , pp. 215-219
- Zhaug, X.¹ Trmal, J.² Povey, D.³ Khudaupur, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.