메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 3586-3589

Audio augmentation for speech recognition

Author keywords

Data augmentation; Deep neural network; Speech recognition

Indexed keywords

SPEECH; SPEECH COMMUNICATION;

EID: 84959118622     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (1345)

References (17)
  • 2
    • 77949375556 scopus 로고    scopus 로고
    • Support vector machines for noise robust asr
    • M. J. F. Gales, A. Ragni, H. AlDamarki, and C. Gautier, "Support vector machines for noise robust asr, " in ASRU, 2009, pp. 205-210.
    • (2009) ASRU , pp. 205-210
    • Gales, M.J.F.1    Ragni, A.2    AlDamarki, H.3    Gautier, C.4
  • 6
    • 84893642825 scopus 로고    scopus 로고
    • Elastic spectral distortion for low resource speech recognition with deep neural networks
    • N. Kanda, R. Takeda, and Y. Obuchi, "Elastic spectral distortion for low resource speech recognition with deep neural networks, " in ASRU, 2013.
    • (2013) ASRU
    • Kanda, N.1    Takeda, R.2    Obuchi, Y.3
  • 7
    • 84959115289 scopus 로고    scopus 로고
    • A time delay neural network architecture for efficient modeling of long temporal contexts
    • V. Peddinti, D. Povey, and S. Khudanpur, "A time delay neural network architecture for efficient modeling of long temporal contexts, " in Proceedings of INTERSPEECH, 2015.
    • (2015) Proceedings of INTERSPEECH
    • Peddinti, V.1    Povey, D.2    Khudanpur, S.3
  • 9
    • 84959085793 scopus 로고    scopus 로고
    • accessed March 25, 2015
    • SoX, audio manipulation tool, (accessed March 25, 2015). [Online]. Available: http: //sox. sourceforge. net/
    • Audio Manipulation Tool
  • 10
    • 84946076428 scopus 로고    scopus 로고
    • Ted-lium: An automatic speech recognition dedicated corpus
    • A. Rousseau, P. Deléglise, and Y. Estève, "Ted-lium: An automatic speech recognition dedicated corpus. " in LREC, 2012, pp. 125-129.
    • (2012) LREC , pp. 125-129
    • Rousseau, A.1    Deléglise, P.2    Estève, Y.3
  • 13
    • 0019053271 scopus 로고
    • Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 28, no. 4, pp. 357-366, 1980.
    • (1980) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.