메뉴 건너뛰기




Volumn , Issue , 2014, Pages 5582-5586

Data Augmentation for deep neural network acoustic modeling

Author keywords

automatic speech recognition; data augmentation; deep neural networks; stochastic feature mapping; vocal tract length perturbation

Indexed keywords

MAPPING; NEURAL NETWORKS; SPEECH RECOGNITION; STOCHASTIC SYSTEMS;

EID: 84905247925     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854671     Document Type: Conference Paper
Times cited : (66)

References (17)
  • 1
    • 0032203257 scopus 로고    scopus 로고
    • Gradientbased learning applied to document recognition
    • Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradientbased learning applied to document recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
    • (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • Lecun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 5
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Interspeech, 2011, pp. 437-440.
    • (2011) Interspeech , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 8
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
    • B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization," in Interspeech, 2012.
    • (2012) Interspeech
    • Kingsbury, B.1    Sainath, T.N.2    Soltau, H.3
  • 9
    • 84905249354 scopus 로고    scopus 로고
    • http://www.iarpa.gov/Programs/ia/Babel/babel.html.
  • 12
    • 0031647824 scopus 로고    scopus 로고
    • A frequency warping approach to speaker normalization
    • L. Lee and R. Rose, "A frequency warping approach to speaker normalization," IEEE Transactions on Speech and Audio Processing, vol. 6, no. 1, pp. 49-60, 1998.
    • (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.1 , pp. 49-60
    • Lee, L.1    Rose, R.2
  • 13
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Transactions on Speech and Audio Processing, vol. 7, no. 3, pp. 272-281, 1999.
    • (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.J.F.1
  • 14
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, pp. 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 16
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hiddenMarkov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hiddenMarkov models," Computer Speech and Language, vol. 9, pp. 171-185, 1995.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.