메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4545-4549

Data augmentation for deep convolutional neural network acoustic modeling

Author keywords

bottleneck features; convolutional neural networks; data augmentation; stochastic feature mapping; vocal tract length perturbation

Indexed keywords

AUDIO SIGNAL PROCESSING; CONVOLUTION; MAPPING; NEURAL NETWORKS; SPEECH COMMUNICATION; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

EID: 84933584545     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178831     Document Type: Conference Paper
Times cited : (53)

References (15)
  • 1
    • 0032203257 scopus 로고    scopus 로고
    • Gradientbased learning applied to document recognition
    • Y. LeCun. L. Bottou, Y. Bengio, and P. Haffner, "Gradientbased learning applied to document recognition, " Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998
    • (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • LeCun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 5
    • 85009168223 scopus 로고    scopus 로고
    • Noise robustness in speech to speech translation
    • F.-H. Liu, Y. Gao, L. Gu, and M. Picheny, "Noise robustness in speech to speech translation, " in Eurospeech, 2003
    • (2003) Eurospeech
    • Liu, F.-H.1    Gao, Y.2    Gu, L.3    Picheny, M.4
  • 6
    • 0024631285 scopus 로고
    • Distance measures for speech recognition
    • M. J. Hunt and C. Lefebvre, "Distance measures for speech recognition, " Aeronautical Note, NAE-AN-57, 1989
    • (1989) Aeronautical Note , vol.NAE-AN-57
    • Hunt, M.J.1    Lefebvre, C.2
  • 8
    • 0031647824 scopus 로고    scopus 로고
    • A frequency warping approach to speaker normalization
    • L. Lee and R. Rose, "A frequency warping approach to speaker normalization, " IEEE Transactions on Speech and Audio Processing, vol. 6, no. 1, pp. 49-60, 1998
    • (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.1 , pp. 49-60
    • Lee, L.1    Rose, R.2
  • 9
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models, " IEEE Transactions on Speech and Audio Processing, vol. 7, no. 3, pp. 272-281, 1999
    • (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.J.F.1
  • 11
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, " Computer Speech and Language, vol. 9, pp. l71-185, 1995
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 12
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Computer Speech and Language, vol. 12, pp. 75-98, 1998
    • (1998) Computer Speech and Language , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 15
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
    • B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization, " in Interspeech, 2012
    • (2012) Interspeech
    • Kingsbury, B.1    Sainath, T.N.2    Soltau, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.