메뉴 건너뛰기




Volumn 2017-August, Issue , 2017, Pages 2411-2415

Dynamic layer normalization for adaptive neural acoustic modeling in speech recognition

Author keywords

Adaptive acoustic model; Dynamic layer normalization; Speech recognition

Indexed keywords

DEEP NEURAL NETWORKS; SPEECH; SPEECH COMMUNICATION;

EID: 85039151782     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: 10.21437/Interspeech.2017-556     Document Type: Conference Paper
Times cited : (48)

References (29)
  • 2
    • 0031573117 scopus 로고    scopus 로고
    • Long short-term memory
    • Nov.
    • S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Comput., Vol. 9, no. 8, pp. 1735-1780, Nov. 1997. [Online]. Available: http://dx.doi.org/10.1162/neco.1997.9.8.1735
    • (1997) Neural Comput. , vol.9 , Issue.8 , pp. 1735-1780
    • Hochreiter, S.1    Schmidhuber, J.2
  • 4
    • 84910046405 scopus 로고    scopus 로고
    • Long short-term memory recurrent neural network architectures for large scale acoustic modeling
    • H. Sak, A. W. Senior, and F. Beaufays, "Long short-term memory recurrent neural network architectures for large scale acoustic modeling." in Interspeech, 2014, pp. 338-342.
    • (2014) Interspeech , pp. 338-342
    • Sak, H.1    Senior, A.W.2    Beaufays, F.3
  • 5
    • 84893691530 scopus 로고    scopus 로고
    • Speaker adaptation of neural network acoustic models using i-vectors
    • G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors." in ASRU, 2013, pp. 55-59.
    • (2013) ASRU , pp. 55-59
    • Saon, G.1    Soltau, H.2    Nahamoo, D.3    Picheny, M.4
  • 10
    • 84983119674 scopus 로고    scopus 로고
    • Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models
    • IEEE
    • P. Swietojanski and S. Renals, "Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models," in Spoken Language Technology Workshop (SLT), 2014 IEEE. IEEE, 2014, pp. 171-176.
    • (2014) Spoken Language Technology Workshop (SLT), 2014 IEEE , pp. 171-176
    • Swietojanski, P.1    Renals, S.2
  • 13
    • 85039174342 scopus 로고    scopus 로고
    • Layer Normalization
    • L. J. Ba, R. Kiros, and G. E. Hinton, "Layer normalization," CoRR, Vol. abs/1607.06450, 2016. [Online]. Available: http://arxiv.org/abs/1607.06450
    • (2016) CoRR
    • Ba, L.J.1    Kiros, R.2    Hinton, G.E.3
  • 14
    • 84990067826 scopus 로고    scopus 로고
    • Texture networks: Feed-forward synthesis of textures and stylized images
    • D. Ulyanov, V. Lebedev, A. Vedaldi, and V. S. Lempitsky, "Texture networks: Feed-forward synthesis of textures and stylized images," CoRR, Vol. abs/1603.03417, 2016. [Online]. Available: http://arxiv.org/abs/1603.03417
    • (2016) CoRR
    • Ulyanov, D.1    Lebedev, V.2    Vedaldi, A.3    Lempitsky, V.S.4
  • 15
    • 84990034290 scopus 로고    scopus 로고
    • Perceptual losses for real-time style transfer and super-resolution
    • J. Johnson, A. Alahi, and F. Li, "Perceptual losses for real-time style transfer and super-resolution," CoRR, Vol. abs/1603.08155, 2016. [Online]. Available: http://arxiv.org/abs/1603.08155
    • (2016) CoRR
    • Johnson, J.1    Alahi, A.2    Li, F.3
  • 16
    • 85039172195 scopus 로고    scopus 로고
    • Instance Normalization: The missing ingredient for fast stylization
    • D. Ulyanov, A. Vedaldi, and V. S. Lempitsky, "Instance normalization: The missing ingredient for fast stylization," CoRR, Vol. abs/1607.08022, 2016. [Online]. Available: http://arxiv.org/abs/1607.08022
    • (2016) CoRR
    • Ulyanov, D.1    Vedaldi, A.2    Lempitsky, V.S.3
  • 17
    • 85028600965 scopus 로고    scopus 로고
    • A learned representation for artistic style
    • V. Dumoulin, J. Shlens, and M. Kudlur, "A learned representation for artistic style," CoRR, Vol. abs/1610.07629, 2016. [Online]. Available: http://arxiv.org/abs/1610.07629
    • (2016) CoRR
    • Dumoulin, V.1    Shlens, J.2    Kudlur, M.3
  • 18
    • 0012330750 scopus 로고
    • The design for the wall street journal-based csr corpus
    • Association for Computational Linguistics
    • D. B. Paul and J. M. Baker, "The design for the wall street journal-based csr corpus," in Proceedings of the workshop on Speech and Natural Language. Association for Computational Linguistics, 1992, pp. 357-362.
    • (1992) Proceedings of the Workshop on Speech and Natural Language , pp. 357-362
    • Paul, D.B.1    Baker, J.M.2
  • 19
    • 85020205851 scopus 로고    scopus 로고
    • Enhancing the tedlium corpus with selected data for language modeling and more ted talks
    • A. Rousseau, P. Deléglise, and Y Estève, "Enhancing the tedlium corpus with selected data for language modeling and more ted talks." in LREC, 2014, pp. 3935-3939.
    • (2014) LREC , pp. 3935-3939
    • Rousseau, A.1    Deléglise, P.2    Estève, Y.3
  • 20
    • 84969584486 scopus 로고    scopus 로고
    • Batch Normalization: Accelerating deep network training by reducing internal covariate shift
    • F. R. Bach and D. M. Blei, Eds. JMLR.org
    • S. Ioffe and C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift." in ICML, ser. JMLR Workshop and Conference Proceedings, F. R. Bach and D. M. Blei, Eds., Vol. 37. JMLR.org, 2015, pp. 448-456.
    • (2015) ICML, Ser. JMLR Workshop and Conference Proceedings , vol.37 , pp. 448-456
    • Ioffe, S.1    Szegedy, C.2
  • 22
  • 24
    • 85083951076 scopus 로고    scopus 로고
    • Adam: A method for stochastic optimization
    • D. P. Kingma and J. Ba, "Adam: A method for stochastic optimization," CoRR, Vol. abs/1412.6980, 2014. [Online]. Available: http://arxiv.org/abs/1412.6980
    • (2014) CoRR
    • Kingma, D.P.1    Ba, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.