메뉴 건너뛰기




Volumn , Issue , 2014, Pages 215-219

Improving deep neural network acoustic models using generalized maxout networks

Author keywords

Acoustic Modeling; Deep Learning; Maxout Networks; Speech Recognition

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; CONTROL NONLINEARITIES; SPEECH RECOGNITION;

EID: 84905239342     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6853589     Document Type: Conference Paper
Times cited : (272)

References (29)
  • 2
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • George E Dahl, Dong Yu, Li Deng, and Alex Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 3
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • Frank Seide, Gang Li, and Dong Yu, "Conversational speech transcription using context-dependent deep neural networks.," in INTERSPEECH, 2011, pp. 437-440.
    • (2011) INTERSPEECH , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 8
    • 84893701756 scopus 로고    scopus 로고
    • Deep maxout networks for low resource speech recognition
    • Y. Miao, S. Rawat, and F. Metze, "Deep maxout networks for low resource speech recognition," in Proc. ASRU, 2013.
    • (2013) Proc. ASRU
    • Miao, Y.1    Rawat, S.2    Metze, F.3
  • 9
    • 84893651518 scopus 로고    scopus 로고
    • Deep maxout neural networks for speech recognition
    • M. Cai, Y. Shi, and J. Liu, "Deep maxout neural networks for speech recognition," in Proc. ASRU, 2013.
    • (2013) Proc. ASRU
    • Cai, M.1    Shi, Y.2    Liu, J.3
  • 14
    • 84858953642 scopus 로고    scopus 로고
    • The kaldi speech recognition toolkit
    • D. Povey, A. Ghoshal, et al., "The Kaldi Speech Recognition Toolkit," in Proc. ASRU, 2011.
    • (2011) Proc. ASRU
    • Povey, D.1    Ghoshal, A.2
  • 15
    • 84906274730 scopus 로고    scopus 로고
    • Sequence-discriminative training of deep neural networks
    • Karel Veselỳ, Arnab Ghoshal, Lukás Burget, and Daniel Povey, "Sequence-discriminative training of deep neural networks," in Interspeech, 2013.
    • (2013) Interspeech
    • Veselỳ, K.1    Ghoshal, A.2    Burget, L.3    Povey, D.4
  • 17
    • 44949182698 scopus 로고    scopus 로고
    • Hypothesis spaces for minimum bayes risk training in large vocabulary speech recognition
    • Gibson M. and Hain T., "Hypothesis Spaces For Minimum Bayes Risk Training In Large Vocabulary Speech Recognition," in Interspeech, 2006.
    • (2006) Interspeech
    • Gibson, M.1    Hain, T.2
  • 18
    • 34547529083 scopus 로고    scopus 로고
    • Evaluation of proposed modifications to MPE for large scale discriminative training
    • Daniel Povey and Brian Kingsbury, "Evaluation of proposed modifications to MPE for large scale discriminative training," in ICASSP, 2007.
    • (2007) ICASSP
    • Povey, D.1    Kingsbury, B.2
  • 22
    • 0032629928 scopus 로고    scopus 로고
    • Statistical analysis of learning dynamics
    • Noboru Murata and Shun-ichi Amari, "Statistical analysis of learning dynamics," Signal Processing, vol. 74, no. 1, pp. 3-28, 1999.
    • (1999) Signal Processing , vol.74 , Issue.1 , pp. 3-28
    • Murata, N.1    Amari, S.2
  • 26
    • 78049502526 scopus 로고    scopus 로고
    • The subspace gaussian mixture model-a structured model for speech recognition
    • April
    • D. Povey, L. Burget, et al., "The Subspace Gaussian Mixture Model-A Structured Model for Speech Recognition," Computer Speech & Language, vol. 25, no. 2, pp. 404-439, April 2011.
    • (2011) Computer Speech & Language , vol.25 , Issue.2 , pp. 404-439
    • Povey, D.1    Burget, L.2
  • 27
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M. J. F. Gales and P. C.Woodland, "Mean and Variance Adaptation Within the MLLR Framework," Computer Speech and Language, vol. 10, pp. 249-264, 1996.
    • (1996) Computer Speech and Language , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 29
    • 84893672075 scopus 로고    scopus 로고
    • The fundamental frequency variation spectrum
    • Kornel Laskowski, Mattias Heldner, and Jens Edlund, "The fundamental frequency variation spectrum," in FONETIK, 2008.
    • (2008) FONETIK
    • Laskowski, K.1    Heldner, M.2    Edlund, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.