메뉴 건너뛰기




Volumn , Issue , 2017, Pages

Outrageously large neural networks: The sparsely-gated mixture-of-experts layer

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL EFFICIENCY; COMPUTATIONAL LINGUISTICS; COMPUTER AIDED LANGUAGE TRANSLATION; MIXTURES; MODELING LANGUAGES; MULTILAYER NEURAL NETWORKS;

EID: 85088226307     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (1103)

References (44)
  • 10
    • 0036583160 scopus 로고    scopus 로고
    • A parallel mixture of SVMs for very large scale problems
    • Ronan Collobert, Samy Bengio, and Yoshua Bengio. A parallel mixture of SVMs for very large scale problems. Neural Computing, 2002.
    • (2002) Neural Computing
    • Collobert, R.1    Bengio, S.2    Bengio, Y.3
  • 12
    • 84969832855 scopus 로고    scopus 로고
    • Distributed Gaussian processes
    • Marc Peter Deisenroth and Jun Wei Ng. Distributed Gaussian processes. In ICML, 2015.
    • (2015) ICML
    • Deisenroth, M.P.1    Ng, J.W.2
  • 25
    • 0000262562 scopus 로고
    • Hierarchical mixtures of experts and the EM algorithm
    • Michael I. Jordan and Robert A. Jacobs. Hierarchical mixtures of experts and the EM algorithm. Neural Computing, 1994.
    • (1994) Neural Computing
    • Jordan, M.I.1    Jacobs, R.A.2
  • 27
    • 85083951076 scopus 로고    scopus 로고
    • ADaM: A method for stochastic optimization
    • Diederik Kingma and Jimmy Ba. Adam: A method for stochastic optimization. In ICLR, 2015.
    • (2015) ICLR
    • Kingma, D.1    Ba, J.2
  • 29
    • 84876231242 scopus 로고    scopus 로고
    • Imagenet classification with deep convolutional neural networks
    • Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 32
    • 84959874994 scopus 로고    scopus 로고
    • Effective approaches to attention-based neural machine translation
    • Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. Effective approaches to attention-based neural machine translation. EMNLP, 2015a.
    • (2015) EMNLP
    • Luong, M.-T.1    Pham, H.2    Manning, C.D.3
  • 33
    • 84943804979 scopus 로고    scopus 로고
    • Addressing the rare word problem in neural machine translation
    • Minh-Thang Luong, Ilya Sutskever, Quoc V. Le, Oriol Vinyals, and Wojciech Zaremba. Addressing the rare word problem in neural machine translation. ACL, 2015b.
    • (2015) ACL
    • Luong, M.-T.1    Sutskever, I.2    Le, Q.V.3    Vinyals, O.4    Zaremba, W.5
  • 34
    • 84896062664 scopus 로고    scopus 로고
    • Infinite mixtures of Gaussian process experts
    • Carl Edward Rasmussen and Zoubin Ghahramani. Infinite mixtures of Gaussian process experts. NIPS, 2002.
    • (2002) NIPS
    • Rasmussen, C.E.1    Ghahramani, Z.2
  • 35
    • 84910046405 scopus 로고    scopus 로고
    • Long short-term memory recurrent neural network architectures for large scale acoustic modeling
    • Hasim Sak, Andrew W Senior, and Françoise Beaufays. Long short-term memory recurrent neural network architectures for large scale acoustic modeling. In INTERSPEECH, pp. 338-342, 2014.
    • (2014) INTERSPEECH , pp. 338-342
    • Sak, H.1    Senior, A.W.2    Beaufays, F.3
  • 36
    • 84867608922 scopus 로고    scopus 로고
    • Japanese and Korean voice search
    • Mike Schuster and Kaisuke Nakajima. Japanese and Korean voice search. ICASSP, 2012.
    • (2012) ICASSP
    • Schuster, M.1    Nakajima, K.2
  • 37
    • 70349425847 scopus 로고    scopus 로고
    • Nonlinear models using dirichlet process mixtures
    • Babak Shahbaba and Radford Neal. Nonlinear models using dirichlet process mixtures. JMLR, 2009.
    • (2009) JMLR
    • Shahbaba, B.1    Neal, R.2
  • 38
    • 84928547704 scopus 로고    scopus 로고
    • Sequence to sequence learning with neural networks
    • Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. Sequence to sequence learning with neural networks. In NIPS, 2014.
    • (2014) NIPS
    • Sutskever, I.1    Vinyals, O.2    Le, Q.V.3
  • 39
    • 84965111399 scopus 로고    scopus 로고
    • Generative image modeling using spatial LSTMs
    • Lucas Theis and Matthias Bethge. Generative image modeling using spatial LSTMs. In NIPS, 2015.
    • (2015) NIPS
    • Theis, L.1    Bethge, M.2
  • 40
    • 84898983832 scopus 로고    scopus 로고
    • Mixtures of Gaussian processes
    • Volker Tresp. Mixtures of Gaussian Processes. In NIPS, 2001.
    • (2001) NIPS
    • Tresp, V.1
  • 42
    • 84858727499 scopus 로고    scopus 로고
    • Hierarchical mixture of classification experts uncovers interactions between brain regions
    • Bangpeng Yao, Dirk Walther, Diane Beck, and Li Fei-fei. Hierarchical mixture of classification experts uncovers interactions between brain regions. In NIPS. 2009.
    • (2009) NIPS
    • Yao, B.1    Walther, D.2    Beck, D.3    Li, F.-F.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.