-
1
-
-
84958264664
-
-
CoRR, abs/1603.04467
-
Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Gregory S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian J. Good-fellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Józefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mané, Rajat Monga, Sherry Moore, Derek Gordon Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul A. Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda B. Viégas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. CoRR, abs/1603.04467, 2016. URL http://arxiv.org/abs/1603.04467.
-
(2016)
Tensorflow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
-
-
Abadi, M.1
Agarwal, A.2
Barham, P.3
Brevdo, E.4
Chen, Z.5
Citro, C.6
Corrado, G.S.7
Davis, A.8
Dean, J.9
Devin, M.10
Ghemawat, S.11
Good-Fellow, I.J.12
Harp, A.13
Irving, G.14
Isard, M.15
Jia, Y.16
Józefowicz, R.17
Kaiser, L.18
Kudlur, M.19
Levenberg, J.20
Mané, D.21
Monga, R.22
Moore, S.23
Murray, D.G.24
Olah, C.25
Schuster, M.26
Shlens, J.27
Steiner, B.28
Sutskever, I.29
Talwar, K.30
Tucker, P.A.31
Vanhoucke, V.32
Vasudevan, V.33
Viégas, F.B.34
Vinyals, O.35
Warden, P.36
Wattenberg, M.37
Wicke, M.38
Yu, Y.39
Zheng, X.40
more..
-
3
-
-
85020209763
-
-
ArXiv e-prints, November
-
A. Almahairi, N. Ballas, T. Cooijmans, Y. Zheng, H. Larochelle, and A. Courville. Dynamic Capacity Networks. ArXiv e-prints, November 2015.
-
(2015)
Dynamic Capacity Networks
-
-
Almahairi, A.1
Ballas, N.2
Cooijmans, T.3
Zheng, Y.4
Larochelle, H.5
Courville, A.6
-
4
-
-
84971463350
-
-
arXiv preprint
-
Dario Amodei, Rishita Anubhai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jing-dong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse Engel, Linxi Fan, Christopher Fougner, Tony Han, Awni Y. Hannun, Billy Jun, Patrick LeGresley, Libby Lin, Sharan Narang, Andrew Y. Ng, Sherjil Ozair, Ryan Prenger, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Yi Wang, Zhiqian Wang, Chong Wang, Bo Xiao, Dani Yo-gatama, Jun Zhan, and Zhenyao Zhu. Deep speech 2: End-to-end speech recognition in english and mandarin. arXiv preprint arXiv:1512.02595, 2015.
-
(2015)
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
-
-
Amodei, D.1
Anubhai, R.2
Battenberg, E.3
Case, C.4
Casper, J.5
Catanzaro, B.6
Chen, J.-D.7
Chrzanowski, M.8
Coates, A.9
Diamos, G.10
Elsen, E.11
Engel, J.12
Fan, L.13
Fougner, C.14
Han, T.15
Hannun, A.Y.16
Jun, B.17
LeGresley, P.18
Lin, L.19
Narang, S.20
Ng, A.Y.21
Ozair, S.22
Prenger, R.23
Raiman, J.24
Satheesh, S.25
Seetapun, D.26
Sengupta, S.27
Wang, Y.28
Wang, Z.29
Wang, C.30
Xiao, B.31
Yo-Gatama, D.32
Zhan, J.33
Zhu, Z.34
more..
-
8
-
-
84943795466
-
-
arXiv preprint
-
Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn, and Tony Robinson. One billion word benchmark for measuring progress in statistical language modeling. arXiv preprint arXiv:1312.3005, 2013.
-
(2013)
One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling
-
-
Chelba, C.1
Mikolov, T.2
Schuster, M.3
Ge, Q.4
Brants, T.5
Koehn, P.6
Robinson, T.7
-
10
-
-
0036583160
-
A parallel mixture of SVMs for very large scale problems
-
Ronan Collobert, Samy Bengio, and Yoshua Bengio. A parallel mixture of SVMs for very large scale problems. Neural Computing, 2002.
-
(2002)
Neural Computing
-
-
Collobert, R.1
Bengio, S.2
Bengio, Y.3
-
12
-
-
84969832855
-
Distributed Gaussian processes
-
Marc Peter Deisenroth and Jun Wei Ng. Distributed Gaussian processes. In ICML, 2015.
-
(2015)
ICML
-
-
Deisenroth, M.P.1
Ng, J.W.2
-
18
-
-
85044255396
-
-
CoRR, abs/1606.03401
-
Audrunas Gruslys, Rémi Munos, Ivo Danihelka, Marc Lanctot, and Alex Graves. Memory-efficient backpropagation through time. CoRR, abs/1606.03401, 2016. URL http://arxiv.org/abs/1606.03401.
-
(2016)
Memory-Efficient Backpropagation through Time
-
-
Gruslys, A.1
Munos, R.2
Danihelka, I.3
Lanctot, M.4
Graves, A.5
-
20
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
Geoffrey Hinton, Li Deng, Dong Yu, George E. Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N. Sainath, et al. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Processing Magazine, 2012.
-
(2012)
IEEE Signal Processing Magazine
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
-
24
-
-
85014030168
-
-
CoRR, abs/1611.04558
-
Melvin Johnson, Mike Schuster, Quoc V. Le, Maxim Krikun, Yonghui Wu, Zhifeng Chen, Nikhil Thorat, Fernanda B. Viégas, Martin Wattenberg, Greg Corrado, Macduff Hughes, and Jeffrey Dean. Google's multilingual neural machine translation system: Enabling zero-shot translation. CoRR, abs/1611.04558, 2016. URL http://arxiv.org/abs/1611.04558.
-
(2016)
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation
-
-
Johnson, M.1
Schuster, M.2
Le, Q.V.3
Krikun, M.4
Wu, Y.5
Chen, Z.6
Thorat, N.7
Viégas, F.B.8
Wattenberg, M.9
Corrado, G.10
Hughes, M.11
Dean, J.12
-
25
-
-
0000262562
-
Hierarchical mixtures of experts and the EM algorithm
-
Michael I. Jordan and Robert A. Jacobs. Hierarchical mixtures of experts and the EM algorithm. Neural Computing, 1994.
-
(1994)
Neural Computing
-
-
Jordan, M.I.1
Jacobs, R.A.2
-
26
-
-
84978840213
-
-
arXiv preprint
-
Rafal Jozefowicz, Oriol Vinyals, Mike Schuster, Noam Shazeer, and Yonghui Wu. Exploring the limits of language modeling. arXiv preprint arXiv:1602.02410, 2016.
-
(2016)
Exploring the Limits of Language Modeling
-
-
Jozefowicz, R.1
Vinyals, O.2
Schuster, M.3
Shazeer, N.4
Wu, Y.5
-
27
-
-
85083951076
-
ADaM: A method for stochastic optimization
-
Diederik Kingma and Jimmy Ba. Adam: A method for stochastic optimization. In ICLR, 2015.
-
(2015)
ICLR
-
-
Kingma, D.1
Ba, J.2
-
29
-
-
84876231242
-
Imagenet classification with deep convolutional neural networks
-
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
-
(2012)
NIPS
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
30
-
-
84867135575
-
Building high-level features using large scale unsupervised learning
-
Quoc V. Le, Marc'Aurelio Ranzato, Rajat Monga, Matthieu Devin, Kai Chen, Greg S. Corrado, Jeffrey Dean, and Andrew Y. Ng. Building high-level features using large scale unsupervised learning. In ICML, 2012.
-
(2012)
ICML
-
-
Le, Q.V.1
Ranzato, M.2
Monga, R.3
Devin, M.4
Chen, K.5
Corrado, G.S.6
Dean, J.7
Ng, A.Y.8
-
32
-
-
84959874994
-
Effective approaches to attention-based neural machine translation
-
Minh-Thang Luong, Hieu Pham, and Christopher D. Manning. Effective approaches to attention-based neural machine translation. EMNLP, 2015a.
-
(2015)
EMNLP
-
-
Luong, M.-T.1
Pham, H.2
Manning, C.D.3
-
33
-
-
84943804979
-
Addressing the rare word problem in neural machine translation
-
Minh-Thang Luong, Ilya Sutskever, Quoc V. Le, Oriol Vinyals, and Wojciech Zaremba. Addressing the rare word problem in neural machine translation. ACL, 2015b.
-
(2015)
ACL
-
-
Luong, M.-T.1
Sutskever, I.2
Le, Q.V.3
Vinyals, O.4
Zaremba, W.5
-
34
-
-
84896062664
-
Infinite mixtures of Gaussian process experts
-
Carl Edward Rasmussen and Zoubin Ghahramani. Infinite mixtures of Gaussian process experts. NIPS, 2002.
-
(2002)
NIPS
-
-
Rasmussen, C.E.1
Ghahramani, Z.2
-
35
-
-
84910046405
-
Long short-term memory recurrent neural network architectures for large scale acoustic modeling
-
Hasim Sak, Andrew W Senior, and Françoise Beaufays. Long short-term memory recurrent neural network architectures for large scale acoustic modeling. In INTERSPEECH, pp. 338-342, 2014.
-
(2014)
INTERSPEECH
, pp. 338-342
-
-
Sak, H.1
Senior, A.W.2
Beaufays, F.3
-
36
-
-
84867608922
-
Japanese and Korean voice search
-
Mike Schuster and Kaisuke Nakajima. Japanese and Korean voice search. ICASSP, 2012.
-
(2012)
ICASSP
-
-
Schuster, M.1
Nakajima, K.2
-
37
-
-
70349425847
-
Nonlinear models using dirichlet process mixtures
-
Babak Shahbaba and Radford Neal. Nonlinear models using dirichlet process mixtures. JMLR, 2009.
-
(2009)
JMLR
-
-
Shahbaba, B.1
Neal, R.2
-
38
-
-
84928547704
-
Sequence to sequence learning with neural networks
-
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. Sequence to sequence learning with neural networks. In NIPS, 2014.
-
(2014)
NIPS
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
-
39
-
-
84965111399
-
Generative image modeling using spatial LSTMs
-
Lucas Theis and Matthias Bethge. Generative image modeling using spatial LSTMs. In NIPS, 2015.
-
(2015)
NIPS
-
-
Theis, L.1
Bethge, M.2
-
40
-
-
84898983832
-
Mixtures of Gaussian processes
-
Volker Tresp. Mixtures of Gaussian Processes. In NIPS, 2001.
-
(2001)
NIPS
-
-
Tresp, V.1
-
41
-
-
85018271332
-
-
arXiv preprint
-
Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, Łukasz Kaiser, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa, Keith Stevens, George Kurian, Nishant Patil, Wei Wang, Cliff Young, Jason Smith, Jason Riesa, Alex Rudnick, Oriol Vinyals, Greg Corrado, Macduff Hughes, and Jeffrey Dean. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144, 2016.
-
(2016)
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
-
-
Wu, Y.1
Schuster, M.2
Chen, Z.3
Le, Q.V.4
Norouzi, M.5
Macherey, W.6
Krikun, M.7
Cao, Y.8
Gao, Q.9
Macherey, K.10
Klingner, J.11
Shah, A.12
Johnson, M.13
Liu, X.14
Kaiser, Ł.15
Gouws, S.16
Kato, Y.17
Kudo, T.18
Kazawa, H.19
Stevens, K.20
Kurian, G.21
Patil, N.22
Wang, W.23
Young, C.24
Smith, J.25
Riesa, J.26
Rudnick, A.27
Vinyals, O.28
Corrado, G.29
Hughes, M.30
Dean, J.31
more..
-
42
-
-
84858727499
-
Hierarchical mixture of classification experts uncovers interactions between brain regions
-
Bangpeng Yao, Dirk Walther, Diane Beck, and Li Fei-fei. Hierarchical mixture of classification experts uncovers interactions between brain regions. In NIPS. 2009.
-
(2009)
NIPS
-
-
Yao, B.1
Walther, D.2
Beck, D.3
Li, F.-F.4
-
44
-
-
85040594930
-
-
arXiv preprint
-
Jie Zhou, Ying Cao, Xuguang Wang, Peng Li, and Wei Xu. Deep recurrent models with fast-forward connections for neural machine translation. arXiv preprint arXiv:1606.04199, 2016.
-
(2016)
Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation
-
-
Zhou, J.1
Cao, Y.2
Wang, X.3
Li, P.4
Xu, W.5
|