SCOPUS 정보 검색 플랫폼

6th International Conference on Learning Representations, ICLR 2018 - Conference Track Proceedings

Volumn , Issue , 2018, Pages

Flipout: Efficient pseudo-independent weight perturbations on mini-batches

(5) Wen, Yeming a Vicol, Paul a Ba, Jimmy a Tran, Dustin b Grosse, Roger a

a UNIVERSITY OF TORONTO (Canada)

b Och Spine at New York Presbyterian Hospitals (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COST REDUCTION; REINFORCEMENT LEARNING; STOCHASTIC SYSTEMS; WEB SERVICES;

AMAZON WEB SERVICES; CONVOLUTIONAL NETWORKS; CPU CORES; EVOLUTION STRATEGIES; FULLY CONNECTED NETWORKS; GAUSSIANS; VARIANCE REDUCTIONS; WEIGHT PERTURBATION;

NEURAL NETWORKS;

EID: 85083950100 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (294)

References (33)

1
- 84969752808
- Weight uncertainty in neural networks
- Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, and Daan Wierstra. Weight uncertainty in neural networks. In Proceedings of the 32nd International Conference on Machine Learning (ICML), pp. 1613–1622, 2015.
- (2015) Proceedings of the 32nd International Conference on Machine Learning (ICML) , pp. 1613-1622
- Blundell, C.¹ Cornebise, J.² Kavukcuoglu, K.³ Wierstra, D.⁴

2
- 85088227437
- Recurrent batch normalization
- Tim Cooijmans, Nicolas Ballas, César Laurent, Çaglar Gülçehre, and Aaron Courville. Recurrent batch normalization. In International Conference on Learning Representations (ICLR), 2017.
- (2017) International Conference on Learning Representations (ICLR)
- Cooijmans, T.¹ Ballas, N.² Laurent, C.³ Gülçehre, Ç.⁴ Courville, A.⁵

3
- 85046770231
- arXiv preprint
- Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, et al. Noisy networks for exploration. arXiv preprint arXiv:1706.10295, 2017.
- (2017) Noisy Networks for Exploration
- Fortunato, M.¹ Azar, M.G.² Piot, B.³ Menick, J.⁴ Osband, I.⁵ Graves, A.⁶ Mnih, V.⁷ Munos, R.⁸ Hassabis, D.⁹ Pietquin, O.¹⁰

4
- 85019171807
- A theoretically grounded application of dropout in recurrent neural networks
- Yarin Gal and Zoubin Ghahramani. A theoretically grounded application of dropout in recurrent neural networks. In Advances in Neural Information Processing Systems (NIPS), pp. 1019–1027, 2016.
- (2016) Advances in Neural Information Processing Systems (NIPS) , pp. 1019-1027
- Gal, Y.¹ Ghahramani, Z.²

5
- 85162557101
- Practical variational inference for neural networks
- Alex Graves. Practical variational inference for neural networks. In Advances in Neural Information Processing Systems (NIPS), pp. 2348–2356, 2011.
- (2011) Advances in Neural Information Processing Systems (NIPS) , pp. 2348-2356
- Graves, A.¹

6
- 85030994434
- arXiv preprint
- David Ha, Andrew Dai, and Quoc V Le. Hypernetworks. arXiv preprint arXiv:1609.09106, 2016.
- (2016) Hypernetworks
- Ha, D.¹ Dai, A.² Le, Q.V.³

7
- 0027803368
- Keeping the neural networks simple by minimizing the description length of the weights
- ACM
- Geoffrey E Hinton and Drew Van Camp. Keeping the neural networks simple by minimizing the description length of the weights. In Proceedings of the 6th Annual Conference on Computational Learning Theory, pp. 5–13. ACM, 1993.
- (1993) Proceedings of the 6th Annual Conference on Computational Learning Theory , pp. 5-13
- Hinton, G.E.¹ Van Camp, D.²

8
- 84969584486
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- Sergey Ioffe and Christian Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning (ICML), pp. 448–456, 2015.
- (2015) International Conference on Machine Learning (ICML) , pp. 448-456
- Ioffe, S.¹ Szegedy, C.²

9
- 85021155217
- Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, Rick Boyle, Pierre luc Cantin, Clifford Chao, Chris Clark, Jeremy Coriell, Mike Daley, Matt Dau, Jeffrey Dean, Ben Gelb, Tara Vazir Ghaemmaghami, Rajendra Gottipati, William Gulland, Robert Hagmann, C. Richard Ho, Doug Hogberg, John Hu, Robert Hundt, Dan Hurt, Julian Ibarz, Aaron Jaffey, Alek Jaworski, Alexander Kaplan, Harshit Khaitan, Andy Koch, Naveen Kumar, Steve Lacy, James Laudon, James Law, Diemthu Le, Chris Leary, Zhuyuan Liu, Kyle Lucke, Alan Lundin, Gordon MacKean, Adriana Maggiore, Maire Mahony, Kieran Miller, Rahul Nagarajan, Ravi Narayanaswami, Ray Ni, Kathy Nix, Thomas Norrie, Mark Omernick, Narayana Penukonda, Andy Phelps, and Jonathan Ross. In-datacenter performance analysis of a tensor processing unit. 2017. URL https://arxiv.org/pdf/1704.04760.pdf.
- (2017) In-Datacenter Performance Analysis of A Tensor Processing Unit
- Jouppi, N.P.¹ Young, C.² Patil, N.³ Patterson, D.⁴ Agrawal, G.⁵ Bajwa, R.⁶ Bates, S.⁷ Bhatia, S.⁸ Boden, N.⁹ Borchers, A.¹⁰ Boyle, R.¹¹ Cantin, P.¹² Chao, C.¹³ Clark, C.¹⁴ Coriell, J.¹⁵ Daley, M.¹⁶ Dau, M.¹⁷ Dean, J.¹⁸ Gelb, B.¹⁹ Ghaemmaghami, T.V.²⁰ more..

10
- 85083952489
- Auto-encoding variational Bayes
- Diederik P Kingma and Max Welling. Auto-encoding variational Bayes. In Proceedings of the 2nd International Conference on Learning Representations (ICLR), 2014.
- (2014) Proceedings of the 2nd International Conference on Learning Representations (ICLR)
- Kingma, D.P.¹ Welling, M.²

11
- 84965103544
- Variational dropout and the local reparameterization trick
- Diederik P Kingma, Tim Salimans, and Max Welling. Variational dropout and the local reparameterization trick. In Advances in Neural Information Processing Systems (NIPS), 2015.
- (2015) Advances in Neural Information Processing Systems (NIPS)
- Kingma, D.P.¹ Salimans, T.² Welling, M.³

12
- 77956002520
- Learning multiple layers of features from tiny images
- University of Toronto
- Alex Krizhevsky and Geoffrey Hinton. Learning multiple layers of features from tiny images. In Technical Report. University of Toronto, 2009.
- (2009) Technical Report
- Krizhevsky, A.¹ Hinton, G.²

13
- 85064840479
- Zoneout: Regularizing RNNs by randomly preserving hidden activations
- abs/1606.01305
- David Krueger, Tegan Maharaj, János Kramár, Mohammad Pezeshki, Nicolas Ballas, Nan Rosemary Ke, Anirudh Goyal, Yoshua Bengio, Hugo Larochelle, Aaron C. Courville, and Chris Pal. Zoneout: Regularizing RNNs by randomly preserving hidden activations. CoRR, abs/1606.01305, 2016.
- (2016) CoRR
- Krueger, D.¹ Maharaj, T.² Kramár, J.³ Pezeshki, M.⁴ Ballas, N.⁵ Ke, N.R.⁶ Goyal, A.⁷ Bengio, Y.⁸ Larochelle, H.⁹ Courville, A.C.¹⁰ Pal, C.¹¹

14
- 84897549944
- Fastfood-approximating kernel expansions in loglinear time
- Quoc Le, Tamás Sarlós, and Alex Smola. Fastfood-approximating kernel expansions in loglinear time. In Proceedings of the International Conference on Machine Learning (ICLR), 2013.
- (2013) Proceedings of the International Conference on Machine Learning (ICLR)
- Le, Q.¹ Sarlós, T.² Smola, A.³

15
- 0032203257
- Gradient-based learning applied to document recognition
- Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.
- (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

16
- 85048433285
- arXiv preprint
- Christos Louizos, Karen Ullrich, and Max Welling. Bayesian compression for deep learning. arXiv preprint arXiv:1705.08665, 2017.
- (2017) Bayesian Compression for Deep Learning
- Louizos, C.¹ Ullrich, K.² Welling, M.³

17
- 34249852033
- Building a large annotated corpus of English: The Penn Treebank
- Mitchell P Marcus, Mary Ann Marcinkiewicz, and Beatrice Santorini. Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313–330, 1993.
- (1993) Computational Linguistics , vol.19 , Issue.2 , pp. 313-330
- Marcus, M.P.¹ Marcinkiewicz, M.A.² Santorini, B.³

18
- 85048703118
- arXiv preprint
- Stephen Merity, Nitish S Keskar, and Richard Socher. Regularizing and optimizing LSTM language models. arXiv preprint arXiv:1708.02182, 2017.
- (2017) Regularizing and Optimizing LSTM Language Models
- Merity, S.¹ Keskar, N.S.² Socher, R.³

19
- 85064830994
- arXiv preprint
- Andrew C Miller, Nicholas J Foti, Alexander D’Amour, and Ryan P Adams. Reducing reparameterization gradient variance. arXiv preprint arXiv:1705.07880, 2017.
- (2017) Reducing Reparameterization Gradient Variance
- Miller, A.C.¹ Foti, N.J.² D’Amour, A.³ Adams, R.P.⁴

20
- 84937852305
- arXiv preprint
- Andriy Mnih and Karol Gregor. Neural variational inference and learning in belief networks. arXiv preprint arXiv:1402.0030, 2014.
- (2014) Neural Variational Inference and Learning in Belief Networks
- Mnih, A.¹ Gregor, K.²

21
- 85050620510
- arXiv preprint
- Matthias Plappert, Rein Houthooft, Prafulla Dhariwal, Szymon Sidor, Richard Y Chen, Xi Chen, Tamim Asfour, Pieter Abbeel, and Marcin Andrychowicz. Parameter space noise for exploration. arXiv preprint arXiv:1706.01905, 2017.
- (2017) Parameter Space Noise for Exploration
- Plappert, M.¹ Houthooft, R.² Dhariwal, P.³ Sidor, S.⁴ Chen, R.Y.⁵ Chen, X.⁶ Asfour, T.⁷ Abbeel, P.⁸ Andrychowicz, M.⁹

22
- 84955506831
- Black box variational inference
- Rajesh Ranganath, Sean Gerrish, and David Blei. Black box variational inference. In Artificial Intelligence and Statistics (AISTATS), pp. 814–822, 2014.
- (2014) Artificial Intelligence and Statistics (AISTATS) , pp. 814-822
- Ranganath, R.¹ Gerrish, S.² Blei, D.³

23
- 0003502414
- Friedrich Frommann Verlag, Stuttgart-Bad Cannstatt
- Ingo Rechenberg and Manfred Eigen. Evolutionsstrategie: Optimierung Technischer Systeme nach Prinzipien der Biologischen Evolution. Friedrich Frommann Verlag, Stuttgart-Bad Cannstatt, 1973.
- (1973) Evolutionsstrategie: Optimierung Technischer Systeme Nach Prinzipien Der Biologischen Evolution
- Rechenberg, I.¹ Eigen, M.²

24
- 85083952740
- On the convergence of Adam and beyond
- Sashank J. Reddi, Satyen Kale, and Sanjiv Kumar. On the convergence of Adam and beyond. In International Conference on Learning Representations (ICLR), 2018.
- (2018) International Conference on Learning Representations (ICLR)
- Reddi, S.J.¹ Kale, S.² Kumar, S.³

25
- 85071172176
- Sticking the landing: A simple, reduced-variance gradient estimator for variational inference
- Geoffrey Roeder, Yuhuai Wu, and David Duvenaud. Sticking the landing: A simple, reduced-variance gradient estimator for variational inference. In Advances in Approximate Bayesian Inference Workshop (NIPS), 2016.
- (2016) Advances in Approximate Bayesian Inference Workshop (NIPS)
- Roeder, G.¹ Wu, Y.² Duvenaud, D.³

26
- 85031121087
- arXiv preprint
- Tim Salimans, Jonathan Ho, Xi Chen, and Ilya Sutskever. Evolution strategies as a scalable alternative to reinforcement learning. arXiv preprint arXiv:1703.03864, 2017.
- (2017) Evolution Strategies as A Scalable Alternative to Reinforcement Learning
- Salimans, T.¹ Ho, J.² Chen, X.³ Sutskever, I.⁴

27
- 33847649288
- Training recurrent networks by evolino
- Jürgen Schmidhuber, Daan Wierstra, Matteo Gagliolo, and Faustino Gomez. Training recurrent networks by evolino. Neural Computation, 19(3):757–779, 2007.
- (2007) Neural Computation , vol.19 , Issue.3 , pp. 757-779
- Schmidhuber, J.¹ Wierstra, D.² Gagliolo, M.³ Gomez, F.⁴

28
- 85054958011
- Recurrent dropout without memory loss
- Stanislau Semeniuta, Aliaksei Severyn, and Erhardt Barth. Recurrent dropout without memory loss. In Proceedings of the 26th International Conference on Computational Linguistics (COLING), pp. 1757–1766, 2016.
- (2016) Proceedings of the 26th International Conference on Computational Linguistics (COLING) , pp. 1757-1766
- Semeniuta, S.¹ Severyn, A.² Barth, E.³

29
- 84925410541
- arXiv preprint
- Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
- (2014) Very Deep Convolutional Networks for Large-Scale Image Recognition
- Simonyan, K.¹ Zisserman, A.²

30
- 84904163933
- Dropout: A simple way to prevent neural networks from overfitting
- Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15:1929–1958, 2014.
- (2014) Journal of Machine Learning Research , vol.15 , pp. 1929-1958
- Srivastava, N.¹ Hinton, G.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

31
- 84897550107
- Regularization of neural networks using DropConnect
- Li Wan, Matthew Zeiler, Sixin Zhang, Yann L Cun, and Rob Fergus. Regularization of neural networks using DropConnect. In Proceedings of the 30th International Conference on Machine Learning (ICML), pp. 1058–1066, 2013.
- (2013) Proceedings of the 30th International Conference on Machine Learning (ICML) , pp. 1058-1066
- Wan, L.¹ Zeiler, M.² Zhang, S.³ Cun, Y.L.⁴ Fergus, R.⁵

32
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Ronald J Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8(3-4):229–256, 1992.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 229-256
- Williams, R.J.¹

33
- 84944053926
- arXiv preprint
- Wojciech Zaremba, Ilya Sutskever, and Oriol Vinyals. Recurrent neural network regularization. arXiv preprint arXiv:1409.2329, 2014.
- (2014) Recurrent Neural Network Regularization
- Zaremba, W.¹ Sutskever, I.² Vinyals, O.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.