SCOPUS 정보 검색 플랫폼

5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings

Volumn , Issue , 2017, Pages

Optimization as a model for few-shot learning

(2) Ravi, Sachin a Larochelle, Hugo a

a TWITTER INC (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DEEP NEURAL NETWORKS; ITERATIVE METHODS;

GRADIENT-BASED OPTIMIZATION; HIGH CAPACITY; LEARNING TASKS; META-LEARNING MODELS; METRIC LEARNING; NEURAL NETWORK CLASSIFIER; OPTIMIZATION ALGORITHMS; PARAMETRIZATIONS;

LONG SHORT-TERM MEMORY;

EID: 85041901997 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (2895)

References (31)

1
- 85037743325
- Learning to learn by gradient descent by gradient descent
- abs
- Marcin Andrychowicz, Misha Denil, Sergio Gomez, Matthew W. Hoffman, David Pfau, Tom Schaul, and Nando de Freitas. Learning to learn by gradient descent by gradient descent. CoRR, abs/1606.04474, 2016. URL http://arxiv.org/abs/1606.04474.
- (2016) CoRR
- Andrychowicz, M.¹ Denil, M.² Gomez, S.³ Hoffman, M.W.⁴ Pfau, D.⁵ Schaul, T.⁶ De Freitas, N.⁷

2
- 85070963863
- PhD thesis, Département d'Informatique et Recherche Opérationnelle. Université de Montréal
- Samy Bengio. Optimisation d'une régle d'apprentissage pour réseaux de neurones artificiels. PhD thesis, Département d'Informatique et Recherche Opérationnelle. Université de Montréal, 1993.
- (1993) Optimisation D'Une Régle D'Apprentissage pour Réseaux De Neurones Artificiels
- Bengio, S.¹

3
- 34249757641
- On the search for new learning rules for ANNs
- Samy Bengio, Yoshua Bengio, and Jocelyn Cloutier. On the search for new learning rules for ANNs. Neural Processing Letters, 2(4):26-30, 1995.
- (1995) Neural Processing Letters , vol.2 , Issue.4 , pp. 26-30
- Bengio, S.¹ Bengio, Y.² Cloutier, J.³

4
- 84921824478
- Université de Montréal, Département d'informatique et de recherche opérationnelle
- Yoshua Bengio, Samy Bengio, and Jocelyn Cloutier. Learning a synaptic learning rule. Université de Montréal, Département d'informatique et de recherche opérationnelle, 1990.
- (1990) Learning a Synaptic Learning Rule
- Bengio, Y.¹ Bengio, S.² Cloutier, J.³

5
- 84904548965
- Deep learning of representations for unsupervised and transfer learning
- Yoshua Bengio et al. Deep learning of representations for unsupervised and transfer learning. ICML Unsupervised and Transfer Learning, 27:17-36, 2012.
- (2012) ICML Unsupervised and Transfer Learning , vol.27 , pp. 17-36
- Bengio, Y.¹

6
- 85018918773
- Learning feed-forward one-shot learners
- abs
- Luca Bertinetto, João F. Henriques, Jack Valmadre, Philip H. S. Torr, and Andrea Vedaldi. Learning feed-forward one-shot learners. CoRR, abs/1606.05233, 2016. URL http://arxiv.org/abs/1606.05233.
- (2016) CoRR
- Bertinetto, L.¹ Henriques, J.F.² Valmadre, J.³ Torr, P.H.S.⁴ Vedaldi, A.⁵

7
- 85066436827
- Tom Bosc. Learning to learn neural networks.
- Learning to Learn Neural Networks
- Bosc, T.¹

8
- 85153936556
- Learning many related tasks at the same time with backpropagation
- Rich Caruana. Learning many related tasks at the same time with backpropagation. Advances in neural information processing systems, pp. 657-664, 1995.
- (1995) Advances in Neural Information Processing Systems , pp. 657-664
- Caruana, R.¹

9
- 84921940378
- Learning phrase representations using RNN encoder-decoder for statistical machine translation
- abs
- Kyunghyun Cho, Bart van Merrienboer, Çaglar Gülçehre, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. Learning phrase representations using RNN encoder-decoder for statistical machine translation. CoRR, abs/1406.1078, 2014. URL http://arxiv.org/abs/1406.1078.
- (2014) CoRR
- Cho, K.¹ Van Merrienboer, B.² Gülçehre, Ç.³ Bougares, F.⁴ Schwenk, H.⁵ Bengio, Y.⁶

10
- 84904482223
- DeCAF: A deep convolutional activation feature for generic visual recognition
- abs
- Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. Decaf: A deep convolutional activation feature for generic visual recognition. CoRR, abs/1310.1531, 2013. URL http://arxiv.org/abs/1310.1531.
- (2013) CoRR
- Donahue, J.¹ Jia, Y.² Vinyals, O.³ Hoffman, J.⁴ Zhang, N.⁵ Tzeng, E.⁶ Darrell, T.⁷

11
- 80052250414
- Adaptive subgradient methods for online learning and stochastic optimization
- July ISSN
- John Duchi, Elad Hazan, and Yoram Singer. Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res., 12:2121-2159, July 2011. ISSN 1532-4435. URL http://dl.acm.org/citation.cfm?id=1953048.2021068.
- (2011) J. Mach. Learn. Res. , vol.12 , pp. 2121-2159
- Duchi, J.¹ Hazan, E.² Singer, Y.³

12
- 84958589374
- Deep residual learning for image recognition
- abs/1512.03385
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. CoRR, abs/1512.03385, 2015. URL http://arxiv.org/abs/1512.03385.
- (2015) CoRR
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

13
- 0031573117
- Long short-term memory
- Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory. Neural computation, 9(8): 1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

14
- 84958985283
- Learning to learn using gradient descent
- Springer
- Sepp Hochreiter, A. Steven Younger, and Peter R. Conwell. Learning to learn using gradient descent. In IN LECTURE NOTES ON COMP. SCI. 2130, PROC. INTL. CONF. ON ARTI NEURAL NETWORKS ICANN-2001, pp. 87-94. Springer, 2001.
- (2001) Lecture Notes on Comp. Sci. 2130, Proc. Intl. Conf. On Arti Neural Networks (ICANN-2001) , pp. 87-94
- Hochreiter, S.¹ Steven Younger, A.² Conwell, P.R.³

15
- 84946590546
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- abs
- Sergey Ioffe and Christian Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. CoRR, abs/1502.03167, 2015. URL http://arxiv.org/abs/1502.03167.
- (2015) CoRR
- Ioffe, S.¹ Szegedy, C.²

16
- 85083951076
- ADaM: A method for stochastic optimization
- abs
- Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization. CoRR, abs/1412.6980, 2014. URL http://arxiv.org/abs/1412.6980.
- (2014) CoRR
- Kingma, D.P.¹ Ba, J.²

17
- 85020183301
- PhD thesis, University of Toronto
- Gregory Koch. Siamese neural networks for one-shot image recognition. PhD thesis, University of Toronto, 2015.
- (2015) Siamese Neural Networks for One-Shot Image Recognition
- Koch, G.¹

18
- 85019169098
- Building machines that learn and think like people
- Brenden M. Lake, Tomer D. Ullman, Joshua B. Tenenbaum, and Samuel J. Gershman. Building machines that learn and think like people. CoRR, abs/1604.00289, 2016. URL http://arxiv.org/abs/1604.00289.
- (2016) CoRR
- Lake, B.M.¹ Ullman, T.D.² Tenenbaum, J.B.³ Gershman, S.J.⁴

19
- 84989338543
- Gradient-based hyperparameter optimization through reversible learning
- Dougal Maclaurin, David Duvenaud, and Ryan P Adams. Gradient-based hyperparameter optimization through reversible learning. In Proceedings of the 32nd International Conference on Machine Learning, 2015.
- (2015) Proceedings of the 32nd International Conference on Machine Learning
- Maclaurin, D.¹ Duvenaud, D.² Adams, R.P.³

20
- 84904461107
- Yurii Nesterov. A method of solving a convex programming problem with convergence rate o (1/k2). 1983.
- (1983) A Method of Solving a Convex Programming Problem with Convergence Rate O (1/K2)
- Nesterov, Y.¹

21
- 85011070895
- arXiv preprint
- Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew Senior, and Koray Kavukcuoglu. Wavenet: A generative model for raw audio. arXiv preprint arXiv:1609.03499, 2016.
- (2016) Wavenet: A Generative Model for Raw Audio
- Van Den Oord, A.¹ Dieleman, S.² Zen, H.³ Simonyan, K.⁴ Vinyals, O.⁵ Graves, A.⁶ Kalchbrenner, N.⁷ Senior, A.⁸ Kavukcuoglu, K.⁹

22
- 85040946180
- Lillicrap. One-shot learning with memory-augmented neural networks
- abs
- Adam Santoro, Sergey Bartunov, Matthew Botvinick, Daan Wierstra, and Timothy P. Lillicrap. One-shot learning with memory-augmented neural networks. CoRR, abs/1605.06065, 2016. URL http://arxiv.org/abs/1605.06065.
- (2016) CoRR
- Santoro, A.¹ Bartunov, S.² Botvinick, M.³ Wierstra, D.⁴ Timothy, P.⁵

23
- 0346377064
- Learning to control fast-weight memories: An alternative to dynamic recurrent networks
- Jürgen Schmidhuber. Learning to control fast-weight memories: An alternative to dynamic recurrent networks. Neural Computation, 4(1):131-139, 1992.
- (1992) Neural Computation , vol.4 , Issue.1 , pp. 131-139
- Schmidhuber, J.¹

24
- 84943265976
- A neural network that embeds its own meta-levels
- Jürgen Schmidhuber. A neural network that embeds its own meta-levels. In Neural Networks, 1993., IEEE International Conference on, pp. 407-412. IEEE, 1993.
- (1993) Neural Networks, 1993., IEEE International Conference on , pp. 407-412
- Schmidhuber, J.¹

25
- 0031186687
- Shifting inductive bias with success-story algorithm, adaptive levin search, and incremental self-improvement
- Jürgen Schmidhuber, Jieyu Zhao, and Marco Wiering. Shifting inductive bias with success-story algorithm, adaptive levin search, and incremental self-improvement. Machine Learning, 28(1): 105-130, 1997.
- (1997) Machine Learning , vol.28 , Issue.1 , pp. 105-130
- Schmidhuber, J.¹ Zhao, J.² Wiering, M.³

26
- 0010687621
- Lifelong learning algorithms
- Springer
- Sebastian Thrun. Lifelong learning algorithms. In Learning to learn, pp. 181-209. Springer, 1998.
- (1998) Learning to Learn , pp. 181-209
- Thrun, S.¹

27
- 85030218957
- Matching networks for one shot learning
- abs
- Oriol Vinyals, Charles Blundell, Timothy P. Lillicrap, Koray Kavukcuoglu, and Daan Wierstra. Matching networks for one shot learning. CoRR, abs/1606.04080, 2016. URL http://arxiv.org/abs/1606.04080.
- (2016) CoRR
- Vinyals, O.¹ Blundell, C.² Lillicrap, T.P.³ Kavukcuoglu, K.⁴ Wierstra, D.⁵

28
- 85018271332
- arXiv preprint
- Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, et al. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144, 2016.
- (2016) Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
- Wu, Y.¹ Schuster, M.² Chen, Z.³ Le, Q.V.⁴ Norouzi, M.⁵ Macherey, W.⁶ Krikun, M.⁷ Cao, Y.⁸ Gao, Q.⁹ Macherey, K.¹⁰

29
- 84952032150
- How transferable are features in deep neural networks?
- abs
- Jason Yosinski, Jeff Clune, Yoshua Bengio, and Hod Lipson. How transferable are features in deep neural networks? CoRR, abs/1411.1792, 2014. URL http://arxiv.org/abs/1411.1792.
- (2014) CoRR
- Yosinski, J.¹ Clune, J.² Bengio, Y.³ Lipson, H.⁴

30
- 85010821099
- Wojciech Zaremba. An empirical exploration of recurrent network architectures. 2015.
- (2015) An Empirical Exploration of Recurrent Network Architectures
- Zaremba, W.¹

31
- 84905272120
- Adadelta: An adaptive learning rate method
- abs
- Matthew D. Zeiler. ADADELTA: an adaptive learning rate method. CoRR, abs/1212.5701, 2012. URL http://arxiv.org/abs/1212.5701.
- (2012) CoRR
- Zeiler, M.D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.