메뉴 건너뛰기




Volumn 2017-December, Issue , 2017, Pages 1088-1099

One-shot imitation learning

Author keywords

[No Author keywords available]

Indexed keywords

NEURAL NETWORKS;

EID: 85047012880     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (571)

References (66)
  • 9
    • 84921824478 scopus 로고
    • Université de Montréal, Département d'informatique et de recherche opérationnelle
    • Yoshua Bengio, Samy Bengio, and Jocelyn Cloutier. Learning a synaptic learning rule. Université de Montréal, Département d'informatique et de recherche opérationnelle, 1990.
    • (1990) Learning a Synaptic Learning Rule
    • Bengio, Y.1    Bengio, S.2    Cloutier, J.3
  • 10
    • 0029509952 scopus 로고
    • Neuro-dynamic programming: An overview
    • Proceedings of the 34th IEEE Conference on IEEE
    • Dimitri P Bertsekas and John N Tsitsiklis. Neuro-dynamic programming: an overview. In Decision and Control, 1995., Proceedings of the 34th IEEE Conference on, Volume 1, pages 560-564. IEEE, 1995.
    • (1995) Decision and Control, 1995 , vol.1 , pp. 560-564
    • Bertsekas, D.P.1    Tsitsiklis, J.N.2
  • 14
    • 84906332834 scopus 로고    scopus 로고
    • Decaf: A deep convolutional activation feature for generic visual recognition
    • Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. Decaf: A deep convolutional activation feature for generic visual recognition. In ICML, pages 647-655, 2014.
    • (2014) ICML , pp. 647-655
    • Donahue, J.1    Jia, Y.2    Vinyals, O.3    Hoffman, J.4    Zhang, N.5    Tzeng, E.6    Darrell, T.7
  • 26
    • 85020183301 scopus 로고    scopus 로고
    • Siamese neural networks for one-shot image recognition
    • Gregory Koch. Siamese neural networks for one-shot image recognition. ICML Deep Learning Workshop, 2015.
    • (2015) ICML Deep Learning Workshop
    • Koch, G.1
  • 33
    • 84973924037 scopus 로고    scopus 로고
    • Learning transferable features with deep adaptation networks
    • Mingsheng Long and Jianmin Wang. Learning transferable features with deep adaptation networks. CoRR, abs/1502.02791, 1:2 2015.
    • (2015) CoRR , vol.1 , pp. 2
    • Long, M.1    Wang, J.2
  • 38
    • 0141596576 scopus 로고    scopus 로고
    • Policy invariance under reward transformations: Theory and application to reward shaping
    • Andrew Y Ng, Daishi Harada, and Stuart Russell. Policy invariance under reward transformations: Theory and application to reward shaping. In ICML, Volume 99, pages 278-287, 1999.
    • (1999) ICML , vol.99 , pp. 278-287
    • Ng, A.Y.1    Harada, D.2    Russell, S.3
  • 39
    • 3042583887 scopus 로고    scopus 로고
    • Autonomous helicopter flight via reinforcement learning
    • Andrew Y Ng, H Jin Kim, Michael I Jordan, Shankar Sastry, and Shiv Ballianda. Autonomous helicopter flight via reinforcement learning. In NIPS, Volume 16, 2003.
    • (2003) NIPS , vol.16
    • Ng, A.Y.1    Jin Kim, H.2    Jordan, M.I.3    Sastry, S.4    Ballianda, S.5
  • 40
    • 44949241322 scopus 로고    scopus 로고
    • Reinforcement learning of motor skills with policy gradients
    • Jan Peters and Stefan Schaal. Reinforcement learning of motor skills with policy gradients. Neural networks, 21(4):682-697, 2008.
    • (2008) Neural Networks , vol.21 , Issue.4 , pp. 682-697
    • Peters, J.1    Schaal, S.2
  • 42
    • 85041901997 scopus 로고    scopus 로고
    • Optimization as a model for few-shot learning
    • Sachin Ravi and Hugo Larochelle. Optimization as a model for few-shot learning. In Under Review, ICLR, 2017.
    • (2017) Under Review, ICLR
    • Ravi, S.1    Larochelle, H.2
  • 44
    • 84867135104 scopus 로고    scopus 로고
    • A reduction of imitation learning and structured prediction to no-regret online learning
    • Stéphane Ross, Geoffrey J Gordon, and Drew Bagnell. A reduction of imitation learning and structured prediction to no-regret online learning. In AISTATS, Volume 1, page 6, 2011.
    • (2011) AISTATS , vol.1 , pp. 6
    • Ross, S.1    Gordon, G.J.2    Bagnell, D.3
  • 48
    • 0033151712 scopus 로고    scopus 로고
    • Is imitation learning the route to humanoid robots?
    • Stefan Schaal. Is imitation learning the route to humanoid robots? Trends in cognitive sciences, 3(6):233-242, 1999.
    • (1999) Trends in Cognitive Sciences , vol.3 , Issue.6 , pp. 233-242
    • Schaal, S.1
  • 49
    • 25944480439 scopus 로고
    • On learning how to learn: The meta-meta-⋯ hook.) Diploma thesis, Institut f. Informatik, Tech. Univ. Munich
    • Jurgen Schmidhuber. Evolutionary principles in self-referential learning. On learning how to learn: The meta-meta-⋯ hook.) Diploma thesis, Institut f. Informatik, Tech. Univ. Munich, 1987.
    • (1987) Evolutionary Principles in Self-Referential Learning
    • Schmidhuber, J.1
  • 50
    • 0346377064 scopus 로고
    • Learning to control fast-weight memories: An alternative to dynamic recurrent networks
    • Jürgen Schmidhuber. Learning to control fast-weight memories: An alternative to dynamic recurrent networks. Neural Computation, 1992.
    • (1992) Neural Computation
    • Schmidhuber, J.1
  • 57
    • 0029276036 scopus 로고
    • Temporal difference learning and td-gammon
    • Gerald Tesauro. Temporal difference learning and td-gammon. Communications of the ACM, 38(3):58-68, 1995.
    • (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1
  • 58
    • 0003901612 scopus 로고    scopus 로고
    • Springer Science & Business Media
    • Sebastian Thrun and Lorien Pratt. Learning to learn. Springer Science & Business Media, 1998.
    • (1998) Learning to Learn
    • Thrun, S.1    Pratt, L.2
  • 63
    • 84939821074 scopus 로고    scopus 로고
    • Show, attend and tell: Neural image caption generation with visual attention
    • Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron C Courville, Ruslan Salakhutdinov, Richard S Zemel, and Yoshua Bengio. Show, attend and tell: Neural image caption generation with visual attention. In ICML, Volume 14, pages 77-81, 2015.
    • (2015) ICML , vol.14 , pp. 77-81
    • Xu, K.1    Ba, J.2    Kiros, R.3    Cho, K.4    Courville, A.C.5    Salakhutdinov, R.6    Zemel, R.S.7    Bengio, Y.8


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.