메뉴 건너뛰기




Volumn , Issue , 2016, Pages

Actor-mimic deep multitask and transfer reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

AUTONOMOUS AGENTS; MACHINE LEARNING; REINFORCEMENT LEARNING;

EID: 85083953433     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (262)

References (19)
  • 5
    • 84937779024 scopus 로고    scopus 로고
    • Deep learning for real-time atari game play using offline monte-carlo tree search planning
    • Guo, Xiaoxiao, Singh, Satinder, Lee, Honglak, Lewis, Richard L, and Wang, Xiaoshi. Deep learning for real-time atari game play using offline monte-carlo tree search planning. In Advances in Neural Information Processing Systems 27, pp. 3338–3346, 2014.
    • (2014) Advances in Neural Information Processing Systems , vol.27 , pp. 3338-3346
    • Guo, X.1    Singh, S.2    Lee, H.3    Lewis, R.L.4    Wang, X.5
  • 10
    • 85028018890 scopus 로고    scopus 로고
    • End-to-end training of deep visuomotor policies
    • Levine, Sergey, Finn, Chelsea, Darrell, Trevor, and Abbeel, Pieter. End-to-end training of deep visuomotor policies. CoRR, abs/1504.00702, 2015.
    • (2015) CoRR
    • Levine, S.1    Finn, C.2    Darrell, T.3    Abbeel, P.4
  • 16
    • 84862273266 scopus 로고    scopus 로고
    • A reduction of imitation learning and structured prediction to no-regret online learning
    • Ross, Stephane, Gordon, Geoffrey, and Bagnell, Andrew. A reduction of imitation learning and structured prediction to no-regret online learning. Journal of Machine Learning Research, 15: 627–635, 2011.
    • (2011) Journal of Machine Learning Research , vol.15 , pp. 627-635
    • Ross, S.1    Gordon, G.2    Bagnell, A.3
  • 17
    • 0037886159 scopus 로고
    • Sensitivity analysis, ergodicity coefficients, and rank-one updates for finite markov chains
    • Seneta, E. Sensitivity analysis, ergodicity coefficients, and rank-one updates for finite markov chains. Numerical solution of Markov chains, 8:121–129, 1991.
    • (1991) Numerical Solution of Markov Chains , vol.8 , pp. 121-129
    • Seneta, E.1
  • 19
    • 68949157375 scopus 로고    scopus 로고
    • Transfer learning for reinforcement learning domains: A survey
    • Taylor, Matthew E and Stone, Peter. Transfer learning for reinforcement learning domains: A survey. The Journal of Machine Learning Research, 10:1633–1685, 2009.
    • (2009) The Journal of Machine Learning Research , vol.10 , pp. 1633-1685
    • Taylor, M.E.1    Stone, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.