메뉴 건너뛰기




Volumn WS-14-07, Issue , 2014, Pages 31-37

An automated measure of MDP similarity for transfer in reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; AUTONOMOUS AGENTS; LEARNING ALGORITHMS; LEARNING SYSTEMS; MARKOV PROCESSES;

EID: 84974799118     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (80)

References (14)
  • 6
    • 84855578361 scopus 로고    scopus 로고
    • Bisim-ulation metrics for continuous Markov decision processes
    • Ferns, N.; Panangaden, P.; and Precup. D. 2011. Bisim-ulation metrics for continuous Markov decision processes. SIAM J. Computing 40(6): 1662-1714.
    • (2011) SIAM J. Computing , vol.40 , Issue.6 , pp. 1662-1714
    • Ferns, N.1    Panangaden, P.2    Precup, D.3
  • 7
    • 0013344078 scopus 로고    scopus 로고
    • Training products of experts by minimizing contrastive divergence
    • Hinton, G. E. 2002. Training products of experts by minimizing contrastive divergence. Neural Computation 14(8):1771-1800.
    • (2002) Neural Computation , vol.14 , Issue.8 , pp. 1771-1800
    • Hinton, G.E.1
  • 9
    • 68949157375 scopus 로고    scopus 로고
    • Transfer learning for reinforcement learning domains: A survey
    • Taylor, M. E., and Stone, P 2009. Transfer learning for reinforcement learning domains: a survey. Journal of Machine Learning Research 10(1):163.3-1685.
    • (2009) Journal of Machine Learning Research , vol.10 , Issue.1 , pp. 1633-1685
    • Taylor, M.E.1    Stone, P.2
  • 10
    • 79955836081 scopus 로고    scopus 로고
    • Two distributed-state models for generating high-dimensional time scries
    • Taylor, G. W.; Hinton, G. E.; and Roweis, S. T 2011. Two distributed-state models for generating high-dimensional time scries. Journal of Machine Learning Research 12:1025-1068.
    • (2011) Journal of Machine Learning Research , vol.12 , pp. 1025-1068
    • Taylor, G.W.1    Hinton, G.E.2    Roweis, S.T.3
  • 12
    • 34848816477 scopus 로고    scopus 로고
    • Transfer learning via inter-task inappings for temporal difference learning
    • Taylor. M. E.; Stone, P; and Liu, Y. 2007. Transfer learning via inter-task inappings for temporal difference learning. Journal of Machine Learning Research 8(1):2125-2167.
    • (2007) Journal of Machine Learning Research , vol.8 , Issue.1 , pp. 2125-2167
    • Taylor, M.E.1    Stone, P.2    Liu, Y.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.