메뉴 건너뛰기




Volumn , Issue , 2016, Pages 1977-1985

Efficient PAC-optimal exploration in concurrent, continuous state MDPs with delayed updates

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS STATE; DISCRETE STATE; EXPLORATION ALGORITHMS; FINE GRAINED; NOCV1; PAC BOUNDS; REAL-TIME ENVIRONMENT; VALUE FUNCTIONS;

EID: 85007188953     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (24)

References (15)
  • 4
    • 84874698101 scopus 로고    scopus 로고
    • TEXPLORE: Real-Time sampleefficient reinforcement learning for robots
    • Hester, T., and Stone, P. 2013. TEXPLORE: Real-Time sampleefficient reinforcement learning for robots. Machine Learning 90(3):385-429.
    • (2013) Machine Learning , vol.90 , Issue.3 , pp. 385-429
    • Hester, T.1    Stone, P.2
  • 8
    • 23244466805 scopus 로고    scopus 로고
    • Ph.D. Dissertation, Gatsby Computational Neuroscience Unit, University College London
    • Kakade, S. M. 2003. On the sample complexity of reinforcement learning. Ph.D. Dissertation, Gatsby Computational Neuroscience Unit, University College London.
    • (2003) On the Sample Complexity of Reinforcement Learning
    • Kakade, S.M.1
  • 12
    • 84893414333 scopus 로고    scopus 로고
    • PAC optimal exploration in continuous space Markov decision processes
    • Pazis, J., and Parr, R. 2013. PAC optimal exploration in continuous space Markov decision processes. In AAAI Conference on Artificial Intelligence, 774-781.
    • (2013) AAAI Conference on Artificial Intelligence , pp. 774-781
    • Pazis, J.1    Parr, R.2
  • 15
    • 77956520676 scopus 로고    scopus 로고
    • Model-based reinforcement learning with nearly tight exploration complexity bounds
    • Szita, I., and Szepesvari, C. 2010. Model-based reinforcement learning with nearly tight exploration complexity bounds. In International Conference on Machine Learning, 1031-1038.
    • (2010) International Conference on Machine Learning , pp. 1031-1038
    • Szita, I.1    Szepesvari, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.