SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2011, Pages 335-338

Sample-based planning for continuous action Markov decision processes

Author keywords

[No Author keywords available]

Indexed keywords

ACTION SPACES; BANDIT PROBLEMS; DISCRETIZATIONS; EMPIRICAL RESULTS; MARKOV DECISION PROCESSES;

ALGORITHMS; TREES (MATHEMATICS);

MARKOV PROCESSES;

EID: 80054835987 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (86)

References (10)

1
- 0036568025
- Finite-time analysis of the multi-armed bandit problem
- Auer, P.; Fischer, P.; and Cesa-Bianchi, N. 2002. Finite-time analysis of the multi-armed bandit problem. Machine Learning 47.
- (2002) Machine Learning , vol.47
- Auer, P.¹ Fischer, P.² Cesa-Bianchi, N.³

2
- 77952027689
- Online optimization in X-armed bandits
- Bubeck, S.; Munos, R.; Stoltz, G.; and Szepesvári, C. 2009. Online optimization in X-armed bandits. In Advances in Neural Information Processing Systems 23.
- (2009) Advances in Neural Information Processing Systems , vol.23
- Bubeck, S.¹ Munos, R.² Stoltz, G.³ Szepesvári, C.⁴

4
- 0028442413
- Associative reinforcement learning: Functions ink-dnf
- Kaelbling, L. 1994. Associative reinforcement learning: Functions ink-dnf. Machine Learning 15(3).
- (1994) Machine Learning , vol.15 , Issue.3
- Kaelbling, L.¹

5
- 84880649215
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- Kearns, M.; Mansour, S.; and Ng, A. 1999. A sparse sampling algorithm for near-optimal planning in large Markov decision processes. In IJCAI.
- (1999) IJCAI
- Kearns, M.¹ Mansour, S.² Ng, A.³

6
- 34547975806
- Bandit based Monte-Carlo planning
- Kocsis, L., and Szepesvári, C. 2006. Bandit based Monte-Carlo planning. In Machine Learning: ECML 2006.
- (2006) Machine Learning: ECML 2006
- Kocsis, L.¹ Szepesvári, C.²

7
- 84899834143
- Online exploration in least-squares policy iteration
- Li, L.; Littman, M. L.; and Mansley, C. R. 2009. Online exploration in least-squares policy iteration. In International Conference on Autonomous Agents and Multiagent Systems.
- (2009) International Conference on Autonomous Agents and Multiagent Systems
- Li, L.¹ Littman, M.L.² Mansley, C.R.³

8
- 71149094455
- Binary action search for learning continuous-action control policies
- Pazis, J., and Lagoudakis, M. 2009. Binary action search for learning continuous-action control policies. In International Conference on Machine Learning.
- (2009) International Conference on Machine Learning
- Pazis, J.¹ Lagoudakis, M.²

9
- 1642401055
- Learning to drive a bicycle using reinforcement learning and shaping
- Randløv, J., and Alstrøm, P. 1998. Learning to drive a bicycle using reinforcement learning and shaping. In International Conference on Machine Learning.
- (1998) International Conference on Machine Learning
- Randløv, J.¹ Alstrøm, P.²

10
- 0031231885
- Experiments with reinforcement learning in problems with continuous state and action spaces
- Santamaría, J. C.; Sutton, R.; and Ram, A. 1998. Experiments with reinforcement learning in problems with continuous state and action spaces. In Adaptive Behavior 6.
- (1998) Adaptive Behavior , vol.6
- Santamaría, J.C.¹ Sutton, R.² Ram, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.