SCOPUS 정보 검색 플랫폼

Proceedings of the National Conference on Artificial Intelligence

Volumn 2, Issue , 2012, Pages 977-983

Kernel-based reinforcement learning on representative states

(2) Kveton, Branislav a Theocharous, Georgios b

a Technicolor Labs (United States)

b YAHOO RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARBITRARY LEVELS; CONTINUOUS STATE; CONTINUOUS VARIABLES; CONTROL PROBLEMS; DECISION-MAKING PROBLEM; FIXED POINTS; MARKOV DECISION PROCESSES; OPTIMAL SOLUTIONS; TIME COMPLEXITY; TRAINING EXAMPLE;

ARTIFICIAL INTELLIGENCE; MARKOV PROCESSES;

REINFORCEMENT LEARNING;

EID: 84868289021 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (21)

References (17)

1
- 85162341384
- Reinforcement learning using kernel-based stochastic factorization
- Barreto, A.; Precup, D.; and Pineau, J. 2011. Reinforcement learning using kernel-based stochastic factorization. In Advances in Neural Information Processing Systems 24, 720-728.
- (2011) Advances in Neural Information Processing Systems , vol.24 , pp. 720-728
- Barreto, A.¹ Precup, D.² Pineau, J.³

2
- 85012688561
- Princeton, NJ: Princeton University Press
- Bellman. R. 1957. Dynamic Programming. Princeton, NJ: Princeton University Press.
- (1957) Dynamic Programming
- Bellman, R.¹

3
- 0003487482
- Belmont, MA: Athena Scientific
- Bertsekas, D., and Tsitsiklis, J. 1996. Neuro-Dynamic Programming. Belmont, MA: Athena Scientific.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

4
- 33749260356
- Cover trees for nearest neighbor
- Beygelzimer, A.; Kakade, S.: and Langford, J. 2006. Cover trees for nearest neighbor. In Proceedings of the 23rd International Conference on Machine Learning, 97-104.
- (2006) Proceedings of the 23rd International Conference on Machine Learning , pp. 97-104
- Beygelzimer, A.¹ Kakade, S.² Langford, J.³

5
- 0041965975
- R-MAX - A general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R., and Tennenholtz, M. 2003. R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research 3:213-231.
- (2003) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.¹ Tennenholtz, M.²

6
- 0026206780
- An optimal one-way multigrid algorithm for discrete-time stochastic control
- Chow, C.-S., and Tsitsiklis, J. 1991. An optimal one-way multigrid algorithm for discrete-time stochastic control. IEEE Transactions on Automatic Control 36(8):898-914.
- (1991) IEEE Transactions on Automatic Control , vol.36 , Issue.8 , pp. 898-914
- Chow, C.-S.¹ Tsitsiklis, J.²

7
- 21844465127
- Tree-based batch mode reinforcement learning
- Ernst, D.; Geurts, P.; and Wehenkel, L. 2005. Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6:503-556.
- (2005) Journal of Machine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

8
- 0032184399
- Quantization
- Gray, R., and Neuhoff, D. 1998. Quantization. IEEE Transactions on Information Theory 44(6):2325-2383.
- (1998) IEEE Transactions on Information Theory , vol.44 , Issue.6 , pp. 2325-2383
- Gray, R.¹ Neuhoff, D.²

9
- 38049096465
- Kernel-based models for reinforcement learning
- Jong, N., and Stone, P. 2006. Kernel-based models for reinforcement learning. In ICML 2006 Workshop on Kernel Methods and Reinforcement Learning.
- (2006) ICML 2006 Workshop on Kernel Methods and Reinforcement Learning
- Jong, N.¹ Stone, P.²

10
- 84868285934
- Compositional models for reinforcement learning
- Jong, N., and Stone, P. 2009. Compositional models for reinforcement learning. In Proceeding of European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases.
- (2009) Proceeding of European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
- Jong, N.¹ Stone, P.²

11
- 77956002520
- Technical report, University of Toronto
- Krizhevsky, A. 2009. Learning multiple layers of features from tiny images. Technical report, University of Toronto.
- (2009) Learning Multiple Layers of Features from Tiny Images
- Krizhevsky, A.¹

12
- 33750586671
- Solving factored MDPs with hybrid state and action variables
- Kveton, B.; Hauskrecht. M.; and Guestrin, C. 2006. Solving factored MDPs with hybrid state and action variables. Journal of Artificial Intelligence Research 27:153-201.
- (2006) Journal of Artificial Intelligence Research , vol.27 , pp. 153-201
- Kveton, B.¹ Hauskrecht, M.² Guestrin, C.³

13
- 84880680664
- Variable resolution discretization for high-accuracy solutions of optimal control problems
- Munos, R., and Moore, A. 1999. Variable resolution discretization for high-accuracy solutions of optimal control problems. In Proceedings of the 16th International Joint Conference on Artificial Intelligence, 1348-1355.
- (1999) Proceedings of the 16th International Joint Conference on Artificial Intelligence , pp. 1348-1355
- Munos, R.¹ Moore, A.²

14
- 0036832956
- Kernel-based reinforcement learning
- Ormoneit, D., and Sen, S. 2002. Kernel-based reinforcement learning. Machine Learning 49:161-178.
- (2002) Machine Learning , vol.49 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

15
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- Pineau, J.; Gordon, G.; and Thrun, S. 2003. Point-based value iteration: An anytime algorithm for POMDPs. In Proceedings of the 18th International Joint Conference on Artificial Intelligence, 1025-1032.
- (2003) Proceedings of the 18th International Joint Conference on Artificial Intelligence , pp. 1025-1032
- Pineau, J.¹ Gordon, G.² Thrun, S.³

16
- 85102627959
- New York, NY: John Wiley & Sons
- Puterman, M. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. New York, NY: John Wiley & Sons.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.¹

17
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R., and Barto, A. 1998. Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.