SCOPUS 정보 검색 플랫폼

Proceedings of the American Control Conference

Volumn 2006, Issue , 2006, Pages 4159-4164

Reinforcement learning with supervision by combining multiple learnings and expert advices

(1) Chang, Hyeong Soo a

a Sogang University (South Korea)

Author keywords

[No Author keywords available]

Indexed keywords

BASE AGENTS; EXPERT ADVICES; REINFORCEMENT FUNCTION;

ADAPTIVE SYSTEMS; CONVERGENCE OF NUMERICAL METHODS; EXPERT SYSTEMS; FUNCTION EVALUATION; NUMERICAL METHODS;

REINFORCEMENT LEARNING;

EID: 34047195105 PISSN: 07431619 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (6)

References (28)

1
- 0036465258
- Expertness based cooperative Q-learning
- M. N. Ahmadabadi and M. Asadpour, "Expertness based cooperative Q-learning," IEEE Trans. an Systems, Man, and Cybernetics, part B. vol. 32, no. 1, pp. 66-76, 2002.
- (2002) IEEE Trans. an Systems, Man, and Cybernetics, part B , vol.32 , Issue.1 , pp. 66-76
- Ahmadabadi, M.N.¹ Asadpour, M.²

2
- 0004007508
- Reinforcement Learning
- J. Si, A. G. Barto, W. B. Powell, and D. Wunsch eds, pp, Wiley-IEEE Press, Piscataway, NJ
- A. G. Barto, "Reinforcement Learning," in Handbook of Learning and Approximate Dynamic Programming, J. Si, A. G. Barto, W. B. Powell, and D. Wunsch (eds.), pp. 804-809, Wiley-IEEE Press, Piscataway, NJ, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming , pp. 804-809
- Barto, A.G.¹

3
- 84979715630
- Supervised Actor-Critic Reinforcement Learning
- J. Si, A. G. Barto, W. B. Powell, and D. Wunsch eds, pp, Wiley-IEEE Press, Piscataway, NJ
- A. G. Barto and M. T. Rosenstein, "Supervised Actor-Critic Reinforcement Learning," in Handbook of Learning and Approximate Dynamic Programming, J. Si, A. G. Barto, W. B. Powell, and D. Wunsch (eds.), pp. 359-380, Wiley-IEEE Press, Piscataway, NJ, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming , pp. 359-380
- Barto, A.G.¹ Rosenstein, M.T.²

4
- 0003487482
- Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

5
- 15744397544
- An Ant System Based Exploration-Exploitation for Reinforcement Learning
- H. S. Chang, "An Ant System Based Exploration-Exploitation for Reinforcement Learning," in Proc. of the IEEE Conf. on Systems, Man, and Cybernetics, Vol. 4, 2004, pp. 3805-3810.
- (2004) Proc. of the IEEE Conf. on Systems, Man, and Cybernetics , vol.4 , pp. 3805-3810
- Chang, H.S.¹

6
- 0004033139
- D. Corne, Fl Glover, and M. Dorigo eds, McGraw-Hill
- D. Corne, Fl Glover, and M. Dorigo (eds.), New Ideas in Optimization, McGraw-Hill, 1999.
- (1999) New Ideas in Optimization

7
- 0043247546
- Accelerating Reinforcement Learning by Composing Solutions of Automatically Identified Subtasks
- C. Drummond, "Accelerating Reinforcement Learning by Composing Solutions of Automatically Identified Subtasks," J. of Artificial Intelligence Research, vol. 16, 2002, pp. 59-104.
- (2002) J. of Artificial Intelligence Research , vol.16 , pp. 59-104
- Drummond, C.¹

8
- 0002012598
- The ant colony optimization metaheuristic
- D. Corne, M. Dorigo eds, pp, McGraw-Hill, NY, USA
- M. Dorigo and G. Di Caro, "The ant colony optimization metaheuristic," New Ideas in Optimization, D. Corne, M. Dorigo (eds.), pp. 11-32, McGraw-Hill, NY, USA, 1999.
- (1999) New Ideas in Optimization , pp. 11-32
- Dorigo, M.¹ Di Caro, G.²

9
- 0013464438
- Integrating experimentation and guidance in relational reinforcement learning
- K. Driessens and S. Dzeroski, "Integrating experimentation and guidance in relational reinforcement learning," in Proc. of the 19th Int. Conf. on Machine Learning, 2002, pp. 115-122.
- (2002) Proc. of the 19th Int. Conf. on Machine Learning , pp. 115-122
- Driessens, K.¹ Dzeroski, S.²

10
- 0004222346
- Morgan Kaufmann
- R. C. Eberhart and J. Kennedy, Swarm Intelligence, Morgan Kaufmann, 2001.
- (2001) Swarm Intelligence
- Eberhart, R.C.¹ Kennedy, J.²

11
- 14344266002
- Learning rates for Q-learning
- E. Even-Dar and Y. Mansour, "Learning rates for Q-learning," J. of Machine Learning Research, vol. 5, 2003, pp. 1-25.
- (2003) J. of Machine Learning Research , vol.5 , pp. 1-25
- Even-Dar, E.¹ Mansour, Y.²

12
- 0002357911
- Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms
- J. D. Cowan, G. Tesauro, and J. Alspector eds, Morgan Kaufmann Publishers, Inc
- V. Gullapalli and A. G. Barto, "Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms", Advances in Neural Information Processing Systems, J. D. Cowan, G. Tesauro, and J. Alspector (eds.), Morgan Kaufmann Publishers, Inc., vol. 6, 1994, pp. 695-702.
- (1994) Advances in Neural Information Processing Systems , vol.6 , pp. 695-702
- Gullapalli, V.¹ Barto, A.G.²

13
- 0029679044
- Reinforcement Learning; A Survey
- L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement Learning; A Survey," J. of Artificial Intelligence Research, vol. 4, 1996, pp. 237-285.
- (1996) J. of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

14
- 29344460680
- Using domain-configurable search control for probabilistic planning
- AAAI
- U. Kuter and D. Nau, "Using domain-configurable search control for probabilistic planning," in Proc of the National Conf. on Artificial Intelligence (AAAI), 2005.
- (2005) Proc of the National Conf. on Artificial Intelligence
- Kuter, U.¹ Nau, D.²

15
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- L. J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Machine Learning, vol. 8, 1992, pp. 294-321.
- (1992) Machine Learning , vol.8 , pp. 294-321
- Lin, L.J.¹

16
- 0029732210
- Creating advice-taking reinforcement learners
- R. Maclin and J.W. Shavlik, "Creating advice-taking reinforcement learners," Machine Learning, vol. 22, 1996, pp. 251-282.
- (1996) Machine Learning , vol.22 , pp. 251-282
- Maclin, R.¹ Shavlik, J.W.²

17
- 0004255908
- McGraw Hill
- T. Mitchell, Machine Learning, McGraw Hill, 1997.
- (1997) Machine Learning
- Mitchell, T.¹

18
- 0003891507
- Englewood Cliffs, NJ: Prentice Hall
- K. Narendra and M. A. L. Thathachar, Learning Automata: An Introduction, Englewood Cliffs, NJ: Prentice Hall, 1989.
- (1989) Learning Automata: An Introduction
- Narendra, K.¹ Thathachar, M.A.L.²

19
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- A. Y. Ng, D. Harada, and S. Russell, "Policy invariance under reward transformations: theory and application to reward shaping," in Proc. of the 16th Int. Conf. on Machine Learning, 1999, pp. 278-287.
- (1999) Proc. of the 16th Int. Conf. on Machine Learning , pp. 278-287
- Ng, A.Y.¹ Harada, D.² Russell, S.³

20
- 85102627959
- Wiley, New York
- M. L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

21
- 8744269435
- Reinforcement learning with super- vision by a stable controller
- M. Rosenstein and A.G. Barto, "Reinforcement learning with super- vision by a stable controller," in Proc. of the American Control Conf., 2004, pp. 4517-4522.
- (2004) Proc. of the American Control Conf , pp. 4517-4522
- Rosenstein, M.¹ Barto, A.G.²

22
- 1942484759
- Q-decomposition for reinforcement learning agents
- S. Russell and A. L. Zimdars, "Q-decomposition for reinforcement learning agents," in Proc. of the 20th Int. Conf. on Machine Learning, 2003, pp. 278-287.
- (2003) Proc. of the 20th Int. Conf. on Machine Learning , pp. 278-287
- Russell, S.¹ Zimdars, A.L.²

23
- 0033901602
- Convergence results for single-step on-policy reinforcement learning algorithms
- S. Singh, T. Jaakkola, M. Littman, and C. Szepesvari, "Convergence results for single-step on-policy reinforcement learning algorithms," Machine Learning, vol. 38, pp. 287-308, 2000.
- (2000) Machine Learning , vol.38 , pp. 287-308
- Singh, S.¹ Jaakkola, T.² Littman, M.³ Szepesvari, C.⁴

24
- 0004007508
- MIT Press
- R. Sutton and A. Barto, Reinforcement Learning. MIT Press, 2000.
- (2000) Reinforcement Learning
- Sutton, R.¹ Barto, A.²

25
- 84947807317
- Open theoretical questions in reinforcement learning
- EuroCOLT'99, Nordkirchen, Germany
- R. Sutton, "Open theoretical questions in reinforcement learning," in Proc. of the 4th European Conference on Computational Learning Theory, EuroCOLT'99, Nordkirchen, Germany, 1999, pp. 11-17.
- (1999) Proc. of the 4th European Conference on Computational Learning Theory , pp. 11-17
- Sutton, R.¹

26
- 0028497630
- Asynchronous stochastic approximation and Q-learning
- J. N. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Machine Learning, vol. 16, pp. 185-202, 1994.
- (1994) Machine Learning , vol.16 , pp. 185-202
- Tsitsiklis, J.N.¹

27
- 0039967456
- Analysis of some incremental variants of policy iteration: First steps toward understanding actor-critic learning systems
- Tech. Rep. NU-CCS-93-11
- R. J. Williams and L. C. Baird, "Analysis of some incremental variants of policy iteration: first steps toward understanding actor-critic learning systems," Tech. Rep. NU-CCS-93-11. 1993.
- (1993)
- Williams, R.J.¹ Baird, L.C.²

28
- 0029488746
- Using case-based reasoning as a reinforcement learning framework for optimization with changing criteria
- D. Zeng and K. Sycara, "Using case-based reasoning as a reinforcement learning framework for optimization with changing criteria," in Proc. of the 7th Int. Conf. on Tools with Artificial Intelligence, (ICTAI'95), pp. 56-62, 1995.
- (1995) Proc. of the 7th Int. Conf. on Tools with Artificial Intelligence, (ICTAI'95) , pp. 56-62
- Zeng, D.¹ Sycara, K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.