SCOPUS 정보 검색 플랫폼

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)

Volumn 1864, Issue , 2000, Pages 26-44

An overview of MAXQ hierarchical reinforcement learning

(1) Dietterich, Thomas G a

a OREGON STATE UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACTING; DECISION MAKING; PROBLEM SOLVING; STOCHASTIC SYSTEMS;

HIERARCHICAL METHOD; HIERARCHICAL REINFORCEMENT LEARNING; OPTIMAL POLICIES; PROBLEM SPACE; REWARD FUNCTION; SEQUENTIAL DECISION MAKING; STATE ABSTRACTION; VALUE FUNCTION DECOMPOSITION;

REINFORCEMENT LEARNING;

EID: 84942867726 PISSN: 03029743 EISSN: None Source Type: Conference Proceeding
DOI: 10.1007/3-540-44914-0_2 Document Type: Conference Paper

Times cited : (61)

References (15)

1
- 0003351108
- Neuro-dynamic programming
- Belmont, MA
- Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-Dynamic Programming. Athena Scientific, Belmont, MA.
- (1996) Athena Scientific
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

2
- 85156187730
- Improving elevator performance using reinforcement learning
- San Francisco, CA. Morgan Kaufmann
- Crites, R. H., & Barto, A. G. (1995). Improving elevator performance using reinforcement learning. In Advances in Neural Information Processing Systems, Vol. 8, pp. 1017-1023 San Francisco, CA. Morgan Kaufmann.
- (1995) Advances in Neural Information Processing Systems , vol.8 , pp. 1017-1023
- Crites, R.H.¹ Barto, A.G.²

3
- 0001234682
- Feudal reinforcement learning
- Morgan Kaufmann, San Francisco, CA
- Dayan, P., & Hinton, G. (1993). Feudal reinforcement learning. In Advances in Neural Information Processing Systems, 5, pp. 271-278. Morgan Kaufmann, San Francisco, CA.
- (1993) Advances in Neural Information Processing Systems , vol.5 , pp. 271-278
- Dayan, P.¹ Hinton, G.²

4
- 0006424007
- Tech. rep. CS-95-10 Department of Computer Science, Brown University, Providence, Rhode Island
- Dean, T., & Lin, S.-H. (1995). Decomposition techniques for planning in stochastic domains. Tech. rep. CS-95-10, Department of Computer Science, Brown University, Providence, Rhode Island.
- (1995) Decomposition Techniques for Planning in Stochastic Domains
- Dean, T.¹ Lin, S.-H.²

5
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich, T. G. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research. To appear.
- (2000) Journal of Artificial Intelligence Research
- Dietterich, T.G.¹

6
- 85143168613
- Hierarchical reinforcement learning: Preliminary results
- San Francisco, CA. Morgan Kaufmann
- Kaelbling, L. P. (1993). Hierarchical reinforcement learning: Preliminary results. In Proceedings of the Tenth International Conference on Machine Learning, pp. 167-173 San Francisco, CA. Morgan Kaufmann.
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 167-173
- Kaelbling, L.P.¹

7
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less time
- Moore, A. W., & Atkeson, C. G. (1993). Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13, 103.
- (1993) Machine Learning , vol.13 , pp. 103
- Moore, A.W.¹ Atkeson, C.G.²

8
- 0003989214
- Ph. D. Thesis, University of California, Berkeley, California
- Parr, R. (1998). Hierarchical control and learning for Markov decision processes. Ph. D. Thesis, University of California, Berkeley, California.
- (1998) Hierarchical Control and Learning for Markov Decision Processes
- Parr, R.¹

9
- 84898956770
- Reinforcement learning with hierarchies of machines
- Cambridge, MA. MIT Press
- Parr, R., & Russell, S. (1998). Reinforcement learning with hierarchies of machines. In Advances in Neural Information Processing Systems, Vol. 10, pp. 1043-1049 Cambridge, MA. MIT Press.
- (1998) Advances in Neural Information Processing Systems , vol.10 , pp. 1043-1049
- Parr, R.¹ Russell, S.²

10
- 0346087506
- Tech. rep., University of Colorado, Department of Computer Science, Boulder, CO. To appear in Machine Learning
- Singh, S., Jaakkola, T., Littman, M. L., & Szepesvári, C. (1998). Convergence results for single-step on-policy reinforcement-learning algorithms. Tech. rep., University of Colorado, Department of Computer Science, Boulder, CO. To appear in Machine Learning.
- (1998) Convergence Results for Single-step On-policy Reinforcement-learning Algorithms
- Singh, S.¹ Jaakkola, T.² Littman, M.L.³ Szepesvári, C.⁴

11
- 0001027894
- Transfer of learning by composing solutions of elemental sequential tasks
- Singh, S. P. (1992). Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8, 323.
- (1992) Machine Learning , vol.8 , pp. 323
- Singh, S.P.¹

12
- 0003420416
- MIT Press, Cambridge, MA.
- Sutton, R., & Barto, A. G. (1998). Introduction to Reinforcement Learning. MIT Press, Cambridge, MA.
- (1998) Introduction to Reinforcement Learning
- Sutton, R.¹ Barto, A.G.²

13
- 0003899594
- Tech. rep., University of Massachusetts, Department of Computer and Information Sciences, Amherst, MA. To appear in Artificial Intelligence
- Sutton, R. S., Precup, D., & Singh, S. (1998). Between MDPs and Semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales. Tech. rep., University of Massachusetts, Department of Computer and Information Sciences, Amherst, MA. To appear in Artificial Intelligence.
- (1998) Between MDPs and Semi-MDPs: Learning, Planning, and Representing Knowledge at Multiple Temporal Scales
- Sutton, R.S.¹ Precup, D.² Singh, S.³

14
- 0029276036
- Temporal difference learning and TD-Gammon
- Tesauro, G. (1995). Temporal difference learning and TD-Gammon. Communications of the ACM, 28 (3), 58-68.
- (1995) Communications of the ACM , vol.28 , Issue.3 , pp. 58-68
- Tesauro, G.¹

15
- 84918834208
- A reinforcement learning approach to job-shop scheduling
- Morgan Kaufmann, San Francisco, CA
- Zhang, W., & Dietterich, T. G. (1995). A reinforcement learning approach to job-shop scheduling. In 1995 International Joint Conference on Artificial Intelligence, pp. 1114-1120. Morgan Kaufmann, San Francisco, CA.
- (1995) 1995 International Joint Conference on Artificial Intelligence , pp. 1114-1120
- Zhang, W.¹ Dietterich, T.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.