SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Advances in Neural Information Processing Systems

Volumn , Issue , 1998, Pages 1043-1049

Reinforcement learning with hierarchies of machines

(2) Parr, Ronald a Russell, Stuart a

a UNIVERSITY OF CALIFORNIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BEHAVIOR-BASED; CONVERGENT ALGORITHMS; LEARNING PROCESS; NEW APPROACHES; PRIOR KNOWLEDGE; SEARCH SPACES;

REINFORCEMENT LEARNING;

EID: 84898956770 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (525)

References (19)

1
- 1142305413
- Reacting, planning and learning in an autonomous agent
- K. Furukawa, D. Michie, and S. Muggleton, editors, Oxford University Press, Oxford
- S. Benson and N. Nilsson. Reacting, planning and learning in an autonomous agent. In K. Furukawa, D. Michie, and S. Muggleton, editors, Machine Intelligence 14. Oxford University Press, Oxford, 1995.
- (1995) Machine Intelligence , vol.14
- Benson, S.¹ Nilsson, N.²

2
- 0003636164
- Prentice-Hall, Englewood Cliffs, New Jersey
- D. C. Bertsekas and J. N. Tsitsiklis. Parallel and Distributed Computation: Numerical Methods. Prentice-Hall, Englewood Cliffs, New Jersey, 1989.
- (1989) Parallel and Distributed Computation: Numerical Methods
- Bertsekas, D.C.¹ Tsitsiklis, J.N.²

3
- 0000409272
- Reinforcement learning methods for continuous-time markov decision problems
- Denver, Colorado, December, MIT Press
- S. J. Bradtke and M. O. Duff. Reinforcement learning methods for continuous-time Markov decision problems. In Advances in Neural Information Processing Systems 7: Proc. of the 1994 Conference, Denver, Colorado, December 1995. MIT Press.
- (1995) Advances in Neural Information Processing Systems 7: Proc. of the 1994 Conference
- Bradtke, S.J.¹ Duff, M.O.²

4
- 0022688781
- A robust layered control system for a mobile robot
- R. A. Brooks. A robust layered control system for a mobile robot. IEEE Journal of Robotics and Automation, 2, 1986.
- (1986) IEEE Journal of Robotics and Automation , vol.2
- Brooks, R.A.¹

5
- 0026255231
- O-plan: The open planning architecture
- November
- K. W. Currie and A. Tate. O-Plan: The Open Planning Architecture. Artificial Intelligence, 52(1), November 1991.
- (1991) Artificial Intelligence , vol.52 , Issue.1
- Currie, K.W.¹ Tate, A.²

6
- 0001234682
- Feudal reinforcement learning
- Stephen Jose Hanson, Jack D. Cowan, and C. Lee Giles, editors, San Mateo, California, Morgan Kaufman
- P. Dayan and G. E. Hinton. Feudal reinforcement learning. In Stephen Jose Hanson, Jack D. Cowan, and C. Lee Giles, editors, Neural Information Processing Systems 5, San Mateo, California, 1993. Morgan Kaufman.
- (1993) Neural Information Processing Systems , vol.5
- Dayan, P.¹ Hinton, G.E.²

7
- 0000746330
- Model reduction techniques for computing approximately optimal solutions for markov decision processes
- Providence, Rhode Island, August, Morgan Kaufmann
- T. Dean, R. Givan, and S. Leach. Model reduction techniques for computing approximately optimal solutions for markov decision processes. In Proc. of the Thirteenth Conference on Un-certainty in Artificial Intelligence, Providence, Rhode Island, August 1997. Morgan Kaufmann.
- (1997) Proc. of the Thirteenth Conference on Uncertainty in Artificial Intelligence
- Dean, T.¹ Givan, R.² Leach, S.³

8
- 85168151397
- Decomposition techniques for planning in stochastic domains
- Montreal, Canada, August, Morgan Kaufmann
- T. Dean and S.-H. Lin. Decomposition techniques for planning in stochastic domains. In Proc. of the Fourteenth Int. Joint Conference on Artificial Intelligence, Montreal, Canada, August 1995. Morgan Kaufmann.
- (1995) Proc. of the Fourteenth Int. Joint Conference on Artificial Intelligence
- Dean, T.¹ Lin, S.-H.²

9
- 0344074989
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Oregon State University, Corvallis, Oregon
- Thomas G. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Technical report, Department of Computer Science, Oregon State University, Corvallis, Oregon, 1997.
- (1997) Technical Report, Department of Computer Science
- Dietterich, T.G.¹

10
- 84898980271
- Synthesizing efficient agents from partial programs
- Proc. Charlotte, North Carolina, October, Springer-Verlag
- Y.-J. Hsu. Synthesizing efficient agents from partial programs. In Methodologies for Intelligent Systems: 6th Int. Symposium, 1SM1S '91, Proc., Charlotte, North Carolina, October 1991. Springer-Verlag.
- (1991) Methodologies for Intelligent Systems: 6th Int. Symposium, 1SM1S '91
- Hsu, Y.-J.¹

11
- 0000439891
- On the convergence of stochastic iterative dynamic programming algorithms
- T. Jaakkola, M.I. Jordan, and S.P. Singh. On the convergence of stochastic iterative dynamic programming algorithms. Neural Computation, 6(6), 1994.
- (1994) Neural Computation , vol.6 , Issue.6
- Jaakkola, T.¹ Jordan, M.I.² Singh, S.P.³

12
- 0003673017
- PhD thesis, Computer Science Department, Carnegie-Mellon University, Pittsburgh, Pennsylvania
- L.-J. Lin. Reinforcement Learning for Robots Using Neural Networks. PhD thesis, Computer Science Department, Carnegie-Mellon University, Pittsburgh, Pennsylvania, 1993.
- (1993) Reinforcement Learning for Robots Using Neural Networks
- Lin, L.-J.¹

13
- 0347369287
- PhD thesis, Computer Science Department, Brown University, Providence, Rhode Island
- Shieu-Hong Lin. Exploiting Structure for Planning and Control. PhD thesis, Computer Science Department, Brown University, Providence, Rhode Island, 1997.
- (1997) Exploiting Structure for Planning and Control
- Lin, S.-H.¹

14
- 0002654557
- Roles of macro-actions in accelerating reinforcement learning
- A. McGovern, R. S. Sutton, and A. H. Fagg. Roles of macro-actions in accelerating reinforcement learning. In 1997 Grace Hopper Celebration of Women in Computing, 1997.
- (1997) 1997 Grace Hopper Celebration of Women in Computing
- Mcgovern, A.¹ Sutton, R.S.² Fagg, A.H.³

15
- 84943322357
- In This Volume
- D. Precup and R. S. Sutton. Multi-time models for temporally abstract planning. In This Volume.
- Multi-time Models for Temporally Abstract Planning
- Precup, D.¹ Sutton, R.S.²

16
- 0002876837
- Scaling reinforcement learning algorithms by learning variable temporal resolution models
- Aberdeen, July, Morgan Kaufmann
- S. P. Singh. Scaling reinforcement learning algorithms by learning variable temporal resolution models. In Proceedings of the Ninth International Conference on Machine Learning, Aberdeen, July 1992. Morgan Kaufmann.
- (1992) Proceedings of the Ninth International Conference on Machine Learning
- Singh, S.P.¹

17
- 0001027894
- Transfer of learning by composing solutions of elemental sequential tasks
- May
- S. P. Singh. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8(3), May 1992.
- (1992) Machine Learning , vol.8 , Issue.3
- Singh, S.P.¹

18
- 0000224681
- Reinforcement learning with soft state aggregation
- G. Tesauro, D. S. Touretzky, and T. K. Leen, editors, Cambridge, Massachusetts, MIT Press
- S. P. Singh, T. Jaakola, and M. I. Jordan. Reinforcement learning with soft state aggregation. In G. Tesauro, D. S. Touretzky, and T. K. Leen, editors, Neural Information Processing Systems 7, Cambridge, Massachusetts, 1995. MIT Press.
- (1995) Neural Information Processing Systems , vol.7
- Singh, S.P.¹ Jaakola, T.² Jordan, M.I.³

19
- 84898991560
- Temporal abstraction in reinforcement learning
- Tahoe City, CA, July, Morgan Kaufmann
- R. S. Sutton. Temporal abstraction in reinforcement learning. In Proc. of the Twelfth Int. Conference on Machine Learning, Tahoe City, CA, July 1995. Morgan Kaufmann.
- (1995) Proc. of the Twelfth Int. Conference on Machine Learning
- Sutton, R.S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.