SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Advances in Neural Information Processing Systems

Volumn , Issue , 2000, Pages 994-1000

State abstraction in MAXQ hierarchical reinforcement learning

(1) Dietterich, Thomas G a

a Oregon State University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

REINFORCEMENT LEARNING;

HIERARCHICAL REINFORCEMENT LEARNING; PRIMITIVE ACTIONS; STATE ABSTRACTION; TEMPORAL ABSTRACTION; VALUE FUNCTION DECOMPOSITION;

ABSTRACTING;

EID: 0003506152 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (50)

References (12)

1
- 84899003140
- Multi-time models for temporally abstract planning
- The MIT Press
- D. Precup and R. S. Sutton, "Multi-time models for temporally abstract planning," in NIPS 10, The MIT Press, 1998.
- (1998) NIPS 10
- Precup, D.¹ Sutton, R.S.²

2
- 0003899594
- Between mdps and semi-mdps: Learning, planning, and representing knowledge at multiple temporal scales
- Amherst, MA
- R. S. Sutton, D. Precup, and S. Singh, "Between MDPs and Semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales," tech. rep., Univ. Mass., Dept. Comp. Inf. Sci., Amherst, MA, 1998.
- (1998) Tech. Rep., Univ. Mass., Dept. Comp. Inf. Sci
- Sutton, R.S.¹ Precup, D.² Singh, S.³

3
- 84898956770
- Reinforcement learning with hierarchies of machines
- The MIT Press
- R. Parr and S. Russell, "Reinforcement learning with hierarchies of machines," in NIPS-10, The MIT Press, 1998.
- (1998) NIPS-10
- Parr, R.¹ Russell, S.²

4
- 0001027894
- Transfer of learning by composing solutions of elemental sequential tasks
- S. P. Singh, "Transfer of learning by composing solutions of elemental sequential tasks," Machine Learning, vol. 8, p. 323, 1992.
- (1992) Machine Learning , vol.8 , pp. 323
- Singh, S.P.¹

5
- 0000908087
- Hierarchical reinforcement learning: Preliminary results
- Morgan Kaufmann
- L. P. Kaelbling, "Hierarchical reinforcement learning: Preliminary results," in Proceedings ICML-10, pp. 167-173, Morgan Kaufmann, 1993.
- (1993) Proceedings ICML-10 , pp. 167-173
- Kaelbling, L.P.¹

6
- 84867797982
- Hierarchical solution of Markov decision processes using macro-actions
- Providence, RI
- M. Hauskrecht, N. Meuleau, C. Boutilier, L. Kaelbling, and T. Dean, "Hierarchical solution of Markov decision processes using macro-actions," tech. rep., Brown Univ., Dept. Comp. Sci., Providence, RI, 1998.
- (1998) Tech. Rep., Brown Univ., Dept. Comp. Sci
- Hauskrecht, M.¹ Meuleau, N.² Boutilier, C.³ Kaelbling, L.⁴ Dean, T.⁵

7
- 0001234682
- Feudal reinforcement learning
- San Francisco, CA: Morgan Kaufmann
- P. Dayan and G. Hinton, "Feudal reinforcement learning," in NIPS-5, pp. 271-278, San Francisco, CA: Morgan Kaufmann, 1993.
- (1993) NIPS-5 , pp. 271-278
- Dayan, P.¹ Hinton, G.²

8
- 0001806701
- The MAXQ method for hierarchical reinforcement learning
- Morgan Kaufmann
- T. G. Dietterich, "The MAXQ method for hierarchical reinforcement learning," in ICML-15, Morgan Kaufmann, 1998.
- (1998) ICML-15
- Dietterich, T.G.¹

9
- 0346087506
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Boulder, CO
- S. Singh, T. Jaakkola, M. L. Littman, and C. Szpesvari, "Convergence results for single-step on-policy reinforcement-learning algorithms," tech. rep., Univ. Col, Dept. Comp. Sci., Boulder, CO, 1998.
- (1998) Tech. Rep., Univ. Col, Dept. Comp. Sci
- Singh, S.¹ Jaakkola, T.² Littman, M.L.³ Szpesvari, C.⁴

10
- 0003487482
- Belmont, MA: Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

11
- 0000439891
- On the convergence of stochastic iterative dynamic programming algorithms
- T. Jaakkola, M. I. Jordan, and S. P. Singh, "On the convergence of stochastic iterative dynamic programming algorithms," Neur. Comp., vol. 6, no. 6, pp. 1185-1201, 1994.
- (1994) Neur. Comp. , vol.6 , Issue.6 , pp. 1185-1201
- Jaakkola, T.¹ Jordan, M.I.² Singh, S.P.³

12
- 85166207010
- Exploiting structure in policy construction
- C. Boutilier, R. Dearden, and M. Goldszmidt, "Exploiting structure in policy construction," in Proceedings IJCAI-95, pp. 1104-1111, 1995.
- (1995) Proceedings IJCAI-95 , pp. 1104-1111
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.