SCOPUS 정보 검색 플랫폼

Volumn 2, Issue , 2003, Pages 608-615

Relativized Options: Choosing the Right Transformation

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; DECISION THEORY; MATHEMATICAL MODELS; MATHEMATICAL TRANSFORMATIONS; PUBLIC POLICY;

CANDIDATE TRANSFORMATIONS; HIERARCHICAL REINFORCEMENT LEARNING; MACRO OPERATORS; MINIMIZATION METHODS;

LEARNING SYSTEMS;

EID: 1942484796 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (48)

References (17)

1
- 0003507838
- Massachusetts Institute of Technology
- Agre, P. E. (1988). The dynamic structure of everyday life (Technical Report AITR-1085). Massachusetts Institute of Technology.
- (1988) The Dynamic Structure of Everyday Life (Technical Report AITR-1085)
- Agre, P.E.¹

2
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich, T. G. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Artificial Intelligence Research, 13, 227-303.
- (2000) Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

5
- 0034272032
- Bounded-parameter Markov decision processes
- Givan, R., Leach, S., & Dean, T. (2000). Bounded-parameter Markov decision processes. Artificial Intelligence, 122, 71-109.
- (2000) Artificial Intelligence , vol.122 , pp. 71-109
- Givan, R.¹ Leach, S.² Dean, T.³

6
- 0003881270
- Englewood Cliffs, NJ: Prentice-Hall
- Hartmanis, J., & Stearns, R. E. (1966). Algebraic structure theory of sequential machines. Englewood Cliffs, NJ: Prentice-Hall.
- (1966) Algebraic Structure Theory of Sequential Machines
- Hartmanis, J.¹ Stearns, R.E.²

7
- 0035487297
- MOSAIC model for sensorimotor learning and control
- Haruno, M., Wolpert, D. M., & Kawato, M. (2001). MOSAIC model for sensorimotor learning and control. Neural Computation, 13, 2201-2220.
- (2001) Neural Computation , vol.13 , pp. 2201-2220
- Haruno, M.¹ Wolpert, D.M.² Kawato, M.³

8
- 0013465036
- Discovering hierarchy in reinforcement learning with HEXQ
- Hengst, B. (2002). Discovering hierarchy in reinforcement learning with HEXQ. Proceedings of the 19th International Conference on Machine Learning (pp. 243-250).
- (2002) Proceedings of the 19th International Conference on Machine Learning , pp. 243-250
- Hengst, B.¹

9
- 0000148778
- A heuristic approach to the discovery of macro-operators
- Iba, G. A. (1989). A heuristic approach to the discovery of macro-operators. Machine Learning, 3, 285-317.
- (1989) Machine Learning , vol.3 , pp. 285-317
- Iba, G.A.¹

11
- 0013465187
- Automatic discovery of subgoals in reinforcement learning using diverse density
- McGovern, A., & Barto, A. G. (2001). Automatic discovery of subgoals in reinforcement learning using diverse density. Proceedings of the 18th International Conference on Machine Learning ICML 2001 (pp. 361-368).
- (2001) Proceedings of the 18th International Conference on Machine Learning ICML 2001 , pp. 361-368
- McGovern, A.¹ Barto, A.G.²

13
- 57749202646
- University of Massachusetts, Amherst
- Ravindran, B., & Barto, A. G. (2001). Symmetries and model minimization of Markov decision processes (Technical Report 01-43). University of Massachusetts, Amherst.
- (2001) Symmetries and Model Minimization of Markov Decision Processes (Technical Report 01-43)
- Ravindran, B.¹ Barto, A.G.²

16
- 0033170372
- Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton, R. S., Precup, D., & Singh, S. (1999). Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

17
- 84962013829
- Learning prospective pick and place behavior
- Wheeler, D. S., Fagg, A. H., & Grupen, R. A. (2002). Learning prospective pick and place behavior. Proceedings of the International Conference on Development and Learning (ICDL '02).
- (2002) Proceedings of the International Conference on Development and Learning (ICDL '02)
- Wheeler, D.S.¹ Fagg, A.H.² Grupen, R.A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.