SCOPUS 정보 검색 플랫폼

Proceedings of the Eighteenth International Florida Artificial Intelligence Research Society Conference, FLAIRS 2005 - Recent Advances in Artifical Intelligence

Volumn , Issue , 2005, Pages 461-466

Toward a topological theory of relational reinforcement learning for navigation tasks

(2) Lane, Terrain a Wilson, Andrew b

a MSC01 1070 (United States)

b SANDIA NATIONAL LABORATORIES (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CLOSED-FORM ENVELOPE; MARKOV DECISION PROCESS (MDP); RELATIONAL REINFORCEMENT LEARNING;

DECISION THEORY; MARKOV PROCESSES; NAVIGATION;

LEARNING SYSTEMS;

EID: 32844454618 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (8)

References (17)

1
- 0242456818
- Relational Markov models and their application to adaptive web navigation
- Edmonton, Alberta, Canada: ACM SIGKDD
- Anderson, C. R.; Domingos, P.; and Weld, D. S. 2002. Relational Markov models and their application to adaptive web navigation. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2002). Edmonton, Alberta, Canada: ACM SIGKDD.
- (2002) Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2002)
- Anderson, C.R.¹ Domingos, P.² Weld, D.S.³

2
- 0034248853
- Stochastic dynamic programming with factored representations
- Boutilier, C.; Dearden, R.; and Goldszmidt, M. 2000. Stochastic dynamic programming with factored representations. Artificial Intelligence 121(1-2):49-107.
- (2000) Artificial Intelligence , vol.121 , Issue.1-2 , pp. 49-107
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

3
- 0031370386
- Model minimization in Markov decision processes
- Providence, RI: AAAI Press/MIT Press
- Dean, T., and Givan, R. 1997. Model minimization in Markov decision processes. In Proceedings of the Fourteenth National Conference on Artificial Intelligence (AAAI-97), 106-111. Providence, RI: AAAI Press/MIT Press.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence (AAAI-97) , pp. 106-111
- Dean, T.¹ Givan, R.²

4
- 0029332887
- Planning under time constraints in stochastic domains
- Dean, T.; Kaelbling, L. P.; Kirman, J.; and Nicholson, A. 1995. Planning under time constraints in stochastic domains. Artificial Intelligence 76.
- (1995) Artificial Intelligence , pp. 76
- Dean, T.¹ Kaelbling, L.P.² Kirman, J.³ Nicholson, A.⁴

5
- 84942867726
- An overview of MAXQ hierarchical reinforcement learning
- Choueiry, B. Y., and Walsh, T., eds., Lecture Notes in Artificial Intelligence. New York: Springer Verlag
- Dietterich, T. G. 2000. An overview of MAXQ hierarchical reinforcement learning. In Choueiry, B. Y., and Walsh, T., eds., Proceedings of the Symposium on Abstraction, Reformulation and Approximation (SARA 2000), Lecture Notes in Artificial Intelligence. New York: Springer Verlag.
- (2000) Proceedings of the Symposium on Abstraction, Reformulation and Approximation (SARA 2000)
- Dietterich, T.G.¹

6
- 33749683139
- The thing that we tried didn't work very well: Deictic representation in reinforcement learning
- Finney, S.; Gardiol, N. H.; Kaelbling, L. P.; and Oates, T. 2002. The thing that we tried didn't work very well: Deictic representation in reinforcement learning. In Proceedings of the Eighteenth International Conference on Uncertainty in Artificial Intelligence (UAI-2002).
- (2002) Proceedings of the Eighteenth International Conference on Uncertainty in Artificial Intelligence (UAI-2002)
- Finney, S.¹ Gardiol, N.H.² Kaelbling, L.P.³ Oates, T.⁴

7
- 0041779094
- Learning Probabilistic Relational Models
- Dzeroski, S. and Lavrac, N., eds.. Springer-Verlag
- Getoor, L.; Friedman, N.; Koller, D.; and Pfeffer, A. 2001. Learning Probabilistic Relational Models. In Dzeroski, S. and Lavrac, N., eds., Relational Data Mining. Springer-Verlag.
- (2001) Relational Data Mining
- Getoor, L.¹ Friedman, N.² Koller, D.³ Pfeffer, A.⁴

8
- 0006419533
- Hierarchical solution of Markov decision processes using macro-actions
- Cooper, G. F., and Moral, S., eds.. Morgan Kaufmann
- Hauskrecht, M.; Meuleau, N.; Boutilier, C.; Kaelbling, L. P.; and Dean, T. 1998. Hierarchical solution of Markov decision processes using macro-actions. In Cooper, G. F., and Moral, S., eds., Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98). Morgan Kaufmann.
- (1998) Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98)
- Hauskrecht, M.¹ Meuleau, N.² Boutilier, C.³ Kaelbling, L.P.⁴ Dean, T.⁵

9
- 0036931070
- Nearly deterministic abstractions of Markov decision processes
- Edmonton, Canada: AAAI Press
- Lane, T., and Kaelbling, L. P. 2002. Nearly deterministic abstractions of Markov decision processes. In Proceedings of the Eighteenth National Conference on Artificial Intelligence (AAAI-02), 260-266. Edmonton, Canada: AAAI Press.
- (2002) Proceedings of the Eighteenth National Conference on Artificial Intelligence (AAAI-02) , pp. 260-266
- Lane, T.¹ Kaelbling, L.P.²

10
- 0346738900
- Flexible decomposition algorithms for weakly coupled Markov decision problems
- Cooper, G. F., and Moral, S., eds.. Morgan Kaufmann
- Parr, R. 1998a. Flexible decomposition algorithms for weakly coupled Markov decision problems. In Cooper, G. F., and Moral, S., eds., Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98). Morgan Kaufmann.
- (1998) Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98)
- Parr, R.¹

11
- 0003989214
- Ph.D. Dissertation, University of California at Berkeley
- Parr, R. E. 1998b. Hierarchical Control and Learning for Markov Decision Processes. Ph.D. Dissertation, University of California at Berkeley.
- (1998) Hierarchical Control and Learning for Markov Decision Processes
- Parr, R.E.¹

12
- 0003392384
- Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst, MA
- Precup, D. 2000. Temporal Abstraction in Reinforcement Learning. Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst, MA.
- (2000) Temporal Abstraction in Reinforcement Learning
- Precup, D.¹

13
- 85102627959
- New York: John Wiley & Sons
- Puterman, M. L. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. New York: John Wiley & Sons.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

14
- 1942514642
- Model minimization in hierarchical reinforcement learning
- Holte, R., ed.
- Ravindran, B., and Barto, A. G. 2002. Model minimization in hierarchical reinforcement learning. In Holte, R., ed., Proceedings of the 2002 Symposium on Abstraction, Reformulation, and Approximation (SARA-200229.
- (2002) Proceedings of the 2002 Symposium on Abstraction, Reformulation, and Approximation SARA-200229
- Ravindran, B.¹ Barto, A.G.²

15
- 1942484796
- Relativized options: Choosing the right transformation
- Fawcett, T., and Mishra, N., eds. Washington, DC: AAAI Press
- Ravindran, B., and Barto, A. G. 2003. Relativized options: Choosing the right transformation. In Fawcett, T., and Mishra, N., eds., Proceedings of the Twentieth International Conference on Machine Learning, 608-615. Washington, DC: AAAI Press.
- (2003) Proceedings of the Twentieth International Conference on Machine Learning , pp. 608-615
- Ravindran, B.¹ Barto, A.G.²

16
- 32844454706
- Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst, MA
- Ravindran, B. 2004. An Algebraic Approach to Abstraction in Reinforcement Learning. Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst, MA.
- (2004) An Algebraic Approach to Abstraction in Reinforcement Learning
- Ravindran, B.¹

17
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., and Barto, A. G. 1998. Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.