SCOPUS 정보 검색 플랫폼

Proceedings of the National Conference on Artificial Intelligence

Volumn , Issue , 2002, Pages 260-266

Nearly deterministic abstractions of Markov decision processes

(2) Lane, Terran a Kaelbling, Leslie Pack a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; MARKOV PROCESSES; NAVIGATION; OPTIMIZATION; PROBLEM SOLVING;

MARKOV DECISION PROCESSES (MDP);

MOBILE ROBOTS;

EID: 0036931070 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (14)

References (23)

1
- 0032629112
- Approximation schemes for minimum latency problems
- Arora, S., and Karakostas, G. 1999. Approximation schemes for minimum latency problems. In ACM Symposium on Theory of Computing, 688-693.
- (1999) ACM Symposium on Theory of Computing , pp. 688-693
- Arora, S.¹ Karakostas, G.²

2
- 84862678670
- Dynamic non-uniform abstractions for approximate planning in large structured stochastic domains
- Baum, J., and Nicholson, A. E. 1998. Dynamic non-uniform abstractions for approximate planning in large structured stochastic domains. In Proceedings of the 5th Pacific Rim International Conference on Artificial Intelligence, 587-598.
- (1998) Proceedings of the 5th Pacific Rim International Conference on Artificial Intelligence , pp. 587-598
- Baum, J.¹ Nicholson, A.E.²

3
- 0034248853
- Stochastic dynamic programming with factored representations
- Boutilier, C.; Dearden, R.; and Goldszmidt, M. 2000. Stochastic dynamic programming with factored representations. Artificial Intelligence 121(1-2):49-107.
- (2000) Artificial Intelligence , vol.121 , Issue.1-2 , pp. 49-107
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

4
- 0031370386
- Model minimization in Markov decision processes
- Providence, RI: AAAI Press/MIT Press
- Dean, T., and Givan, R. 1997. Model minimization in Markov decision processes. In Proceedings of the Fourteenth National Conference on Artificial Intelligence (AAAI-97), 106-111. Providence, RI: AAAI Press/MIT Press.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence (AAAI-97) , pp. 106-111
- Dean, T.¹ Givan, R.²

5
- 0029332887
- Planning under time constraints in stochastic domains
- Dean, T.; Kaelbling, L. P.; Kirman, J.; and Nicholson, A. 1995. Planning under time constraints in stochastic domains. Artificial Intelligence 76.
- (1995) Artificial Intelligence , pp. 76
- Dean, T.¹ Kaelbling, L.P.² Kirman, J.³ Nicholson, A.⁴

6
- 84995724100
- An improved approximation ratio for the minimum latency problem
- Goemans, M., and Kleinberg, J. 1996. An improved approximation ratio for the minimum latency problem. In SODA: Proceedings of the Seventh ACM-SIAM Symposium on Discrete Algorithms.
- (1996) SODA: Proceedings of the Seventh ACM-SIAM Symposium on Discrete Algorithms
- Goemans, M.¹ Kleinberg, J.²

7
- 0000423702
- Robust combination of local controllers
- Breese, J., and Koller, D., eds., Seattle, WA: Morgan Kaufmann
- Guestrin, C., and Ormoneit, D. 2001. Robust combination of local controllers. In Breese, J., and Koller, D., eds., Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI-01), 178-185. Seattle, WA: Morgan Kaufmann.
- (2001) Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI-01) , pp. 178-185
- Guestrin, C.¹ Ormoneit, D.²

8
- 0006419533
- Hierarchical solution of Markov decision processes using macro-actions
- Cooper, G. F., and Moral, S., eds., Morgan Kaufmann
- Hauskrecht, M.; Meuleau, N.; Boutilier, C.; Kaelbling, L. P.; and Dean, T. 1998. Hierarchical solution of Markov decision processes using macro-actions. In Cooper, G. F., and Moral, S., eds., Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98). Morgan Kaufmann.
- (1998) Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98)
- Hauskrecht, M.¹ Meuleau, N.² Boutilier, C.³ Kaelbling, L.P.⁴ Dean, T.⁵

9
- 0013061165
- Kluwer Academic Publishers, chapter Experimental Analysis of Heuristics for the STSP. To appear
- Johnson, D. S., and McGeoch, L. A. 2001. The Traveling Salesman Problem and its Variations. Kluwer Academic Publishers, chapter Experimental Analysis of Heuristics for the STSP. To appear.
- (2001) The Traveling Salesman Problem and its Variations
- Johnson, D.S.¹ McGeoch, L.A.²

10
- 85143168613
- Hierarchical learning in stochastic domains: Preliminary results
- Kaelbling, L. P. 1993. Hierarchical learning in stochastic domains: Preliminary results. In Proceedings of the Tenth International Conference on Machine Learning, 167-173.
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 167-173
- Kaelbling, L.P.¹

11
- 84880677563
- Efficient reinforcement learning in factored MDPs
- Dean, T., ed., Stockholm, Sweden: Morgan Kaufmann
- Kearns, M., and Koller, D. 1999. Efficient reinforcement learning in factored MDPs. In Dean, T., ed., Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99), 740-747. Stockholm, Sweden: Morgan Kaufmann.
- (1999) Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99) , pp. 740-747
- Kearns, M.¹ Koller, D.²

12
- 0003979966
- Finite Markov chains
- New York: Springer-Verlag
- Kemeny, J. G., and Snell, J. L. 1976. Finite Markov Chains. Undergraduate Texts in Mathematics. New York: Springer-Verlag.
- (1976) Undergraduate Texts in Mathematics
- Kemeny, J.G.¹ Snell, J.L.²

13
- 0010359703
- Policy iteration for factored MDPs
- Morgan Kaufmann
- Koller, D., and Parr, R. 2000. Policy iteration for factored MDPs. In Uncertainty in Artificial Intelligence: Proceedings of the Sixteenth Conference (UAI 2000). Morgan Kaufmann.
- (2000) Uncertainty in Artificial Intelligence: Proceedings of the Sixteenth Conference (UAI 2000)
- Koller, D.¹ Parr, R.²

14
- 0031369472
- Probabilisitic propositional planning: Representations and complexity
- Providence, RI: AAAI Press/MIT Press
- Littman, M. 1997. Probabilisitic propositional planning: Representations and complexity. In Proceedings of the Fourteenth National Conference on Artificial Intelligence (AAAI-97), 748-754. Providence, RI: AAAI Press/MIT Press.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence (AAAI-97) , pp. 748-754
- Littman, M.¹

15
- 0036374190
- Non-approximability results for partially observable Markov decision processes
- Lusena, C.; Mundhenk, M.; and Goldsmith, J. 2001. Non-approximability results for partially observable Markov decision processes. Journal of Artificial Intelligence Research 14:83-103.
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 83-103
- Lusena, C.¹ Mundhenk, M.² Goldsmith, J.³

16
- 0002654557
- Roles of macro-actions in accelerating reinforcement learning
- McGovern, A.; Sutton, R. S.; and Fagg, A. H. 1997. Roles of macro-actions in accelerating reinforcement learning. In Proceedings of the 1997 Grace Hopper Celebration of Women in Computing, 13-18.
- (1997) Proceedings of the 1997 Grace Hopper Celebration of Women in Computing , pp. 13-18
- McGovern, A.¹ Sutton, R.S.² Fagg, A.H.³

17
- 84880688141
- Multi-value-functions: Efficient automatic action hierarchies for multiple goal MDPs
- Dean, T., ed., Stockholm, Sweden: Morgan Kaufmann
- Moore, A. W.; Baird, L. C.; and Kaelbling, L. 1999. Multi-value-functions: Efficient automatic action hierarchies for multiple goal MDPs. In Dean, T., ed., Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99). Stockholm, Sweden: Morgan Kaufmann.
- (1999) Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99)
- Moore, A.W.¹ Baird, L.C.² Kaelbling, L.³

18
- 0001205548
- Complexity of finite-horizon markov decision process problems
- Mundhenk, M.; Goldsmith, J.; Lusena, C.; and Allender, E. 2000. Complexity of finite-horizon markov decision process problems. Journal of the ACM 47(4):681-720.
- (2000) Journal of the ACM , vol.47 , Issue.4 , pp. 681-720
- Mundhenk, M.¹ Goldsmith, J.² Lusena, C.³ Allender, E.⁴

19
- 0346738900
- Flexible decomposition algorithms for weakly coupled Markov decision problems
- Cooper, G. F., and Moral, S., eds., Morgan Kaufmann
- Parr, R. 1998a. Flexible decomposition algorithms for weakly coupled Markov decision problems. In Cooper, G. F., and Moral, S., eds., Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98). Morgan Kaufmann.
- (1998) Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98)
- Parr, R.¹

20
- 0003989214
- Ph.D. Dissertation, University of California at Berkeley
- Parr, R. E. 1998b. Hierarchical Control and Learning for Markov Decision Processes. Ph.D. Dissertation, University of California at Berkeley.
- (1998) Hierarchical Control and Learning for Markov Decision Processes
- Parr, R.E.¹

21
- 0003392384
- Ph.D. Dissertation, University of Massachusetts, Amherst, Department of Computer Science
- Precup, D. 2000. Temporal Abstraction in Reinforcement Learning. Ph.D. Dissertation, University of Massachusetts, Amherst, Department of Computer Science.
- (2000) Temporal Abstraction in Reinforcement Learning
- Precup, D.¹

22
- 85102627959
- New York: John Wiley & Sons
- Puterman, M. L. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. New York: John Wiley & Sons.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

23
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton, R. S.; Precup, D.; and Singh, S. 1999. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112:181-211.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.