SCOPUS 정보 검색 플랫폼

Artificial Intelligence

Volumn 101, Issue 1-2, 1998, Pages 267-284

Utility-based on-line exploration for repeated navigation in an embedded graph

(3) Argamon Engelson, Shlomo a Kraus, Sarit a,b Sina, Sigalit a

a BAR ILAN UNIVERSITY (Israel)

b UNIVERSITY OF MARYLAND (United States)

Author keywords

Exploration versus exploitation; Navigation on embedded graphs; Repeated tasks; Utility based search

Indexed keywords

ALGORITHMS; GRAPH THEORY; LEARNING SYSTEMS; NAVIGATION; RECURSIVE FUNCTIONS;

EMBEDDED GRAPHS; REPEATED TASKS; UTILITY BASED SEARCH;

ARTIFICIAL INTELLIGENCE;

EID: 0032072803 PISSN: 00043702 EISSN: None Source Type: Journal
DOI: 10.1016/s0004-3702(98)00014-9 Document Type: Article

Times cited : (7)

References (29)

1
- 34247234783
- C.B. Barber and H. Huhdanpaa, Qhull software package (1995). http://www.geom.umn.edu/software/qhull.
- (1995) Qhull Software Package
- Barber, C.B.¹ Huhdanpaa, H.²

2
- 0042157618
- Technical Report CS-89-27, Brown University, Department of Computer Science
- K. Basye, T. Dean and J.S. Vitter, Coping with uncertainty in map learning, Technical Report CS-89-27, Brown University, Department of Computer Science, 1989.
- (1989) Coping with Uncertainty in Map Learning
- Basye, K.¹ Dean, T.² Vitter, J.S.³

3
- 0042658634
- Dynamic Programming, Princeton Univ. Press, Princeton
- R. Bellman, Dynamic Programming, Princeton Univ. Press, Princeton, 1957.
- (1957)
- Bellman, R.¹

4
- 0004181906
- Chapman and Hall, London
- D.A. Berry and B. Fristedt, Bandit Problems: Sequential Allocation of Experiments, Chapman and Hall, London, 1985.
- (1985) Bandit Problems: Sequential Allocation of Experiments
- Berry, D.A.¹ Fristedt, B.²

5
- 0003565779
- Prentice-Hall, Englewood Cliffs, NJ
- D.P. Bertsekas, Dynamic Programming: Deterministic and Stochastic Models, Prentice-Hall, Englewood Cliffs, NJ, 1987.
- (1987) Dynamic Programming: Deterministic and Stochastic Models
- Bertsekas, D.P.¹

6
- 0029256026
- Piecemeal learning of an unknown environment
- M. Betke, R.L. Rivest and M. Singh, Piecemeal learning of an unknown environment, Machine Learning 18 (2-3) (1995) 231-254.
- (1995) Machine Learning , vol.18 , Issue.2-3 , pp. 231-254
- Betke, M.¹ Rivest, R.L.² Singh, M.³

7
- 0027837675
- An on-line algorithm for improving performance in navigation
- A. Blum and P. Chalasani, An on-line algorithm for improving performance in navigation, in: Proceedings Symposium on Foundations of Computer Science, 1993, pp. 2-11.
- (1993) Proceedings Symposium on Foundations of Computer Science , pp. 2-11
- Blum, A.¹ Chalasani, P.²

8
- 0028601246
- The trailblazer search: A new method for searching and capturing moving targets
- Seattle, WA
- F. Chimura and M. Tokoro, The trailblazer search: a new method for searching and capturing moving targets, in: Proceedings AAAI-94, Seattle, WA, 1994, pp. 1347-1352.
- (1994) Proceedings AAAI-94 , pp. 1347-1352
- Chimura, F.¹ Tokoro, M.²

9
- 0001032516
- Learning in navigation: Goal finding in graphs, internat
- P. Cucka, N.S. Netanyahu and A. Rosenfeld, Learning in navigation: goal finding in graphs, Internat. J. Pattern Recognition and Artificial Intelligence 10 (5) (1996) 429-446.
- (1996) J. Pattern Recognition and Artificial Intelligence , vol.10 , Issue.5 , pp. 429-446
- Cucka, P.¹ Netanyahu, N.S.² Rosenfeld, A.³

10
- 0029207679
- Inferring finite automata with stochastic output functions and an application to map learning
- T. Dean, D. Angluin, K. Basye, S. Engelson, L. Kaelbling, E. Kokkevis and O. Maron, Inferring finite automata with stochastic output functions and an application to map learning, Machine Learning 18 (1) (1995) 81-108.
- (1995) Machine Learning , vol.18 , Issue.1 , pp. 81-108
- Dean, T.¹ Angluin, D.² Basye, K.³ Engelson, S.⁴ Kaelbling, L.⁵ Kokkevis, E.⁶ Maron, O.⁷

11
- 0029332887
- Planning under time constraints in stochastic domains
- T. Dean, L.P. Kaelbling, J. Kirman and A. Nicholson, Planning under time constraints in stochastic domains, Artificial Intelligence 76 (1-2) (1995) 35-74.
- (1995) Artificial Intelligence , vol.76 , Issue.1-2 , pp. 35-74
- Dean, T.¹ Kaelbling, L.P.² Kirman, J.³ Nicholson, A.⁴

12
- 0026376345
- How to learn in an unknown environment
- IEEE Computer Society Press, Los Alamitos, CA
- X. Deng, T. Kameda and C. Papadimitriou, How to learn in an unknown environment, in: Proceedings 32nd Symposium on the Foundations of Computer Science, IEEE Computer Society Press, Los Alamitos, CA, 1991, pp. 298-303.
- (1991) Proceedings 32nd Symposium on the Foundations of Computer Science , pp. 298-303
- Deng, X.¹ Kameda, T.² Papadimitriou, C.³

13
- 0025742530
- Exploring an unknown graph
- IEEE Computer Society Press
- X. Deng and C.H. Papadimitriou, Exploring an unknown graph, in: Proceedings 31st Annual Symposium on Foundations of Computer Science, IEEE Computer Society Press, 1990, pp. 355-361.
- (1990) Proceedings 31st Annual Symposium on Foundations of Computer Science , pp. 355-361
- Deng, X.¹ Papadimitriou, C.H.²

14
- 0003435471
- Brooks/Cole Publishing Company, Pacific Grove, CA
- J.L. Devore, Probability and Statistics for Engineering and Sciences, Brooks/Cole Publishing Company, Pacific Grove, CA, 1991.
- (1991) Probability and Statistics for Engineering and Sciences
- Devore, J.L.¹

15
- 0026151568
- Embedding decision-analytic control in a learning architecture
- O. Etzioni, Embedding decision-analytic control in a learning architecture, Artificial Intelligence 49 (1991) 129-159.
- (1991) Artificial Intelligence , vol.49 , pp. 129-159
- Etzioni, O.¹

16
- 0005177760
- Efficient decision-theoretic planning: Techniques and empirical analysis
- P. Haddawy, A. Doan and R. Goodwin, Efficient decision-theoretic planning: techniques and empirical analysis, in: Proceedings Conference on Uncertainty in Artificial Intelligence, 1995, pp. 229-236.
- (1995) Proceedings Conference on Uncertainty in Artificial Intelligence , pp. 229-236
- Haddawy, P.¹ Doan, A.² Goodwin, R.³

17
- 0029326195
- A moving target search: A real-time search for changing goals
- T. Ishida and R. Korf, A moving target search: a real-time search for changing goals, IEEE Trans. Pattern Analysis and Machine Intelligence 17 (6) (1995) 609-619.
- (1995) IEEE Trans. Pattern Analysis and Machine Intelligence , vol.17 , Issue.6 , pp. 609-619
- Ishida, T.¹ Korf, R.²

18
- 0030362555
- Improving the learning efficiencies of realtime search
- Portland, OR
- T. Ishida and M. Shimbo, Improving the learning efficiencies of realtime search, in: Proceedings AAAI-96, Portland, OR, 1996, pp. 305-310.
- (1996) Proceedings AAAI-96 , pp. 305-310
- Ishida, T.¹ Shimbo, M.²

19
- 0004280606
- The MIT Press, Cambridge, MA
- L.P. Kaelbling, Learning in Embedded Systems, The MIT Press, Cambridge, MA, 1993.
- (1993) Learning in Embedded Systems
- Kaelbling, L.P.¹

20
- 0029679044
- Reinforcement learning: A survey
- L.P. Kaelbling, M.L. Littman and A.W. Moore, Reinforcement learning: a survey, J. Artif. Intell. Res. 4 (1996) 237-285.
- (1996) J. Artif. Intell. Res. , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

21
- 0041912178
- Probabilistic exploration in planning while learning
- P. Besnard and S. Hanks (Eds.)
- G.I. Karakoulas, Probabilistic exploration in planning while learning, in: P. Besnard and S. Hanks (Eds.), Eleventh Annual Conference on Uncertainty in Artificial Intelligence, 1995, pp. 352-361.
- (1995) Eleventh Annual Conference on Uncertainty in Artificial Intelligence , pp. 352-361
- Karakoulas, G.I.¹

22
- 0025400088
- Real-time heuristic search
- R. Korf, Real-time heuristic search, Artificial Intelligence 42 (2-3) (1990) 189-211.
- (1990) Artificial Intelligence , vol.42 , Issue.2-3 , pp. 189-211
- Korf, R.¹

23
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less real time
- A.W. Moore and C.G. Atkeson, Prioritized sweeping: reinforcement learning with less data and less real time, Machine Learning 13 (1993).
- (1993) Machine Learning , vol.13
- Moore, A.W.¹ Atkeson, C.G.²

24
- 0026190127
- Shortest paths without a map
- C.H. Papadimitriou and M. Yannakakis, Shortest paths without a map, Theoret. Comput. Sci. 84 (1) (1991) 127-150.
- (1991) Theoret. Comput. Sci. , vol.84 , Issue.1 , pp. 127-150
- Papadimitriou, C.H.¹ Yannakakis, M.²

25
- 85102627959
- John Wiley & Sons, New York
- M.L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming, John Wiley & Sons, New York, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

26
- 0024862996
- Inference of finite automata using homing sequences
- Seattle, WA
- R.L. Rivest and R.E. Schapire, Inference of finite automata using homing sequences (extended abstract), in: Proceedings Twenty-First Annual ACM Symposium on Theory of Computing, Seattle, WA, 1989, pp. 411-420.
- (1989) Proceedings Twenty-first Annual ACM Symposium on Theory of Computing , pp. 411-420
- Rivest, R.L.¹ Schapire, R.E.²

27
- 0030372124
- Efficient goaldirected exploration
- Portland, OR
- Y. Smirnov, S. Koenig, M.M. Veloso and R.G. Simmons, Efficient goaldirected exploration, in: Proceedings AAAI-96, Portland, OR, 1996, pp. 292-297.
- (1996) Proceedings AAAI-96 , pp. 292-297
- Smirnov, Y.¹ Koenig, S.² Veloso, M.M.³ Simmons, R.G.⁴

28
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- Austin, TX, Morgan Kaufmann, San Mateo, CA
- R.S. Sutton, Integrated architectures for learning, planning, and reacting based on approximating dynamic programming, in: Proceedings Seventh International Conference on Machine Learning, Austin, TX, Morgan Kaufmann, San Mateo, CA, 1990, pp. 216-224.
- (1990) Proceedings Seventh International Conference on Machine Learning , pp. 216-224
- Sutton, R.S.¹

29
- 0002210775
- The role of exploration in learning control
- Van Nostrand Reinhold, Florence, KY
- S. Thrun, The role of exploration in learning control, in: Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches, Van Nostrand Reinhold, Florence, KY, 1992.
- (1992) Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches
- Thrun, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.