메뉴 건너뛰기




Volumn 101, Issue 1-2, 1998, Pages 267-284

Utility-based on-line exploration for repeated navigation in an embedded graph

Author keywords

Exploration versus exploitation; Navigation on embedded graphs; Repeated tasks; Utility based search

Indexed keywords

ALGORITHMS; GRAPH THEORY; LEARNING SYSTEMS; NAVIGATION; RECURSIVE FUNCTIONS;

EID: 0032072803     PISSN: 00043702     EISSN: None     Source Type: Journal    
DOI: 10.1016/s0004-3702(98)00014-9     Document Type: Article
Times cited : (7)

References (29)
  • 3
    • 0042658634 scopus 로고
    • Dynamic Programming, Princeton Univ. Press, Princeton
    • R. Bellman, Dynamic Programming, Princeton Univ. Press, Princeton, 1957.
    • (1957)
    • Bellman, R.1
  • 6
    • 0029256026 scopus 로고
    • Piecemeal learning of an unknown environment
    • M. Betke, R.L. Rivest and M. Singh, Piecemeal learning of an unknown environment, Machine Learning 18 (2-3) (1995) 231-254.
    • (1995) Machine Learning , vol.18 , Issue.2-3 , pp. 231-254
    • Betke, M.1    Rivest, R.L.2    Singh, M.3
  • 8
    • 0028601246 scopus 로고
    • The trailblazer search: A new method for searching and capturing moving targets
    • Seattle, WA
    • F. Chimura and M. Tokoro, The trailblazer search: a new method for searching and capturing moving targets, in: Proceedings AAAI-94, Seattle, WA, 1994, pp. 1347-1352.
    • (1994) Proceedings AAAI-94 , pp. 1347-1352
    • Chimura, F.1    Tokoro, M.2
  • 10
    • 0029207679 scopus 로고
    • Inferring finite automata with stochastic output functions and an application to map learning
    • T. Dean, D. Angluin, K. Basye, S. Engelson, L. Kaelbling, E. Kokkevis and O. Maron, Inferring finite automata with stochastic output functions and an application to map learning, Machine Learning 18 (1) (1995) 81-108.
    • (1995) Machine Learning , vol.18 , Issue.1 , pp. 81-108
    • Dean, T.1    Angluin, D.2    Basye, K.3    Engelson, S.4    Kaelbling, L.5    Kokkevis, E.6    Maron, O.7
  • 15
    • 0026151568 scopus 로고
    • Embedding decision-analytic control in a learning architecture
    • O. Etzioni, Embedding decision-analytic control in a learning architecture, Artificial Intelligence 49 (1991) 129-159.
    • (1991) Artificial Intelligence , vol.49 , pp. 129-159
    • Etzioni, O.1
  • 18
    • 0030362555 scopus 로고    scopus 로고
    • Improving the learning efficiencies of realtime search
    • Portland, OR
    • T. Ishida and M. Shimbo, Improving the learning efficiencies of realtime search, in: Proceedings AAAI-96, Portland, OR, 1996, pp. 305-310.
    • (1996) Proceedings AAAI-96 , pp. 305-310
    • Ishida, T.1    Shimbo, M.2
  • 22
    • 0025400088 scopus 로고
    • Real-time heuristic search
    • R. Korf, Real-time heuristic search, Artificial Intelligence 42 (2-3) (1990) 189-211.
    • (1990) Artificial Intelligence , vol.42 , Issue.2-3 , pp. 189-211
    • Korf, R.1
  • 23
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less real time
    • A.W. Moore and C.G. Atkeson, Prioritized sweeping: reinforcement learning with less data and less real time, Machine Learning 13 (1993).
    • (1993) Machine Learning , vol.13
    • Moore, A.W.1    Atkeson, C.G.2
  • 28
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • Austin, TX, Morgan Kaufmann, San Mateo, CA
    • R.S. Sutton, Integrated architectures for learning, planning, and reacting based on approximating dynamic programming, in: Proceedings Seventh International Conference on Machine Learning, Austin, TX, Morgan Kaufmann, San Mateo, CA, 1990, pp. 216-224.
    • (1990) Proceedings Seventh International Conference on Machine Learning , pp. 216-224
    • Sutton, R.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.