메뉴 건너뛰기




Volumn 3809 LNAI, Issue , 2005, Pages 164-175

Structural abstraction experiments in reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SCIENCE; HIERARCHICAL SYSTEMS; MULTI AGENT SYSTEMS; PROBLEM SOLVING; SOFTWARE AGENTS; STORAGE ALLOCATION (COMPUTER);

EID: 33745586802     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/11589990_19     Document Type: Conference Paper
Times cited : (18)

References (16)
  • 2
    • 0030854548 scopus 로고    scopus 로고
    • Trading spaces: Computation, representation, and the limits of uninformed learning
    • Clark, A., Thornton, C.: Trading spaces: Computation, representation, and the limits of uninformed learning. Behavioral and Brain Sciences 20 (1997) 57-66
    • (1997) Behavioral and Brain Sciences , vol.20 , pp. 57-66
    • Clark, A.1    Thornton, C.2
  • 5
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, T.G.: Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research 13 (2000) 227-303
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 8
    • 84956854078 scopus 로고    scopus 로고
    • Model minimization in hierarchical reinforcement learning
    • Fifth Symposium on Abstraction, Reformulation and Approximation (SARA 2002) Springer Verlag
    • Ravindran, B., Barto, A.G.: Model minimization in hierarchical reinforcement learning. In: Fifth Symposium on Abstraction, Reformulation and Approximation (SARA 2002). LNCS, Springer Verlag (2002) 196-211
    • (2002) LNCS , pp. 196-211
    • Ravindran, B.1    Barto, A.G.2
  • 9
    • 0031370386 scopus 로고    scopus 로고
    • Model minimization in markov decision processes
    • Dean, T., Givan, R.: Model minimization in markov decision processes. In: AAAI/IAAI. (1997) 106-111
    • (1997) AAAI/IAAI , pp. 106-111
    • Dean, T.1    Givan, R.2
  • 10
    • 0034272032 scopus 로고    scopus 로고
    • Bounded-parameter markov decision processes
    • Givan, R., Leach, S.M., Dean, T.: Bounded-parameter markov decision processes. Artificial Intelligence 122 (2000) 71-109
    • (2000) Artificial Intelligence , vol.122 , pp. 71-109
    • Givan, R.1    Leach, S.M.2    Dean, T.3
  • 11
    • 0032208335 scopus 로고    scopus 로고
    • Elevator group control using multiple reinforcement learning agents
    • Crites, R.H., Barto, A.G.: Elevator group control using multiple reinforcement learning agents. Machine Learning 33 (1998) 235-262
    • (1998) Machine Learning , vol.33 , pp. 235-262
    • Crites, R.H.1    Barto, A.G.2
  • 12
    • 0004320981 scopus 로고    scopus 로고
    • An introduction to collective intelligence
    • NASA Ames Research Center, CA
    • Wolpert, D., Turner, K.: An introduction to collective intelligence. Technical Report NASA-ARC-IC-99-63, NASA Ames Research Center, CA (1999)
    • (1999) Technical Report , vol.NASA-ARC-IC-99-63
    • Wolpert, D.1    Turner, K.2
  • 13
    • 34250513249 scopus 로고
    • Über ein Paradoxon der Verkehrsplanung
    • Braess, D.: Über ein Paradoxon der Verkehrsplanung. Unternehmensforschung 12 (1968) 258-268
    • (1968) Unternehmensforschung , vol.12 , pp. 258-268
    • Braess, D.1
  • 14
    • 85156265058 scopus 로고    scopus 로고
    • Learning to take concurrent actions
    • Rohanimanesh, K., Mahadevan, S.: Learning to take concurrent actions. In: NIPS. (2002) 1619-1626
    • (2002) NIPS , pp. 1619-1626
    • Rohanimanesh, K.1    Mahadevan, S.2
  • 15
    • 0013465036 scopus 로고    scopus 로고
    • Discovering hierarchy in reinforcement learning with HEXQ
    • Sammut, C., Hoffmann, A., eds., Morgan-Kaufman
    • Hengst, B.: Discovering hierarchy in reinforcement learning with HEXQ. In Sammut, C., Hoffmann, A., eds.: Proceedings of the Nineteenth International Conference on Machine Learning, Morgan-Kaufman (2002) 243-250
    • (2002) Proceedings of the Nineteenth International Conference on Machine Learning , pp. 243-250
    • Hengst, B.1
  • 16
    • 85143168613 scopus 로고
    • Hierarchical learning in stochastic domains: Preliminary results
    • San Mateo, CA, Morgan Kaufmann
    • Kaelbling, L.P.: Hierarchical learning in stochastic domains: Preliminary results. In: Machine Learning Proceedings of the Tenth International Conference, San Mateo, CA, Morgan Kaufmann (1993) 167-173
    • (1993) Machine Learning Proceedings of the Tenth International Conference , pp. 167-173
    • Kaelbling, L.P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.