메뉴 건너뛰기




Volumn 2, Issue , 2004, Pages 980-987

Unifying temporal and structural credit assignment problems

Author keywords

[No Author keywords available]

Indexed keywords

Q-LEARNING; SINGLE AGENT SYSTEMS; SINGLE AGENTS; STRUCTURAL CREDIT ASSIGNMENT;

EID: 4544292959     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (77)

References (16)
  • 2
    • 85156187730 scopus 로고    scopus 로고
    • Improving elevator performance using reinforcement learning
    • D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, MIT Press
    • R. H. Crites and A. G. Barto. Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems - 8, pages 1017-1023. MIT Press, 1996.
    • (1996) Advances in Neural Information Processing Systems - 8 , pp. 1017-1023
    • Crites, R.H.1    Barto, A.G.2
  • 7
    • 0034205975 scopus 로고    scopus 로고
    • Multiagent systems: A survey from a machine learning perspective
    • P. Stone and M. Veloso. Multiagent systems: A survey from a machine learning perspective. Autonomous Robots, 8(3), 2000.
    • (2000) Autonomous Robots , vol.8 , Issue.3
    • Stone, P.1    Veloso, M.2
  • 8
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • R. S. Sutton. Learning to predict by the methods of temporal differences. Machine Learning, 3:9-44, 1988.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 10
    • 0001046225 scopus 로고
    • Practical issues in temporal difference learning
    • G. Tesauro. Practical issues in temporal difference learning. Machine Learning, 8:33-53, 1992.
    • (1992) Machine Learning , vol.8 , pp. 33-53
    • Tesauro, G.1
  • 16
    • 0034635650 scopus 로고    scopus 로고
    • Collective intelligence for control of distributed dynamical systems
    • March
    • D. H. Wolpert, K. Wheeler, and K. Turner. Collective intelligence for control of distributed dynamical systems. Europhysics Letters, 49(6), March 2000.
    • (2000) Europhysics Letters , vol.49 , Issue.6
    • Wolpert, D.H.1    Wheeler, K.2    Turner, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.