메뉴 건너뛰기




Volumn 1747, Issue , 1999, Pages 417-428

Q-learning in continuous state and action spaces

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SCIENCE; COMPUTERS;

EID: 84957629024     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/3-540-46695-9_35     Document Type: Conference Paper
Times cited : (96)

References (17)
  • 1
    • 0016556021 scopus 로고
    • A new approach to manipulator control: The cerebrellar model articulated controller (CMAC)
    • [1] J. S. Albus. A new approach to manipulator control: the cerebrellar model articulated controller (CMAC). J. Dynamic Systems, Measurement and Control, 97:220-227, 1975.
    • (1975) J. Dynamic Systems, Measurement and Control , vol.97 , pp. 220-227
    • Albus, J.S.1
  • 2
    • 0003477315 scopus 로고
    • Reinforcement learning with high-dimensional, continuous actions
    • Wright Laboratory
    • [2] Leemon C. Baird and A. Harry Klopf. Reinforcement learning with high-dimensional, continuous actions. Technical Report WL-TR-93-1147, Wright Laboratory, 1993.
    • (1993) Technical Report WL-TR-93-1147
    • Baird, L.C.1    Harry Klopf, A.2
  • 3
    • 0001934093 scopus 로고
    • An introduction to connectionist learning control systems
    • In D. A. White and D. A. Sofge, editors, Van Nostrand Reinhold
    • [3] W. Baker and J. Farrel. An introduction to connectionist learning control systems. In D. A. White and D. A. Sofge, editors, Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches. Van Nostrand Reinhold, 1992.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
    • Baker, W.1    Farrel, J.2
  • 10
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • [10] Long-Ji Lin. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning Journal, 8(3/4), 1992.
    • (1992) Machine Learning Journal , vol.8 , Issue.3-4
    • Lin, L.-J.1
  • 11
    • 0003677359 scopus 로고
    • Problem solving with reinforcement learning
    • Cambridge University
    • [11] Gavin Adrian Rummery. Problem solving with reinforcement learning. PhD thesis, Cambridge University, 1995.
    • (1995) Phd Thesis
    • Rummery, G.A.1
  • 12
    • 0031231885 scopus 로고    scopus 로고
    • Experiments with reinforcement learning in problems with continuous state and action spaces
    • [12] Juan C. Santamaria, Richard S. Sutton, and Ashwin Ram. Experiments with reinforcement learning in problems with continuous state and action spaces. Adaptive Behaviour, 6(2):163-218, 1998.
    • (1998) Adaptive Behaviour , vol.6 , Issue.2 , pp. 163-218
    • Santamaria, J.C.1    Sutton, R.S.2    Ram, A.3
  • 13
    • 0003963062 scopus 로고    scopus 로고
    • Contribution to the study and design of reinforcement functions
    • Universidad de Buenos Aires, Universite d'Aix-Marseille
    • [13] Juan Miguel Santos. Contribution to the study and design of reinforcement functions. PhD thesis, Universidad de Buenos Aires, Universite d'Aix-Marseille 3, 1999.
    • (1999) Phd Thesis , vol.3
    • Santos, J.M.1
  • 15
    • 0031341345 scopus 로고    scopus 로고
    • Neural reinforcement learning for behaviour synthesis
    • [15] Claude F. Touzet. Neural reinforcement learning for behaviour synthesis. Robotics and Autonomous Systems, 22(3-4):251-81, 1997.
    • (1997) Robotics and Autonomous Systems , vol.22 , Issue.3-4 , pp. 251-281
    • Touzet, C.F.1
  • 16
    • 84883071723 scopus 로고
    • Watkins. Learning from Delayed Rewards
    • University of Cambridge
    • [16] Christopher J. C. H. Watkins. Learning from Delayed Rewards. PhD thesis, University of Cambridge, 1989.
    • (1989) Phd Thesis
    • Christopher, J.1
  • 17
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • In D. A. White and D. A. Sofge, editors, Van Nostrand Reinhold
    • [17] Paul J. Werbos. Approximate dynamic programming for real-time control and neural modeling. In D. A. White and D. A. Sofge, editors, Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches. Van Nostrand Reinhold, 1992.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
    • Werbos, P.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.