메뉴 건너뛰기




Volumn 1, Issue , 2003, Pages 680-685

Neural Q-learning control architectures for a wall-following behavior

Author keywords

[No Author keywords available]

Indexed keywords

CONVERGENCE OF NUMERICAL METHODS; DATA REDUCTION; LEARNING ALGORITHMS; NAVIGATION SYSTEMS; NEURAL NETWORKS; ROBOTICS; SENSORS;

EID: 0348041519     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (6)

References (12)
  • 1
    • 0030149709 scopus 로고    scopus 로고
    • Purposive behavior acquisition for a real robot by vision-based reinforcement learning
    • May/June
    • M. Asada, S. Noda, S. Tawaratsumida, and K. Hosoda. Purposive behavior acquisition for a real robot by vision-based reinforcement learning. Machine Learning, May/June 1996.
    • (1996) Machine Learning
    • Asada, M.1    Noda, S.2    Tawaratsumida, S.3    Hosoda, K.4
  • 3
    • 84942750244 scopus 로고    scopus 로고
    • Feedforward neural networks in reinforcement learning applied to high-dimensional motor control
    • Masayuki Numao Nicol Cesa-Bianchi and Rdiger Reischuk, editors, Springer
    • Rmi Coulom. Feedforward neural networks in reinforcement learning applied to high-dimensional motor control. In Masayuki Numao Nicol Cesa-Bianchi and Rdiger Reischuk, editors, Proceedings of the 13th International Conference on Algorithmic Learning Theory, pages 402-413. Springer, 2002.
    • (2002) Proceedings of the 13th International Conference on Algorithmic Learning Theory , pp. 402-413
    • Coulom, R.1
  • 5
    • 0030171602 scopus 로고    scopus 로고
    • Rapid, safe, and incremental learning of navigation strategies
    • J. del R. Millan. Rapid, safe, and incremental learning of navigation strategies. Sys. Man and Cybernetics, 26(3), 1996.
    • (1996) Sys. Man and Cybernetics , vol.26 , Issue.3
    • Millan, J.D.R.1
  • 6
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • L.J. Lin. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8:293-321, 1992.
    • (1992) Machine Learning , vol.8 , pp. 293-321
    • Lin, L.J.1
  • 9
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • R. Sutton. Learning to predict by the methods of temporal differences. Machine Learning, 3:9-44, 1988.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.1
  • 10
    • 0029276036 scopus 로고
    • Temporal difference learning and td-gammon
    • Gerald Tesauro. Temporal difference learning and td-gammon. Communications of the ACM, 38(3):58-68, 1995.
    • (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.