메뉴 건너뛰기




Volumn 1, Issue , 2004, Pages 48-52

Mobile robot navigation using neural Q-Learning

Author keywords

Markov decision; Mobile robot; Neural network; Q learning; Value function approximation

Indexed keywords

INFRARED SENSORS; Q-LEARNING; ROBOT NAVIGATION; VALUE FUNCTION-APPROXIMATION;

EID: 6344250977     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (30)

References (12)
  • 5
    • 0030149709 scopus 로고    scopus 로고
    • Purposive behavior acquisition for a real robot by vision-based reinforcement learning
    • Asada M., Noda S., and Hosoda K., "Purposive behavior acquisition for a real robot by vision-based reinforcement learning", Machine Learning, 23, 163-187, 1996.
    • (1996) Machine Learning , vol.23 , pp. 163-187
    • Asada, M.1    Noda, S.2    Hosoda, K.3
  • 7
    • 6344290932 scopus 로고    scopus 로고
    • Reinforcement learning for the behavior navigation of a mobile robot
    • Beijing, China
    • th IFAC World Congress, Beijing, China, 157-162, 1999.
    • (1999) th IFAC World Congress , pp. 157-162
    • Zalama, E.1
  • 10
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal difference learning with function approximation
    • Tsitsiklis J., and Roy B. V., "An analysis of temporal difference learning with function approximation", IEEE Tran. On Automatic Control, 42(5), 674-690, 1997.
    • (1997) IEEE Tran. on Automatic Control , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.1    Roy, B.V.2
  • 12
    • 0024861871 scopus 로고
    • Approximation by superpositions of a sigmoidal function
    • Cybenko G., "Approximation by superpositions of a sigmoidal function", Mathematics of Control, Signal, and Systems, 2, 303-314, 1989.
    • (1989) Mathematics of Control, Signal, and Systems , vol.2 , pp. 303-314
    • Cybenko, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.