메뉴 건너뛰기




Volumn 1, Issue , 2003, Pages 430-435

A Robot that Reinforcement-Learns to Identify and Memorize Important Previous Observations

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER NETWORKS; EXTRAPOLATION; INFORMATION RETRIEVAL; LEARNING ALGORITHMS; MARKOV PROCESSES; NEURAL NETWORKS; ROBOT LEARNING; SENSOR DATA FUSION; TRAJECTORIES; VECTORS;

EID: 0346149797     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (49)

References (12)
  • 1
    • 84899015857 scopus 로고    scopus 로고
    • Reinforcement learning with Long Short-Term Memory
    • B. Bakker. Reinforcement learning with Long Short-Term Memory. In NIPS 14. 2002.
    • (2002) NIPS , vol.14
    • Bakker, B.1
  • 2
    • 0346242070 scopus 로고    scopus 로고
    • Reinforcement learning in partially observable mobile robot domains using unsupervised event extraction
    • B. Bakker, F. Linåker, and J. Schmidhuber. Reinforcement learning in partially observable mobile robot domains using unsupervised event extraction. In Proc. IROS'02, 2002.
    • (2002) Proc. IROS'02
    • Bakker, B.1    Linåker, F.2    Schmidhuber, J.3
  • 6
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning, and teaching
    • L.-J. Lin. Self-improving reactive agents based on reinforcement learning, planning, and teaching. Machine Learning, 8:293-321, 1992.
    • (1992) Machine Learning , vol.8 , pp. 293-321
    • Lin, L.-J.1
  • 8
    • 84880884411 scopus 로고    scopus 로고
    • Mobile robot learning of delayed response tasks through event extraction: A solution to the road sign problem and beyond
    • F. Linåker and H. Jacobsson. Mobile robot learning of delayed response tasks through event extraction: A solution to the road sign problem and beyond. In Proc. IJCAI'2001, 2001.
    • (2001) Proc. IJCAI'2001
    • Linåker, F.1    Jacobsson, H.2
  • 11
    • 0034293945 scopus 로고    scopus 로고
    • Embedding connectionist autonomous agents in time: The road sign problem
    • R. Rylatt and C. Czamecki. Embedding connectionist autonomous agents in time: The road sign problem. Neural Processing Letters, 12:145-158, 2000.
    • (2000) Neural Processing Letters , vol.12 , pp. 145-158
    • Rylatt, R.1    Czamecki, C.2
  • 12
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • Richard S. Sutton. Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proc. ICML 7, 1990.
    • (1990) Proc. ICML , vol.7
    • Sutton, R.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.