메뉴 건너뛰기




Volumn 2684, Issue , 2003, Pages 179-200

Forward and bidirectional planning based on reinforcement learning and neural networks in a simulated robot.

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; DYNAMIC PROGRAMMING; LEARNING SYSTEMS; MULTI AGENT SYSTEMS; NEURAL NETWORKS; NUMERICAL METHODS; REINFORCEMENT; ARTIFICIAL INTELLIGENCE; CHAINS; FORECASTING; INTELLIGENT BUILDINGS; INTELLIGENT SYSTEMS; PLANNING; ROBOT PROGRAMMING; STOCHASTIC SYSTEMS;

EID: 23144433803     PISSN: 03029743     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1007/978-3-540-45002-3_11     Document Type: Conference Paper
Times cited : (21)

References (15)
  • 3
    • 23144449644 scopus 로고    scopus 로고
    • Planning with neural networks and reinforcement learning
    • University of Essex Ph.D. Thesis
    • Baldassarre, G.: Planning with neural networks and reinforcement learning. Computer Science Department, University of Essex (2002) Ph.D. Thesis.
    • (2002) Computer Science Department
    • Baldassarre, G.1
  • 4
    • 0842276977 scopus 로고    scopus 로고
    • A modular neural-network model of the basal ganglia.s role in learning and selecting motor behaviors
    • Baldassarre G.: A modular neural-network model of the basal ganglia.s role in learning and selecting motor behaviors. Cognitive Systems Research. 3 (2002) 5-13.
    • (2002) Cognitive Systems Research , vol.3 , pp. 5-13
    • Baldassarre, G.1
  • 6
  • 8
    • 2842560201 scopus 로고
    • STRIPS: A new approach to the application of theorem proving to problem solving
    • Fikes R.E., Nilsson N.J.: STRIPS: a new approach to the application of theorem proving to problem solving. Artificial Intelligence. 2 (1971) 189-208.
    • (1971) Artificial Intelligence , vol.2 , pp. 189-208
    • Fikes, R.E.1    Nilsson, N.J.2
  • 10
    • 85153938292 scopus 로고
    • Reinforcement learning algorithm for partially ob-servable Markov decision problems
    • Tesauro G. Touretzky D.S. Leen T.K. (eds.):, The MIT Press, Cambridge Mass
    • Jaakkola T., Singh S.P., Jordan M.I.: Reinforcement learning algorithm for partially ob-servable Markov decision problems. In: Tesauro G., Touretzky D.S., Leen T.K. (eds.): Advances in Neural Information Processing Systems 7. The MIT Press, Cambridge Mass. (1995) 345-352.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 345-352
    • Jaakkola, T.1    Singh, S.P.2    Jordan, M.I.3
  • 11
    • 0020068152 scopus 로고
    • Self-organized formation of topologically correct feature maps
    • Kohonen T.: Self-organized formation of topologically correct feature maps. Biological Cybernetics. 43 (1982) 59-69.
    • (1982) Biological Cybernetics , vol.43 , pp. 59-69
    • Kohonen, T.1
  • 12
    • 0006972915 scopus 로고
    • Optimal path finding algorithms
    • In: Kanal L.N. Kumar V. (eds.):, Springer-Verlag, Berlin
    • Korf, R.E.: Optimal path finding algorithms. In: Kanal L.N., Kumar V. (eds.): Search in Artificial Intelligence. Springer-Verlag, Berlin (1988) 223-267.
    • (1988) Search in Artificial Intelligence , pp. 223-267
    • Korf, R.E.1
  • 14
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • Lin, L.J.: Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning. 8 (1992) 293-391.
    • (1992) Machine Learning , vol.8 , pp. 293-391
    • Lin, L.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.