메뉴 건너뛰기




Volumn 10, Issue 4, 1998, Pages 289-294

TD Learning with Neural Networks

Author keywords

Environmental change; Multiple outputs; Neural networks; TD learning

Indexed keywords

ARTIFICIAL INTELLIGENCE; VECTORS;

EID: 0342407739     PISSN: 09153942     EISSN: 18838049     Source Type: Journal    
DOI: 10.20965/jrm.1998.p0289     Document Type: Article
Times cited : (2)

References (10)
  • 1
    • 0024750455 scopus 로고
    • Neural Network Architectures for Robot Applications
    • S.Y. Kung and JN. Hwang. "Neural Network Architectures for Robot Applications," IEEE Trans. Robotics and Automation, 5, 641-657, (1989).
    • (1989) IEEE Trans. Robotics and Automation , vol.5 , pp. 641-657
    • Kung, S.Y.1    Hwang, JN.2
  • 2
    • 0003948559 scopus 로고
    • A Real-Time Learning Neural Robot Controller
    • North-Holland
    • P. Smags and B.J. Krose, "A Real-Time Learning Neural Robot Controller," Artificial Neural Networks, North-Holland, 351-356, (1991).
    • (1991) Artificial Neural Networks , pp. 351-356
    • Smags, P.1    Krose, B.J.2
  • 3
    • 0031269483 scopus 로고    scopus 로고
    • Neural-Network-Based Robust Fault Diagnosis in Robotic Systems
    • A.T. Vemuri and M.M. Polycarpou, "Neural-Network-Based Robust Fault Diagnosis in Robotic Systems," IEEE Trans. Neural Networks, 8, 1410-1420, (1997).
    • (1997) IEEE Trans. Neural Networks , vol.8 , pp. 1410-1420
    • Vemuri, A.T.1    Polycarpou, M.M.2
  • 4
    • 85165970657 scopus 로고
    • Application of neural technology to robots
    • Babu, Ozawa, "Application of neural technology to robots," J. of RSJ, 11, 341-362, (1992).
    • (1992) J. of RSJ , vol.11 , pp. 341-362
    • Babu, Ozawa1
  • 5
    • 33847202724 scopus 로고
    • Learning to Predict by the Methods of Temporal Differences
    • R.S. Sution, "Learning to Predict by the Methods of Temporal Differences," Machine Learning, 3. 9-44, (1988).
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sution, R.S.1
  • 6
    • 0029276036 scopus 로고
    • Temporal Difference Learning and TD - Gammon
    • G. Tesauro, "Temporal Difference Learning and TD - Gammon," Communications of the ACM, 38, 58-68, (1995).
    • (1995) Communications of the ACM , vol.38 , pp. 58-68
    • Tesauro, G.1
  • 7
    • 0000430514 scopus 로고
    • The Convergence of TD(2) for General 2
    • P. Dayan, "The Convergence of TD(2) for General 2," Machine Learning. 8, 341-362, (1992).
    • (1992) Machine Learning , vol.8 , pp. 341-362
    • Dayan, P.1
  • 8
    • 0028388685 scopus 로고
    • TD(2) converges with Probability 1
    • P. Dayan and T1 Sejnowski, "TD(2) converges with Probability 1," Machine Leaming 14, 295-301, (1994).
    • (1994) Machine Leaming , vol.14 , pp. 295-301
    • Dayan, P.1    Sejnowski, T12


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.