메뉴 건너뛰기




Volumn , Issue , 2009, Pages

Reinforcement learning control of robot manipulators in uncertain environments

Author keywords

[No Author keywords available]

Indexed keywords

AVERAGE ERRORS; CONTROL APPROACH; CONTROL EFFORT; DYNAMIC FUZZY Q-LEARNING; FUNCTION APPROXIMATORS; FUZZY-Q-LEARNING; MAXIMUM ERROR; PARAMETER VARIATION; PARAMETERIZED; REINFORCEMENT LEARNING CONTROL; ROBOT MANIPULATOR; ROBUST TRACKING; ROBUSTNESS PROPERTIES; SIMULATION RESULT; STABLE CONTROLLERS; UNCERTAIN ENVIRONMENTS;

EID: 67650330019     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICIT.2009.4939504     Document Type: Conference Paper
Times cited : (10)

References (14)
  • 2
    • 0026852362 scopus 로고
    • Reinforcement learning is direct adaptive optimal control
    • R. S. Sutton, A. G. Barto, and R. J. Williams, "Reinforcement learning is direct adaptive optimal control," IEEE Control Syst. Mag., vol. 12, no. 2, 1992, pp. 19-22.
    • (1992) IEEE Control Syst. Mag , vol.12 , Issue.2 , pp. 19-22
    • Sutton, R.S.1    Barto, A.G.2    Williams, R.J.3
  • 3
    • 0004049893 scopus 로고
    • Learning from delayed rewards
    • University of Cambridge, England
    • C. H. Watkins, "Learning from delayed rewards," Thesis, University of Cambridge, England, 1989.
    • (1989) Thesis
    • Watkins, C.H.1
  • 5
    • 1442302546 scopus 로고    scopus 로고
    • Reinforcement learning with decision tree
    • Applied Informatics, Austria
    • L. D. Pyeatt, "Reinforcement learning with decision tree," Proc. 21st IASTED Int. Conf., Applied Informatics, Austria, 2003, pp. 26-31.
    • (2003) Proc. 21st IASTED Int. Conf , pp. 26-31
    • Pyeatt, L.D.1
  • 6
    • 47149095559 scopus 로고    scopus 로고
    • Value approximation with least square support vector machines in reinforcement learning system
    • X. Wang, X. Tian, and Y. Cheng, "Value approximation with least square support vector machines in reinforcement learning system," Journal of Computational and Theoretical Nanoscience, vol. 4, no. 7-8, 2007, pp. 1290-1294.
    • (2007) Journal of Computational and Theoretical Nanoscience , vol.4 , Issue.7-8 , pp. 1290-1294
    • Wang, X.1    Tian, X.2    Cheng, Y.3
  • 7
    • 0032140718 scopus 로고    scopus 로고
    • Fuzzy inference system learning by reinforcement methods
    • L. Jouffe, "Fuzzy inference system learning by reinforcement methods," IEEE Trans. Syst., Man, and Cybernetics, Part C, vol. 28, no. 3, 1998, pp. 338-355.
    • (1998) IEEE Trans. Syst., Man, and Cybernetics, Part C , vol.28 , Issue.3 , pp. 338-355
    • Jouffe, L.1
  • 9
    • 40549092708 scopus 로고    scopus 로고
    • A Markov game-adaptive fuzzy controller for robot manipulators
    • R. Sharma, and M. Gopal, "A Markov game-adaptive fuzzy controller for robot manipulators," IEEE Trans. on Fuzzy Systems, vol. 16, no.1, 2007, pp. 171-186.
    • (2007) IEEE Trans. on Fuzzy Systems , vol.16 , Issue.1 , pp. 171-186
    • Sharma, R.1    Gopal, M.2
  • 10
    • 2942574444 scopus 로고    scopus 로고
    • Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning
    • Part B
    • M. J. Er, and C. Deng, "Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning," IEEE Trans on Systems, Man, and Cybernetics, Part B, vol. 34, no. 3, 2004, pp. 1478-1489.
    • (2004) IEEE Trans on Systems, Man, and Cybernetics , vol.34 , Issue.3 , pp. 1478-1489
    • Er, M.J.1    Deng, C.2
  • 12
    • 67650295105 scopus 로고    scopus 로고
    • W. T. B. Uther, and M. M. Veloso, Tree based discretization for continuous state space reinforcement learning, In proc. of 16th national conference on Artificial Intelligence (AAAI-98), Madison, 1998.
    • W. T. B. Uther, and M. M. Veloso, "Tree based discretization for continuous state space reinforcement learning," In proc. of 16th national conference on Artificial Intelligence (AAAI-98), Madison, 1998.
  • 14
    • 0032638628 scopus 로고    scopus 로고
    • Least square support vector machine classifiers
    • J. A. K. Suykens, and J. Vandewalle, "Least square support vector machine classifiers", Neural processing letters, vol. 9, 1999, pp. 293-300.
    • (1999) Neural processing letters , vol.9 , pp. 293-300
    • Suykens, J.A.K.1    Vandewalle, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.