메뉴 건너뛰기




Volumn 4, Issue , 2016, Pages 2439-2449

Gait Balance and Acceleration of a Biped Robot Based on Q-Learning

Author keywords

Biped Robot; Continuous Action Space; Reinforcement Learning; Zero Moment Point

Indexed keywords

MEMORY ARCHITECTURE; ROBOTICS; ROBOTS;

EID: 84979846290     PISSN: None     EISSN: 21693536     Source Type: Journal    
DOI: 10.1109/ACCESS.2016.2570255     Document Type: Article
Times cited : (45)

References (24)
  • 1
    • 21244464782 scopus 로고    scopus 로고
    • Reinforcement learning method-based stable gait synthesis for biped robot
    • Autom., Robot. Vis., Dec
    • H. Lingyun and S. Zengqi, "Reinforcement learning method-based stable gait synthesis for biped robot," in Proc. 8th Int. Conf. Control, Autom., Robot. Vis., vol. 2. Dec. 2004, pp. 1017-1022.
    • (2004) Proc. 8th Int. Conf. Control , vol.2 , pp. 1017-1022
    • Lingyun, H.1    Zengqi, S.2
  • 2
    • 84969858853 scopus 로고    scopus 로고
    • Learning to adjust and refine gait patterns for a biped robot
    • Dec
    • K.-S. Hwang, J.-L. Lin, and K.-H. Yeh, "Learning to adjust and refine gait patterns for a biped robot," IEEE Trans. Syst., Man, Cybern., Syst., vol. 45, no. 12, pp. 1481-1490, Dec. 2015.
    • (2015) IEEE Trans. Syst., Man, Cybern., Syst , vol.45 , Issue.12 , pp. 1481-1490
    • Hwang, K.-S.1    Lin, J.-L.2    Yeh, K.-H.3
  • 4
    • 79957534331 scopus 로고    scopus 로고
    • Walking motion gener- ation, synthesis, and control for biped robot by using PGRL, LPI, and fuzzy logic
    • Jun
    • T.-H. S. Li, Y.-T. Su, S.-W. Lai, and J.-J. Hu, "Walking motion gener- ation, synthesis, and control for biped robot by using PGRL, LPI, and fuzzy logic," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 41, no. 3, pp. 736-748, Jun. 2011.
    • (2011) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.41 , Issue.3 , pp. 736-748
    • Li, T.-H.S.1    Su, Y.-T.2    Lai, S.-W.3    Hu, J.-J.4
  • 6
    • 78951475737 scopus 로고    scopus 로고
    • Human-like walking: Optimal motion of a bipedal robot with toe-rotation motion
    • Apr
    • D. Tlalolini, C. Chevallereau, and Y. Aoustin, "Human-like walking: Optimal motion of a bipedal robot with toe-rotation motion," IEEE/ASME Trans. Mechatronics, vol. 16, no. 2, pp. 310-320, Apr. 2011.
    • (2011) IEEE/ASME Trans. Mechatronics , vol.16 , Issue.2 , pp. 310-320
    • Tlalolini, D.1    Chevallereau, C.2    Aoustin, Y.3
  • 7
    • 34250743596 scopus 로고    scopus 로고
    • Fuzzy posture control for biped walking robot based on force sensor for ZMP
    • Oct
    • K.-C. Choi, H.-J. Lee, and M. C. Lee, "Fuzzy posture control for biped walking robot based on force sensor for ZMP," in Proc. SICE-ICASE Int. Joint Conf., Oct. 2006, pp. 1185-1189.
    • (2006) Proc. SICE-ICASE Int. Joint Conf , pp. 1185-1189
    • Choi, K.-C.1    Lee, H.-J.2    Lee, M.C.3
  • 10
    • 84861590200 scopus 로고    scopus 로고
    • Gyroscope integrated environmental mode compliance control for biped robot
    • Mar
    • T. Sato, H. Ono, and K. Ohnishi, "Gyroscope integrated environmental mode compliance control for biped robot," in Proc. IEEE Int. Workshop Adv. Motion Control, Mar. 2012, pp. 1-6.
    • (2012) Proc. IEEE Int. Workshop Adv. Motion Control , pp. 1-6
    • Sato, T.1    Ono, H.2    Ohnishi, K.3
  • 12
    • 84901253272 scopus 로고    scopus 로고
    • Postural balance strategies in response to disturbances in the frontal plane and their implementation with a humanoid robot
    • Jun
    • Y. Yoshida, K. Takeuchi, Y. Miyamoto, D. Sato, and D. Nenchev, "Postural balance strategies in response to disturbances in the frontal plane and their implementation with a humanoid robot," IEEE Trans. Syst., Man, Cybern., Syst., vol. 44, no. 6, pp. 692-704, Jun. 2014.
    • (2014) IEEE Trans. Syst., Man, Cybern., Syst , vol.44 , Issue.6 , pp. 692-704
    • Yoshida, Y.1    Takeuchi, K.2    Miyamoto, Y.3    Sato, D.4    Nenchev, D.5
  • 14
    • 84961081359 scopus 로고    scopus 로고
    • A biped gait learning algorithm for humanoid robots based on environmental impact assessed artificial bee colony
    • T.-H. S. Li, P.-H. Kuo, Y.-F. Ho, M.-C. Kao, and L.-H. Tai, "A biped gait learning algorithm for humanoid robots based on environmental impact assessed artificial bee colony," IEEE Access, vol. 3, pp. 13-26, 2015.
    • (2015) IEEE Access , vol.3 , pp. 13-26
    • Li, T.-H.S.1    Kuo, P.-H.2    Ho, Y.-F.3    Kao, M.-C.4    Tai, L.-H.5
  • 15
    • 0025600638 scopus 로고
    • A stochastic reinforcement learning algorithm for learning real-valued functions
    • V. Gullapalli, "A stochastic reinforcement learning algorithm for learning real-valued functions," Neural Netw., vol. 3, no. 6, pp. 671-692, 1990.
    • (1990) Neural Netw , vol.3 , Issue.6 , pp. 671-692
    • Gullapalli, V.1
  • 16
    • 0026376960 scopus 로고
    • Associative reinforcement learning of real-valued functions
    • Charlottesville, VA, USA, Oct
    • V. Gullapalli, "Associative reinforcement learning of real-valued functions," in Proc. IEEE Int. Conf. Syst., Man, Cybern., vol. 3. Charlottesville, VA, USA, Oct. 1991, pp. 1453-1458.
    • (1991) Proc. IEEE Int. Conf. Syst., Man, Cybern , vol.3 , pp. 1453-1458
    • Gullapalli, V.1
  • 17
    • 0004049893 scopus 로고
    • Ph.D. dissertation, Dept. Psychol., Univ. Cambridge, Cambridge, U.K
    • C. J. C. H. Watkins, "Learning from delayed rewards," Ph.D. dissertation, Dept. Psychol., Univ. Cambridge, Cambridge, U.K., 1989.
    • (1989) Learning from Delayed Rewards
    • Watkins, C.J.C.H.1
  • 18
    • 34249833101 scopus 로고
    • Q-learning
    • May
    • C. J. C. H. Watkins and P. Dayan, "Q-learning," Mach. Learn., vol. 8, no. 3, pp. 279-292, May 1992.
    • (1992) Mach. Learn , vol.8 , Issue.3 , pp. 279-292
    • Watkins, C.J.C.H.1    Dayan, P.2
  • 19
    • 33745838668 scopus 로고    scopus 로고
    • Q-learning for robot
    • M. A. Arbib, Ed. Cambridge, MA, USA: MIT Press
    • C. F. Touzet, "Q-learning for robot," in The Handbook of Brain Theory and Neural Networks, M. A. Arbib, Ed. Cambridge, MA, USA: MIT Press, 2003, pp. 934-937.
    • (2003) The Handbook of Brain Theory and Neural Networks , pp. 934-937
    • Touzet, C.F.1
  • 20
    • 79953906172 scopus 로고    scopus 로고
    • Self-organizing state aggregation for architecture design of Q-learning
    • Jul
    • K.-S. Hwang, H.-Y. Lin, Y.-P. Hsu, and H.-H. Yu, "Self-organizing state aggregation for architecture design of Q-learning," Inf. Sci., vol. 181, no. 13, pp. 2813-2822, Jul. 2011.
    • (2011) Inf. Sci , vol.181 , Issue.13 , pp. 2813-2822
    • Hwang, K.-S.1    Lin, H.-Y.2    Hsu, Y.-P.3    Yu, H.-H.4
  • 21
    • 0031636218 scopus 로고    scopus 로고
    • Tree based discretization for continuous state space reinforcement learning
    • Madison, WI, USA
    • W. T. B. Uther and M. M. Veloso, "Tree based discretization for continuous state space reinforcement learning," in Proc. 15th Nat. Conf. Artif. Intell. (AAAI), Madison, WI, USA, 1998, pp. 769-774.
    • (1998) Proc. 15th Nat. Conf. Artif. Intell. (AAAI) , pp. 769-774
    • Uther, W.T.B.1    Veloso, M.M.2
  • 23
    • 84979834555 scopus 로고    scopus 로고
    • Biped balance control by reinforce- ment learning
    • to be published
    • K.-S. Hwang, J.-L. Lin, and J.-S. Li, "Biped balance control by reinforce- ment learning," J. Inf. Sci. Eng., to be published.
    • J. Inf. Sci. Eng
    • Hwang, K.-S.1    Lin, J.-L.2    Li, J.-S.3
  • 24
    • 84979812602 scopus 로고    scopus 로고
    • IRIS. (Feb. 11, IRIS Lab., National Sun Yat-sen University, Kaohsiung, Taiwan, accessed Nov. 16, 2015
    • IRIS. (Feb. 11, 2014). Biped Robot Walk and Experience Transfer Demo. IRIS Lab., National Sun Yat-sen University, Kaohsiung, Taiwan, accessed Nov. 16, 2015. [Online]. Available: https://www.youtube.com/ watch?v=mVahCHBFWyo
    • (2014) Biped Robot Walk and Experience Transfer Demo


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.