메뉴 건너뛰기




Volumn 6, Issue 3, 2005, Pages 285-293

An approach to tune fuzzy controllers based on reinforcement learning for autonomous vehicle control

Author keywords

Autonomous vehicles; Fuzzy controllers; Longitudinal control; Reinforcement learning

Indexed keywords

AUTONOMOUS VEHICLE CONTROL; FUZZY CONTROLLERS; REINFORCEMENT LEARNING;

EID: 27744536933     PISSN: 15249050     EISSN: None     Source Type: Journal    
DOI: 10.1109/TITS.2005.853698     Document Type: Article
Times cited : (103)

References (36)
  • 3
    • 0027591320 scopus 로고
    • Stable adaptive fuzzy control of nonlinear systems
    • May
    • _, "Stable adaptive fuzzy control of nonlinear systems," IEEE Trans. Fuzzy Syst., vol. 1, no. 2, pp. 146-155, May 1993.
    • (1993) IEEE Trans. Fuzzy Syst. , vol.1 , Issue.2 , pp. 146-155
  • 4
    • 0026852362 scopus 로고
    • Reinforcement learning is direct adaptive optimal control
    • Apr.
    • R. S. Sutton, A. G. Barto, and R. J. Williams, "Reinforcement learning is direct adaptive optimal control," IEEE Control Syst. Mag., vol. 12, no. 2, pp. 19-22, Apr. 1992.
    • (1992) IEEE Control Syst. Mag. , vol.12 , Issue.2 , pp. 19-22
    • Sutton, R.S.1    Barto, A.G.2    Williams, R.J.3
  • 6
    • 0031211975 scopus 로고    scopus 로고
    • A self-learning fuzzy logic controller using genetic algorithms with reinforcements
    • Jun.
    • C.-K. Chiang, H.-Y. Chung, and J.-J. Lin, "A self-learning fuzzy logic controller using genetic algorithms with reinforcements," IEEE Trans. Fuzzy Syst., vol. 5, no. 3, pp. 460-467, Jun. 1997.
    • (1997) IEEE Trans. Fuzzy Syst. , vol.5 , Issue.3 , pp. 460-467
    • Chiang, C.-K.1    Chung, H.-Y.2    Lin, J.-J.3
  • 7
    • 0004049895 scopus 로고
    • Ph.D. dissertation, Psychology Dept., Cambridge Univ., U.K.
    • C. J. C. H. Watkins, "Learning with delayed rewards," Ph.D. dissertation, Psychology Dept., Cambridge Univ., U.K., 1989.
    • (1989) Learning with Delayed Rewards
    • Watkins, C.J.C.H.1
  • 8
    • 0028497289 scopus 로고
    • 2 function and its error bounds using regular-center Gaussian networks
    • Sep.
    • 2 function and its error bounds using regular-center Gaussian networks," IEEE Trans. Neural Netw., vol. 5, no. 5, pp. 845-847, Sep. 1994.
    • (1994) IEEE Trans. Neural Netw. , vol.5 , Issue.5 , pp. 845-847
    • Liu, B.1    Si, J.2
  • 9
    • 0035026413 scopus 로고    scopus 로고
    • Direct adaptive longitudinal control of vehicle platoons
    • Jan.
    • D. Swaroop, J. K. Hedrick, and S. B. Choi, "Direct adaptive longitudinal control of vehicle platoons," IEEE Trans. Veh. Technol., vol. 50, no. 1, pp. 150-161, Jan. 2001.
    • (2001) IEEE Trans. Veh. Technol. , vol.50 , Issue.1 , pp. 150-161
    • Swaroop, D.1    Hedrick, J.K.2    Choi, S.B.3
  • 10
  • 11
    • 0001898381 scopus 로고    scopus 로고
    • Practical reinforcement learning in continuous spaces
    • Stanford, CA
    • W. D. Smart and L. P. Kaelbling, "Practical reinforcement learning in continuous spaces," in Proc. 17th Int. Conf. Machine Learning, Stanford, CA, 2000, pp. 903-910.
    • (2000) Proc. 17th Int. Conf. Machine Learning , pp. 903-910
    • Smart, W.D.1    Kaelbling, L.P.2
  • 12
    • 0001133021 scopus 로고
    • Generalization in reinforcement learning: Safely approximating the value function
    • G. Tesauro, D. S. Touretzky, and T. K. Leen, Eds. Cambridge, MA: MIT Press
    • J. A. Boyan and A. W. Moore, "Generalization in reinforcement learning: Safely approximating the value function," in Advances in Neural Information Processing Systems 7, G. Tesauro, D. S. Touretzky, and T. K. Leen, Eds. Cambridge, MA: MIT Press, 1995.
    • (1995) Advances in Neural Information Processing Systems , vol.7
    • Boyan, J.A.1    Moore, A.W.2
  • 13
    • 0000985504 scopus 로고
    • TD-Gammon, a self teaching backgammon program, achieves master-level play
    • G. Tesauro, "TD-Gammon, a self teaching backgammon program, achieves master-level play," Neural Comput., vol. 6, no. 2, pp. 215-219, 1994.
    • (1994) Neural Comput. , vol.6 , Issue.2 , pp. 215-219
    • Tesauro, G.1
  • 14
    • 0000683869 scopus 로고
    • Gradient following without backpropagation in layered networks
    • San Diego, CA
    • A. G. Barto and M. I. Jordan, "Gradient following without backpropagation in layered networks," in Proc. IEEE 1st Annu. Conf. Neural Networks, San Diego, CA, 1987, pp. II629-II636.
    • (1987) Proc. IEEE 1st Annu. Conf. Neural Networks
    • Barto, A.G.1    Jordan, M.I.2
  • 15
    • 0030147547 scopus 로고    scopus 로고
    • Reinforcement learning for an ART-based fuzzy adaptive learning control network
    • May
    • C.-J. Lin and C.-T. Lin, "Reinforcement learning for an ART-based fuzzy adaptive learning control network," IEEE Trans. Neural Netw., vol. 7, no. 3, pp. 709-731, May 1996.
    • (1996) IEEE Trans. Neural Netw. , vol.7 , Issue.3 , pp. 709-731
    • Lin, C.-J.1    Lin, C.-T.2
  • 16
    • 0033280159 scopus 로고    scopus 로고
    • A reinforcement neuro-fuzzy combiner for multiobjective control
    • Dec.
    • C.-T. Lin and I.-F. Chung, "A reinforcement neuro-fuzzy combiner for multiobjective control," IEEE Trans. Syst., Man, Cybern. B, vol. 29, no. 6, pp. 726-744, Dec. 1999.
    • (1999) IEEE Trans. Syst., Man, Cybern. B , vol.29 , Issue.6 , pp. 726-744
    • Lin, C.-T.1    Chung, I.-F.2
  • 17
    • 0033685792 scopus 로고    scopus 로고
    • A new reinforcement learning vehicle control architecture for vision-based road following
    • May
    • S.-Y. Oh, J.-H. Lee, and D.-H. Choi, "A new reinforcement learning vehicle control architecture for vision-based road following," IEEE Trans. Veh. Technol., vol. 49, no. 3, pp. 997-1005, May 2000.
    • (2000) IEEE Trans. Veh. Technol. , vol.49 , Issue.3 , pp. 997-1005
    • Oh, S.-Y.1    Lee, J.-H.2    Choi, D.-H.3
  • 18
    • 0004049893 scopus 로고
    • Ph.D. dissertation, Psychology Dept., Cambridge Univ., U.K.
    • C. J. C. H. Watkins, "Learning from delayed rewards," Ph.D. dissertation, Psychology Dept., Cambridge Univ., U.K., 1989.
    • (1989) Learning from Delayed Rewards
    • Watkins, C.J.C.H.1
  • 19
    • 0028369322 scopus 로고
    • Reinforcement structure/parameter learning for neural-network-based fuzzy logic control systems
    • Feb.
    • C.-T. Lin and C. S. G. Lee, "Reinforcement structure/parameter learning for neural-network-based fuzzy logic control systems," IEEE Trans. Fuzzy Syst., vol. 2, no. 1, pp. 46-63, Feb. 1994.
    • (1994) IEEE Trans. Fuzzy Syst. , vol.2 , Issue.1 , pp. 46-63
    • Lin, C.-T.1    Lee, C.S.G.2
  • 20
    • 0035273403 scopus 로고    scopus 로고
    • Online learning control by association and reinforcement
    • Mar.
    • J. Si and Y.-T. Wang, "Online learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001.
    • (2001) IEEE Trans. Neural Netw. , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.-T.2
  • 21
    • 0033308175 scopus 로고    scopus 로고
    • Multiple state estimation reinforcement learning for driving model: Driver model of automobile
    • Tokyo, Japan
    • Y. Koike and K. Doya, "Multiple state estimation reinforcement learning for driving model: Driver model of automobile," in Proc. IEEE Int. Conf. Systems, Man, and Cybernetics (SMC), Tokyo, Japan, 1999, vol. 5, pp. 504-509.
    • (1999) Proc. IEEE Int. Conf. Systems, Man, and Cybernetics (SMC) , vol.5 , pp. 504-509
    • Koike, Y.1    Doya, K.2
  • 22
    • 0032140718 scopus 로고    scopus 로고
    • Fuzzy inference system learning by reinforcement methods
    • Aug.
    • L. Jouffe, "Fuzzy inference system learning by reinforcement methods," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 28, no. 3, pp. 338-355, Aug. 1998.
    • (1998) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev. , vol.28 , Issue.3 , pp. 338-355
    • Jouffe, L.1
  • 23
    • 0025600638 scopus 로고
    • A stochastic reinforcement learning algorithm for learning real-valued functions
    • V. Gullapalli, "A stochastic reinforcement learning algorithm for learning real-valued functions," Neural Netw., vol. 3, no. 6, pp. 671-692, 1990.
    • (1990) Neural Netw. , vol.3 , Issue.6 , pp. 671-692
    • Gullapalli, V.1
  • 27
    • 0026114636 scopus 로고
    • Automated vehicle control developments in the PATH program
    • Feb.
    • S. Shladover et al., "Automated vehicle control developments in the PATH program," IEEE Trans. Veh. Technol., vol. 40, no. 1, pp. 114-130, Feb. 1991.
    • (1991) IEEE Trans. Veh. Technol. , vol.40 , Issue.1 , pp. 114-130
    • Shladover, S.1
  • 28
    • 0026372729 scopus 로고
    • The development of autonomously controlled vehicle (PVS)
    • Dearborn, MI
    • M. Taniguchi et al., "The development of autonomously controlled vehicle (PVS)," in Proc. Vehicle Navigation and Information Syst. Conf., Dearborn, MI, 1991, pp. 1137-1141.
    • (1991) Proc. Vehicle Navigation and Information Syst. Conf. , pp. 1137-1141
    • Taniguchi, M.1
  • 29
    • 84888911588 scopus 로고    scopus 로고
    • Connectionist-nonconnectionist fusion architecture for high speed road following
    • Sep.
    • D. H. Choi, S. Y. Oh, and K. Kim, "Connectionist-nonconnectionist fusion architecture for high speed road following," Neural Parallel Sci. Comput., vol. 4, no. 3, pp. 367-386, Sep. 1996.
    • (1996) Neural Parallel Sci. Comput. , vol.4 , Issue.3 , pp. 367-386
    • Choi, D.H.1    Oh, S.Y.2    Kim, K.3
  • 30
    • 0030414704 scopus 로고    scopus 로고
    • Fuzzy throttle and brake control for platoons of smart cars
    • Dec.
    • H. M. Kim, J. Dickerson, and B. Kosko, "Fuzzy throttle and brake control for platoons of smart cars," Fuzzy Sets Syst., vol. 84, no. 23, pp. 209-234, Dec. 1996.
    • (1996) Fuzzy Sets Syst. , vol.84 , Issue.23 , pp. 209-234
    • Kim, H.M.1    Dickerson, J.2    Kosko, B.3
  • 31
    • 0033339376 scopus 로고    scopus 로고
    • Use of neural fuzzy networks with mixed genetic/gradient algorithm in automated vehicle control
    • Dec.
    • S. Huang and W. Ren, "Use of neural fuzzy networks with mixed genetic/gradient algorithm in automated vehicle control," IEEE Trans. Ind. Electron., vol. 46, no. 6, pp. 1090-1102, Dec. 1999.
    • (1999) IEEE Trans. Ind. Electron. , vol.46 , Issue.6 , pp. 1090-1102
    • Huang, S.1    Ren, W.2
  • 32
    • 0035388695 scopus 로고    scopus 로고
    • An ANFIS controller for the car-following collision prevention system
    • Jul.
    • J. Mar and F.-J. Lin, "An ANFIS controller for the car-following collision prevention system," IEEE Trans. Veh. Technol., vol. 50, no. 4, pp. 11061113, Jul. 2001.
    • (2001) IEEE Trans. Veh. Technol. , vol.50 , Issue.4 , pp. 11061113
    • Mar, J.1    Lin, F.-J.2
  • 33
    • 0000380758 scopus 로고    scopus 로고
    • A transportable neural-network approach to autonomous vehicle following
    • May
    • N. Kehtarnavaz, N. Groswold, K. Miller, and P. Lascoe, "A transportable neural-network approach to autonomous vehicle following," IEEE Trans. Veh. Technol., vol. 47, no. 2, pp. 694-702, May 1998.
    • (1998) IEEE Trans. Veh. Technol. , vol.47 , Issue.2 , pp. 694-702
    • Kehtarnavaz, N.1    Groswold, N.2    Miller, K.3    Lascoe, P.4
  • 34
    • 0030685520 scopus 로고    scopus 로고
    • Fuzzy Q-learning for autonomous robot systems
    • Houston, TX
    • I. H. Sun, J. H. Kim, and F. C.-H. Rhee, "Fuzzy Q-learning for autonomous robot systems," in Proc. Int. Conf. Neural Networks, Houston, TX, 1997, vol. 3, pp. 1738-1743.
    • (1997) Proc. Int. Conf. Neural Networks , vol.3 , pp. 1738-1743
    • Sun, I.H.1    Kim, J.H.2    Rhee, F.C.-H.3
  • 36
    • 84898958374 scopus 로고    scopus 로고
    • Gradient descent for general reinforcement learning
    • Cambridge, MA: MIT Press
    • L. Barid and A. Moore, "Gradient descent for general reinforcement learning," in Advances in Neural Information Processing Systems 11. Cambridge, MA: MIT Press, 1999.
    • (1999) Advances in Neural Information Processing Systems , vol.11
    • Barid, L.1    Moore, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.