SCOPUS 정보 검색 플랫폼

IEEE Transactions on Neural Networks

Volumn 17, Issue 6, 2006, Pages 1511-1531

Neural networks for continuous online learning and control

(3) Choy, Min Chee a Srinivasan, Dipti a Cheu, Ruey Long a

a NATIONAL UNIVERSITY OF SINGAPORE (Singapore)

Author keywords

Distributed control; Hybrid model; Neural control; Online learning; Traffic signal control

Indexed keywords

COMPUTER SIMULATION; EVOLUTIONARY ALGORITHMS; LEARNING ALGORITHMS; LEARNING SYSTEMS; MATHEMATICAL MODELS; ONLINE SYSTEMS;

HYBRID MODEL; NEURAL CONTROL; ONLINE LEARNING; TRAFFIC SIGNAL CONTROL;

NEURAL NETWORKS;

ALGORITHM; ARTICLE; ARTIFICIAL INTELLIGENCE; ARTIFICIAL NEURAL NETWORK; AUTOMATED PATTERN RECOGNITION; INFORMATION RETRIEVAL; METHODOLOGY; SIGNAL PROCESSING; TRAFFIC AND TRANSPORT;

ALGORITHMS; ARTIFICIAL INTELLIGENCE; INFORMATION STORAGE AND RETRIEVAL; NEURAL NETWORKS (COMPUTER); PATTERN RECOGNITION, AUTOMATED; SIGNAL PROCESSING, COMPUTER-ASSISTED; TRANSPORTATION;

EID: 34447547289 PISSN: 10459227 EISSN: None Source Type: Journal
DOI: 10.1109/TNN.2006.881710 Document Type: Article

Times cited : (65)

References (32)

1
- 0037288370
- "Recent advances in hierarchical reinforcement learning"
- A. G. Barto and S. Mahadevan, "Recent advances in hierarchical reinforcement learning," Special Issue Reinforcement Learn., Discrete Event Syst. J., vol. 13, pp. 41-77, 2003.
- (2003) Special Issue Reinforcement Learn., Discrete Event Syst. J. , vol.13 , pp. 41-77
- Barto, A.G.¹ Mahadevan, S.²

2
- 0012257655
- "Near-optimal reinforcement learning in polynomial time"
- M. Kearns and S. Singh, "Near-optimal reinforcement learning in polynomial time," in Proc. Int. Conf. Mach. Learn., 1999, pp. 260-268.
- (1999) Proc. Int. Conf. Mach. Learn. , pp. 260-268
- Kearns, M.¹ Singh, S.²

3
- 0001961616
- "A generalized reinforcement learning model: Convergence and applications"
- M. Littman and C. Szepesvari, "A generalized reinforcement learning model: Convergence and applications," in Proc. 13th Int. Conf. Mach. Learn., 1996, pp. 310-318.
- (1996) Proc. 13th Int. Conf. Mach. Learn. , pp. 310-318
- Littman, M.¹ Szepesvari, C.²

4
- 0003989214
- "Hierarchical control and learning for Markov decision processes"
- Ph.D. dissertation, Univ. California Berkeley, Berkeley, CA
- R. E. Parr, "Hierarchical control and learning for Markov decision processes," Ph.D. dissertation, Univ. California Berkeley, Berkeley, CA, 1998.
- (1998)
- Parr, R.E.¹

5
- 0003636089
- Online Q-learning using connectionist system
- Cambridge Univ., Eng. Dept., Tech. Rep. CUED/F-INFENG/TR 166
- G. Rummery and M. Niranjan, Online Q-learning using connectionist system Cambridge Univ., Eng. Dept., Tech. Rep. CUED/F-INFENG/TR 166, 1994.
- (1994)
- Rummery, G.¹ Niranjan, M.²

6
- 0033686132
- "On-line connectionist Q-learning produces unreliable performance with a synonym finding task"
- in Jul
- I. Johnson and M. D. Plumbley, "On-line connectionist Q-learning produces unreliable performance with a synonym finding task," in Proc. Int. Joint Conf. Neural Netw., Jul. 2000, pp. 24-27.
- (2000) Proc. Int. Joint Conf. Neural Netw. , pp. 24-27
- Johnson, I.¹ Plumbley, M.D.²

7
- 0003806984
- Reinforcement learning based on back propagation for mobile robot navigation
- Computational Intelligence Group, Dept. Cybern. Artif. Intell., Technical Univ., Kosice, Slovakia
- R. Jaksa, P. Majernik, and P. Sincak, Reinforcement learning based on back propagation for mobile robot navigation Computational Intelligence Group, Dept. Cybern. Artif. Intell., Technical Univ., Kosice, Slovakia, 2000.
- (2000)
- Jaksa, R.¹ Majernik, P.² Sincak, P.³

8
- 0027810452
- "Self-organizing traffic control via fuzzy logic"
- S. Chiu and S. Chand, "Self-organizing traffic control via fuzzy logic," in Proc. 32nd IEEE Conf. Decision Control, 1993, pp. 1987-1902.
- (1993) Proc. 32nd IEEE Conf. Decision Control , pp. 1902-1987
- Chiu, S.¹ Chand, S.²

9
- 0030392629
- "Fuzzy sets in distributed traffic control"
- G. Nakamiti and F. Gomide, "Fuzzy sets in distributed traffic control," in Proc. 5th IEEE Int. Conf. Fuzzy Syst., 1996, pp. 1617-1623.
- (1996) Proc. 5th IEEE Int. Conf. Fuzzy Syst. , pp. 1617-1623
- Nakamiti, G.¹ Gomide, F.²

10
- 0028574934
- "Genetic reinforcement learning for cooperative traffic signal control"
- S. Mikami and Y. Kakazu, "Genetic reinforcement learning for cooperative traffic signal control," in Proc. 1st IEEE Conf. Evol. Comput., 1994, vol. 1, pp. 223-228.
- (1994) Proc. 1st IEEE Conf. Evol. Comput. , vol.1 , pp. 223-228
- Mikami, S.¹ Kakazu, Y.²

11
- 0036456219
- "FL-FN based traffic signal control"
- in May
- W. Wei and Y. Zhang, "FL-FN based traffic signal control," in Proc. 2002 IEEE Int. Conf. Fuzzy Syst., May 2002, vol. 1, no. 12-17, pp. 296-300.
- (2002) Proc. 2002 IEEE Int. Conf. Fuzzy Syst. , vol.1 , Issue.12-17 , pp. 296-300
- Wei, W.¹ Zhang, Y.²

12
- 0345880144
- "Traffic-responsive signal timing for system-wide traffic control"
- J. C. Spall and D. C. Chin, "Traffic-responsive signal timing for system-wide traffic control," Transpn. Res.- C, vol. 5, no. 3/4, pp. 153-163, 1997.
- (1997) Transpn. Res.- C , vol.5 , Issue.3-4 , pp. 153-163
- Spall, J.C.¹ Chin, D.C.²

13
- 0035372090
- "Reinforcement learning in neural fuzzy traffic signal control"
- E. Bingham, "Reinforcement learning in neural fuzzy traffic signal control," Euro. J. Operation Res., vol. 131, no. 2, pp. 232-241, 2001.
- (2001) Euro. J. Operation Res. , vol.131 , Issue.2 , pp. 232-241
- Bingham, E.¹

14
- 0004175911
- 2nd ed. Boston, MA: PWS-Kent
- N. J. Garber and L. A. Hoel, Traffic and Highway Engineering, 2nd ed. Boston, MA: PWS-Kent, 1997, pp. 281-329.
- (1997) Traffic and Highway Engineering , pp. 281-329
- Garber, N.J.¹ Hoel, L.A.²

15
- 0035330108
- "Distributed-information neural control: The case of dynamic routing in traffic networks"
- May
- M. Baglietto, T. Parisini, and R. Zoppoli, "Distributed-information neural control: The case of dynamic routing in traffic networks," IEEE Trans. Neural Netw., vol. 12, no. 3, pp. 485-502, May 2001.
- (2001) IEEE Trans. Neural Netw. , vol.12 , Issue.3 , pp. 485-502
- Baglietto, M.¹ Parisini, T.² Zoppoli, R.³

16
- 0032096305
- "Nonlinear stabilization by receding-horizon neural regulators"
- T. Parisini, M. Sanguineti, and R. Zoppoli, "Nonlinear stabilization by receding-horizon neural regulators," Int. J. Control, vol. 70, no. 3, pp. 341-362, 1998.
- (1998) Int. J. Control , vol.70 , Issue.3 , pp. 341-362
- Parisini, T.¹ Sanguineti, M.² Zoppoli, R.³

17
- 0028729520
- "Hybrid fuzzy neural nets are universal approximators"
- in Jun
- J. J. Buckley and U. Hayashi, "Hybrid fuzzy neural nets are universal approximators," in Proc. 3rd IEEE Conf. Fuzzy Syst. IEEE World Congr. Comput. Intell., Jun. 1994, vol. 1, pp. 238-243.
- (1994) Proc. 3rd IEEE Conf. Fuzzy Syst. IEEE World Congr. Comput. Intell. , Issue.1 , pp. 238-243
- Buckley, J.J.¹ Hayashi, U.²

18
- 0033285473
- "On the use of simultaneous perturbation stochastic approximation for neural network training"
- A. V. Wouwer, C. Renotte, and M. Remy, "On the use of simultaneous perturbation stochastic approximation for neural network training," in Proc. Amer. Control Conf., 1999, pp. 388-392.
- (1999) Proc. Amer. Control Conf. , pp. 388-392
- Wouwer, A.V.¹ Renotte, C.² Remy, M.³

19
- 0036075615
- "Stochastic learning control for nonlinear systems"
- in May
- E. Gomez-Ramirez, P. L. Najim, and E. Ikonen, "Stochastic learning control for nonlinear systems," in Proc. Int. Joint Conf. Neural Netw. (IJCNN'02), May 2002, vol. 1, pp. 171-176.
- (2002) Proc. Int. Joint Conf. Neural Netw. (IJCNN'02) , vol.1 , pp. 171-176
- Gomez-Ramirez, E.¹ Najim, P.L.² Ikonen, E.³

20
- 0000439891
- "On the convergence of stochastic iterative dynamic programming algorithms"
- T. Jaakkola, M. I. Jordan, and S. P. Singh, "On the convergence of stochastic iterative dynamic programming algorithms," Neural Comput., vol. 6, no. 6, pp. 1185-1201, 1994.
- (1994) Neural Comput. , vol.6 , Issue.6 , pp. 1185-1201
- Jaakkola, T.¹ Jordan, M.I.² Singh, S.P.³

21
- 0001961616
- "A generalized reinforcement learning model: Convergence and applications"
- M. Littman and C. Szepesvari, "A generalized reinforcement learning model: Convergence and applications," in Proc. 13th Int. Conf. Mach. Learn., 1996, pp. 310-318.
- (1996) Proc. 13th Int. Conf. Mach. Learn. , pp. 310-318
- Littman, M.¹ Szepesvari, C.²

22
- 0026839090
- "Multivariate stochastic approximation using a simultaneous perturbation gradient approximation"
- Mar
- J. C. Spall, "Multivariate stochastic approximation using a simultaneous perturbation gradient approximation," IEEE Trans. Autom. Control, vol. 37, no. 3, pp. 332-341, Mar. 1992.
- (1992) IEEE Trans. Autom. Control , vol.37 , Issue.3 , pp. 332-341
- Spall, J.C.¹

23
- 0000016172
- "A stochastic approximation method"
- H. Robbins and S. Monro, "A stochastic approximation method," Ann. Math. Statist., vol. 25, pp. 382-386, 1951.
- (1951) Ann. Math. Statist. , vol.25 , pp. 382-386
- Robbins, H.¹ Monro, S.²

24
- 0001079593
- "Stochastic estimation of a regression function"
- J. Kiefer and J. Wolfowitz, "Stochastic estimation of a regression function," Ann. Math. Stat., vol. 23, pp. 462-466, 1952.
- (1952) Ann. Math. Stat. , vol.23 , pp. 462-466
- Kiefer, J.¹ Wolfowitz, J.²

25
- 37949008637
- "Reinforcement learning based on back propagation for mobile robot navigation"
- in Vienna, Austria
- R. Jaksa, P. Majernik, and P. Sincak, "Reinforcement learning based on back propagation for mobile robot navigation," in Proc. Comput. Intell. Modeling, Control, Autom. (CIMCA), Vienna, Austria, 1999.
- (1999) Proc. Comput. Intell. Modeling, Control, Autom. (CIMCA)
- Jaksa, R.¹ Majernik, P.² Sincak, P.³

26
- 0004102479
- Cambridge, MA: MIT Press
- R. S. Sutton and A. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.²

27
- 0024137490
- "Increased rates of convergence through learning rate adaptation"
- R. A. Jacobs, "Increased rates of convergence through learning rate adaptation," Neural Netw., vol. 1, pp. 295-307, 1988.
- (1988) Neural Netw. , vol.1 , pp. 295-307
- Jacobs, R.A.¹

28
- 0001518167
- "On the convergence of the LMS algorithm with adaptive learning rate for linear feedforward networks"
- Z. Luo, "On the convergence of the LMS algorithm with adaptive learning rate for linear feedforward networks," Neural Comput., vol. 3, pp. 226-245, 1991.
- (1991) Neural Comput. , vol.3 , pp. 226-245
- Luo, Z.¹

29
- 0003410791
- 2nd ed. Berlin, Germany: Springer-Verlag
- T. Kohonen, Self-Organizing Maps, 2nd ed. Berlin, Germany: Springer-Verlag, 1997.
- (1997) Self-Organizing Maps
- Kohonen, T.¹

30
- 0034862807
- "Coordination of exploration and exploitation in a dynamic environment"
- G. Yan, F. Yang, T. Hickey, and M. Goldstein, "Coordination of exploration and exploitation in a dynamic environment," in Proc. Int. Joint Conf. Neural Netw. (IJCNN), 2001, pp. 1014-1018.
- (2001) Proc. Int. Joint Conf. Neural Netw. (IJCNN) , pp. 1014-1018
- Yan, G.¹ Yang, F.² Hickey, T.³ Goldstein, M.⁴

31
- 0004273476
- Quadstone, Edinburgh, U.K.: Quadstone Ltd
- Quadstone, PARAMICS Modeller v4.0 User Guide and Reference Manual. Edinburgh, U.K.: Quadstone Ltd., 2002.
- (2002) PARAMICS Modeller V4.0 User Guide and Reference Manual

32
- 0032628023
- "Analysis of intersection delay under real-time adaptive signal control"
- Feb
- P. B. Wolshon and W. C. Taylor, "Analysis of intersection delay under real-time adaptive signal control," Transportation Res. Part C, Emerging Technol., vol. 7C, no. 1, pp. 53-72, Feb. 1999.
- (1999) Transportation Res. Part C, Emerging Technol. , vol.7 C , Issue.1 , pp. 53-72
- Wolshon, P.B.¹ Taylor, W.C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.