SCOPUS 정보 검색 플랫폼

IEEE/CAA Journal of Automatica Sinica

Volumn 4, Issue 1, 2017, Pages 1-5

PDP: Parallel dynamic programming

(5) Wang, Fei Yue a Zhang, Jie a Wei, Qinglai a Zheng, Xinhu b Li, Li c

a INSTITUTE OF AUTOMATION (China)

b University of Minnesota (United States)

c TSINGHUA UNIVERSITY (China)

Author keywords

Adaptive dynamic programming; Artificial intelligence; Deep learning; Dynamic programming; Neural networks; Parallel dynamic programming; Reinforcement learning

Indexed keywords

ARTIFICIAL INTELLIGENCE; DEEP LEARNING; DEEP NEURAL NETWORKS; INTELLIGENT COMPUTING; LEARNING SYSTEMS; NEURAL NETWORKS; REINFORCEMENT LEARNING;

ADAPTIVE DYNAMIC PROGRAMMING; PARALLEL DYNAMIC PROGRAMMING; PRINCIPLE OF OPTIMALITY; REINFORCEMENT LEARNING METHOD;

DYNAMIC PROGRAMMING;

EID: 85010049728 PISSN: 23299266 EISSN: 23299274 Source Type: Journal
DOI: 10.1109/JAS.2017.7510310 Document Type: Article

Times cited : (95)

References (46)

1
- 84963949906
- Mastering the game of Go with deep neural networks and tree search
- D. Silver et al., "Mastering the game of Go with deep neural networks and tree search, " Nature 529.7587, pp. 484-489, 2016.
- (2016) Nature , vol.529 , Issue.7587 , pp. 484-489
- Silver, D.¹

2
- 0003787146
- Princeton, NJ: Princeton University Press
- R. E. Bellman, Dynamic Programming. Princeton, NJ: Princeton University Press, 1957.
- (1957) Dynamic Programming
- Bellman, R.E.¹

3
- 0002557583
- Advanced forecasting methods for global crisis warning and models of intelligence
- P. J. Werbos, "Advanced forecasting methods for global crisis warning and models of intelligence, " General Syst. Yearbook, vol. 22, 1977.
- (1977) General Syst. Yearbook , vol.22
- Werbos, P.J.¹

4
- 0002011091
- A menu of designs for reinforcement learning over time
- W. T. Miller, R. S. Sutton and P. J. Werbos (Eds.), ambridge MIT Press
- P. J. Werbos, "A menu of designs for reinforcement learning over time, " in Neural Networks for Control, W. T. Miller, R. S. Sutton and P. J. Werbos (Eds.), Cambridge: MIT Press, 1991, pp. 67-95.
- (1991) Neural Networks for Control , pp. 67-95
- Werbos, P.J.¹

5
- 84975322654
- Where does AlphaGo go: From church-Turing
- thesis to AlphaGo thesis and beyond April
- F.-Y. Wang, et al., "Where does AlphaGo go: from church-Turing thesis to AlphaGo thesis and beyond", IEEE/CAA J. Autom. Sinica, vol. 3, no. 2, pp. 113-120, April 2016.
- (2016) IEEE/CAA J. Autom. Sinica , vol.3 , Issue.2 , pp. 113-120
- Wang, F.-Y.¹

6
- 84869402639
- A big-data perspective on ai: Newton, merton, and analytics intelligence
- F.-Y. Wang, "A big-data perspective on AI: Newton, Merton, and analytics intelligence", IEEE Intell. Syst., vol. 27, no. 5, pp. 2-4, 2012.
- (2012) IEEE Intell. Syst , vol.27 , Issue.5 , pp. 2-4
- Wang, F.-Y.¹

7
- 85017294169
- Parallel learning-A new framework for machine learning
- (in Chinese)
- L. Li, Y.-L. Lin, D.-P. Cao, N.-N. Zheng, and F.-Y. Wang, "Parallel learning-A new framework for machine learning, " Acta Autom. Sinica, vol. 43, no. 1, pp. 1-8, 2017 (in Chinese).
- (2017) Acta Autom. Sinica , vol.43 , Issue.1 , pp. 1-8
- Li, L.¹ Lin, Y.-L.² Cao, D.-P.³ Zheng, N.-N.⁴ Wang, F.-Y.⁵

8
- 84960456924
- Efficient video stitching based on fast structure deformation
- article in press
- J. Li, W. Xu, J. Zhang, M. Zhang, Z. Wang, and X. Li, "Efficient video stitching based on fast structure deformation, " IEEE Trans. Cybern., article in press, 2015. DOI: 10.1109/TCYB.2014.2381774.
- (2015) IEEE Trans. Cybern
- Li, J.¹ Xu, W.² Zhang, J.³ Zhang, M.⁴ Wang, Z.⁵ Li, X.⁶

9
- 84949845142
- Stochastic dynamic programming in the real-world control of hybrid electric vehicles
- Mar
- C. Vagg, S. Akehurst, C. J. Brace, and L. Ash, "Stochastic dynamic programming in the real-world control of hybrid electric vehicles, " IEEE Trans. Control Syst. Technol., vol. 24, no. 3, pp. 853-866, Mar. 2016.
- (2016) IEEE Trans. Control Syst. Technol , vol.24 , Issue.3 , pp. 853-866
- Vagg, C.¹ Akehurst, S.² Brace, C.J.³ Ash, L.⁴

10
- 84982757712
- Motion planning for continuous-Time stochastic processes: A dynamic programming approach
- P. M. Esfahani, D. Chatterjee, and J. Lygeros, "Motion planning for continuous-Time stochastic processes: A dynamic programming approach, " IEEE Trans. Autom. Control, vol. 61, pp. 2155-2170, 2016.
- (2016) IEEE Trans. Autom. Control , vol.61 , pp. 2155-2170
- Esfahani, P.M.¹ Chatterjee, D.² Lygeros, J.³

11
- 0002031779
- Approximate dynamic programming for real-Time control and neural modeling
- D.A. White and D.A. Sofge (Eds.), New York Van Nostrand Reinhold ch. 13
- P. J. Werbos, "Approximate dynamic programming for real-Time control and neural modeling, " in Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, D.A. White and D.A. Sofge (Eds.), New York: Van Nostrand Reinhold, 1992, ch. 13.
- (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
- Werbos, P.J.¹

12
- 0003487482
- Belmont, MA: Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

13
- 0031236002
- Adaptive critic designs
- Sep
- D. V. Prokhorov and D. C. Wunsch, "Adaptive critic designs, " IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
- (1997) IEEE Trans. Neural Netw , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch, D.C.²

14
- 84960091952
- Adaptive critic design-based dynamic stochastic optimal control design for a microgrid with multiple renewable resources
- Jun
- J. Han, S. Khushalani-Solanki, J. Solanki, and J. Liang, "Adaptive critic design-based dynamic stochastic optimal control design for a microgrid with multiple renewable resources, " IEEE Trans. Smart Grid, vol. 6, no. 6, pp. 2694-2703, Jun. 2015.
- (2015) IEEE Trans. Smart Grid , vol.6 , Issue.6 , pp. 2694-2703
- Han, J.¹ Khushalani-Solanki, S.² Solanki, J.³ Liang, J.⁴

15
- 0004102479
- MA MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction. Cambridge
- Sutton, R.S.¹ Barto, A.G.²

16
- 0036588686
- Adaptive dynamic programming
- May
- J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming, " IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 32, no. 2, pp. 140-153, May 2002.
- (2002) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

17
- 85046690719
- Discrete-Time local value Iteration adaptive dynamic programming: Convergence analysis
- article in press
- Q. Wei, F. L. Lewis, D. Liu, R. Song, and H. Lin, "Discrete-Time local value Iteration adaptive dynamic programming: Convergence analysis, " IEEE Trans. Syst., Man, Cybern. A, Syst., article in press, 2016. DOI: 10.1109/TSMC.2016.2623766.
- (2016) IEEE Trans. Syst., Man, Cybern. A, Syst
- Wei, Q.¹ Lewis, F.L.² Liu, D.³ Song, R.⁴ Lin, H.⁵

18
- 84963604827
- Discrete-Time deterministic Q-learning: A novel convergence analysis
- article in press
- Q. Wei, F. L. Lewis, Q. Sun, P. Yan, and R. Song, "Discrete-Time deterministic Q-learning: A novel convergence analysis, " IEEE Trans. Cybern., article in press, 2016. DOI: 10.1109/TCYB.2016.2542923.
- (2016) IEEE Trans. Cybern
- Wei, Q.¹ Lewis, F.L.² Sun, Q.³ Yan, P.⁴ Song, R.⁵

19
- 84924872284
- A novel dual iterative Q-learning method for optimal battery management in smart residential environments
- Apr
- Q. Wei, D. Liu, and G. Shi, "A novel dual iterative Q-learning method for optimal battery management in smart residential environments, " IEEE Trans. Ind. Electron., vol. 62, no. 4, pp. 2509-2518, Apr. 2015.
- (2015) IEEE Trans. Ind. Electron , vol.62 , Issue.4 , pp. 2509-2518
- Wei, Q.¹ Liu, D.² Shi, G.³

20
- 84908658175
- A novel iterative-Adaptive dynamic programming for discrete-Time nonlinear systems
- Oct
- Q. Wei and D. Liu, "A novel iterative-Adaptive dynamic programming for discrete-Time nonlinear systems, " IEEE Trans. Autom. Sci. Eng., vol. 11, no. 4, pp. 1176-1190, Oct. 2014.
- (2014) IEEE Trans. Autom. Sci. Eng , vol.11 , Issue.4 , pp. 1176-1190
- Wei, Q.¹ Liu, D.²

21
- 84978818227
- Discrete-Time optimal control via local policy iteration adaptive dynamic programming
- article in press
- Q. Wei, D. Liu, Q. Lin, and R. Song, "Discrete-Time optimal control via local policy iteration adaptive dynamic programming, " IEEE Trans. Cybern., article in press, 2016. DOI: 10.1109/TCYB.2016.2586082.
- (2016) IEEE Trans. Cybern
- Wei, Q.¹ Liu, D.² Lin, Q.³ Song, R.⁴

22
- 0043026775
- Helicopter trimming and tracking control using direct neural dynamic programming
- Aug
- R. Enns and J. Si, "Helicopter trimming and tracking control using direct neural dynamic programming, " IEEE Trans. Neural Netw., vol. 14, no. 4, pp. 929-939, Aug. 2003.
- (2003) IEEE Trans. Neural Netw , vol.14 , Issue.4 , pp. 929-939
- Enns, R.¹ Si, J.²

23
- 84921346879
- Concurrent learningbased approximate feedback-Nash equilibrium solution of N-player nonzero-sum differential games
- Jul
- R. Kamalapurkar, J. R. Klotz, and W. E. Dixon, "Concurrent learningbased approximate feedback-Nash equilibrium solution of N-player nonzero-sum differential games, " IEEE/CAA J. Autom. Sinica, vol. 1, no. 3, pp. 239-247, Jul. 2014.
- (2014) IEEE/CAA J. Autom. Sinica , vol.1 , Issue.3 , pp. 239-247
- Kamalapurkar, R.¹ Klotz, J.R.² Dixon, W.E.³

24
- 84981727630
- Discrete-Time local iterative adaptive dynamic programming: Terminations and admissibility analysis
- article in press
- Q. Wei, D. Liu, and Q. Lin, "Discrete-Time local iterative adaptive dynamic programming: Terminations and admissibility analysis, " IEEE Trans. Neural Netw. Learn. Syst., article in press, 2016. DOI: 10.1109/TNNLS.2016.2593743.
- (2016) IEEE Trans. Neural Netw. Learn. Syst
- Wei, Q.¹ Liu, D.² Lin, Q.³

25
- 84939617304
- Data-driven zero-sum neuro-optimal control for a class of continuous-Time unknown nonlinear systems with disturbance using ADP
- Feb
- Q. Wei, R. Song, and P. Yan, "Data-driven zero-sum neuro-optimal control for a class of continuous-Time unknown nonlinear systems with disturbance using ADP, " IEEE Trans. Neural Netw. Learn. Syst., vol. 27, no. 2, pp. 444-458, Feb. 2016.
- (2016) IEEE Trans. Neural Netw. Learn. Syst , vol.27 , Issue.2 , pp. 444-458
- Wei, Q.¹ Song, R.² Yan, P.³

26
- 84912048350
- Online adaptive policy learning algorithm for H1 state feedback control of unknown affine nonlinear discrete-Time systems
- Dec
- H. Zhang, C. Qin, B. Jiang, and Y. Luo, "Online adaptive policy learning algorithm for H1 state feedback control of unknown affine nonlinear discrete-Time systems, " IEEE Trans. Cybern., vol. 44, no. 12, pp. 2706-2718, Dec. 2014.
- (2014) IEEE Trans. Cybern , vol.44 , Issue.12 , pp. 2706-2718
- Zhang, H.¹ Qin, C.² Jiang, B.³ Luo, Y.⁴

27
- 34047218055
- Suboptimal control for nonlinear stochastic systems
- F.-Y. Wang and G. N. Saridis, "Suboptimal control for nonlinear stochastic systems, " Proc. 31st IEEE Conf. Decision Control, 1992.
- (1992) Proc. 31st IEEE Conf. Decision Control
- Wang, F.-Y.¹ Saridis, G.N.²

28
- 0039434283
- Suboptimal control of nonlinear stochastic systems
- G. N. Saridis and F.-Y. Wang, "Suboptimal control of nonlinear stochastic systems, " Control Theory and Advanced Technology, vol. 10, no. 4, pp. 847-871, 1994.
- (1994) Control Theory and Advanced Technology , vol.10 , Issue.4 , pp. 847-871
- Saridis, G.N.¹ Wang, F.-Y.²

29
- 85027953921
- Infinite horizon self-learning optimal control of nonaffine discrete-Time nonlinear systems
- Apr
- Q. Wei, D. Liu, and X. Yang, "Infinite horizon self-learning optimal control of nonaffine discrete-Time nonlinear systems, " IEEE Trans. Neural Netw. Learn. Syst., vol. 26, no. 4, pp. 866-879, Apr. 2015.
- (2015) IEEE Trans. Neural Netw. Learn. Syst , vol.26 , Issue.4 , pp. 866-879
- Wei, Q.¹ Liu, D.² Yang, X.³

30
- 85017654846
- Optimal constrained self-learning battery sequential management in microgrid via adaptive dynamic programming
- article in press
- Q. Wei, D. Liu, Y. Liu, and R. Song, "Optimal constrained self-learning battery sequential management in microgrid via adaptive dynamic programming, " IEEE/CAA J. Autom. Sinica, article in press, 2016. DOI: 10.1109/JAS.2016.7510262.
- (2016) IEEE/CAA J. Autom. Sinica
- Wei, Q.¹ Liu, D.² Liu, Y.³ Song, R.⁴

31
- 84921361841
- Near optimal output feedback control of nonlinear discrete-Time systems based on reinforcement neural network learning
- Oct
- Q. Zhao, H. Xu, and S. Jagannathan, "Near optimal output feedback control of nonlinear discrete-Time systems based on reinforcement neural network learning, " IEEE/CAA J. Autom. Sinica, vol. 1, no. 4, pp. 372-384, Oct. 2014.
- (2014) IEEE/CAA J. Autom. Sinica , vol.1 , Issue.4 , pp. 372-384
- Zhao, Q.¹ Xu, H.² Jagannathan, S.³

32
- 84930506123
- Optimal multi-battery coordination control for home energy management systems via distributed iterative adaptive dynamic programming
- Jul
- Q. Wei, D. Liu, G. Shi, and Y. Liu, "Optimal multi-battery coordination control for home energy management systems via distributed iterative adaptive dynamic programming, " IEEE Trans. Ind. Electron., vol. 42, no. 7, pp. 4203-4214, Jul. 2015.
- (2015) IEEE Trans. Ind. Electron , vol.42 , Issue.7 , pp. 4203-4214
- Wei, Q.¹ Liu, D.² Shi, G.³ Liu, Y.⁴

33
- 84946811900
- Value iteration adaptive dynamic programming for optimal control of discrete-Time nonlinear systems
- Mar
- Q. Wei, D. Liu, and H. Lin, "Value iteration adaptive dynamic programming for optimal control of discrete-Time nonlinear systems, " IEEE Trans. Cybern., vol. 46, no. 3, pp. 840-853, Mar. 2016.
- (2016) IEEE Trans. Cybern , vol.46 , Issue.3 , pp. 840-853
- Wei, Q.¹ Liu, D.² Lin, H.³

34
- 84912122528
- Finite-Approximation-error based discrete-Time iterative adaptive dynamic programming
- Dec
- Q. Wei, F. Wang, D. Liu, and X. Yang, "Finite-Approximation-error based discrete-Time iterative adaptive dynamic programming, " IEEE Trans. Cybern., vol. 44, no. 12, pp. 2820-2833, Dec. 2014.
- (2014) IEEE Trans. Cybern , vol.44 , Issue.12 , pp. 2820-2833
- Wei, Q.¹ Wang, F.² Liu, D.³ Yang, X.⁴

35
- 84878421441
- Optimal control for discrete-Time affine non-linear systems using general value iteration
- Dec
- H. Li and D. Liu, "Optimal control for discrete-Time affine non-linear systems using general value iteration, " IET Control Theory Appl., vol. 6, no. 18, pp. 2725-2736, Dec. 2012.
- (2012) IET Control Theory Appl , vol.6 , Issue.18 , pp. 2725-2736
- Li, H.¹ Liu, D.²

36
- 85005983375
- Adaptive dynamic programming and adaptive optimal output regulation of linear systems
- Dec
- W. Gao and Z.-P. Jiang, "Adaptive dynamic programming and adaptive optimal output regulation of linear systems, " IEEE Trans. Autom. Control, vol. 61, no. 12, pp. 4164-4169, Dec. 2016.
- (2016) IEEE Trans. Autom. Control , vol.61 , Issue.12 , pp. 4164-4169
- Gao, W.¹ Jiang, Z.-P.²

37
- 84971473616
- Deep learning for control: The state of the art and prospects
- Y. Duan, Y. Lv, J. Zhang, X. Zhao, and F.-Y. Wang, "Deep learning for control: The state of the art and prospects, " Acta Autom. Sinica, vol 42, no. 5, pp. 643-654, 2016.
- (2016) Acta Autom. Sinica , vol.42 , Issue.5 , pp. 643-654
- Duan, Y.¹ Lv, Y.² Zhang, J.³ Zhao, X.⁴ Wang, F.-Y.⁵

38
- 0742300845
- Building knowledge structure in neural nets using fuzzy logic
- M. Jamshidi (Eds.), New York, NY, ASME (American Society of Mechanical Engineers) Press
- F.-Y. Wang, "Building knowledge structure in neural nets using fuzzy logic, " Robotics and Manufacturing: Recent Trends in Research Education and Applications, M. Jamshidi (Eds.), New York, NY, ASME (American Society of Mechanical Engineers) Press, 1992.
- (1992) Robotics and Manufacturing: Recent Trends in Research Education and Applications
- Wang, F.-Y.¹

39
- 84974753844
- Implementing adaptive fuzzy logic controllers with neural networks: A design paradigm
- F.-Y. Wang and H.-A. Kim, "Implementing adaptive fuzzy logic controllers with neural networks: A design paradigm, " J. Intell. Fuzzy Syst., vol. 3, no. 2, pp. 165-180, 1995.
- (1995) J. Intell. Fuzzy Syst , vol.3 , Issue.2 , pp. 165-180
- Wang, F.-Y.¹ Kim, H.-A.²

40
- 77956341661
- The emergence of intelligent enterprises: From CPS to CPSS
- F.-Y. Wang, "The emergence of intelligent enterprises: From CPS to CPSS, " IEEE Intell. Syst., vol. 25, no. 4, pp. 85-88, 2010.
- (2010) IEEE Intell. Syst , vol.25 , Issue.4 , pp. 85-88
- Wang, F.-Y.¹

41
- 84901070536
- Predictive analytics white paper
- C. Nyce, "Predictive analytics white paper, " American Institute for Chartered Property Casualty Underwriters/Insurance Institute of America, 2007.
- (2007) American Institute for Chartered Property Casualty Underwriters/Insurance Institute of America
- Nyce, C.¹

42
- 85010072911
- Extending the value of your data warehousing investment
- W. Eckerson, "Extending the value of your data warehousing investment, " The Data Warehouse Institute, USA, 2007.
- (2007) The Data Warehouse Institute, USA
- Eckerson, W.¹

43
- 84900352446
- Business analytics: The next frontier for decision sciences
- Mar
- J. R. Evans and C. H. Lindner, "Business analytics: The next frontier for decision sciences, " Decision Line, vol. 43, no. 2, pp. 1-4, Mar. 2012.
- (2012) Decision Line , vol.43 , Issue.2 , pp. 1-4
- Evans, J.R.¹ Lindner, C.H.²

44
- 85010028505
- Parallel dynammic programming with an average-greedy mechanism for discrete systems
- ASIA, Beijing, China
- J. Zhang, Q. Wei, and F.-Y. Wang, "Parallel dynammic programming with an average-greedy mechanism for discrete systems, " SKLMCCS/QAII Tech Report 01-09-2016, ASIA, Beijing, China.
- SKLMCCS/QAII Tech Report 01-09-2016
- Zhang, J.¹ Wei, Q.² Wang, F.-Y.³

45
- 84877350550
- Parallel control: A method for data-driven and computational control
- F.-Y. Wang, "Parallel control: A method for data-driven and computational control, " Acta Autom.a Sinica, vol.39, no. 2, pp. 293-302, 2013.
- (2013) Acta Autom a Sinica , vol.39 , Issue.2 , pp. 293-302
- Wang, F.-Y.¹

46
- 84978955280
- Control 5.0: From Newton to merton in poppers cyber-social-physical spaces
- F.-Y. Wang, "Control 5.0: From Newton to Merton in Poppers Cyber-Social-Physical Spaces, " IEEE/CAA J. Autom. Sinica, vol. 3, no. 3, pp. 233-234, 2016.
- (2016) IEEE/CAA J. Autom. Sinica , vol.3 , Issue.3 , pp. 233-234
- Wang, F.-Y.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.