메뉴 건너뛰기




Volumn 4, Issue 1, 2017, Pages 1-5

PDP: Parallel dynamic programming

Author keywords

Adaptive dynamic programming; Artificial intelligence; Deep learning; Dynamic programming; Neural networks; Parallel dynamic programming; Reinforcement learning

Indexed keywords

ARTIFICIAL INTELLIGENCE; DEEP LEARNING; DEEP NEURAL NETWORKS; INTELLIGENT COMPUTING; LEARNING SYSTEMS; NEURAL NETWORKS; REINFORCEMENT LEARNING;

EID: 85010049728     PISSN: 23299266     EISSN: 23299274     Source Type: Journal    
DOI: 10.1109/JAS.2017.7510310     Document Type: Article
Times cited : (95)

References (46)
  • 1
    • 84963949906 scopus 로고    scopus 로고
    • Mastering the game of Go with deep neural networks and tree search
    • D. Silver et al., "Mastering the game of Go with deep neural networks and tree search, " Nature 529.7587, pp. 484-489, 2016.
    • (2016) Nature , vol.529 , Issue.7587 , pp. 484-489
    • Silver, D.1
  • 2
    • 0003787146 scopus 로고
    • Princeton, NJ: Princeton University Press
    • R. E. Bellman, Dynamic Programming. Princeton, NJ: Princeton University Press, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 3
    • 0002557583 scopus 로고
    • Advanced forecasting methods for global crisis warning and models of intelligence
    • P. J. Werbos, "Advanced forecasting methods for global crisis warning and models of intelligence, " General Syst. Yearbook, vol. 22, 1977.
    • (1977) General Syst. Yearbook , vol.22
    • Werbos, P.J.1
  • 4
    • 0002011091 scopus 로고
    • A menu of designs for reinforcement learning over time
    • W. T. Miller, R. S. Sutton and P. J. Werbos (Eds.), ambridge MIT Press
    • P. J. Werbos, "A menu of designs for reinforcement learning over time, " in Neural Networks for Control, W. T. Miller, R. S. Sutton and P. J. Werbos (Eds.), Cambridge: MIT Press, 1991, pp. 67-95.
    • (1991) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 5
    • 84975322654 scopus 로고    scopus 로고
    • Where does AlphaGo go: From church-Turing
    • thesis to AlphaGo thesis and beyond April
    • F.-Y. Wang, et al., "Where does AlphaGo go: from church-Turing thesis to AlphaGo thesis and beyond", IEEE/CAA J. Autom. Sinica, vol. 3, no. 2, pp. 113-120, April 2016.
    • (2016) IEEE/CAA J. Autom. Sinica , vol.3 , Issue.2 , pp. 113-120
    • Wang, F.-Y.1
  • 6
    • 84869402639 scopus 로고    scopus 로고
    • A big-data perspective on ai: Newton, merton, and analytics intelligence
    • F.-Y. Wang, "A big-data perspective on AI: Newton, Merton, and analytics intelligence", IEEE Intell. Syst., vol. 27, no. 5, pp. 2-4, 2012.
    • (2012) IEEE Intell. Syst , vol.27 , Issue.5 , pp. 2-4
    • Wang, F.-Y.1
  • 7
    • 85017294169 scopus 로고    scopus 로고
    • Parallel learning-A new framework for machine learning
    • (in Chinese)
    • L. Li, Y.-L. Lin, D.-P. Cao, N.-N. Zheng, and F.-Y. Wang, "Parallel learning-A new framework for machine learning, " Acta Autom. Sinica, vol. 43, no. 1, pp. 1-8, 2017 (in Chinese).
    • (2017) Acta Autom. Sinica , vol.43 , Issue.1 , pp. 1-8
    • Li, L.1    Lin, Y.-L.2    Cao, D.-P.3    Zheng, N.-N.4    Wang, F.-Y.5
  • 8
    • 84960456924 scopus 로고    scopus 로고
    • Efficient video stitching based on fast structure deformation
    • article in press
    • J. Li, W. Xu, J. Zhang, M. Zhang, Z. Wang, and X. Li, "Efficient video stitching based on fast structure deformation, " IEEE Trans. Cybern., article in press, 2015. DOI: 10.1109/TCYB.2014.2381774.
    • (2015) IEEE Trans. Cybern
    • Li, J.1    Xu, W.2    Zhang, J.3    Zhang, M.4    Wang, Z.5    Li, X.6
  • 9
    • 84949845142 scopus 로고    scopus 로고
    • Stochastic dynamic programming in the real-world control of hybrid electric vehicles
    • Mar
    • C. Vagg, S. Akehurst, C. J. Brace, and L. Ash, "Stochastic dynamic programming in the real-world control of hybrid electric vehicles, " IEEE Trans. Control Syst. Technol., vol. 24, no. 3, pp. 853-866, Mar. 2016.
    • (2016) IEEE Trans. Control Syst. Technol , vol.24 , Issue.3 , pp. 853-866
    • Vagg, C.1    Akehurst, S.2    Brace, C.J.3    Ash, L.4
  • 10
    • 84982757712 scopus 로고    scopus 로고
    • Motion planning for continuous-Time stochastic processes: A dynamic programming approach
    • P. M. Esfahani, D. Chatterjee, and J. Lygeros, "Motion planning for continuous-Time stochastic processes: A dynamic programming approach, " IEEE Trans. Autom. Control, vol. 61, pp. 2155-2170, 2016.
    • (2016) IEEE Trans. Autom. Control , vol.61 , pp. 2155-2170
    • Esfahani, P.M.1    Chatterjee, D.2    Lygeros, J.3
  • 11
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-Time control and neural modeling
    • D.A. White and D.A. Sofge (Eds.), New York Van Nostrand Reinhold ch. 13
    • P. J. Werbos, "Approximate dynamic programming for real-Time control and neural modeling, " in Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, D.A. White and D.A. Sofge (Eds.), New York: Van Nostrand Reinhold, 1992, ch. 13.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
    • Werbos, P.J.1
  • 13
    • 0031236002 scopus 로고    scopus 로고
    • Adaptive critic designs
    • Sep
    • D. V. Prokhorov and D. C. Wunsch, "Adaptive critic designs, " IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
    • (1997) IEEE Trans. Neural Netw , vol.8 , Issue.5 , pp. 997-1007
    • Prokhorov, D.V.1    Wunsch, D.C.2
  • 14
    • 84960091952 scopus 로고    scopus 로고
    • Adaptive critic design-based dynamic stochastic optimal control design for a microgrid with multiple renewable resources
    • Jun
    • J. Han, S. Khushalani-Solanki, J. Solanki, and J. Liang, "Adaptive critic design-based dynamic stochastic optimal control design for a microgrid with multiple renewable resources, " IEEE Trans. Smart Grid, vol. 6, no. 6, pp. 2694-2703, Jun. 2015.
    • (2015) IEEE Trans. Smart Grid , vol.6 , Issue.6 , pp. 2694-2703
    • Han, J.1    Khushalani-Solanki, S.2    Solanki, J.3    Liang, J.4
  • 17
    • 85046690719 scopus 로고    scopus 로고
    • Discrete-Time local value Iteration adaptive dynamic programming: Convergence analysis
    • article in press
    • Q. Wei, F. L. Lewis, D. Liu, R. Song, and H. Lin, "Discrete-Time local value Iteration adaptive dynamic programming: Convergence analysis, " IEEE Trans. Syst., Man, Cybern. A, Syst., article in press, 2016. DOI: 10.1109/TSMC.2016.2623766.
    • (2016) IEEE Trans. Syst., Man, Cybern. A, Syst
    • Wei, Q.1    Lewis, F.L.2    Liu, D.3    Song, R.4    Lin, H.5
  • 18
    • 84963604827 scopus 로고    scopus 로고
    • Discrete-Time deterministic Q-learning: A novel convergence analysis
    • article in press
    • Q. Wei, F. L. Lewis, Q. Sun, P. Yan, and R. Song, "Discrete-Time deterministic Q-learning: A novel convergence analysis, " IEEE Trans. Cybern., article in press, 2016. DOI: 10.1109/TCYB.2016.2542923.
    • (2016) IEEE Trans. Cybern
    • Wei, Q.1    Lewis, F.L.2    Sun, Q.3    Yan, P.4    Song, R.5
  • 19
    • 84924872284 scopus 로고    scopus 로고
    • A novel dual iterative Q-learning method for optimal battery management in smart residential environments
    • Apr
    • Q. Wei, D. Liu, and G. Shi, "A novel dual iterative Q-learning method for optimal battery management in smart residential environments, " IEEE Trans. Ind. Electron., vol. 62, no. 4, pp. 2509-2518, Apr. 2015.
    • (2015) IEEE Trans. Ind. Electron , vol.62 , Issue.4 , pp. 2509-2518
    • Wei, Q.1    Liu, D.2    Shi, G.3
  • 20
    • 84908658175 scopus 로고    scopus 로고
    • A novel iterative-Adaptive dynamic programming for discrete-Time nonlinear systems
    • Oct
    • Q. Wei and D. Liu, "A novel iterative-Adaptive dynamic programming for discrete-Time nonlinear systems, " IEEE Trans. Autom. Sci. Eng., vol. 11, no. 4, pp. 1176-1190, Oct. 2014.
    • (2014) IEEE Trans. Autom. Sci. Eng , vol.11 , Issue.4 , pp. 1176-1190
    • Wei, Q.1    Liu, D.2
  • 21
    • 84978818227 scopus 로고    scopus 로고
    • Discrete-Time optimal control via local policy iteration adaptive dynamic programming
    • article in press
    • Q. Wei, D. Liu, Q. Lin, and R. Song, "Discrete-Time optimal control via local policy iteration adaptive dynamic programming, " IEEE Trans. Cybern., article in press, 2016. DOI: 10.1109/TCYB.2016.2586082.
    • (2016) IEEE Trans. Cybern
    • Wei, Q.1    Liu, D.2    Lin, Q.3    Song, R.4
  • 22
    • 0043026775 scopus 로고    scopus 로고
    • Helicopter trimming and tracking control using direct neural dynamic programming
    • Aug
    • R. Enns and J. Si, "Helicopter trimming and tracking control using direct neural dynamic programming, " IEEE Trans. Neural Netw., vol. 14, no. 4, pp. 929-939, Aug. 2003.
    • (2003) IEEE Trans. Neural Netw , vol.14 , Issue.4 , pp. 929-939
    • Enns, R.1    Si, J.2
  • 23
    • 84921346879 scopus 로고    scopus 로고
    • Concurrent learningbased approximate feedback-Nash equilibrium solution of N-player nonzero-sum differential games
    • Jul
    • R. Kamalapurkar, J. R. Klotz, and W. E. Dixon, "Concurrent learningbased approximate feedback-Nash equilibrium solution of N-player nonzero-sum differential games, " IEEE/CAA J. Autom. Sinica, vol. 1, no. 3, pp. 239-247, Jul. 2014.
    • (2014) IEEE/CAA J. Autom. Sinica , vol.1 , Issue.3 , pp. 239-247
    • Kamalapurkar, R.1    Klotz, J.R.2    Dixon, W.E.3
  • 24
    • 84981727630 scopus 로고    scopus 로고
    • Discrete-Time local iterative adaptive dynamic programming: Terminations and admissibility analysis
    • article in press
    • Q. Wei, D. Liu, and Q. Lin, "Discrete-Time local iterative adaptive dynamic programming: Terminations and admissibility analysis, " IEEE Trans. Neural Netw. Learn. Syst., article in press, 2016. DOI: 10.1109/TNNLS.2016.2593743.
    • (2016) IEEE Trans. Neural Netw. Learn. Syst
    • Wei, Q.1    Liu, D.2    Lin, Q.3
  • 25
    • 84939617304 scopus 로고    scopus 로고
    • Data-driven zero-sum neuro-optimal control for a class of continuous-Time unknown nonlinear systems with disturbance using ADP
    • Feb
    • Q. Wei, R. Song, and P. Yan, "Data-driven zero-sum neuro-optimal control for a class of continuous-Time unknown nonlinear systems with disturbance using ADP, " IEEE Trans. Neural Netw. Learn. Syst., vol. 27, no. 2, pp. 444-458, Feb. 2016.
    • (2016) IEEE Trans. Neural Netw. Learn. Syst , vol.27 , Issue.2 , pp. 444-458
    • Wei, Q.1    Song, R.2    Yan, P.3
  • 26
    • 84912048350 scopus 로고    scopus 로고
    • Online adaptive policy learning algorithm for H1 state feedback control of unknown affine nonlinear discrete-Time systems
    • Dec
    • H. Zhang, C. Qin, B. Jiang, and Y. Luo, "Online adaptive policy learning algorithm for H1 state feedback control of unknown affine nonlinear discrete-Time systems, " IEEE Trans. Cybern., vol. 44, no. 12, pp. 2706-2718, Dec. 2014.
    • (2014) IEEE Trans. Cybern , vol.44 , Issue.12 , pp. 2706-2718
    • Zhang, H.1    Qin, C.2    Jiang, B.3    Luo, Y.4
  • 28
    • 0039434283 scopus 로고
    • Suboptimal control of nonlinear stochastic systems
    • G. N. Saridis and F.-Y. Wang, "Suboptimal control of nonlinear stochastic systems, " Control Theory and Advanced Technology, vol. 10, no. 4, pp. 847-871, 1994.
    • (1994) Control Theory and Advanced Technology , vol.10 , Issue.4 , pp. 847-871
    • Saridis, G.N.1    Wang, F.-Y.2
  • 29
    • 85027953921 scopus 로고    scopus 로고
    • Infinite horizon self-learning optimal control of nonaffine discrete-Time nonlinear systems
    • Apr
    • Q. Wei, D. Liu, and X. Yang, "Infinite horizon self-learning optimal control of nonaffine discrete-Time nonlinear systems, " IEEE Trans. Neural Netw. Learn. Syst., vol. 26, no. 4, pp. 866-879, Apr. 2015.
    • (2015) IEEE Trans. Neural Netw. Learn. Syst , vol.26 , Issue.4 , pp. 866-879
    • Wei, Q.1    Liu, D.2    Yang, X.3
  • 30
    • 85017654846 scopus 로고    scopus 로고
    • Optimal constrained self-learning battery sequential management in microgrid via adaptive dynamic programming
    • article in press
    • Q. Wei, D. Liu, Y. Liu, and R. Song, "Optimal constrained self-learning battery sequential management in microgrid via adaptive dynamic programming, " IEEE/CAA J. Autom. Sinica, article in press, 2016. DOI: 10.1109/JAS.2016.7510262.
    • (2016) IEEE/CAA J. Autom. Sinica
    • Wei, Q.1    Liu, D.2    Liu, Y.3    Song, R.4
  • 31
    • 84921361841 scopus 로고    scopus 로고
    • Near optimal output feedback control of nonlinear discrete-Time systems based on reinforcement neural network learning
    • Oct
    • Q. Zhao, H. Xu, and S. Jagannathan, "Near optimal output feedback control of nonlinear discrete-Time systems based on reinforcement neural network learning, " IEEE/CAA J. Autom. Sinica, vol. 1, no. 4, pp. 372-384, Oct. 2014.
    • (2014) IEEE/CAA J. Autom. Sinica , vol.1 , Issue.4 , pp. 372-384
    • Zhao, Q.1    Xu, H.2    Jagannathan, S.3
  • 32
    • 84930506123 scopus 로고    scopus 로고
    • Optimal multi-battery coordination control for home energy management systems via distributed iterative adaptive dynamic programming
    • Jul
    • Q. Wei, D. Liu, G. Shi, and Y. Liu, "Optimal multi-battery coordination control for home energy management systems via distributed iterative adaptive dynamic programming, " IEEE Trans. Ind. Electron., vol. 42, no. 7, pp. 4203-4214, Jul. 2015.
    • (2015) IEEE Trans. Ind. Electron , vol.42 , Issue.7 , pp. 4203-4214
    • Wei, Q.1    Liu, D.2    Shi, G.3    Liu, Y.4
  • 33
    • 84946811900 scopus 로고    scopus 로고
    • Value iteration adaptive dynamic programming for optimal control of discrete-Time nonlinear systems
    • Mar
    • Q. Wei, D. Liu, and H. Lin, "Value iteration adaptive dynamic programming for optimal control of discrete-Time nonlinear systems, " IEEE Trans. Cybern., vol. 46, no. 3, pp. 840-853, Mar. 2016.
    • (2016) IEEE Trans. Cybern , vol.46 , Issue.3 , pp. 840-853
    • Wei, Q.1    Liu, D.2    Lin, H.3
  • 34
    • 84912122528 scopus 로고    scopus 로고
    • Finite-Approximation-error based discrete-Time iterative adaptive dynamic programming
    • Dec
    • Q. Wei, F. Wang, D. Liu, and X. Yang, "Finite-Approximation-error based discrete-Time iterative adaptive dynamic programming, " IEEE Trans. Cybern., vol. 44, no. 12, pp. 2820-2833, Dec. 2014.
    • (2014) IEEE Trans. Cybern , vol.44 , Issue.12 , pp. 2820-2833
    • Wei, Q.1    Wang, F.2    Liu, D.3    Yang, X.4
  • 35
    • 84878421441 scopus 로고    scopus 로고
    • Optimal control for discrete-Time affine non-linear systems using general value iteration
    • Dec
    • H. Li and D. Liu, "Optimal control for discrete-Time affine non-linear systems using general value iteration, " IET Control Theory Appl., vol. 6, no. 18, pp. 2725-2736, Dec. 2012.
    • (2012) IET Control Theory Appl , vol.6 , Issue.18 , pp. 2725-2736
    • Li, H.1    Liu, D.2
  • 36
    • 85005983375 scopus 로고    scopus 로고
    • Adaptive dynamic programming and adaptive optimal output regulation of linear systems
    • Dec
    • W. Gao and Z.-P. Jiang, "Adaptive dynamic programming and adaptive optimal output regulation of linear systems, " IEEE Trans. Autom. Control, vol. 61, no. 12, pp. 4164-4169, Dec. 2016.
    • (2016) IEEE Trans. Autom. Control , vol.61 , Issue.12 , pp. 4164-4169
    • Gao, W.1    Jiang, Z.-P.2
  • 37
    • 84971473616 scopus 로고    scopus 로고
    • Deep learning for control: The state of the art and prospects
    • Y. Duan, Y. Lv, J. Zhang, X. Zhao, and F.-Y. Wang, "Deep learning for control: The state of the art and prospects, " Acta Autom. Sinica, vol 42, no. 5, pp. 643-654, 2016.
    • (2016) Acta Autom. Sinica , vol.42 , Issue.5 , pp. 643-654
    • Duan, Y.1    Lv, Y.2    Zhang, J.3    Zhao, X.4    Wang, F.-Y.5
  • 38
    • 0742300845 scopus 로고
    • Building knowledge structure in neural nets using fuzzy logic
    • M. Jamshidi (Eds.), New York, NY, ASME (American Society of Mechanical Engineers) Press
    • F.-Y. Wang, "Building knowledge structure in neural nets using fuzzy logic, " Robotics and Manufacturing: Recent Trends in Research Education and Applications, M. Jamshidi (Eds.), New York, NY, ASME (American Society of Mechanical Engineers) Press, 1992.
    • (1992) Robotics and Manufacturing: Recent Trends in Research Education and Applications
    • Wang, F.-Y.1
  • 39
    • 84974753844 scopus 로고
    • Implementing adaptive fuzzy logic controllers with neural networks: A design paradigm
    • F.-Y. Wang and H.-A. Kim, "Implementing adaptive fuzzy logic controllers with neural networks: A design paradigm, " J. Intell. Fuzzy Syst., vol. 3, no. 2, pp. 165-180, 1995.
    • (1995) J. Intell. Fuzzy Syst , vol.3 , Issue.2 , pp. 165-180
    • Wang, F.-Y.1    Kim, H.-A.2
  • 40
    • 77956341661 scopus 로고    scopus 로고
    • The emergence of intelligent enterprises: From CPS to CPSS
    • F.-Y. Wang, "The emergence of intelligent enterprises: From CPS to CPSS, " IEEE Intell. Syst., vol. 25, no. 4, pp. 85-88, 2010.
    • (2010) IEEE Intell. Syst , vol.25 , Issue.4 , pp. 85-88
    • Wang, F.-Y.1
  • 42
    • 85010072911 scopus 로고    scopus 로고
    • Extending the value of your data warehousing investment
    • W. Eckerson, "Extending the value of your data warehousing investment, " The Data Warehouse Institute, USA, 2007.
    • (2007) The Data Warehouse Institute, USA
    • Eckerson, W.1
  • 43
    • 84900352446 scopus 로고    scopus 로고
    • Business analytics: The next frontier for decision sciences
    • Mar
    • J. R. Evans and C. H. Lindner, "Business analytics: The next frontier for decision sciences, " Decision Line, vol. 43, no. 2, pp. 1-4, Mar. 2012.
    • (2012) Decision Line , vol.43 , Issue.2 , pp. 1-4
    • Evans, J.R.1    Lindner, C.H.2
  • 44
    • 85010028505 scopus 로고    scopus 로고
    • Parallel dynammic programming with an average-greedy mechanism for discrete systems
    • ASIA, Beijing, China
    • J. Zhang, Q. Wei, and F.-Y. Wang, "Parallel dynammic programming with an average-greedy mechanism for discrete systems, " SKLMCCS/QAII Tech Report 01-09-2016, ASIA, Beijing, China.
    • SKLMCCS/QAII Tech Report 01-09-2016
    • Zhang, J.1    Wei, Q.2    Wang, F.-Y.3
  • 45
    • 84877350550 scopus 로고    scopus 로고
    • Parallel control: A method for data-driven and computational control
    • F.-Y. Wang, "Parallel control: A method for data-driven and computational control, " Acta Autom.a Sinica, vol.39, no. 2, pp. 293-302, 2013.
    • (2013) Acta Autom a Sinica , vol.39 , Issue.2 , pp. 293-302
    • Wang, F.-Y.1
  • 46
    • 84978955280 scopus 로고    scopus 로고
    • Control 5.0: From Newton to merton in poppers cyber-social-physical spaces
    • F.-Y. Wang, "Control 5.0: From Newton to Merton in Poppers Cyber-Social-Physical Spaces, " IEEE/CAA J. Autom. Sinica, vol. 3, no. 3, pp. 233-234, 2016.
    • (2016) IEEE/CAA J. Autom. Sinica , vol.3 , Issue.3 , pp. 233-234
    • Wang, F.-Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.