SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Proceedings of the 2007 IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning, ADPRL 2007

Volumn , Issue , 2007, Pages 268-271

Toward effective combination of off-line and on-line training in ADP framework

(1) Prokhorov, Danil a

a TOYOTA TECHNICAL CENTER (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL METHODS; KALMAN FILTERS; REAL TIME SYSTEMS; RECURRENT NEURAL NETWORKS; REINFORCEMENT LEARNING; ROBUSTNESS (CONTROL SYSTEMS);

MULTISTREAM KALMAN FILTER METHOD; OFF-LINE METHODS; RESEMBLING REINFORCEMENT;

DYNAMIC PROGRAMMING;

EID: 34548777734 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ADPRL.2007.368198 Document Type: Conference Paper

Times cited : (12)

References (16)

1
- 0344592212
- Enhanced multi-stream Kalman filter training for recurrent networks
- J. Suykens and J. Vandewalle eds, Kluwer Academic Publishers
- L. A. Feldkamp, D. V. Prokhorov, C. F. Eagen, and F. Yuan, "Enhanced multi-stream Kalman filter training for recurrent networks," in J. Suykens and J. Vandewalle (eds), Nonlinear Modeling: Advanced Black-Box Techniques, Kluwer Academic Publishers, 1998., pp. 29-53.
- (1998) Nonlinear Modeling: Advanced Black-Box Techniques , pp. 29-53
- Feldkamp, L.A.¹ Prokhorov, D.V.² Eagen, C.F.³ Yuan, F.⁴

2
- 0005906870
- D. V. Prokhorov, G. V. Puskorius, and L. A. Feldkamp, Dynamical neural networks for control, see in, J. Kolen and S. Kremer Eds, IEEE Press
- D. V. Prokhorov, G. V. Puskorius, and L. A. Feldkamp, "Dynamical neural networks for control," see in A Field Guide to Dynamical Recurrent Networks, J. Kolen and S. Kremer (Eds.), IEEE Press, 2001, pp. 257-289.
- (2001) A Field Guide to Dynamical Recurrent Networks , pp. 257-289

3
- 34547129948
- Training Recurrent Neurocontrollers for Robustness with Derivative-Free Kalman Filter
- November
- D. Prokhorov, "Training Recurrent Neurocontrollers for Robustness with Derivative-Free Kalman Filter," IEEE Trans. Neural Networks, November 2006, pp. 1606-1616.
- (2006) IEEE Trans. Neural Networks , pp. 1606-1616
- Prokhorov, D.¹

4
- 34547133026
- Training Recurrent Neurocontrollers for Real-Time Applications
- to appear
- D. Prokhorov, "Training Recurrent Neurocontrollers for Real-Time Applications," IEEE Trans. Neural Networks, to appear.
- IEEE Trans. Neural Networks
- Prokhorov, D.¹

5
- 0023996866
- Hierarchical neural network model for voluntary movement with application to robotics
- April
- M. Kawato, Y Uno, M. Isobe, and R. Suzuki, "Hierarchical neural network model for voluntary movement with application to robotics," IEEE Control Systems Magazine, vol. 8, no. 2, April 1988, pp. 8-15.
- (1988) IEEE Control Systems Magazine , vol.8 , Issue.2 , pp. 8-15
- Kawato, M.¹ Uno, Y.² Isobe, M.³ Suzuki, R.⁴

6
- 0003455467
- Kluwer Academic
- J. Suykens, J. Vandewalle, and B. De Moor, Artificial Neural Networks for Modeling and Control of Non-Linear Systems, Kluwer Academic, 1996.
- (1996) Artificial Neural Networks for Modeling and Control of Non-Linear Systems
- Suykens, J.¹ Vandewalle, J.² De Moor, B.³

7
- 34548733572
- Publications of Paul J. Werbos at www.werbos.com.
- Publications of Paul J. Werbos at www.werbos.com.

8
- 0012331016
- Memory Approaches To Reinforcement Learning In Non-Markovian Domains
- Technical Report CMU-CS-92-138, School of Computer Science, Carnegie Mellon University, May
- L.-J. Lin and T. Mitchell, Memory Approaches To Reinforcement Learning In Non-Markovian Domains, Technical Report CMU-CS-92-138, School of Computer Science, Carnegie Mellon University, May 1992.
- (1992)
- Lin, L.-J.¹ Mitchell, T.²

9
- 0033750123
- Neurocontroller Alternatives for Fuzzy Ball-and-Beam Systems with Nonlinear, Nonuniform Friction
- March
- P. Eaton, D. Prokhorov, and D. Wunsch, "Neurocontroller Alternatives for Fuzzy Ball-and-Beam Systems with Nonlinear, Nonuniform Friction," IEEE Trans. on Neural Networks, March 2000, pp. 423-435.
- (2000) IEEE Trans. on Neural Networks , pp. 423-435
- Eaton, P.¹ Prokhorov, D.² Wunsch, D.³

10
- 84954240138
- Modeling Reward Functions for Incomplete State Representations via Echo State Networks
- Montreal, Canada, August 1-4
- K. Bush and C. Anderson, "Modeling Reward Functions for Incomplete State Representations via Echo State Networks," Proceedings of the International Joint Conference on Neural Networks, Montreal, Canada, August 1-4, 2005.
- (2005) Proceedings of the International Joint Conference on Neural Networks
- Bush, K.¹ Anderson, C.²

11
- 34548771976
- Reinforcement learning by backpropagation through an LSTM model/critic
- Honolulu, Hawaii, April 1-5
- B. Bakker, "Reinforcement learning by backpropagation through an LSTM model/critic," Proceedings of the 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, Honolulu, Hawaii, April 1-5, 2007.
- (2007) Proceedings of the 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning
- Bakker, B.¹

12
- 0031332005
- Observations on the Practical Use of Derivative Adaptive Critics
- Orlando, FL, October
- L. Feldkamp and D. Prokhorov, "Observations on the Practical Use of Derivative Adaptive Critics," Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics (SMC), Orlando, FL, October 1997, pp. 3061-3066.
- (1997) Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics (SMC) , pp. 3061-3066
- Feldkamp, L.¹ Prokhorov, D.²

13
- 34548772284
- Backpropagation through time and derivative adaptive critics: A common framework for comparison
- Chapter 15, J. Si et al, eds, IEEE Press
- D. Prokhorov, "Backpropagation through time and derivative adaptive critics: a common framework for comparison," Chapter 15 in Handbook of Learning and Approximate Dynamic Programming, J. Si et al. (eds), IEEE Press, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Prokhorov, D.¹

14
- 1842421269
- Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless telecommunications
- April 2
- H. Jaeger and H. Haas, "Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless telecommunications," Science, April 2, 2004, pp. 78-80.
- (2004) Science , pp. 78-80
- Jaeger, H.¹ Haas, H.²

15
- 0004834437
- CRC Press
- E. Micheli-Tzanakou, Supervised and Unsupervised Pattern Recognition: Feature Extraction in Computational Intelligence, CRC Press, 2000.
- (2000) Supervised and Unsupervised Pattern Recognition: Feature Extraction in Computational Intelligence
- Micheli-Tzanakou, E.¹

16
- 0032164973
- Model-free control of nonlinear stochastic systems with discrete-time measurements
- September
- J. C. Spall and J. A. Cristion, "Model-free control of nonlinear stochastic systems with discrete-time measurements," IEEE Trans. Automatic Control, vol. 43, no. 9, September 1998, pp. 1198-1210.
- (1998) IEEE Trans. Automatic Control , vol.43 , Issue.9 , pp. 1198-1210
- Spall, J.C.¹ Cristion, J.A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.