SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 20 - Proceedings of the 2007 Conference

Volumn , Issue , 2008, Pages

Random sampling of states in dynamic programming

(2) Atkeson, Christopher G a Stephens, Benjamin a

a CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATE DYNAMIC PROGRAMMING; DETERMINISTICS; LOCAL MODEL; OPTIMIZERS; RANDOM SAMPLING; STATE POLICY; STATE-VALUE FUNCTIONS; STEADY STATE; TIME INVARIANTS; VALUE FUNCTIONS;

DYNAMIC PROGRAMMING;

EID: 85162005214 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (7)

References (19)

1
- 0001509947
- Using randomization to break the curse of dimensionality
- J. Rust. Using randomization to break the curse of dimensionality. Econometrica, 65(3):487-516, 1997. (Pubitemid 127466157)
- (1997) Econometrica , vol.65 , Issue.3 , pp. 487-516
- Rust, J.¹

2
- 0031385618
- Incremental methods for computing bounds in partially observable Markov decision processes
- Providence, Rhode Island. AAAI Press / MIT Press
- M. Hauskrecht. Incremental methods for computing bounds in partially observable Markov decision processes. In Proceedings of the 14th National Conference on Artificial Intelligence (AAAI-97), pages 734-739, Providence, Rhode Island, 1997. AAAI Press / MIT Press.
- (1997) Proceedings of the 14th National Conference on Artificial Intelligence (AAAI-97) , pp. 734-739
- Hauskrecht, M.¹

3
- 0036374229
- Speeding up the convergence of value iteration in partially observable Markov decision processes
- N.L. Zhang and W. Zhang. Speeding up the convergence of value iteration in partially observable Markov decision processes. JAIR, 14:29-51, 2001. (Pubitemid 33738058)
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 29-51
- Zhang, N.L.¹ Zhang, W.²

4
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: An anytime algorithm for POMDPs. In International Joint Conference on Artificial Intelligence (IJCAI), 2003.
- (2003) International Joint Conference on Artificial Intelligence (IJCAI)
- Pineau, J.¹ Gordon, G.² Thrun, S.³

5
- 31144465830
- Heuristic search value iteration for POMDPs
- T. Smith and R. Simmons. Heuristic search value iteration for POMDPs. In Uncertainty in Artificial Intelligence, 2004.
- (2004) Uncertainty in Artificial Intelligence
- Smith, T.¹ Simmons, R.²

6
- 3042527666
- A point-based POMDP algorithm for robot planning
- New Orleans, Louisiana, April
- M.T.J. Spaan and Nikos V. A point-based POMDP algorithm for robot planning. In Proceedings of the IEEE International Conference on Robotics and Automation, pages 2399-2404, New Orleans, Louisiana, April 2004.
- (2004) Proceedings of the IEEE International Conference on Robotics and Automation , pp. 2399-2404
- Spaan, M.T.J.¹ Nikos, V.²

7
- 84898978676
- Monte carlo POMDPs
- S.A. Solla, T.K. Leen, and K.-R.Müller, editors. MIT Press
- S. Thrun. Monte Carlo POMDPs. In S.A. Solla, T.K. Leen, and K.-R.Müller, editors, Advances in Neural Information Processing 12, pages 1064-1070. MIT Press, 2000.
- (2000) Advances in Neural Information Processing , vol.12 , pp. 1064-1070
- Thrun, S.¹

8
- 0039816976
- Using local trajectory optimizers to speed up global optimization in dynamic programming
- Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors. Morgan Kaufmann Publishers, Inc.
- C. G. Atkeson. Using local trajectory optimizers to speed up global optimization in dynamic programming. In Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors, Advances in Neural Information Processing Systems, volume 6, pages 663-670. Morgan Kaufmann Publishers, Inc., 1994.
- (1994) Advances in Neural Information Processing Systems , vol.6 , pp. 663-670
- Atkeson, C.G.¹

9
- 0004163205
- Wiley-Interscience
- F. L. Lewis and V. L. Syrmos. Optimal Control, 2nd Edition. Wiley-Interscience, 1995.
- (1995) Optimal Control, 2nd Edition
- Lewis, F.L.¹ Syrmos, V.L.²

10
- 0034759906
- Efficient approximate planning in continuous space Markovian decision problems
- C. Szepesvári. Efficient approximate planning in continuous space Markovian decision problems. AI Communications, 13(3):163-176, 2001. (Pubitemid 33018021)
- (2001) AI Communications , vol.14 , Issue.3 , pp. 163-176
- Szepesvari, C.¹

11
- 0035391083
- Regression methods for pricing complex American-style options
- DOI 10.1109/72.935083, PII S1045922701050238
- J. N. Tsitsiklis and Van B. Roy. Regression methods for pricing complex American-style options. IEEE-NN, 12:694-703, July 2001. (Pubitemid 32732812)
- (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.4 , pp. 694-703
- Tsitsiklis, J.N.¹ Van Roy, B.²

12
- 84862393559
- V. D. Blondel and J. N. Tsitsiklis. A survey of computational complexity results in systems and control, 2000.
- (2000) A Survey of Computational Complexity Results in Systems and Control
- Blondel, V.D.¹ Tsitsiklis, J.N.²

13
- 0036832953
- Variable resolution discretization in optimal control
- DOI 10.1023/A:1017992615625
- R. Munos and A. W. Moore. Variable resolution discretization in optimal control. Machine Learning Journal, 49:291-323, 2002. (Pubitemid 34325691)
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 291-323
- Munos, R.¹ Moore, A.²

14
- 77952010176
- Cambridge University Press
- S. M. LaValle. Planning Algorithms. Cambridge University Press, 2006.
- (2006) Planning Algorithms
- Lavalle, S.M.¹

15
- 34548784023
- Randomly sampling actions in dynamic programming
- C. G. Atkeson. Randomly sampling actions in dynamic programming. In 2007 IEEE In- ternational Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL), 2007.
- (2007) 2007 IEEE In- Ternational Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL)
- Atkeson, C.G.¹

16
- 84898983672
- Nonparametric representation of a policies and value functions: A trajectory based approach
- MIT Press
- C. G. Atkeson and J. Morimoto. Nonparametric representation of a policies and value functions: A trajectory based approach. In Advances in Neural Information Processing Systems 15. MIT Press, 2003.
- (2003) Advances in Neural Information Processing Systems , vol.15
- Atkeson, C.G.¹ Morimoto, J.²

17
- 0004276055
- Academic Press, New York, NY
- P. Dyer and S. R. McReynolds. The Computation and Theory of Optimal Control. Academic Press, New York, NY, 1970.
- (1970) The Computation and Theory of Optimal Control
- Dyer, P.¹ McReynolds, S.R.²

18
- 0004291983
- Elsevier, New York, NY
- D. H. Jacobson and D. Q. Mayne. Differential Dynamic Programming. Elsevier, New York, NY, 1970.
- (1970) Differential Dynamic Programming
- Jacobson, D.H.¹ Mayne, D.Q.²

19
- 67649736510
- Multiple balance strategies from one optimization criterion
- C. G. Atkeson and B. Stephens. Multiple balance strategies from one optimization criterion. In Humanoids, 2007.
- (2007) Humanoids
- Atkeson, C.G.¹ Stephens, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.