-
1
-
-
0001509947
-
Using randomization to break the curse of dimensionality
-
J. Rust. Using randomization to break the curse of dimensionality. Econometrica, 65(3):487-516, 1997. (Pubitemid 127466157)
-
(1997)
Econometrica
, vol.65
, Issue.3
, pp. 487-516
-
-
Rust, J.1
-
2
-
-
0031385618
-
Incremental methods for computing bounds in partially observable Markov decision processes
-
Providence, Rhode Island. AAAI Press / MIT Press
-
M. Hauskrecht. Incremental methods for computing bounds in partially observable Markov decision processes. In Proceedings of the 14th National Conference on Artificial Intelligence (AAAI-97), pages 734-739, Providence, Rhode Island, 1997. AAAI Press / MIT Press.
-
(1997)
Proceedings of the 14th National Conference on Artificial Intelligence (AAAI-97)
, pp. 734-739
-
-
Hauskrecht, M.1
-
3
-
-
0036374229
-
Speeding up the convergence of value iteration in partially observable Markov decision processes
-
N.L. Zhang and W. Zhang. Speeding up the convergence of value iteration in partially observable Markov decision processes. JAIR, 14:29-51, 2001. (Pubitemid 33738058)
-
(2001)
Journal of Artificial Intelligence Research
, vol.14
, pp. 29-51
-
-
Zhang, N.L.1
Zhang, W.2
-
7
-
-
84898978676
-
Monte carlo POMDPs
-
S.A. Solla, T.K. Leen, and K.-R.Müller, editors. MIT Press
-
S. Thrun. Monte Carlo POMDPs. In S.A. Solla, T.K. Leen, and K.-R.Müller, editors, Advances in Neural Information Processing 12, pages 1064-1070. MIT Press, 2000.
-
(2000)
Advances in Neural Information Processing
, vol.12
, pp. 1064-1070
-
-
Thrun, S.1
-
8
-
-
0039816976
-
Using local trajectory optimizers to speed up global optimization in dynamic programming
-
Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors. Morgan Kaufmann Publishers, Inc.
-
C. G. Atkeson. Using local trajectory optimizers to speed up global optimization in dynamic programming. In Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors, Advances in Neural Information Processing Systems, volume 6, pages 663-670. Morgan Kaufmann Publishers, Inc., 1994.
-
(1994)
Advances in Neural Information Processing Systems
, vol.6
, pp. 663-670
-
-
Atkeson, C.G.1
-
10
-
-
0034759906
-
Efficient approximate planning in continuous space Markovian decision problems
-
C. Szepesvári. Efficient approximate planning in continuous space Markovian decision problems. AI Communications, 13(3):163-176, 2001. (Pubitemid 33018021)
-
(2001)
AI Communications
, vol.14
, Issue.3
, pp. 163-176
-
-
Szepesvari, C.1
-
11
-
-
0035391083
-
Regression methods for pricing complex American-style options
-
DOI 10.1109/72.935083, PII S1045922701050238
-
J. N. Tsitsiklis and Van B. Roy. Regression methods for pricing complex American-style options. IEEE-NN, 12:694-703, July 2001. (Pubitemid 32732812)
-
(2001)
IEEE Transactions on Neural Networks
, vol.12
, Issue.4
, pp. 694-703
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
-
13
-
-
0036832953
-
Variable resolution discretization in optimal control
-
DOI 10.1023/A:1017992615625
-
R. Munos and A. W. Moore. Variable resolution discretization in optimal control. Machine Learning Journal, 49:291-323, 2002. (Pubitemid 34325691)
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 291-323
-
-
Munos, R.1
Moore, A.2
-
16
-
-
84898983672
-
Nonparametric representation of a policies and value functions: A trajectory based approach
-
MIT Press
-
C. G. Atkeson and J. Morimoto. Nonparametric representation of a policies and value functions: A trajectory based approach. In Advances in Neural Information Processing Systems 15. MIT Press, 2003.
-
(2003)
Advances in Neural Information Processing Systems
, vol.15
-
-
Atkeson, C.G.1
Morimoto, J.2
-
19
-
-
67649736510
-
Multiple balance strategies from one optimization criterion
-
C. G. Atkeson and B. Stephens. Multiple balance strategies from one optimization criterion. In Humanoids, 2007.
-
(2007)
Humanoids
-
-
Atkeson, C.G.1
Stephens, B.2
|