-
1
-
-
85151728371
-
Residual algorithms: Reinforcement learning with function approximation
-
Prieditis and Russell, Eds. San Francisco, CA: Morgan Kaufman
-
L. C. Baird, "Residual algorithms: Reinforcement learning with function approximation," in Machine Learning: Proc. Twelfth Int. Conf., Prieditis and Russell, Eds. San Francisco, CA: Morgan Kaufman, 1995.
-
(1995)
Machine Learning: Proc. Twelfth Int. Conf.
-
-
Baird, L.C.1
-
7
-
-
85015692260
-
The pricing of options and corporate liabilities
-
F. Black and M. Scholes, "The pricing of options and corporate liabilities," J. Political Economy, vol. 81, pp. 637-654, 1973.
-
(1973)
J. Political Economy
, vol.81
, pp. 637-654
-
-
Black, F.1
Scholes, M.2
-
8
-
-
0031590025
-
Pricing american-style securities using simulation
-
M. Broadie and P. Glasserman, "Pricing american-style securities using simulation," J. Economic Dynam. Contr., vol. 21, pp. 1323-1352, 1997.
-
(1997)
J. Economic Dynam. Contr.
, vol.21
, pp. 1323-1352
-
-
Broadie, M.1
Glasserman, P.2
-
10
-
-
38649141305
-
Martingales and arbitrage in multiperiod securities markets
-
J. M. Harrison and D. Kreps, "Martingales and arbitrage in multiperiod securities markets," J. Economic Theory, vol. 20, pp. 381-408, 1979.
-
(1979)
J. Economic Theory
, vol.20
, pp. 381-408
-
-
Harrison, J.M.1
Kreps, D.2
-
11
-
-
41649091143
-
Martingales and stochastic integrals in the theory of continuous trading
-
J. M. Harrison and S. Pliska, "Martingales and stochastic integrals in the theory of continuous trading," Stochastic Processes and Their Appl., vol. 11, pp. 215-260, 1981.
-
(1981)
Stochastic Processes and Their Appl.
, vol.11
, pp. 215-260
-
-
Harrison, J.M.1
Pliska, S.2
-
14
-
-
33646480726
-
Exotics enter the mainstream
-
Mar.
-
M. Parsley, "Exotics enter the mainstream," Euromoney, pp. 127-130, Mar. 1997.
-
(1997)
Euromoney
, pp. 127-130
-
-
Parsley, M.1
-
15
-
-
11344285070
-
Using randomization to break the curse of dimensionality
-
J. Rust, "Using randomization to break the curse of dimensionality," Econometrica, 1996.
-
(1996)
Econometrica
-
-
Rust, J.1
-
17
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
R. S. Sutton, "Learning to predict by the methods of temporal differences," Machine Learning, vol. 3, pp. 9-44, 1988.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
18
-
-
33746944751
-
On the virtues of linear learning and trajectory distributions
-
Boyan, Moore, and Sutton, Eds. Technical Rep. CMU-CS-95-206, Carnegie Mellon Univ., Pittsburgh, PA 15213
-
_, "On the virtues of linear learning and trajectory distributions," in Proc. Workshop on Value Function Approximation, Machine Learning Conf. 1995, Boyan, Moore, and Sutton, Eds. Technical Rep. CMU-CS-95-206, Carnegie Mellon Univ., Pittsburgh, PA 15213, 1995, p. 85.
-
(1995)
Proc. Workshop on Value Function Approximation, Machine Learning Conf. 1995
, pp. 85
-
-
-
19
-
-
0031143730
-
An analysis of temporal-difference learning with function approximation
-
May
-
J. N. Tsitsiklis and B. Van Roy, "An analysis of temporal-difference learning with function approximation," IEEE Trans. Automat. Contr., May 1997.
-
(1997)
IEEE Trans. Automat. Contr.
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
-
20
-
-
84898980025
-
Approximate solutions to optimal stopping problems
-
M. C. Mozer, M. I. Jordan, and T. Petsche, Eds. Cambridge, MA: MIT Press
-
_, "Approximate solutions to optimal stopping problems," in Advances in Neural Information Processing Systems 9, M. C. Mozer, M. I. Jordan, and T. Petsche, Eds. Cambridge, MA: MIT Press, 1997.
-
(1997)
Advances in Neural Information Processing Systems 9
-
-
-
22
-
-
0004049893
-
-
Doctoral dissertation, Univ. Cambridge, Cambridge, U.K.
-
C. J. C. H. Watkins, "Learning from delayed rewards," Doctoral dissertation, Univ. Cambridge, Cambridge, U.K., 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.J.C.H.1
|