SCOPUS 정보 검색 플랫폼

Computational Economics

Volumn 27, Issue 4, 2006, Pages 433-452

Approximate policy optimization and adaptive control in regression models

(3) Han, Jiarui a Lai, Tze Leung b Spivakovsky, Viktor c

a Federal Service for Surveillance on Consumer Rights Protection and Human Well Being (United States)

b STANFORD UNIVERSITY (United States)

c Citadel Investment Group (United States)

Author keywords

Dynamic programming; Learning by doing; Monte Carlo; Policy iteration; Rollout

Indexed keywords

EID: 33745450286 PISSN: 09277099 EISSN: 15729974 Source Type: Journal
DOI: 10.1007/s10614-005-9007-1 Document Type: Article

Times cited : (6)

References (27)

1
- 0041656119
- Optimal learning byexperimentation
- Aghion, P., Bolton, P. and Harris, C. (1991). Optimal learning byexperimentation. Review of Economic Studies 58, 621-654.
- (1991) Review of Economic Studies , vol.58 , pp. 621-654
- Aghion, P.¹ Bolton, P.² Harris, C.³

2
- 0006231196
- Some experimental resultson the statistical properties of least squares estimates in controlproblems
- Anderson, T.W. and Taylor, J. (1976). Some experimental resultson the statistical properties of least squares estimates in controlproblems. Econometrica 44, 1289-1302.
- (1976) Econometrica , vol.44 , pp. 1289-1302
- Anderson, T.W.¹ Taylor, J.²

3
- 0020815940
- Theory and applications of adaptive control - A survey
- Åstrom, K.J. (1983). Theory and applications of adaptive control - a survey. Automatica 19, 471-486.
- (1983) Automatica , vol.19 , pp. 471-486
- Åstrom, K.J.¹

4
- 85012688561
- Princeton, NJ:Princeton University Press
- Bellman, R. (1957). Dynamic Programming. Princeton, NJ:PRinceton University Press.
- (1957) Dynamic Programming
- Bellman, R.¹

5
- 0003565783
- 2nd edition. Belmont, MA: Athena Scientific
- Bertsekas, D.P. (2000). Dynamic Programming and Optimal Control, 2nd edition. Belmont, MA: Athena Scientific.
- (2000) Dynamic Programming and Optimal Control
- Bertsekas, D.P.¹

6
- 0003487482
- Belmont, MA: Athena Scientific
- Bertsekas, D.P. and Tsitsiklis, J.N. (1996). Neuro-Dynamic Programming. Belmont, MA: Athena Scientific.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

7
- 0031272681
- Rollout algorithms for combinatorial optimization
- Bertsekas, D.P. Tsitsiklis, J.N. and Wu, C. (1997). Rollout algorithms for combinatorial optimization. Journal ofHeuristics 3, 245-262.
- (1997) Journal OfHeuristics , vol.3 , pp. 245-262
- Bertsekas, D.P.¹ Tsitsiklis, J.N.² Wu, C.³

8
- 0002930340
- Rational expectationsequilibrium: An alternative approach
- Blume, L. and Easley, D. (1984). Rational expectationsequilibrium: An alternative approach. Journal of EconomicTheory 34, 116-129.
- (1984) Journal of EconomicTheory , vol.34 , pp. 116-129
- Blume, L.¹ Easley, D.²

9
- 0003532687
- New York: Wiley
- Box, G.E.P. and Tiao, G.C. (1973). Bayesian Inference in Statistical Analysis. New York: Wiley.
- (1973) Bayesian Inference in Statistical Analysis
- Box, G.E.P.¹ Tiao, G.C.²

10
- 0347697424
- Optimal learning withendogenous data
- Easley, D. and Kiefer, N.M. (1989). Optimal learning withendogenous data. International Economic Review 30, 963-978.
- (1989) International Economic Review , vol.30 , pp. 963-978
- Easley, D.¹ Kiefer, N.M.²

11
- 0004008325
- New York: McGraw-Hill
- Kendrick, D. (1981). Stochastic Control for Economic Models. New York: McGraw-Hill.
- (1981) Stochastic Control for Economic Models
- Kendrick, D.¹

12
- 37849017798
- A value function arising in the economics ofinformation
- Kiefer, N.M. (1989). A value function arising in the economics ofinformation. Journal of Economic Dynamics and Control 13, 201-223.
- (1989) Journal of Economic Dynamics and Control , vol.13 , pp. 201-223
- Kiefer, N.M.¹

13
- 0000878355
- Adaptive design and stochastic approximation
- Lai, T.L. and Robbins, H. (1979). Adaptive design and stochastic approximation. Annals of Statistics 7, 1196-1221.
- (1979) Annals of Statistics , vol.7 , pp. 1196-1221
- Lai, T.L.¹ Robbins, H.²

14
- 0011531259
- Iterated least squares inmultiperiod control
- Lai, T.L. and Robbins, H. (1982). Iterated least squares inmultiperiod control. Advances in Applied Mathematics 3, 50-73.
- (1982) Advances in Applied Mathematics , vol.3 , pp. 50-73
- Lai, T.L.¹ Robbins, H.²

15
- 1842587086
- Valuation of American options via basis functions
- Lai, T.L. and Wong, S.P. (2004). Valuation of American options via basis functions. IEEE Transactions onAutomatic Control 49 374-385.
- (2004) IEEE Transactions OnAutomatic Control , vol.49 , pp. 374-385
- Lai, T.L.¹ Wong, S.P.²

16
- 0035578679
- Valuing American options by simulation: A simple least-squaresapproach
- Longstaff, F.A. and Schwartz, E.S. (2001). Valuing American options by simulation: A simple least-squaresapproach. Review of Financial Studies 14, 113-147.
- (2001) Review of Financial Studies , vol.14 , pp. 113-147
- Longstaff, F.A.¹ Schwartz, E.S.²

17
- 0036832961
- Building a basic block instruction scheduler using reinforcement learning and rollouts
- MacGovern, A., Moss, E. and Barto, A. (2002). Building a basic block instruction scheduler using reinforcement learning and rollouts. Machine Learning 49, 141-160.
- (2002) Machine Learning , vol.49 , pp. 141-160
- MacGovern, A.¹ Moss, E.² Barto, A.³

18
- 0000464775
- The multi-period control problem under uncertainty
- Prescott, E. (1972). The multi-period control problem under uncertainty. Econometrica 40, 1043-1058.
- (1972) Econometrica , vol.40 , pp. 1043-1058
- Prescott, E.¹

19
- 0001509947
- Using randomization to break the curse of dimensionality
- Rust, J. (1997). Using randomization to break the curse of dimensionality. Econometrica 65, 487-516.
- (1997) Econometrica , vol.65 , pp. 487-516
- Rust, J.¹

20
- 0035463693
- A rollout policy for the vehicle routing problem with stochastic demands
- Secomandi, N. (2001). A rollout policy for the vehicle routing problem with stochastic demands. Operations Research, 49, 796-802.
- (2001) Operations Research , vol.49 , pp. 796-802
- Secomandi, N.¹

21
- 0142157191
- Analysis of a rollout approach to sequencing problems with stochastic routing applications
- Secomandi, N. (2003). Analysis of a rollout approach to sequencing problems with stochastic routing applications. Journal of Heuristics 9, 321-352.
- (2003) Journal of Heuristics , vol.9 , pp. 321-352
- Secomandi, N.¹

22
- 0003538072
- Cambridge, MA: HarvardUniversity Press
- Stokey, N. and Lucas, R.E. (1989). Recursive Methods in Economic Dynamics. Cambridge, MA: HarvardUniversity Press.
- (1989) Recursive Methods in Economic Dynamics
- Stokey, N.¹ Lucas, R.E.²

23
- 84898992015
- On-line policy improvement using Monte-Carlo search
- Cambridge, MA: MIT Press
- Tesauro, G. and Galperin, G. (1996). On-line policy improvement using Monte-Carlo search. In Advances inNeural Information Processing Systems 9, 1068-1074. Cambridge, MA: MIT Press.
- (1996) Advances in Neural Information Processing Systems , vol.9 , pp. 1068-1074
- Tesauro, G.¹ Galperin, G.²

24
- 0035391083
- Regression methods for pricing complex American-style options
- Tsitsiklis, J.N. and Van Roy, B. (2001). Regression methods for pricing complex American-style options. IEEE Transactions on Neural Networks 12, 694-703.
- (2001) IEEE Transactions on Neural Networks , vol.12 , pp. 694-703
- Tsitsiklis, J.N.¹ Van Roy, B.²

25
- 0042933390
- Optimal control with unknown parameters - A study of optimal learning strategies with anapplication to monetary policy
- Ph.D. Thesis, Stanford University
- Wieland, V. (1995). Optimal control with unknown parameters - a study of optimal learning strategies with anapplication to monetary policy. Ph.D. Thesis, Stanford University.
- (1995)
- Wieland, V.¹

26
- 0006250951
- Learning by doing and the value of optimal experimentation
- Wieland, V. (2000). Learning by doing and the value of optimal experimentation. Journal of Economic Dynamics and Control 24, 501-534.
- (2000) Journal of Economic Dynamics and Control , vol.24 , pp. 501-534
- Wieland, V.¹

27
- 33645415150
- Solitaire: Man versus machine
- in press. Cambridge, MA: MIT Press
- Yan, X., Diaconis, P., Rusmevichientong, P. and Van Roy, B. (2005). Solitaire: Man versus machine. In Advances in Neural Information Processing Systems 17, in press. Cambridge, MA: MIT Press
- (2005) Advances in Neural Information Processing Systems , vol.17
- Yan, X.¹ Diaconis, P.² Rusmevichientong, P.³ Van Roy, B.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.