메뉴 건너뛰기




Volumn , Issue , 2008, Pages

Random sampling of states in dynamic programming

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATE DYNAMIC PROGRAMMING; DETERMINISTICS; LOCAL MODEL; OPTIMIZERS; RANDOM SAMPLING; STATE POLICY; STATE-VALUE FUNCTIONS; STEADY STATE; TIME INVARIANTS; VALUE FUNCTIONS;

EID: 85162005214     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (7)

References (19)
  • 1
    • 0001509947 scopus 로고    scopus 로고
    • Using randomization to break the curse of dimensionality
    • J. Rust. Using randomization to break the curse of dimensionality. Econometrica, 65(3):487-516, 1997. (Pubitemid 127466157)
    • (1997) Econometrica , vol.65 , Issue.3 , pp. 487-516
    • Rust, J.1
  • 2
    • 0031385618 scopus 로고    scopus 로고
    • Incremental methods for computing bounds in partially observable Markov decision processes
    • Providence, Rhode Island. AAAI Press / MIT Press
    • M. Hauskrecht. Incremental methods for computing bounds in partially observable Markov decision processes. In Proceedings of the 14th National Conference on Artificial Intelligence (AAAI-97), pages 734-739, Providence, Rhode Island, 1997. AAAI Press / MIT Press.
    • (1997) Proceedings of the 14th National Conference on Artificial Intelligence (AAAI-97) , pp. 734-739
    • Hauskrecht, M.1
  • 3
    • 0036374229 scopus 로고    scopus 로고
    • Speeding up the convergence of value iteration in partially observable Markov decision processes
    • N.L. Zhang and W. Zhang. Speeding up the convergence of value iteration in partially observable Markov decision processes. JAIR, 14:29-51, 2001. (Pubitemid 33738058)
    • (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 29-51
    • Zhang, N.L.1    Zhang, W.2
  • 7
    • 84898978676 scopus 로고    scopus 로고
    • Monte carlo POMDPs
    • S.A. Solla, T.K. Leen, and K.-R.Müller, editors. MIT Press
    • S. Thrun. Monte Carlo POMDPs. In S.A. Solla, T.K. Leen, and K.-R.Müller, editors, Advances in Neural Information Processing 12, pages 1064-1070. MIT Press, 2000.
    • (2000) Advances in Neural Information Processing , vol.12 , pp. 1064-1070
    • Thrun, S.1
  • 8
    • 0039816976 scopus 로고
    • Using local trajectory optimizers to speed up global optimization in dynamic programming
    • Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors. Morgan Kaufmann Publishers, Inc.
    • C. G. Atkeson. Using local trajectory optimizers to speed up global optimization in dynamic programming. In Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors, Advances in Neural Information Processing Systems, volume 6, pages 663-670. Morgan Kaufmann Publishers, Inc., 1994.
    • (1994) Advances in Neural Information Processing Systems , vol.6 , pp. 663-670
    • Atkeson, C.G.1
  • 10
    • 0034759906 scopus 로고    scopus 로고
    • Efficient approximate planning in continuous space Markovian decision problems
    • C. Szepesvári. Efficient approximate planning in continuous space Markovian decision problems. AI Communications, 13(3):163-176, 2001. (Pubitemid 33018021)
    • (2001) AI Communications , vol.14 , Issue.3 , pp. 163-176
    • Szepesvari, C.1
  • 11
    • 0035391083 scopus 로고    scopus 로고
    • Regression methods for pricing complex American-style options
    • DOI 10.1109/72.935083, PII S1045922701050238
    • J. N. Tsitsiklis and Van B. Roy. Regression methods for pricing complex American-style options. IEEE-NN, 12:694-703, July 2001. (Pubitemid 32732812)
    • (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.4 , pp. 694-703
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 13
    • 0036832953 scopus 로고    scopus 로고
    • Variable resolution discretization in optimal control
    • DOI 10.1023/A:1017992615625
    • R. Munos and A. W. Moore. Variable resolution discretization in optimal control. Machine Learning Journal, 49:291-323, 2002. (Pubitemid 34325691)
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 291-323
    • Munos, R.1    Moore, A.2
  • 16
    • 84898983672 scopus 로고    scopus 로고
    • Nonparametric representation of a policies and value functions: A trajectory based approach
    • MIT Press
    • C. G. Atkeson and J. Morimoto. Nonparametric representation of a policies and value functions: A trajectory based approach. In Advances in Neural Information Processing Systems 15. MIT Press, 2003.
    • (2003) Advances in Neural Information Processing Systems , vol.15
    • Atkeson, C.G.1    Morimoto, J.2
  • 19
    • 67649736510 scopus 로고    scopus 로고
    • Multiple balance strategies from one optimization criterion
    • C. G. Atkeson and B. Stephens. Multiple balance strategies from one optimization criterion. In Humanoids, 2007.
    • (2007) Humanoids
    • Atkeson, C.G.1    Stephens, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.