메뉴 건너뛰기




Volumn , Issue , 2010, Pages

Basis construction from power series expansions of value functions

Author keywords

[No Author keywords available]

Indexed keywords

INVERSE PROBLEMS; MATRIX ALGEBRA;

EID: 85161990353     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (20)

References (15)
  • 1
    • 0024680419 scopus 로고
    • Adaptive aggregation methods for infinite horizon dynamic programming
    • D. Bertsekas and D. Castañon. Adaptive aggregation methods for infinite horizon dynamic programming. IEEE Transactions on Automatic Control, 34:589-598, 1989.
    • (1989) IEEE Transactions on Automatic Control , vol.34 , pp. 589-598
    • Bertsekas, D.1    Castañon, D.2
  • 4
    • 85162034137 scopus 로고    scopus 로고
    • An investigation of basis construction from power series expansions of value functions
    • Amherst
    • B. Liu and S. Mahadevan. An investigation of basis construction from power series expansions of value functions. Technical report, University Massachusetts, Amherst, 2010.
    • (2010) Technical Report, University Massachusetts
    • Liu, B.1    Mahadevan, S.2
  • 6
    • 70349322784 scopus 로고    scopus 로고
    • Learning representation and control in Markov Decision Processes: New frontiers
    • S. Mahadevan. Learning representation and control in Markov Decision Processes: New frontiers. Foundations and Trends in Machine Learning, 1(4):403-565, 2009.
    • (2009) Foundations and Trends in Machine Learning , vol.1 , Issue.4 , pp. 403-565
    • Mahadevan, S.1
  • 7
    • 35748957806 scopus 로고    scopus 로고
    • Proto-value functions: A Laplacian framework for learning representation and control in Markov Decision Processes
    • S. Mahadevan and M. Maggioni. Proto-value functions: A Laplacian framework for learning representation and control in Markov Decision Processes. Journal of Machine Learning Research, 8:2169-2231, 2007.
    • (2007) Journal of Machine Learning Research , vol.8 , pp. 2169-2231
    • Mahadevan, S.1    Maggioni, M.2
  • 13
    • 85152626183 scopus 로고
    • A reinforcement learning method for maximizing undiscounted rewards
    • Morgan Kaufmann, San Francisco, CA
    • A. Schwartz. A reinforcement learning method for maximizing undiscounted rewards. In Proc. 10th International Conf. on Machine Learning. Morgan Kaufmann, San Francisco, CA, 1993.
    • (1993) Proc. 10th International Conf. on Machine Learning
    • Schwartz, A.1
  • 14
    • 0346922977 scopus 로고    scopus 로고
    • Numerical methods for computing stationary distributions of finite irreducible markov chains
    • Kluwer Academic Publishers
    • William J. Stewart. Numerical methods for computing stationary distributions of finite irreducible markov chains. In Advances in Computational Probability. Kluwer Academic Publishers, 1997.
    • (1997) Advances in Computational Probability
    • Stewart, W.J.1
  • 15
    • 0012841228 scopus 로고    scopus 로고
    • Successive matrix squaring algorithm for computing the Drazin inverse
    • Y. Wei. Successive matrix squaring algorithm for computing the Drazin inverse. Applied Mathematics and Computation, 108:67-75, 2000.
    • (2000) Applied Mathematics and Computation , vol.108 , pp. 67-75
    • Wei, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.