메뉴 건너뛰기




Volumn , Issue , 2004, Pages 568-575

Bias and variance in value function estimation

Author keywords

Bayesian Estimation; Bias; Markov Processes; Reinforcement Learning; Variance

Indexed keywords

BAYESIAN ESTIMATION; BIAS; REINFORCEMENT LEARNING; VARIANCE;

EID: 14344261137     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (34)

References (11)
  • 1
    • 0030241987 scopus 로고    scopus 로고
    • Mailing decisions in the catalog sales industry
    • Bitran, G. R., & Mondschein, S. V. (1996). Mailing decisions in the catalog sales industry. Management Science, 42, 1364-1381.
    • (1996) Management Science , vol.42 , pp. 1364-1381
    • Bitran, G.R.1    Mondschein, S.V.2
  • 2
    • 21844509404 scopus 로고
    • Optimal selection for direct mail
    • Bult, J., & Wansbeek, T. (1995). Optimal selection for direct mail. Marketing Science, 14, 378-394.
    • (1995) Marketing Science , vol.14 , pp. 378-394
    • Bult, J.1    Wansbeek, T.2
  • 6
    • 0032154071 scopus 로고    scopus 로고
    • Optimal mailing of catalogs: A new methodology using estimable structural dynamic programming models
    • Gönül, F., & Shi, M. (1998). Optimal mailing of catalogs: A new methodology using estimable structural dynamic programming models. Management Science, 44, 1249-1262.
    • (1998) Management Science , vol.44 , pp. 1249-1262
    • Gönül, F.1    Shi, M.2
  • 8
    • 0036832954 scopus 로고    scopus 로고
    • Near-optimal reinforcement learning in polynomial time
    • Kearns, M., & Singh, S. (2002). Near-optimal reinforcement learning in polynomial time. Machine Learning, 49, 209-232.
    • (2002) Machine Learning , vol.49 , pp. 209-232
    • Kearns, M.1    Singh, S.2
  • 10
    • 0020279968 scopus 로고
    • The variance of discounted Markov decision process
    • Sobel, M. J. (1982). The variance of discounted Markov decision process. Journal of Applied Probability, 19, 794-802.
    • (1982) Journal of Applied Probability , vol.19 , pp. 794-802
    • Sobel, M.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.