메뉴 건너뛰기




Volumn , Issue , 2002, Pages 259-268

Sequential cost-sensitive decision making with reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; ARTIFICIAL INTELLIGENCE; DATA MINING; DECISION THEORY; OPTIMIZATION;

EID: 0242540456     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (65)

References (17)
  • 2
    • 0003704318 scopus 로고    scopus 로고
    • UCI KDD archive
    • Department of Information and Computer Sciences, University of California, Irvine
    • S. D. Bay. UCI KDD archive. Department of Information and Computer Sciences, University of California, Irvine, 2000, http://kdd.ics.uci.edu/.
    • (2000)
    • Bay, S.D.1
  • 8
    • 0001845032 scopus 로고    scopus 로고
    • Bootstrap methods for the cost-sensitive evaluation of classifiers
    • Morgan Kaufmann, San Francisco, CA
    • D. D. Margineantu and T. G. Dietterich. Bootstrap methods for the cost-sensitive evaluation of classifiers. In Proc. 17th International Conf. on Machine Learning, pages 583-590. Morgan Kaufmann, San Francisco, CA, 2000.
    • (2000) Proc. 17th International Conf. on Machine Learning , pp. 583-590
    • Margineantu, D.D.1    Dietterich, T.G.2
  • 10
    • 0003636089 scopus 로고
    • On-line q-learning using connectionist systems
    • Technical Report CUED/F-INFENG/TR 166, Cambridge University Engineering Department; Ph.D. thesis
    • G. A. Rummery and M. Niranjan. On-line q-learning using connectionist systems. Technical Report CUED/F-INFENG/TR 166, Cambridge University Engineering Department, 1994. Ph.D. thesis.
    • (1994)
    • Rummery, G.A.1    Niranjan, M.2
  • 12
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal difference learning with function approximation
    • J. N. Tsitsiklis and B. V. Roy. An analysis of temporal difference learning with function approximation. IEEE Transactions on Automatic Control, 42(5):674-690, 1997.
    • (1997) IEEE Transactions on Automatic Control , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.N.1    Roy, B.V.2
  • 13
    • 0003558537 scopus 로고    scopus 로고
    • Cost-sensitive learning bibliography
    • Institute for Information Technology, National Research Council, Ottawa, Canada
    • P. Turney. Cost-sensitive learning bibliography. Institute for Information Technology, National Research Council, Ottawa, Canada, 2000. http://extractor.iit.nrc.ca/bibliographies/cost-sensitive.html.
    • (2000)
    • Turney, P.1
  • 15
    • 0004049893 scopus 로고
    • Learning from delayed rewards
    • PhD thesis, Cambridge University, Cambridge
    • C. J. C. H. Watkins. Learning from Delayed Rewards. PhD thesis, Cambridge University, Cambridge, 1989.
    • (1989)
    • Watkins, C.J.C.H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.