메뉴 건너뛰기




Volumn 4739, Issue , 2002, Pages 40-47

Reinforcement learning and design of nonparametric sequential decision networks

Author keywords

Dynamic programming; Neural networks; Reinforcement learning; Sequential detection

Indexed keywords

APPROXIMATION THEORY; DATA REDUCTION; DECISION SUPPORT SYSTEMS; DYNAMIC PROGRAMMING; LEARNING SYSTEMS; OPTIMIZATION; PROBABILITY DISTRIBUTIONS; SEQUENTIAL MACHINES;

EID: 0036408665     PISSN: 0277786X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1117/12.458718     Document Type: Conference Paper
Times cited : (2)

References (23)
  • 1
    • 0026846207 scopus 로고
    • M-ary sequential hypothesis tests for automatic target recognition
    • I. Jouny and F. Garber, "M-ary sequential hypothesis tests for automatic target recognition," IEEE Trans. Aerospace Electron. Syst. 28, pp. 473-483, 1992.
    • (1992) IEEE Trans. Aerospace Electron. Syst. , vol.28 , pp. 473-483
    • Jouny, I.1    Garber, F.2
  • 3
    • 0031095258 scopus 로고    scopus 로고
    • Temporal difference learning applied to sequential detection
    • March
    • C. Guo and A. Kuh, "Temporal difference learning applied to sequential detection," IEEE Transactions on Neural Networks 8, pp. 278-287, March 1997.
    • (1997) IEEE Transactions on Neural Networks , vol.8 , pp. 278-287
    • Guo, C.1    Kuh, A.2
  • 10
    • 33847202724 scopus 로고
    • Learning to predict by the method of temporal differences
    • R. S. Sutton, "Learning to predict by the method of temporal differences," Machine Learning 3, pp. 9-44, 1988.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 13
    • 0023169119 scopus 로고
    • Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research
    • P. J. Werbos, "Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research," IEEE Transactions on Systems, Man, and Cybernetics 17, pp. 7-20, 1987.
    • (1987) IEEE Transactions on Systems, Man, and Cybernetics , vol.17 , pp. 7-20
    • Werbos, P.J.1
  • 16
    • 0011115920 scopus 로고
    • An extention of Wald's theory of statistical decision functions
    • L. LeCam, "An extention of Wald's theory of statistical decision functions," Ann. Math. Statist 26, pp. 69-81, 1955.
    • (1955) Ann. Math. Statist , vol.26 , pp. 69-81
    • LeCam, L.1
  • 22
    • 0025670892 scopus 로고
    • The multilayer perceptron as an approximation to a bayes optimal discriminant function
    • Dec
    • D. Ruck, S. Rogers, M. Kabrisky, M. Oxley, and B. Suter, "The multilayer perceptron as an approximation to a bayes optimal discriminant function," IEEE Transactions on Neural Networks 1, pp. 296-298, Dec 1990.
    • (1990) IEEE Transactions on Neural Networks , vol.1 , pp. 296-298
    • Ruck, D.1    Rogers, S.2    Kabrisky, M.3    Oxley, M.4    Suter, B.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.