메뉴 건너뛰기




Volumn 1778, Issue , 2000, Pages 333-347

Supplementing neural reinforcement learning with symbolic methods

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTERS;

EID: 84942843866     PISSN: 03029743     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1007/10719871_23     Document Type: Conference Paper
Times cited : (4)

References (22)
  • 1
    • 85153940465 scopus 로고
    • Generalization in reinforcement learning: Safely approximating the value function
    • J. Tesauro, and D. Touretzky, and T. Leen, (eds.) MIT Press, Cambridge, MA
    • J. Boyan and A. Moore, (1995). Generalization in reinforcement learning: safely approximating the value function. in: J. Tesauro, and D. Touretzky, and T. Leen, (eds.) Neural Information Processing Systems, 369-376, MIT Press, Cambridge, MA.
    • (1995) Neural Information Processing Systems , pp. 369-376
    • Boyan, J.1    Moore, A.2
  • 4
    • 0000262562 scopus 로고
    • Hierarchical mixtures of experts and the em algorithm
    • M. Jordan and R. Jacobs, (1994). Hierarchical mixtures of experts and the EM algorithm. Neural Computation. 6, 181-214.
    • (1994) Neural Computation. , vol.6 , pp. 181-214
    • Jordan, M.1    Jacobs, R.2
  • 6
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning
    • L. Lin, (1992). Self-improving reactive agents based on reinforcement learning, planning, and teaching. Machine Learning. Vol.8, pp.293-321.
    • (1992) Planning, and Teaching. Machine Learning , vol.8 , pp. 293-321
    • Lin, L.1
  • 9
    • 21144465016 scopus 로고
    • On variable binding in connectionist networks
    • 1992
    • R. Sun, (1992). On variable binding in connectionist networks. Connection Science, Vol.4, No.2, pp.93-124. 1992.
    • (1992) Connection Science , vol.4 , Issue.2 , pp. 93-124
    • Sun, R.1
  • 10
    • 0031258480 scopus 로고    scopus 로고
    • Learning, action, and consciousness: A hybrid approach towards modeling consciousness
    • R. Sun, (1997). Learning, action, and consciousness: a hybrid approach towards modeling consciousness. Neural Networks, 10 (7), pp.1317-1331
    • (1997) Neural Networks , vol.10 , Issue.7 , pp. 1317-1331
    • Sun, R.1
  • 12
    • 0032207548 scopus 로고    scopus 로고
    • Autonomous learning of sequential tasks: Experiments and analyses
    • R. Sun and T. Peterson, (1998). Autonomous learning of sequential tasks: experiments and analyses. IEEE Transactions on Neural Networks, Vol.9, No.6, pp.12171234.
    • (1998) IEEE Transactions on Neural Networks , vol.9 , Issue.6 , pp. 12171234
    • Sun, R.1    Peterson, T.2
  • 13
    • 0032772352 scopus 로고    scopus 로고
    • Multi-agent reinforcement learning: Weighting and partitioning
    • R. Sun and T. Peterson, (1999). Multi-agent reinforcement learning: weighting and partitioning. Neural Networks, Vol.12 No.4-5. pp.127-153.
    • (1999) Neural Networks , vol.12 , Issue.4-5 , pp. 127-153
    • Sun, R.1    Peterson, T.2
  • 14
    • 0033165135 scopus 로고    scopus 로고
    • A hybrid architecture for situated learning of reactive sequential decision making
    • in press
    • R. Sun, T. Peterson, and E. Merrill, (1999). A hybrid architecture for situated learning of reactive sequential decision making. Applied Intelligence, in press.
    • (1999) Applied Intelligence
    • Sun, R.1    Peterson, T.2    Merrill, E.3
  • 18
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • Morgan Kaufmann, San Meteo, CA
    • R. Sutton, (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proc. of Seventh International Conference on Machine Learning. Morgan Kaufmann, San Meteo, CA.
    • (1990) Proc. of Seventh International Conference on Machine Learning
    • Sutton, R.1
  • 19
    • 0001046225 scopus 로고
    • Practical issues in temporal difference learning
    • T. Tesauro, (1992). Practical issues in temporal difference learning. Machine Learning. Vol.8, 257-277.
    • (1992) Machine Learning , vol.8 , pp. 257-277
    • Tesauro, T.1
  • 20
    • 0027678679 scopus 로고
    • Extracting refined rules from Knowledge-Based Neural Networks
    • G. Towell and J. Shavlik, (1993). Extracting refined rules from Knowledge-Based Neural Networks, Machine Learning. 13 (1), 71-101.
    • (1993) Machine Learning. , vol.13 , Issue.1 , pp. 71-101
    • Towell, G.1    Shavlik, J.2
  • 21
    • 0004049895 scopus 로고
    • Ph.D Thesis, Cambridge University, Cambridge, UK
    • C. Watkins, (1989). Learning with Delayed Rewards. Ph.D Thesis, Cambridge University, Cambridge, UK.
    • (1989) Learning with Delayed Rewards
    • Watkins, C.1
  • 22
    • 85158158334 scopus 로고
    • A complexity analysis of cooperative mechanisms in reinforcement learning
    • Morgan Kaufmann, San Francisco, CA
    • S. Whitehead, (1993). A complexity analysis of cooperative mechanisms in reinforcement learning. Proc. of the National Conference on Artificial Intelligence (AAAI'93), 607-613. Morgan Kaufmann, San Francisco, CA.
    • (1993) Proc. of the National Conference on Artificial Intelligence (AAAI'93 , pp. 607-613
    • Whitehead, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.