SCOPUS 정보 검색 플랫폼

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)

Volumn 1778, Issue , 2000, Pages 333-347

Supplementing neural reinforcement learning with symbolic methods

(1) Sun, Ron a

a UNIVERSITY OF MISSOURI (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTERS;

AUTONOMOUS LEARNING; DOMAIN-SPECIFIC KNOWLEDGE; SYMBOLIC METHODS;

REINFORCEMENT LEARNING;

EID: 84942843866 PISSN: 03029743 EISSN: None Source Type: Conference Proceeding
DOI: 10.1007/10719871_23 Document Type: Conference Paper

Times cited : (4)

References (22)

1
- 85153940465
- Generalization in reinforcement learning: Safely approximating the value function
- J. Tesauro, and D. Touretzky, and T. Leen, (eds.) MIT Press, Cambridge, MA
- J. Boyan and A. Moore, (1995). Generalization in reinforcement learning: safely approximating the value function. in: J. Tesauro, and D. Touretzky, and T. Leen, (eds.) Neural Information Processing Systems, 369-376, MIT Press, Cambridge, MA.
- (1995) Neural Information Processing Systems , pp. 369-376
- Boyan, J.¹ Moore, A.²

2
- 0004291566
- Wadsworth, Belmont, CA
- L. Breiman, L. Friedman, and P. Stone, (1984). Classification and Regression. Wadsworth, Belmont, CA.
- (1984) Classification and Regression
- Breiman, L.¹ Friedman, L.² Stone, P.³

3
- 0001940458
- Adaptive mixtures of local experts
- R. Jacobs, M. Jordan, S. Nowlan, and G. Hinton, (1991). Adaptive mixtures of local experts. Neural Computation. 3, 79-87.
- (1991) Neural Computation. , vol.3 , pp. 79-87
- Jacobs, R.¹ Jordan, M.² Nowlan, S.³ Hinton, G.⁴

4
- 0000262562
- Hierarchical mixtures of experts and the em algorithm
- M. Jordan and R. Jacobs, (1994). Hierarchical mixtures of experts and the EM algorithm. Neural Computation. 6, 181-214.
- (1994) Neural Computation. , vol.6 , pp. 181-214
- Jordan, M.¹ Jacobs, R.²

5
- 0004080766
- Ellis Horword, New York
- N. Lavrac and S. Dzeroski, (1994). Inductive Logic Programming. Ellis Horword, New York.
- (1994) Inductive Logic Programming
- Lavrac, N.¹ Dzeroski, S.²

6
- 0000123778
- Self-improving reactive agents based on reinforcement learning
- L. Lin, (1992). Self-improving reactive agents based on reinforcement learning, planning, and teaching. Machine Learning. Vol.8, pp.293-321.
- (1992) Planning, and Teaching. Machine Learning , vol.8 , pp. 293-321
- Lin, L.¹

7
- 0028566290
- Incorporating advice into agents that learn from reinforcements
- Morgan Kaufmann, San Meteo, CA
- R. Maclin and J. Shavlik, (1994). Incorporating advice into agents that learn from reinforcements. Proc. of the National Conference on Artificial Intelligence (AAAI-94). Morgan Kaufmann, San Meteo, CA.
- (1994) Proc. of the National Conference on Artificial Intelligence (AAAI-94)
- Maclin, R.¹ Shavlik, J.²

8
- 0003824303
- Ph.D Thesis, University of Massachusetts, Amherst, MA
- S. Singh, (1994). Learning to Solve Markovian Decision Processes. Ph.D Thesis, University of Massachusetts, Amherst, MA.
- (1994) Learning to Solve Markovian Decision Processes
- Singh, S.¹

9
- 21144465016
- On variable binding in connectionist networks
- 1992
- R. Sun, (1992). On variable binding in connectionist networks. Connection Science, Vol.4, No.2, pp.93-124. 1992.
- (1992) Connection Science , vol.4 , Issue.2 , pp. 93-124
- Sun, R.¹

10
- 0031258480
- Learning, action, and consciousness: A hybrid approach towards modeling consciousness
- R. Sun, (1997). Learning, action, and consciousness: a hybrid approach towards modeling consciousness. Neural Networks, 10 (7), pp.1317-1331
- (1997) Neural Networks , vol.10 , Issue.7 , pp. 1317-1331
- Sun, R.¹

11
- 0030675538
- A hybrid model for learning sequential navigation. Proc. of
- IEEE Press, Piscateway, NJ
- R. Sun and T. Peterson, (1997). A hybrid model for learning sequential navigation. Proc. of IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA'97). Monterey, CA. pp.234-239. IEEE Press, Piscateway, NJ.
- (1997) IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA'97). Monterey, CA. , pp. 234-239
- Sun, R.¹ Peterson, T.²

12
- 0032207548
- Autonomous learning of sequential tasks: Experiments and analyses
- R. Sun and T. Peterson, (1998). Autonomous learning of sequential tasks: experiments and analyses. IEEE Transactions on Neural Networks, Vol.9, No.6, pp.12171234.
- (1998) IEEE Transactions on Neural Networks , vol.9 , Issue.6 , pp. 12171234
- Sun, R.¹ Peterson, T.²

13
- 0032772352
- Multi-agent reinforcement learning: Weighting and partitioning
- R. Sun and T. Peterson, (1999). Multi-agent reinforcement learning: weighting and partitioning. Neural Networks, Vol.12 No.4-5. pp.127-153.
- (1999) Neural Networks , vol.12 , Issue.4-5 , pp. 127-153
- Sun, R.¹ Peterson, T.²

14
- 0033165135
- A hybrid architecture for situated learning of reactive sequential decision making
- in press
- R. Sun, T. Peterson, and E. Merrill, (1999). A hybrid architecture for situated learning of reactive sequential decision making. Applied Intelligence, in press.
- (1999) Applied Intelligence
- Sun, R.¹ Peterson, T.² Merrill, E.³

15
- 84900180782
- Extracting plans from reinforcement learners
- eds. L. Xu, L. Chan, I. King, and A. Fu. Springer Verlag, Heidelberg
- R. Sun and C. Sessions, (1998a). Extracting plans from reinforcement learners. Proceedings of the 1998 International Symposium on Intelligent Data Engineering and Learning (IDEAL'98). pp.243-248. eds. L. Xu, L. Chan, I. King, and A. Fu. Springer-Verlag, Heidelberg.
- (1998) Proceedings of the 1998 International Symposium on Intelligent Data Engineering and Learning (IDEAL'98). , pp. 243-248
- Sun, R.¹ Sessions, C.²

16
- 0031641936
- Learning to plan probabilistically from neural networks
- IEEE Press, Piscataway, NJ
- R. Sun and C. Sessions, (1998b). Learning to plan probabilistically from neural networks. Proceedings of IEEE International Joint Conference on Neural Networks, pp.1-6. IEEE Press, Piscataway, NJ.
- (1998) Proceedings of IEEE International Joint Conference on Neural Networks , pp. 1-6
- Sun, R.¹ Sessions, C.²

17
- 0033332125
- Self segmentation of sequences
- R. Sun and C. Sessions, (1999). Self segmentation of sequences. Proceedings of IEEE International Joint Conference on Neural Networks, IEEE Press, Piscataway, NJ.
- (1999) Proceedings of IEEE International Joint Conference on Neural Networks, IEEE Press, Piscataway, NJ
- Sun, R.¹ Sessions, C.²

18
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- Morgan Kaufmann, San Meteo, CA
- R. Sutton, (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proc. of Seventh International Conference on Machine Learning. Morgan Kaufmann, San Meteo, CA.
- (1990) Proc. of Seventh International Conference on Machine Learning
- Sutton, R.¹

19
- 0001046225
- Practical issues in temporal difference learning
- T. Tesauro, (1992). Practical issues in temporal difference learning. Machine Learning. Vol.8, 257-277.
- (1992) Machine Learning , vol.8 , pp. 257-277
- Tesauro, T.¹

20
- 0027678679
- Extracting refined rules from Knowledge-Based Neural Networks
- G. Towell and J. Shavlik, (1993). Extracting refined rules from Knowledge-Based Neural Networks, Machine Learning. 13 (1), 71-101.
- (1993) Machine Learning. , vol.13 , Issue.1 , pp. 71-101
- Towell, G.¹ Shavlik, J.²

21
- 0004049895
- Ph.D Thesis, Cambridge University, Cambridge, UK
- C. Watkins, (1989). Learning with Delayed Rewards. Ph.D Thesis, Cambridge University, Cambridge, UK.
- (1989) Learning with Delayed Rewards
- Watkins, C.¹

22
- 85158158334
- A complexity analysis of cooperative mechanisms in reinforcement learning
- Morgan Kaufmann, San Francisco, CA
- S. Whitehead, (1993). A complexity analysis of cooperative mechanisms in reinforcement learning. Proc. of the National Conference on Artificial Intelligence (AAAI'93), 607-613. Morgan Kaufmann, San Francisco, CA.
- (1993) Proc. of the National Conference on Artificial Intelligence (AAAI'93 , pp. 607-613
- Whitehead, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.