SCOPUS 정보 검색 플랫폼

Proceedings of the 9th International Workshop on Machine Learning, ICML 1992

Volumn , Issue , 1992, Pages 316-321

Using Transitional Proximity for Faster Reinforcement Learning

(1) McCallum, R Andrew a

a UNIVERSITY OF ROCHESTER (United States)

Author keywords

[No Author keywords available]

Indexed keywords

'CURRENT; CURRENT PATHS; FAST LEARNING; INTERNAL STATE; KOHONEN NETWORK; LEARN+; LEARNING NETWORK; OPTIMAL POLICIES; Q-LEARNING; REINFORCEMENT LEARNINGS;

REINFORCEMENT LEARNING;

EID: 84957622922 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1016/B978-1-55860-247-2.50045-0 Document Type: Conference Paper

Times cited : (13)

References (12)

1
- 0003787146
- [Bellman, 1957] Princeton University Press, Princeton, NJ
- [Bellman, 1957] R. E. Bellman. Dynamic Programming. Princeton University Press, Princeton, NJ, 1957.
- (1957) Dynamic Programming
- Bellman, R. E.¹

2
- 0002192119
- Learning from delayed reinforcement in a complex domain
- [Chapman and Kaelbling, 1991]
- [Chapman and Kaelbling, 1991] David Chapman and Leslie Pack Kaelbling. Learning from delayed reinforcement in a complex domain. In Proceedings of IJCAI, 1991.
- (1991) Proceedings of IJCAI
- Chapman, David¹ Kaelbling, Leslie Pack²

3
- 0003882939
- [Chapman, 1990] PhD thesis, MIT Artificial Intelligence Laboratory
- [Chapman, 1990] David Chapman. Vision, Instruction, and action. PhD thesis, MIT Artificial Intelligence Laboratory, 1990.
- (1990) Vision, Instruction, and action
- Chapman, David¹

4
- 0003979924
- [Hertz et al., l#9l] Addison-Wesley, Redwood City, California
- [Hertz et al., l#9l] John Hertz, Anders Krogh, and Richard Palmer. Introduction to the Theory of Neural Computation. Addison-Wesley, Redwood City, California, 1991.
- (1991) Introduction to the Theory of Neural Computation
- Hertz, John¹ Krogh, Anders² Palmer, Richard³

5
- 0003527079
- [Kohonen, 1989] (3rd ed). Springer-Verlag
- [Kohonen, 1989] Teuvo Kohonen. Self-Organization and Associative Memory (3rd ed.). Springer-Verlag, 1989.
- (1989) Self-Organization and Associative Memory
- Kohonen, Teuvo¹

6
- 85151437138
- Programming robots using reinforcement learning and teaching
- 1991]
- [Lin, 1991] Long-Ji Lin. Programming robots using reinforcement learning and teaching. AAAI, 1991.
- (1991) AAAI
- Lin, Long-Ji¹

7
- 0000742931
- [Martinetz and Schulten, 1991] (unpub) Beckman Institute, UI-UC
- [Martinetz and Schulten, 1991] T. Martinetz and K. Schulten. A "neural gas" network for vector quantization and learning of unknown topologies, (unpub.) Beckman Institute, UI-UC, 1991.
- (1991) A "neural gas" network for vector quantization and learning of unknown topologies
- Martinetz, T.¹ Schulten, K.²

8
- 85152516416
- [McCallum, 1992] Technical report, Department of Computer Science, University of Rochester
- [McCallum, 1992] R. Andrew McCallum. Using transitional proximity for faster reinforcement learning. Technical report, Department of Computer Science, University of Rochester, 1992.
- (1992) Using transitional proximity for faster reinforcement learning
- Andrew McCallum, R.¹

9
- 80055012083
- First results with DYNA, an integrated architecture for learning, planning, and reacting
- [Sutton, 1990]
- [Sutton, 1990] Richard S. Sutton. First results with DYNA, an integrated architecture for learning, planning, and reacting. In Proceedings of the AAAI Spring Symposium on Planning in Uncertain, Unpredictable, or Changing Environments, 1990.
- (1990) Proceedings of the AAAI Spring Symposium on Planning in Uncertain, Unpredictable, or Changing Environments
- Sutton, Richard S.¹

10
- 84916567474
- Cost sensitive reinforcement learning for adaptive classification and control
- [Tan, 1991]
- [Tan, 1991] Ming Tan. Cost sensitive reinforcement learning for adaptive classification and control. In AAAI, 1991.
- (1991) AAAI
- Tan, Ming¹

11
- 0004049893
- [Watkins, 1989] PhD thesis, Cambridge University
- [Watkins, 1989] Chris Watkins. Learning from delayed rewards. PhD thesis, Cambridge University, 1989.
- (1989) Learning from delayed rewards
- Watkins, Chris¹

12
- 0003619736
- [Whitehead, 1992] PhD thesis, Department of Computer Science, University of Rochester
- [Whitehead, 1992] Steven Whitehead. Reinforcement Learning for the Adaptive Control of Perception and Action. PhD thesis, Department of Computer Science, University of Rochester, 1992.
- (1992) Reinforcement Learning for the Adaptive Control of Perception and Action
- Whitehead, Steven¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.