SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Proceedings of the International Joint Conference on Neural Networks

Volumn 4, Issue , 2003, Pages 2910-2915

Tabu Search Exploration for on-Policy Reinforcement Learning

(2) Abramson, Myriam a Wechsler, Harry b

a GEORGE MASON UNIVERSITY (United States)

b George Mason University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DYNAMIC PROGRAMMING; LEARNING ALGORITHMS; PROBLEM SOLVING; VECTOR QUANTIZATION;

TABU SEARCH (TS) EXPLORATION;

LEARNING SYSTEMS;

EID: 0141704192 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (20)

1
- 0034860530
- Competitive reinforcement learning for combinatorial problems
- M. Abramson and H. Wechsler, Competitive reinforcement learning for combinatorial problems. In International Joint Conference on Neural Networks, 2001.
- (2001) International Joint Conference on Neural Networks
- Abramson, M.¹ Wechsler, H.²

2
- 24944480025
- Honte, a go-playing program using neural nets
- F. A. Dahl. Honte, a go-playing program using neural nets. In Workshop on Machine learning in Game Playing, 1999.
- (1999) Workshop on Machine Learning in Game Playing
- Dahl, F.A.¹

3
- 0000430514
- The convergence of td(lambda) for general lambda
- P. Dayan. The convergence of td(lambda) for general lambda. Machine Learning, (8):341-362, 1992.
- (1992) Machine Learning , Issue.8 , pp. 341-362
- Dayan, P.¹

4
- 0028388685
- Td(lambda) converges with probability 1
- P. Dayan and T. J. Sejnowski. Td(lambda) converges with probability 1. Machine Learning, (14):295-301, 1994.
- (1994) Machine Learning , Issue.14 , pp. 295-301
- Dayan, P.¹ Sejnowski, T.J.²

5
- 0141629190
- Hidden strengths and limitations: An empirical investigation of reinforcement learning
- Morgan Kaufmann
- G. DeJong. Hidden strengths and limitations: an empirical investigation of reinforcement learning. In International Conference on Machine Learning. Morgan Kaufmann, 2000.
- (2000) International Conference on Machine Learning
- DeJong, G.¹

6
- 0004215426
- Kluwer Academic Publishers
- F. Glover and M. Laguna. Tabu Search. Kluwer Academic Publishers, 1997.
- (1997) Tabu Search
- Glover, F.¹ Laguna, M.²

7
- 0000928452
- The tabu search metaheuristic: How we use it
- A. Hertz and D. de Werra. The tabu search metaheuristic: how we use it. Annals of Mathematics and Artificial Intelligence, 1:111-121, 1990.
- (1990) Annals of Mathematics and Artificial Intelligence , vol.1 , pp. 111-121
- Hertz, A.¹ De Werra, D.²

8
- 0003463297
- MIT Press, 2nd edition
- J. H. Holland. Adaptation in Natural and Artificial Systems. MIT Press, 2nd edition, 1992.
- (1992) Adaptation in Natural and Artificial Systems
- Holland, J.H.¹

9
- 0029751419
- The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithm
- S. Koenig and R. G. Simmons. The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithm. Machine Learning, 22:227-250, 1996.
- (1996) Machine Learning , vol.22 , pp. 227-250
- Koenig, S.¹ Simmons, R.G.²

10
- 0003410791
- Springer, 2nd edition
- T. Kohonen. Self-Organizing Maps. Springer, 2nd edition, 1997.
- (1997) Self-Organizing Maps
- Kohonen, T.¹

11
- 84976681619
- Go is polynominal-space hard
- D. Lichtenstein and M. Sipser. Go is polynominal-space hard. Journal of the Association for Computing Machinery, 27(2):393-401, 1980.
- (1980) Journal of the Association for Computing Machinery , vol.27 , Issue.2 , pp. 393-401
- Lichtenstein, D.¹ Sipser, M.²

12
- 0003636089
- On-line q-learning using connectionist systems
- Cambridge University Engineering Dept.
- G. A. Rummery and M. Niranjan. On-line q-learning using connectionist systems. Technical report, Cambridge University Engineering Dept., 1994.
- (1994) Technical Report
- Rummery, G.A.¹ Niranjan, M.²

13
- 0033189324
- Similarity measures
- S. Santini and R. Jain. Similarity measures. IEEE Transactions on Pattern Analysis and Machine Intelligence, 21(9), 1999.
- (1999) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.21 , Issue.9
- Santini, S.¹ Jain, R.²

14
- 0033901602
- Convergence results for single-step on-policy reinforcement learning algorithms
- S. Singh, T. Jaakkola, M. L. Littman, and Csaba Szepesvari. Convergence results for single-step on-policy reinforcement learning algorithms. Machine Learning, 38(3):287-308, 2000.
- (2000) Machine Learning , vol.38 , Issue.3 , pp. 287-308
- Singh, S.¹ Jaakkola, T.² Littman, M.L.³ Szepesvari, C.⁴

15
- 0003725332
- Harcourt Brace Jovanovich
- J. E. R. Staddon and R. H. Ettinger. An Introduction to the Principles of Adaptive Behavior. Harcourt Brace Jovanovich, 1989.
- (1989) An Introduction to the Principles of Adaptive Behavior
- Staddon, J.E.R.¹ Ettinger, R.H.²

16
- 0004102479
- MIT Press, Cambridge, MA
- R. S. Sutton and A. Barto. Reinforcement Learning: an Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.²

17
- 0003411271
- Efficient exploration in reinforcement learning
- Carnegie Mellon University
- S. B. Thrun. Efficient exploration in reinforcement learning. Technical Report TR CMU-CS-92-102, Carnegie Mellon University, 1992.
- (1992) Technical Report , vol.TR CMU-CS-92-102
- Thrun, S.B.¹

18
- 0004049893
- PhD thesis, King's College
- C. Watkins. Learning from Delayed Rewards. PhD thesis, King's College, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

19
- 34249833101
- Q-learning
- C. Watkins and P. Dayan. Q-learning. Machine Learning, (8):279-292, 1992.
- (1992) Machine Learning , Issue.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

20
- 0003445905
- PhD thesis, University of Wisconsin
- L. Zobrist. Feature Extraction and Representation for Pattern Recognition and the Game of Go. PhD thesis, University of Wisconsin, 1970.
- (1970) Feature Extraction and Representation for Pattern Recognition and the Game of Go
- Zobrist, L.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.