SCOPUS 정보 검색 플랫폼

1
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- Barto, A. G., Sutton, R. S., & Anderson, C. W. (1983). Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Trans. Syst., Man. & Cybern., 13, 834-846.
- (1983) IEEE Trans. Syst., Man. & Cybern. , vol.13 , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

2
- 0030284259
- Perfect recall and pruning in games with imperfect information
- Blair, J. R. S., Mutchler, D., & Lent, M. (1995). Perfect recall and pruning in games with imperfect information. Computational Intelligence, 12, 131-154.
- (1995) Computational Intelligence , vol.12 , pp. 131-154
- Blair, J.R.S.¹ Mutchler, D.² Lent, M.³

3
- 0003456153
- Ph.D. thesis, University of Massachusetts, Amherst
- Crites, R. H. (1996). Large-scale dynamic optimization using teams of reinforcement learning agents. Ph.D. thesis, University of Massachusetts, Amherst.
- (1996) Large-scale Dynamic Optimization Using Teams of Reinforcement Learning Agents
- Crites, R.H.¹

4
- 0032208335
- Elevator group control using multiple reinforcement learning agents
- Crites, R. H., & Barto, A. G. (1996). Elevator group control using multiple reinforcement learning agents. Machine Learning, 33, 235-262.
- (1996) Machine Learning , vol.33 , pp. 235-262
- Crites, R.H.¹ Barto, A.G.²

5
- 0036374294
- Gib: Imperfect information in a computationally challenging fame
- Ginsberg, M. (2001). Gib: Imperfect information in a computationally challenging fame. Journal of Artificial Intelligence Research, 14, 303-358.
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 303-358
- Ginsberg, M.¹

6
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- Hu, J., & Wellman, M. P. (1998). Multiagent reinforcement learning: Theoretical framework and an algorithm. In Proceedings of the Fifteenth International Conference on Machine learning (pp. 242-250).
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

7
- 0036592028
- Control of exploitation-exploration meta-parameter in reinforcement learning
- Ishii, S., Yoshida, W., & Yoshimoto, J. (2002). Control of exploitation-exploration meta-parameter in reinforcement learning. Neural Networks, 15, 665-687.
- (2002) Neural Networks , vol.15 , pp. 665-687
- Ishii, S.¹ Yoshida, W.² Yoshimoto, J.³

8
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L. P., Littman, M. L., & Cassandra, A. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134.
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.³

9
- 0012331016
- Memory approaches to reinforcement learning in non-markovian domains
- Lin, L.-J., & Mitchell, T. (1992). Memory approaches to reinforcement learning in non-markovian domains. Tech. rep., CMU-CS-92-138.
- (1992) Tech. Rep. , vol.CMU-CS-92-138
- Lin, L.-J.¹ Mitchell, T.²

10
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the 11th International Conference on Machine Learning (pp. 157-163).
- (1994) Proceedings of the 11th International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

11
- 0034819983
- A multi-agent reinforcement learning method for a partially-observable competitive game
- Matsuno, Y., Yamazaki, T., Matsuda, J., & Ishii, S. (2001). A multi-agent reinforcement learning method for a partially-observable competitive game. In Proceedings of the Fifth International Conference on Autonomous Agents (pp. 39-40).
- (2001) Proceedings of the Fifth International Conference on Autonomous Agents , pp. 39-40
- Matsuno, Y.¹ Yamazaki, T.² Matsuda, J.³ Ishii, S.⁴

12
- 0003932121
- Ph. D. thesis, Univercity of Rochester
- McCallum, A. (1995). Reinforcement learning with selective perception and hidden state. Ph. D. thesis, Univercity of Rochester.
- (1995) Reinforcement Learning with Selective Perception and Hidden State
- McCallum, A.¹

13
- 0000672424
- Fast learning in networks of locally-tuned processing units
- Moody, J., & Darken, C. J. (1989). Fast learning in networks of locally-tuned processing units. Neural Computation, 1, 281-294.
- (1989) Neural Computation , vol.1 , pp. 281-294
- Moody, J.¹ Darken, C.J.²

14
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less real time
- Moore, A., & Atkeson, C. (1993). Prioritized sweeping: Reinforcement learning with less data and less real time. Machine Learning, 13, 103-130.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.¹ Atkeson, C.²

15
- 0001435241
- Multi-agent reinforcement learning: An approach based on the other agent's internal model
- Nagayuki, Y., Ishii, S., & Doya, K. (2000). Multi-agent reinforcement learning: An approach based on the other agent's internal model. In Proceedings of the Fourth International Conference on MultiAgent Systems (pp. 215-221).
- (2000) Proceedings of the Fourth International Conference on MultiAgent Systems , pp. 215-221
- Nagayuki, Y.¹ Ishii, S.² Doya, K.³

16
- 0032208296
- Learning team strategies: Soccer case studies
- Salustowicz, K. P., Wiering, M. A., & Schmidhuber, J. (1998). Learning team strategies: Soccer case studies. Machine Learning, 33, 263-282.
- (1998) Machine Learning , vol.33 , pp. 263-282
- Salustowicz, K.P.¹ Wiering, M.A.² Schmidhuber, J.³

17
- 0030050933
- Multiagent reinforcement learning in the iterated prisoner's dilemma
- Sandholm, T. W., & Crites, R. H. (1995). Multiagent reinforcement learning in the iterated prisoner's dilemma, Biosystems, 37, 147-166.
- (1995) Biosystems , vol.37 , pp. 147-166
- Sandholm, T.W.¹ Crites, R.H.²

18
- 0034131785
- On-line em algorithm for the normalized gaussian network
- Sato, M., & Ishii, S. (2000). On-line em algorithm for the normalized gaussian network. Neural Computation, 12, 407-432.
- (2000) Neural Computation , vol.12 , pp. 407-432
- Sato, M.¹ Ishii, S.²

19
- 0028555752
- Learning to coordinate without sharing information
- Sen, S., Sekaran, M., & Hale, J. (1994). learning to coordinate without sharing information. In Proceedings of the Twelfth National Conference on Artificial Intelligence (pp. 426-431).
- (1994) Proceedings of the Twelfth National Conference on Artificial Intelligence , pp. 426-431
- Sen, S.¹ Sekaran, M.² Hale, J.³

20
- 0004102479
- MIT Press
- Sutton, R., & Barto, A. (Eds.). (1998). Reinforcement learning: An introduction. MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

21
- 85152198941
- Multi-agent reinforcement learning: Independent vs. cooperative agents
- Tan, M. (1993). Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings of the Tenth International Conference on Machine Learning (pp. 330-337).
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 330-337
- Tan, M.¹

22
- 0000985504
- Td-gammon, a self-teaching backgammon program, achieves masterlevel play
- Tesauro, G. J. (1994). Td-gammon, a self-teaching backgammon program, achieves masterlevel play. Neural Computation, 6, 215-219.
- (1994) Neural Computation , vol.6 , pp. 215-219
- Tesauro, G.J.¹

23
- 0029250080
- Reinforcement learning of non-markov decision processes
- Whitehead, S., & Lin, L.-J. (1995). Reinforcement learning of non-markov decision processes. Artificial Intelligence, 73, 271-306.
- (1995) Artificial Intelligence , vol.73 , pp. 271-306
- Whitehead, S.¹ Lin, L.-J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.