SCOPUS 정보 검색 플랫폼

International Journal of Robotics Research

Volumn 16, Issue 4, 1997, Pages 217-226

Learning in large cooperative multi-robot domains

(2) Fernandez F a,b Parker, L E a

a OAK RIDGE NATIONAL LABORATORY (United States)

b UNIVERSIDAD CARLOS III DE MADRID (Spain)

Author keywords

Cooperative robotics; Multi robot systems; Reinforcement learning; State space representation

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING ALGORITHMS; MATHEMATICAL MODELS; MOTION CONTROL; ROBOT LEARNING; STATE SPACE METHODS; VECTOR QUANTIZATION;

COOPERATIVE MULTI-ROBOT DOMAINS; COOPERATIVE ROBOTICS; REINFORCEMENT LEARNING; STATE-SPACE REPRESENTATION;

ROBOTICS;

EID: 5644261272 PISSN: 02783649 EISSN: None Source Type: Journal
DOI: None Document Type: Article

Times cited : (33)

References (28)

1
- 0003782395
- Berlin: Springer
- G. Weiss & S. Sen (Eds.), Adaption and learning in multi-agent systems (Berlin: Springer, 1996).
- (1996) Adaption and Learning in Multi-agent Systems
- Weiss, G.¹ Sen, S.²

2
- 0029679044
- Reinforcement learning: A survey
- L. Kaelbling, M.L. Littman, & A.W. Moore, Reinforcement learning: A survey, Journal of Artificial Intelligence Research, 4, 1996, 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.¹ Littman, M.L.² Moore, A.W.³

3
- 33747997674
- Variable resolution dynamic programming: Efficiently learning action maps in multivariate real-valued spaces
- Evanston, IL
- A.W. Moore, Variable resolution dynamic programming: Efficiently learning action maps in multivariate real-valued spaces, Proc. 8th Int. Conf. on Machine Learning, Evanston, IL, 1991, 333-337.
- (1991) Proc. 8th Int. Conf. on Machine Learning , pp. 333-337
- Moore, A.W.¹

4
- 84880680664
- Variable resolution discretization for high-accuracy solutions of optimal control problems
- Stockholm, Sweden
- R. Munos & A. Moore, Variable resolution discretization for high-accuracy solutions of optimal control problems, Proc. 16th Int. Joint Conf. on Artificial Intelligence, 2, Stockholm, Sweden, 1999, 1348-1355.
- (1999) Proc. 16th Int. Joint Conf. on Artificial Intelligence , vol.2 , pp. 1348-1355
- Munos, R.¹ Moore, A.²

5
- 0002192119
- Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
- Sydney, Australia
- D. Chapman & L.P. Kaelbling, Input generalization in delayed reinforcement learning: An algorithm and performance comparisons, Proc. Int. Joint Conf. on Artificial Intelligence, Sydney, Australia, 1991, 726-731.
- (1991) Proc. Int. Joint Conf. on Artificial Intelligence , pp. 726-731
- Chapman, D.¹ Kaelbling, L.P.²

6
- 0030380251
- Action-based sensor space categorization for robot learning
- Osaka, Japan
- M. Asada, S. Noda, & K. Hosoda, Action-based sensor space categorization for robot learning, Proc. 1996 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'96), 3, Osaka, Japan, 1996, 1502-1509.
- (1996) Proc. 1996 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'96) , vol.3 , pp. 1502-1509
- Asada, M.¹ Noda, S.² Hosoda, K.³

7
- 0030395609
- Simultaneous learning of situation classification based on rewards and behavior selection based on the situation
- Osaka, Japan
- A. Ueno, K. Hori, & S. Nakasuda, Simultaneous learning of situation classification based on rewards and behavior selection based on the situation, Proc. 1996 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'96), 3, Osaka, Japan, 1996, 1510-1517.
- (1996) Proc. 1996 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'96) , vol.3 , pp. 1510-1517
- Ueno, A.¹ Hori, K.² Nakasuda, S.³

8
- 0033346697
- Autonomous action-mode change in a two-mobile robotics system: S-temperature based on-line learning
- Kyongju, Korea
- T. Sawada, S. Ichikawa, & F. Hara. Autonomous action-mode change in a two-mobile robotics system: S-temperature based on-line learning, Proc. 1999 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'99), 1, Kyongju, Korea, 1999, 393-399.
- (1999) Proc. 1999 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'99) , vol.1 , pp. 393-399
- Sawada, T.¹ Ichikawa, S.² Hara, F.³

9
- 84947709855
- Adaptive state-space quantization for reinforcement learning of collision-free navigation
- Raleigh, NC
- B.J.A. Krose & J.W.M. van Dam, Adaptive state-space quantization for reinforcement learning of collision-free navigation, Proc. 1992 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'92), 2, Raleigh, NC, 1992, 1327-1332.
- (1992) Proc. 1992 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'92) , vol.2 , pp. 1327-1332
- Krose, B.J.A.¹ Van Dam, J.W.M.²

10
- 0008364145
- Reinforcement learning using functional approximation for generalization and their application to cart centering and fractal compression
- Stockholm, Sweden
- C. Claussen, S. Gutta, & H. Wechsler, Reinforcement learning using functional approximation for generalization and their application to cart centering and fractal compression, Proc. 16th Int. Joint Conf. on Artificial Intelligence, 2, Stockholm, Sweden, 1999, 1362-1367
- (1999) Proc. 16th Int. Joint Conf. on Artificial Intelligence , vol.2 , pp. 1362-1367
- Claussen, C.¹ Gutta, S.² Wechsler, H.³

11
- 0003505613
- Report TKK-F-A601, Helsinki University of Technology, Espoo, Finland
- T. Kohonen, Learning vector quantization for pattern recognition, Report TKK-F-A601, Helsinki University of Technology, Espoo, Finland, 1986.
- (1986) Learning Vector Quantization for Pattern Recognition
- Kohonen, T.¹

12
- 0004049893
- doctoral diss., King's College, Cambridge, UK
- C.J.C.H. Watkins, Learning from delayed rewards, doctoral diss., King's College, Cambridge, UK, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

13
- 0031341345
- Neural reinforcement learning for behaviour synthesis
- C. Touzet, Neural reinforcement learning for behaviour synthesis, Robotics and Autonomous Systems. 22(3-4), 1997, 251-282.
- (1997) Robotics and Autonomous Systems. , vol.22 , Issue.3-4 , pp. 251-282
- Touzet, C.¹

14
- 0003527079
- Berlin, Heidelberg: Springer
- T. Kohonen, Self-organization and associative memory, 3rd ed., (Berlin, Heidelberg: Springer, 1989).
- (1989) Self-organization and Associative Memory, 3rd Ed.
- Kohonen, T.¹

15
- 0003356379
- VQQL: Applying vector quantization to reinforcement learning
- Stockholm, Sweden: Springer Verlag
- F. Fernández and D. Borrajo, VQQL: Applying vector quantization to reinforcement learning, RoboCup-99: Robot Soccer World Cup III (Stockholm, Sweden: Springer Verlag, 2000).
- (2000) RoboCup-99: Robot Soccer World Cup III
- Fernández, F.¹ Borrajo, D.²

16
- 0020102027
- Least squares quantization in pcm
- S.P. Lloyd, Least squares quantization in pcm, IEEE Trans. on Information Theory, 28, 1982, 127-135.
- (1982) IEEE Trans. on Information Theory , vol.28 , pp. 127-135
- Lloyd, S.P.¹

17
- 0018918171
- An algorithm for vector quantizer design
- Com-28
- Y. Linde, A. Buzo, & R.M. Gray, An algorithm for vector quantizer design, IEEE Trans. on Communications, 1 (1), Com-28, 1980, 84-95.
- (1980) IEEE Trans. on Communications , vol.1 , Issue.1 , pp. 84-95
- Linde, Y.¹ Buzo, A.² Gray, R.M.³

18
- 0033312347
- A ease study for life-long learning and adaptation in cooperative robot teams
- Boston, MA
- L.E. Parker, A ease study for life-long learning and adaptation in cooperative robot teams, Proc. SPIE Sensor Fusion and Decentralized Control in Robotic Systems II, 3839, Boston, MA, 1999, 92-101.
- (1999) Proc. SPIE Sensor Fusion and Decentralized Control in Robotic Systems II , vol.3839 , pp. 92-101
- Parker, L.E.¹

19
- 0001790234
- Broadcast of local eligibility for multi-target observation
- L.E. Parker, G. Bekey, & J. Barhem (Eds.), Tokyo, Japan: Springer
- B.B. Werger & M. Matarić, Broadcast of local eligibility for multi-target observation, in L.E. Parker, G. Bekey, & J. Barhem (Eds.), Distributed Autonomous Robotic Systems, 4, (Tokyo, Japan: Springer, 2000), 347-356.
- (2000) Distributed Autonomous Robotic Systems , vol.4 , pp. 347-356
- Werger, B.B.¹ Matarić, M.²

20
- 0008303956
- Ultrafast neural network training for robot learning from uncertain data
- Tokyo
- J. Barhen & V. Protopopescu, Ultrafast neural network training for robot learning from uncertain data, Distributed Autonomous Robotic Systems, 4, Tokyo, 2000, 347-356.
- (2000) Distributed Autonomous Robotic Systems , vol.4 , pp. 347-356
- Barhen, J.¹ Protopopescu, V.²

21
- 0001534236
- Multi-robot learning in a cooperative observation task
- L.E. Parker, G. Bekey, & J. Barhem (Eds.), Tokyo, Japan: Springer
- L. Parker & C. Touzet, Multi-robot learning in a cooperative observation task, in L.E. Parker, G. Bekey, & J. Barhem (Eds.), Distributed Autonomous Robotic Systems, 4 (Tokyo, Japan: Springer, 2000) 391-401.
- (2000) Distributed Autonomous Robotic Systems , vol.4 , pp. 391-401
- Parker, L.¹ Touzet, C.²

22
- 5644301833
- Iterative VQQL for learning skills
- Leganés, Madrid, Spain
- F. Fernández & D. Borrajo, Iterative VQQL for learning skills, Proc. of Learning '00, Leganés, Madrid, Spain, 2000.
- (2000) Proc. of Learning '00
- Fernández, F.¹ Borrajo, D.²

23
- 34249833101
- Q-learning
- C.J.C.H. Watkins & P. Dayan, Q-learning, Machine learning, 8, 1992, 279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

24
- 0004491880
- Robocup: The robot world cup initiative
- Montreal, Canada
- H. Kitano, M. Asada, Y. Kuniyoshi, I. Noda, & E. Osawa, Robocup: The robot world cup initiative, Proc. IJCAI-95 Workshop on Learning Robots, Montreal, Canada, 1995, 19-24.
- (1995) Proc. IJCAI-95 Workshop on Learning Robots , pp. 19-24
- Kitano, H.¹ Asada, M.² Kuniyoshi, Y.³ Noda, I.⁴ Osawa, E.⁵

25
- 0003356517
- Soccer server: A simulator of robocup
- Tokyo, Japan
- I. Noda, Soccer server: A simulator of robocup, Proc. 4th Int. Symposium'95, Tokyo, Japan, 1995, 29-34.
- (1995) Proc. 4th Int. Symposium'95 , pp. 29-34
- Noda, I.¹

26
- 0004267735
- Dordrecht: Kluwer
- D. Aha, ed., Lazy learning (Dordrecht: Kluwer, 1997).
- (1997) Lazy Learning
- Aha, D.¹

27
- 0032000094
- Multiple-prototype classifier design
- J.C. Bezdek, T.R. Rechherzer, G.S. Lim, & Y. Attikiouzel, Multiple-prototype classifier design, IEEE Trans. on Systems, Man and Cybernetics, 28(1), 1998, 67-79.
- (1998) IEEE Trans. on Systems, Man and Cybernetics , vol.28 , Issue.1 , pp. 67-79
- Bezdek, J.C.¹ Rechherzer, T.R.² Lim, G.S.³ Attikiouzel, Y.⁴

28
- 0028748949
- Growing cell structures: A self-organizing network for unsupervised and supervised learning
- B. Fritzke, Growing cell structures: A self-organizing network for unsupervised and supervised learning, Neural Networks, 7(9), 1994, 1441-1460.
- (1994) Neural Networks , vol.7 , Issue.9 , pp. 1441-1460
- Fritzke, B.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.