SCOPUS 정보 검색 플랫폼

International Journal of Robotics Research

Volumn 27, Issue 3-4, 2008, Pages 505-526

Automated design of adaptive controllers for modular robots using reinforcement learning

(3) Varshavskaya, Paulina a Kaelbling, Leslie Pack a Rus, Daniela a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Animation and simulation; Cellular and modular robots; Learning and adaptive systems

Indexed keywords

ALGORITHMS; AUTOMATION; REINFORCEMENT LEARNING; ROBOTS; SYSTEMS ANALYSIS;

CELLULAR AND MODULAR ROBOTS; HUMAN DESIGNERS;

ADAPTIVE CONTROL SYSTEMS;

EID: 40049102683 PISSN: 02783649 EISSN: 17413176 Source Type: Journal
DOI: 10.1177/0278364907084983 Document Type: Article

Times cited : (38)

References (37)

1
- 0013465186
- Leen, T. K., Dietterich, T. G. and Tresp, V. (eds). Cambridge, MA, MIT Press.
- Andre, D. and Russell, S. (2000). Programmable reinforcement learning agents. Advances in Neural Information Processing Systems Leen, T. K., Dietterich, T. G. and Tresp, V. (eds). Cambridge, MA, MIT Press.
- (2000) Programmable Reinforcement Learning Agents. Advances in Neural Information Processing Systems
- Andre, D.¹ Russell, S.²

2
- 84898962948
- Policy search by dynamic programming
- Cambridge, MA, MIT Press.
- Bagnell, J.A. et al. (2004). Policy search by dynamic programming. Advances in Neural Information Processing Systems, Vol. 16, Thrun, S., Saul, L. K. and Scholkopf, B. (eds). Cambridge, MA, MIT Press.
- (2004) Advances in Neural Information Processing Systems , vol.16
- Bagnell, J.A.¹

3
- 0013535965
- Infinite-horizon gradient-based policy search
- Baxter, J. and Bartlett, P.L. (2001). Infinite-horizon gradient-based policy search. Journal of Artificial Intelligence Research, 15: 319 - 350.
- (2001) Journal of Artificial Intelligence Research , vol.15 , pp. 319-350
- Baxter, J.¹ Bartlett, P.L.²

4
- 0003565783
- Belmont, MA, Athena Scientific.
- Bertsekas, D.P. (1995). Dynamic Programming and Optimal Control. Belmont, MA, Athena Scientific.
- (1995) Dynamic Programming and Optimal Control
- Bertsekas, D.P.¹

5
- 40049085465
- Cellular automata for decentralized control of self-reconfigurable robots
- Butler, Z. et al. (2001). Cellular automata for decentralized control of self-reconfigurable robots. Proceedings of the International Conference on Intelligent Robots and Systems, Wailea, HI.
- Proceedings of the International Conference on Intelligent Robots and Systems
- Butler, Z.¹

6
- 4444267710
- Generic distributed control for locomotion with self-reconfiguring robots
- Butler, Z. et al. (2004). Generic distributed control for locomotion with self-reconfiguring robots. International Journal of Robotics Research, 23 (9). 919 - 938.
- (2004) International Journal of Robotics Research , vol.23 , Issue.9 , pp. 919-938
- Butler, Z.¹

7
- 84899032145
- All learning is local: Multi-agent learning in global reward games
- Cambridge, MA, MIT Press.
- Chang, Y.-H., Ho, T. and Kaelbling, L.P. (2004). All learning is local: multi-agent learning in global reward games. Advances in Neural Information Processing Systems, Vol. 16, Thrun, S., Saul, L. K. and Scholkopf, B. (eds). Cambridge, MA, MIT Press.
- (2004) Advances in Neural Information Processing Systems , vol.16
- Chang, Y.-H.¹ Ho, T.² Kaelbling, L.P.³

8
- 5644261272
- Learning in large cooperative multi-robot domains
- Fernandez, F. and Parker, L.E. (2001). Learning in large cooperative multi-robot domains. International Journal of Robotics and Automation, 16 (4). 217 - 226.
- (2001) International Journal of Robotics and Automation , vol.16 , Issue.4 , pp. 217-226
- Fernandez, F.¹ Parker, L.E.²

9
- 40049097431
- A million-module march
- Fitch, R. and Butler, Z. (2006). A million-module march. Digital Proceedings of the RSS Workshop on Self-Reconfigurable Modular Robotics, Philadelphia, PA.
- Digital Proceedings of the RSS Workshop on Self-Reconfigurable Modular Robotics
- Fitch, R.¹ Butler, Z.²

10
- 0347410594
- Using policy gradient reinforcement learning on autonomous robot controllers
- Grudic, G., Kumar, V. and Ungar, L.H. (2003). Using policy gradient reinforcement learning on autonomous robot controllers. Proceedings of the International Conference on Intelligent Robots and Systems, Las Vegas, NV.
- Proceedings of the International Conference on Intelligent Robots and Systems
- Grudic, G.¹ Kumar, V.² Ungar, L.H.³

11
- 84899028010
- Multiagent planning with factored MDPs
- Cambridge, MA, MIT Press.
- Guestrin, C., Koller, D. and Parr, R. (2002). Multiagent planning with factored MDPs. Advances in Neural Information Processing Systems, Vol. 14 Dietterich, T. G., Becker, S. and Ghahramani, Z. (eds). Cambridge, MA, MIT Press.
- (2002) Advances in Neural Information Processing Systems , vol.14
- Guestrin, C.¹ Koller, D.² Parr, R.³

12
- 14044276980
- Distributed adaptive locomotion by a modular robotic system, M-TRAN II-from local adaptation to global coordinated motion using cpg controllers
- Kamimura, A. et al. (2004). Distributed adaptive locomotion by a modular robotic system, M-TRAN II-from local adaptation to global coordinated motion using cpg controllers. Proceedings of the International Conference on Intelligent Robots and Systems, Sendai, Japan.
- Proceedings of the International Conference on Intelligent Robots and Systems
- Kamimura, A.¹

13
- 0030655611
- RoboCup: The robot world cup initiative
- Kitano, H. et al. (1997). RoboCup: The robot world cup initiative. Proceedings of the 1st International Conference on Autonomous Agents (Agents'97), Johnson, W. L. and Hayes-Roth, B. (eds). New York, ACM Press, pp. 340- 347.
- Proceedings of the 1st International Conference on Autonomous Agents (Agents'97)
- Kitano, H.¹

14
- 33748543203
- Collaborative multiagent reinforcement learning by payoff propagation
- Kok, J.R. and Vlassis, N. (2006). Collaborative multiagent reinforcement learning by payoff propagation. Journal of Machine Learning Research, 7: 1789 - 1828.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1789-1828
- Kok, J.R.¹ Vlassis, N.²

15
- 33846134048
- Efficient locomotion for a self-reconfiguring robot
- Kotay, K. and Rus, D. (2005). Efficient locomotion for a self-reconfiguring robot. Proceedings of the International Conference on Robotics and Automation, Barcelona, Spain.
- Proceedings of the International Conference on Robotics and Automation
- Kotay, K.¹ Rus, D.²

16
- 0005598418
- Collaborating with a genetic programming system to generate modular robotic code
- Kubica, J. and Rieffel, E. (2002). Collaborating with a genetic programming system to generate modular robotic code. GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference, W. et al. (ed). New York, Morgan Kaufmann, pp. 804-811.
- GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference
- Kubica, J.¹ Rieffel, E.²

17
- 4644323293
- Least-squares policy iteration
- 1149
- Lagoudakis, M.G. and Parr, R. (2003). Least-squares policy iteration. Journal of Machine Learning Research, 4: 1107 - 1149.
- (2003) Journal of Machine Learning Research , vol.4
- Lagoudakis, M.G.¹ Parr, R.²

18
- 40049092002
- Cambridge, MA, MIT Media Laboratory.
- Martin, M. (2004). The essential dynamics algorithm: fast policy search in continuous worlds. Vision and Modeling Technical Report 582, Cambridge, MA, MIT Media Laboratory.
- (2004) The Essential Dynamics Algorithm: Fast Policy Search in Continuous Worlds. Vision and Modeling Technical Report 582
- Martin, M.¹

19
- 0030647149
- Reinforcement learning in the multi-robot domain
- Mataric, M.J. (1997). Reinforcement learning in the multi-robot domain. Autonomous Robots, 4 (1). 73 - 83.
- (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 73-83
- Mataric, M.J.¹

20
- 33845547298
- Designed and evolved blueprints for physical self-replicating machines
- Mytilinaios, E. et al. (2004). Designed and evolved blueprints for physical self-replicating machines. Proceedings of the 9th International Conference on Artificial Life (ALIFE IX), Boston, MA.
- Proceedings of the 9th International Conference on Artificial Life (ALIFE IX)
- Mytilinaios, E.¹

21
- 0036355195
- Programmable self-assembly using biologically-inspired multiagent control
- Nagpal, R. (2002). Programmable self-assembly using biologically-inspired multiagent control. Proceedings of the 1st International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), Bologna, Italy.
- Proceedings of the 1st International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS)
- Nagpal, R.¹

22
- 33744488034
- Autonomous inverted helicopter flight via reinforcement learning
- Ng, A.Y. et al. (2004). Autonomous inverted helicopter flight via reinforcement learning. Proceedings of the International Symposium on Experimental Robotics (ISER), Singapore.
- Proceedings of the International Symposium on Experimental Robotics (ISER)
- Ng, A.Y.¹

23
- 0141819580
- PEGASUS: A policy search method for large MDPs and POMDPs
- Ng, A.Y. and Jordan, M. (2000). PEGASUS: a policy search method for large MDPs and POMDPs. Proceedings of the International Conference on Uncertainty in AI (UAI), Stanford, CA.
- Proceedings of the International Conference on Uncertainty in AI (UAI)
- Ng, A.Y.¹ Jordan, M.²

24
- 0005943267
- Department of Computer Science.
- Peshkin, L. (2001). Reinforcement learning by policy search. Ph.D. Thesis, Brown University, Department of Computer Science.
- (2001) Reinforcement Learning by Policy Search. Ph.D. Thesis, Brown University
- Peshkin, L.¹

25
- 34250620650
- Learning movement primitives
- Schaal, S. et al. (2003). Learning movement primitives. Proceedings of the International Symposium on Robotics Research (ISRR), Siena, Italy.
- Proceedings of the International Symposium on Robotics Research (ISRR)
- Schaal, S.¹

26
- 0001395498
- Distributed value functions
- Schneider, J. et al. (1999). Distributed value functions. Proceedings of the International Conference on Machine Learning, Bled, Slovenia.
- Proceedings of the International Conference on Machine Learning
- Schneider, J.¹

27
- 4544279348
- Palo Alto, CA, Stanford University.
- Shoham, Y., Powers, R. and Grenager, T. (2003). Multi-agent reinforcement learning: a critical survey. Technical Report, Palo Alto, CA, Stanford University.
- (2003) Multi-agent Reinforcement Learning: A Critical Survey. Technical Report
- Shoham, Y.¹ Powers, R.² Grenager, T.³

28
- 0034205975
- Multiagent systems: A survey from a machine learning perspective
- Stone, P. and Veloso, M.M. (2000). Multiagent systems: a survey from a machine learning perspective, Autonomous Robots 8 (3). 345 - 383.
- (2000) Autonomous Robots , vol.8 , Issue.3 , pp. 345-383
- Stone, P.¹ Veloso, M.M.²

29
- 85156221438
- Touretzky, D. S., Mozer, M. C. and Hasselmo, M. E. (eds). Cambridge, MA, MIT Press, pp.
- Sutton, R.S. (1995). Generatlization in reinforcement learning: successful examples using sparse coarse coding. Advances in Neural Information Processing Systems, Touretzky, D. S., Mozer, M. C. and Hasselmo, M. E. (eds). Cambridge, MA, MIT Press, pp. 1038 - 1044.
- (1995) Generatlization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding. Advances in Neural Information Processing Systems , pp. 1038-1044
- Sutton, R.S.¹

30
- 0004007508
- Cambridge, MA, MIT Press.
- Sutton, R.S. and Barto, A.G. (1998). Reinforcement Learning. Cambridge, MA, MIT Press.
- (1998) Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

31
- 84898939480
- Leen, T. K., Dietterich, T. G. and Tresp, V. (eds), Vol. 12. Cambridge, MA, MIT Press.
- Sutton, R.S. et al. (2000). Policy gradient methods for reinforcement learning with function approximation. Advances in Neural Information Processing Systems, Leen, T. K., Dietterich, T. G. and Tresp, V. (eds), Vol. 12. Cambridge, MA, MIT Press.
- (2000) Policy Gradient Methods for Reinforcement Learning with Function Approximation. Advances in Neural Information Processing Systems
- Sutton, R.S.¹

32
- 34250679869
- Learning to walk in 20 minutes
- Tedrake, R., Zhang, T.W. and Seung, H.S. (2005). Learning to walk in 20 minutes. Proceedings of the 14th Yale Workshop on Adaptive and Learning Systems, New Haven, CT.
- Proceedings of the 14th Yale Workshop on Adaptive and Learning Systems
- Tedrake, R.¹ Zhang, T.W.² Seung, H.S.³

33
- 40049087029
- Distributed learning for modular robots
- Varshavskaya, P., Kaelbling, L.P. and Rus, D. (2004). Distributed learning for modular robots. Proceedings of the International Conference on Intelligent Robots and Systems, Sendai, Japan.
- Proceedings of the International Conference on Intelligent Robots and Systems
- Varshavskaya, P.¹ Kaelbling, L.P.² Rus, D.³

34
- 40049101475
- On scalability issues in reinforcement learning for self-reconfiguring modular robots
- Varshavskaya, P., Kaelbling, L.P. and Rus, D. (2006). On scalability issues in reinforcement learning for self-reconfiguring modular robots. Digital Proceedings of RSS Workshop on Self-Reconfigurable Modular Robotics, Philadelphia, PA.
- Digital Proceedings of RSS Workshop on Self-Reconfigurable Modular Robotics
- Varshavskaya, P.¹ Kaelbling, L.P.² Rus, D.³

35
- 34249833101
- Q-learning
- Watkins, C.J.C.H. and Dayan, P. (1992). Q-learning. Machine Learning, 8 (3 - 4). 279 - 292.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

36
- 40049106373
- Using interaction-based learning to construct an adaptive and fault-tolerant multi-link floating robot
- Yu, W. et al. (2002). Using interaction-based learning to construct an adaptive and fault-tolerant multi-link floating robot. Proceedings of the International Workshop on Distributed Autonomous Robotic Systems (DARS), Vol. 5, Asama, H., Arai, T., Fukuda, T. and Hasegawa, T. (eds). Berlin, Springer, pp. 455-464.
- Proceedings of the International Workshop on Distributed Autonomous Robotic Systems (DARS)
- Yu, W.¹

37
- 18744399581
- Self-reproducing machines
- Zykov, V. et al. (2005). Self-reproducing machines. Nature, 435 (7038). 163 - 164.
- (2005) Nature , vol.435 , Issue.7038 , pp. 163-164
- Zykov, V.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.