-
2
-
-
0000494894
-
Computationally feasible bounds for partially observed Markov decision processes
-
Jan./Feb
-
W. S. Lovejoy, "Computationally feasible bounds for partially observed Markov decision processes," Oper. Res., vol. 39, no. 1, pp. 162-175, Jan./Feb. 1991.
-
(1991)
Oper. Res
, vol.39
, Issue.1
, pp. 162-175
-
-
Lovejoy, W.S.1
-
3
-
-
10944258804
-
FALCON: A fusion architecture for learning, cognition, and navigation
-
Budapest, Hungary
-
A. H. Tan, "FALCON: A fusion architecture for learning, cognition, and navigation," in Proc. IJCNN, Budapest, Hungary, 2004, pp. 3297-3302.
-
(2004)
Proc. IJCNN
, pp. 3297-3302
-
-
Tan, A.H.1
-
4
-
-
33846312864
-
Self-organizing cognitive agents and reinforcement learning in a multi-agent environment
-
A.-H. Tan and D. Xiao, "Self-organizing cognitive agents and reinforcement learning in a multi-agent environment," in Proc. IEEE/ACM/WIC Int. Conf. Intell. Agent Technol., 2005, pp. 351-357.
-
(2005)
Proc. IEEE/ACM/WIC Int. Conf. Intell. Agent Technol
, pp. 351-357
-
-
Tan, A.-H.1
Xiao, D.2
-
5
-
-
40549121994
-
Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback
-
to be published
-
A. H. Tan, N. Lu, and D. Xiao, "Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback," IEEE Trans. Neural Netw. to be published.
-
IEEE Trans. Neural Netw
-
-
Tan, A.H.1
Lu, N.2
Xiao, D.3
-
7
-
-
0035740527
-
From implicit skills to explicit knowledge: A bottom-up model of skill learning
-
Mar
-
R. Sun, E. Merrill, and T. Peterson, "From implicit skills to explicit knowledge: A bottom-up model of skill learning," Cogn. Sci., vol. 25, no. 2, pp. 203-244, Mar. 2001.
-
(2001)
Cogn. Sci
, vol.25
, Issue.2
, pp. 203-244
-
-
Sun, R.1
Merrill, E.2
Peterson, T.3
-
8
-
-
36749051994
-
-
M. Riedmiller and H. Braun, RPROP - A fast adaptive learning algorithm, Univ. Karlsruhe, Karlsruhe, Germany, 1992. Tech. Rep. (Also Proc. of ISCIS VII).
-
M. Riedmiller and H. Braun, "RPROP - A fast adaptive learning algorithm," Univ. Karlsruhe, Karlsruhe, Germany, 1992. Tech. Rep. (Also Proc. of ISCIS VII).
-
-
-
-
9
-
-
84943274699
-
A direct adaptive method for faster back-propagation learning: The RPROP algorithm
-
San Francisco, CA
-
M. Riedmiller and H. Braun, "A direct adaptive method for faster back-propagation learning: The RPROP algorithm," in Proc. IEEE Int. Conf. Neural Netw., San Francisco, CA, 1993, pp. 586-591.
-
(1993)
Proc. IEEE Int. Conf. Neural Netw
, pp. 586-591
-
-
Riedmiller, M.1
Braun, H.2
-
10
-
-
36749072878
-
-
M. Brenda, V. Jagannathan, and R. Dodhiawala, On optimal cooperation of knowledge sources - An empirical investigation, Boeing Adv. Technol. Center, Boeing Comput. Services, Seattle, WA, Tech. Rep. BCS-G2010-28, 1986.
-
M. Brenda, V. Jagannathan, and R. Dodhiawala, "On optimal cooperation of knowledge sources - An empirical investigation," Boeing Adv. Technol. Center, Boeing Comput. Services, Seattle, WA, Tech. Rep. BCS-G2010-28, 1986.
-
-
-
-
11
-
-
27144475166
-
-
George Mason Univ, Fairfax, VA, Tech. Rep. GMU-CS-TR-2003-1
-
L. Panait and S. Luke, "Cooperative multi-agent learning: The state of the art," George Mason Univ., Fairfax, VA, Tech. Rep. GMU-CS-TR-2003-1, 2003.
-
(2003)
Cooperative multi-agent learning: The state of the art
-
-
Panait, L.1
Luke, S.2
-
12
-
-
31744441013
-
Analysis of a master-slave architecture for distributed evolutionary computations
-
Feb
-
M. Dubreuil, C. Gagne, and M. Parizeau, "Analysis of a master-slave architecture for distributed evolutionary computations," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 36, no. 1, pp. 229-235, Feb. 2006.
-
(2006)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.36
, Issue.1
, pp. 229-235
-
-
Dubreuil, M.1
Gagne, C.2
Parizeau, M.3
-
14
-
-
1842535228
-
Modular fuzzy-reinforcement learning approach with internal model capabilities for multiagent systems
-
Apr
-
M. Kaya and R. Alhajj, "Modular fuzzy-reinforcement learning approach with internal model capabilities for multiagent systems" IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 34, no. 2, pp. 1210-1223, Apr. 2004.
-
(2004)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.34
, Issue.2
, pp. 1210-1223
-
-
Kaya, M.1
Alhajj, R.2
-
15
-
-
17444385973
-
Fuzzy OLAP association rules mining-based modular reinforcement learning approach for multiagent systems
-
Apr
-
M. Kaya and R. Alhajj, "Fuzzy OLAP association rules mining-based modular reinforcement learning approach for multiagent systems," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 35, no. 2, pp. 326-338, Apr. 2005.
-
(2005)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.35
, Issue.2
, pp. 326-338
-
-
Kaya, M.1
Alhajj, R.2
-
16
-
-
34249833101
-
Q-learning
-
C. J. C. H. Watkins and P. Dayan, "Q-learning," Mach. Learn., vol. 8, no. 3/4, pp. 279-292, 1992.
-
(1992)
Mach. Learn
, vol.8
, Issue.3-4
, pp. 279-292
-
-
Watkins, C.J.C.H.1
Dayan, P.2
-
17
-
-
33947192713
-
Q-learning based market-driven multi-agent collaboration in robot soccer
-
Izmir, Turkey
-
H. Kose, U. Tatlidede, C. Mericli, K. Kaplan, and H. L. Akin, "Q-learning based market-driven multi-agent collaboration in robot soccer," in Proc. Turkish Symp. Artif. Intell. Neural Netw., Izmir, Turkey, 2004, pp. 219-228.
-
(2004)
Proc. Turkish Symp. Artif. Intell. Neural Netw
, pp. 219-228
-
-
Kose, H.1
Tatlidede, U.2
Mericli, C.3
Kaplan, K.4
Akin, H.L.5
-
18
-
-
17044414523
-
Gradient descent for symmetric and asymmetric multiagent reinforcement learning
-
V. Kononen, "Gradient descent for symmetric and asymmetric multiagent reinforcement learning," Web Intell. Agent Syst.: Int. J. (WIAS), vol. 3, no. 1, pp. 17-30, 2005.
-
(2005)
Web Intell. Agent Syst.: Int. J. (WIAS)
, vol.3
, Issue.1
, pp. 17-30
-
-
Kononen, V.1
-
19
-
-
36749089731
-
-
E. F. Yang and D. B. Gu, Multiagent reinforcement learning for multi-robot systems: A survey, Dep. Comput. Sci., Univ. Essex, Colchester, U.K., Tech. Rep. CSM-404, 2004.
-
E. F. Yang and D. B. Gu, "Multiagent reinforcement learning for multi-robot systems: A survey," Dep. Comput. Sci., Univ. Essex, Colchester, U.K., Tech. Rep. CSM-404, 2004.
-
-
-
-
20
-
-
17444424596
-
Cooperative multiagent congestion control for high-speed networks
-
Apr
-
K. S. Hwang, S. W. Tan, M. C. Hsiao, and C. S. Wu, "Cooperative multiagent congestion control for high-speed networks," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 35, no. 2, pp. 255-268, Apr. 2005.
-
(2005)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.35
, Issue.2
, pp. 255-268
-
-
Hwang, K.S.1
Tan, S.W.2
Hsiao, M.C.3
Wu, C.S.4
-
21
-
-
0005807053
-
Evolving cooperative communicating classifier systems
-
L. Bull and T. C. Fogarty, "Evolving cooperative communicating classifier systems," in Proc. 4th Annu. Conf. Evol. Program., 1994, pp. 308-315.
-
(1994)
Proc. 4th Annu. Conf. Evol. Program
, pp. 308-315
-
-
Bull, L.1
Fogarty, T.C.2
-
23
-
-
84962044659
-
The moving target function problem in multi-agent learning
-
Paris, France, Jul
-
J. M. Vidal and E. H. Durfee, "The moving target function problem in multi-agent learning," in Proc. 3rd Int. Conf. Multi-Agent Syst., Paris, France, Jul. 1998, pp. 317-324.
-
(1998)
Proc. 3rd Int. Conf. Multi-Agent Syst
, pp. 317-324
-
-
Vidal, J.M.1
Durfee, E.H.2
-
25
-
-
0001309161
-
Optimal payoff functions for members of collectives
-
D. H. Wolpert and K. Turner, "Optimal payoff functions for members of collectives," Adv. Complex Systems, vol. 4, no. 2/3, pp. 265-279, 2001.
-
(2001)
Adv. Complex Systems
, vol.4
, Issue.2-3
, pp. 265-279
-
-
Wolpert, D.H.1
Turner, K.2
-
26
-
-
36749006381
-
-
T. Balch, Learning roles: Behavioural diversity in robot teams, Georgia Inst. Technol., Atlanta, GA, Tech. Rep. GIT-CC-97-12, 1997.
-
T. Balch, "Learning roles: Behavioural diversity in robot teams," Georgia Inst. Technol., Atlanta, GA, Tech. Rep. GIT-CC-97-12, 1997.
-
-
-
-
28
-
-
0031630561
-
The dynamics of reinforcement learning in cooperative multiagent systems
-
C. Claus and C. Boutilier, "The dynamics of reinforcement learning in cooperative multiagent systems," in Proc. Nat. Conf. Artif. Intell. 1998, pp. 746-752.
-
(1998)
Proc. Nat. Conf. Artif. Intell
, pp. 746-752
-
-
Claus, C.1
Boutilier, C.2
-
29
-
-
0003863106
-
-
Comput. Sci. Dept, Carnegie Mellon Univ, Pittsburgh, PA, Tech. Rep. CMU-CS-00-165
-
M. Bowling and M. Veloso, "An analysis of stochastic game theory for multiagent reinforcement learning," Comput. Sci. Dept., Carnegie Mellon Univ., Pittsburgh, PA, Tech. Rep. CMU-CS-00-165, 2000.
-
(2000)
An analysis of stochastic game theory for multiagent reinforcement learning
-
-
Bowling, M.1
Veloso, M.2
-
30
-
-
1142268794
-
Towards a Pareto-optimal solution in general-sum games
-
Melbourne, Australia
-
R. Mukherjee and S. Sen, "Towards a Pareto-optimal solution in general-sum games," in Proc. 2nd Int. Joint Conf. AAMAS, Melbourne, Australia, 2003, pp. 153-160.
-
(2003)
Proc. 2nd Int. Joint Conf. AAMAS
, pp. 153-160
-
-
Mukherjee, R.1
Sen, S.2
-
31
-
-
84962092523
-
Evaluating concurrent reinforcement learners
-
Boston, MA
-
M. Mundhe and S. Sen, "Evaluating concurrent reinforcement learners," in Proc. 4th ICMAS, Boston, MA, 2000, pp. 421-422.
-
(2000)
Proc. 4th ICMAS
, pp. 421-422
-
-
Mundhe, M.1
Sen, S.2
-
32
-
-
0032359707
-
Individual learning of coordination knowledge
-
Jul
-
S. Sen and M. Sekaran, "Individual learning of coordination knowledge," J. Exp. Theor. Artif. Intell., vol. 10, no. 3, pp. 333-356, Jul. 1998.
-
(1998)
J. Exp. Theor. Artif. Intell
, vol.10
, Issue.3
, pp. 333-356
-
-
Sen, S.1
Sekaran, M.2
-
33
-
-
0028555752
-
Learning to coordinate without sharing information
-
Seattle, WA
-
S. Sen M. Sekaran, and J. Hale, "Learning to coordinate without sharing information," in Proc. 12th Nat. Conf. Artif. Intell., Seattle, WA, 1994, pp. 426-431.
-
(1994)
Proc. 12th Nat. Conf. Artif. Intell
, pp. 426-431
-
-
Sen, S.1
Sekaran, M.2
Hale, J.3
-
34
-
-
85149834820
-
Markov games as a framework for multiagent reinforcement learning
-
San Francisco, CA
-
M. L. Littman, "Markov games as a framework for multiagent reinforcement learning," in Proc. 11th Int. Conf. Machine Learning San Francisco, CA, 1994, pp. 157-163.
-
(1994)
Proc. 11th Int. Conf. Machine Learning
, pp. 157-163
-
-
Littman, M.L.1
-
35
-
-
85152198941
-
Multi-agent reinforcement learning: Independent vs. cooperative agents
-
M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents," in Proc. 10th Int. Conf. Machine Learning, 1993, pp. 330-337.
-
(1993)
Proc. 10th Int. Conf. Machine Learning
, pp. 330-337
-
-
Tan, M.1
-
36
-
-
0000929496
-
Multiagent reinforcement learning: Theoretical framework and an algorithm
-
J. Hu and M. Wellman, "Multiagent reinforcement learning: Theoretical framework and an algorithm," in Proc. 15th Int. Conf. Machine Learning, 1998, pp. 242-250.
-
(1998)
Proc. 15th Int. Conf. Machine Learning
, pp. 242-250
-
-
Hu, J.1
Wellman, M.2
-
38
-
-
0021776661
-
A massively parallel architecture for a self-organizing neural pattern recognition machine
-
Jan
-
G. A. Carpenter and S. Grossberg, "A massively parallel architecture for a self-organizing neural pattern recognition machine,:Comput. Vis. Graph. Image Process., vol. 37, no. 1, pp. 54-115, Jan. 1987.
-
(1987)
Comput. Vis. Graph. Image Process
, vol.37
, Issue.1
, pp. 54-115
-
-
Carpenter, G.A.1
Grossberg, S.2
-
39
-
-
84973857317
-
ART 2: Self-organization of stable category recognition codes for analog input patterns
-
Dec
-
G. A. Carpenter and S. Grossberg, "ART 2: Self-organization of stable category recognition codes for analog input patterns," Appl. Opt. vol. 26, no. 23, pp. 4919-4930, Dec. 1987.
-
(1987)
Appl. Opt
, vol.26
, Issue.23
, pp. 4919-4930
-
-
Carpenter, G.A.1
Grossberg, S.2
-
41
-
-
0026408256
-
Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system
-
G. A. Carpenter, S. Grossberg, and D. B. Rosen, "Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system," Neural Netw., vol. 4, pp. 759-771, 1991.
-
(1991)
Neural Netw
, vol.4
, pp. 759-771
-
-
Carpenter, G.A.1
Grossberg, S.2
Rosen, D.B.3
-
42
-
-
18144423305
-
Structure-adaptable digital neural networks,
-
Ph.D. dissertation, Swiss Federal Inst. Technol.-Lausanne, Lausanne, Switzerland
-
A. Pérez-Uribe, "Structure-adaptable digital neural networks," Ph.D. dissertation, Swiss Federal Inst. Technol.-Lausanne, Lausanne, Switzerland, 2002.
-
(2002)
-
-
Pérez-Uribe, A.1
-
43
-
-
0001842850
-
Bottom-up skill learning in reactive sequential decision tasks
-
R. Sun, T. Peterson, and E. Merrill, "Bottom-up skill learning in reactive sequential decision tasks," in Proc. 18th Cognitive Sci. Soc. Conf., 1996, pp. 684-690.
-
(1996)
Proc. 18th Cognitive Sci. Soc. Conf
, pp. 684-690
-
-
Sun, R.1
Peterson, T.2
Merrill, E.3
-
44
-
-
85007487673
-
A new learning rates adaptation strategy for the resilient propagation algorithm
-
A. D. Anastasiadis, G. D. Magoulas, and M. N. Vrahatis, "A new learning rates adaptation strategy for the resilient propagation algorithm," in Proc. ESANN, 2004, pp. 1-6.
-
(2004)
Proc. ESANN
, pp. 1-6
-
-
Anastasiadis, A.D.1
Magoulas, G.D.2
Vrahatis, M.N.3
-
45
-
-
0031363808
-
Layered learning in multiagent systems
-
P. Stone, "Layered learning in multiagent systems," in Proc. AAAI/IAAI, 1997, p. 819.
-
(1997)
Proc. AAAI/IAAI
, pp. 819
-
-
Stone, P.1
-
46
-
-
0010276169
-
Experiments in learning prototypical situations for variants of the pursuit game
-
Kyoto, Japan
-
J. Denzinger and M. Fuchs, "Experiments in learning prototypical situations for variants of the pursuit game," in Proc. 2nd ICMAS, Kyoto, Japan, 1996, pp. 48-55.
-
(1996)
Proc. 2nd ICMAS
, pp. 48-55
-
-
Denzinger, J.1
Fuchs, M.2
-
47
-
-
7444263056
-
On customizing evolutionary learning of agent behavior
-
J. Denzinger and A. Schur, "On customizing evolutionary learning of agent behavior," in Proc. Can. Conf. AI, 2004, pp. 146-160.
-
(2004)
Proc. Can. Conf. AI
, pp. 146-160
-
-
Denzinger, J.1
Schur, A.2
-
48
-
-
0030050933
-
Multiagent reinforcement leaning in the iterated prisoner's dilemma
-
T. W. Sandholm and R. H. Crites, "Multiagent reinforcement leaning in the iterated prisoner's dilemma," Biosystems, vol. 37, no. 1, pp. 147-166, 1995.
-
(1995)
Biosystems
, vol.37
, Issue.1
, pp. 147-166
-
-
Sandholm, T.W.1
Crites, R.H.2
-
49
-
-
0002335248
-
Multi-agent reinforcement learning: A modular approach
-
N. Ono and K. Fukomoto, "Multi-agent reinforcement learning: A modular approach," in Proc. 2nd Int. Conf. Multi-Agent Syst., 1996, pp. 252-258.
-
(1996)
Proc. 2nd Int. Conf. Multi-Agent Syst
, pp. 252-258
-
-
Ono, N.1
Fukomoto, K.2
|