메뉴 건너뛰기




Volumn 38, Issue 7, 2011, Pages 8477-8487

A hybrid agent architecture integrating desire, intention and reinforcement learning

Author keywords

BDI architecture; Minefield navigation; Plan learning; Reinforcement learning; Self organizing neural networks

Indexed keywords

ACTION EXECUTION; BDI AGENT; BDI ARCHITECTURE; DELIBERATIVE PLANNING; EMPIRICAL RESULTS; FUSION ARCHITECTURE; HYBRID AGENT ARCHITECTURE; HYBRID ARCHITECTURES; LEARNING MODULES; MINEFIELD NAVIGATION; PLAN LEARNING; REAL-TIME ENVIRONMENT; REINFORCEMENT SIGNAL; ROBUST PERFORMANCE; SELF-ORGANIZING NEURAL NETWORKS; TEMPORAL DIFFERENCES;

EID: 79952438883     PISSN: 09574174     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.eswa.2011.01.045     Document Type: Article
Times cited : (27)

References (31)
  • 1
    • 0041386587 scopus 로고    scopus 로고
    • Embodied cognition: A field guide
    • M. Anderson Embodied cognition: A field guide Artificial Intelligence 149 2003 91 130
    • (2003) Artificial Intelligence , vol.149 , pp. 91-130
    • Anderson, M.1
  • 5
    • 0021776661 scopus 로고
    • A massively parallel architecture for a self-organizing neural pattern recognition machine
    • G.A. Carpenter, and S. Grossberg A massively parallel architecture for a self-organizing neural pattern recognition machine Computer Vision, Graphics, and Image Processing 37 1987 54 115
    • (1987) Computer Vision, Graphics, and Image Processing , vol.37 , pp. 54-115
    • Carpenter, G.A.1    Grossberg, S.2
  • 6
    • 0026408256 scopus 로고
    • Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system
    • G.A. Carpenter, S. Grossberg, and D.B. Rosen Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system Neural Networks 4 1991 759 771
    • (1991) Neural Networks , vol.4 , pp. 759-771
    • Carpenter, G.A.1    Grossberg, S.2    Rosen, D.B.3
  • 13
    • 5644238775 scopus 로고    scopus 로고
    • Sequence Learning in the ACT-R Cognitive Architecture: Empirical Analysis of a Hybrid Model
    • Sequence Learning Paradigms, Algorithms, and Applications
    • C. Lebiere, and D. Wallach Sequence learning in the act-r cognitive architecture: Empirical analysis of a hybrid model R. Sun, C. Giles, Sequence Learning LNAI Vol. 1828 2000 Springer-Verlag 188 212 (Pubitemid 33221930)
    • (2001) Lecture Notes in Computer Science , Issue.1828 , pp. 188-212
    • Lebiere, C.1    Wallach, D.2
  • 14
    • 4544286287 scopus 로고    scopus 로고
    • Folk psychology for human modelling: Extending the BDI paradigm
    • New York, USA
    • Norling, E. (2004). Folk psychology for human modelling: Extending the BDI paradigm. In Proceedings, AAMAS'04, New York, USA (pp. 202-209).
    • (2004) Proceedings, AAMAS'04 , pp. 202-209
    • Norling, E.1
  • 18
    • 0033725516 scopus 로고    scopus 로고
    • Beyond simple rule extraction: The extraction of planning knowledge from reinforcement learners
    • IEEE Press Piscataway, NJ
    • R. Sun Beyond simple rule extraction: The extraction of planning knowledge from reinforcement learners Proceedings of the international joint conference on neural networks, Como, Italy 2000 IEEE Press Piscataway, NJ 24 27
    • (2000) Proceedings of the International Joint Conference on Neural Networks, Como, Italy , pp. 24-27
    • Sun, R.1
  • 19
    • 1642457555 scopus 로고    scopus 로고
    • Sequence learning: Paradigms, algorithms, and applications
    • Springer-Verlag
    • R. Sun, C. Giles, Sequence learning: Paradigms, algorithms, and applications LNAI Vol. 1828 2000 Springer-Verlag
    • (2000) LNAI , vol.1828
    • Sun, R.1    Giles, C.2
  • 20
    • 24944558582 scopus 로고    scopus 로고
    • Learning Plans without a priori Knowledge
    • R. Sun, and C. Sessions Learning plans without a priori knowledge Adaptive Behavior 8 3/4 2000 225 254 (Pubitemid 34334586)
    • (2000) Adaptive Behavior , vol.8 , Issue.3-4 , pp. 225-254
    • Sun, R.1    Sessions, C.2
  • 22
    • 0029009071 scopus 로고
    • Adaptive resonance associative map
    • A.-H. Tan Adaptive resonance associative map Neural Networks 8 3 1995 437 446
    • (1995) Neural Networks , vol.8 , Issue.3 , pp. 437-446
    • Tan, A.-H.1
  • 23
    • 10944258804 scopus 로고    scopus 로고
    • FALCON: A fusion architecture for learning, cognition, and navigation
    • 2004 IEEE International Joint Conference on Neural Networks - Proceedings
    • Tan, A.-H. (2004). FALCON: A fusion architecture for learning, cognition, and navigation. In Proceedings of the international joint conference on neural networks (pp. 3297-3302). (Pubitemid 40011563)
    • (2004) IEEE International Conference on Neural Networks - Conference Proceedings , vol.4 , pp. 3297-3302
    • Tan, A.-H.1
  • 25
    • 33846312864 scopus 로고    scopus 로고
    • Self-organizing cognitive agents and reinforcement learning in multi-agent environment
    • DOI 10.1109/IAT.2005.125, 1565565, Proceedings - 2005 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT'05
    • Tan, A.-H., Xiao, D. (2005). Self-organizing cognitive agents and reinforcement learning in a multi-agent environment. In Proceedings of the IEEE/WIC/ACM international conference on intelligent agent technology (IAT'05), France (pp. 351-357). (Pubitemid 46116575)
    • (2005) Proceedings - 2005 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT'05 , vol.2005 , pp. 351-357
    • Tan, A.-H.1    Xiao, D.2
  • 26
    • 37249071335 scopus 로고    scopus 로고
    • Intelligence through interaction: Towards a unified theory for learning
    • Tan, A.-H., Carpenter, G. A., Grossberg, S. (2007). Intelligence through interaction: Towards a unified theory for learning. In Proceedings of ISNN, LNCS (Vol. 4491, pp. 1098-1107).
    • (2007) Proceedings of ISNN, LNCS , vol.4491 , pp. 1098-1107
    • Tan, A.-H.1    Carpenter, G.A.2    Grossberg, S.3
  • 27
    • 40549121994 scopus 로고    scopus 로고
    • Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback
    • DOI 10.1109/TNN.2007.905839
    • A.-H. Tan, N. Lu, and D. Xiao Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback IEEE Transactions on Neural Networks 9 2 2008 230 244 (Pubitemid 351359289)
    • (2008) IEEE Transactions on Neural Networks , vol.19 , Issue.2 , pp. 230-244
    • Tan, A.-H.1    Lu, N.2    Xiao, D.3
  • 28
    • 19344363436 scopus 로고    scopus 로고
    • Predictive neural networks for gene expression data analysis
    • DOI 10.1016/j.neunet.2005.01.003, PII S0893608005000237
    • A.-H. Tan, and H. Pan Predictive neural networks for gene expression data analysis Neural Networks 18 3 2005 297 306 (Pubitemid 40719531)
    • (2005) Neural Networks , vol.18 , Issue.3 , pp. 297-306
    • Tan, A.-H.1    Pan, H.2
  • 29
  • 31
    • 36749092785 scopus 로고    scopus 로고
    • Self-organizing neural architectures and cooperative learning in a multiagent environment
    • DOI 10.1109/TSMCB.2007.907040
    • D. Xiao, and A.-H. Tan Self-organizing neural architectures and cooperative learning in multi-agent environment IEEE Transactions on Systems, Man, and Cybernetics - Part B 37 6 2007 1567 1580 (Pubitemid 350201163)
    • (2007) IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics , vol.37 , Issue.6 , pp. 1567-1580
    • Xiao, D.1    Tan, A.-H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.