메뉴 건너뛰기




Volumn , Issue , 2008, Pages 1806-1811

Reinforcement learning based on modular fuzzy model with gating unit

Author keywords

Modular fuzzy model; Modular neural network; Neural network; Q Learning; Reinforcement learning

Indexed keywords

AUTONOMOUS MOBILE ROBOT; BEHAVIOR CONTROL; CONVENTIONAL MODELS; CONVERGENCE PROPERTIES; CURSE OF DIMENSIONALITY; FUZZY MODELS; LEARNING PARAMETERS; LEARNING PROCESS; MODULAR FUZZY MODEL; MODULAR NEURAL NETWORK; NEURAL NETWORK MODEL; NUMERICAL EXAMPLE; Q-LEARNING; Q-LEARNING ALGORITHMS; ROBOT TASKS; SENSORY STATE; TASK DECOMPOSITION;

EID: 69949174783     PISSN: 1062922X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICSMC.2008.4811551     Document Type: Conference Paper
Times cited : (6)

References (23)
  • 2
    • 34249833101 scopus 로고
    • Technical Note: Q-Leaning
    • C. J. H. Watkins and P. Dayan, "Technical Note: Q-Leaning", Machine Learning, Vol. 8, pp. 58-68, 1992
    • (1992) Machine Learning , vol.8 , pp. 58-68
    • Watkins, C.J.H.1    Dayan, P.2
  • 3
    • 0000146518 scopus 로고
    • Credit Assignment in Rule Discovery Systems Based on Genetic Algorithms
    • J. J. Grefenstette, "Credit Assignment in Rule Discovery Systems Based on Genetic Algorithms," Machine Learning, Vol. 3, pp. 225-245, 1988.
    • (1988) Machine Learning , vol.3 , pp. 225-245
    • Grefenstette, J.J.1
  • 4
    • 0007987833 scopus 로고    scopus 로고
    • Theory and Application of Reinforcement Learning Based on Profit Sharing
    • K. Miyazaki, H. Kimura, and S. Kobayashi, "Theory and Application of Reinforcement Learning Based on Profit Sharing," J. of JSAI, Vol. 14, No. 5, pp. 800-807, 1999.
    • (1999) J. of JSAI , vol.14 , Issue.5 , pp. 800-807
    • Miyazaki, K.1    Kimura, H.2    Kobayashi, S.3
  • 5
    • 0007982016 scopus 로고    scopus 로고
    • A Theory of Profit Sharing in Multi-agent Reinforcement Learning
    • K. Miyazaki, S. Arai, and S. Kobayashi, "A Theory of Profit Sharing in Multi-agent Reinforcement Learning," J. of JSAI, Vol. 14, No. 6, pp. 1156-1164, 1999.
    • (1999) J. of JSAI , vol.14 , Issue.6 , pp. 1156-1164
    • Miyazaki, K.1    Arai, S.2    Kobayashi, S.3
  • 6
    • 0003529066 scopus 로고
    • On Optimal Cooperation of Knowledge Sources,
    • Technical Report, BCS-G2010-28, Boeing AI Center
    • M. Benda, V. Jagannathan, and R. Dodhiawalla, "On Optimal Cooperation of Knowledge Sources," Technical Report, BCS-G2010-28, Boeing AI Center, 1985.
    • (1985)
    • Benda, M.1    Jagannathan, V.2    Dodhiawalla, R.3
  • 7
    • 69949116392 scopus 로고    scopus 로고
    • A. Ito and M. Kanabuchi, Speeding up Multi-Agent Reinforcement Learning by Coarse-Graining of Perception-Hunter Game as an Example-, Trans. of IEICE, J84-D-1, No. 3, pp. 285-293, 2001.
    • A. Ito and M. Kanabuchi, "Speeding up Multi-Agent Reinforcement Learning by Coarse-Graining of Perception-Hunter Game as an Example-," Trans. of IEICE, Vol. J84-D-1, No. 3, pp. 285-293, 2001.
  • 9
    • 0001100659 scopus 로고    scopus 로고
    • Acquisition of Stand-up Behavior by a Real Robot using Hierarchical Reinforcement Learning
    • J. Morimoto and K Doya, "Acquisition of Stand-up Behavior by a Real Robot using Hierarchical Reinforcement Learning," Proc, of International Conference on Machine Learning, pp. 623-630, 2000.
    • (2000) Proc, of International Conference on Machine Learning , pp. 623-630
    • Morimoto, J.1    Doya, K.2
  • 11
    • 69949121296 scopus 로고    scopus 로고
    • K. Fujita and H. Matsuo, Multi-agent Reinforcement Learning with the Partly High-Dimensional State Space, Trans, of IEICE, J88-D-1, No. 4, pp. 864-872, 2005.
    • K. Fujita and H. Matsuo, "Multi-agent Reinforcement Learning with the Partly High-Dimensional State Space," Trans, of IEICE, Vol. J88-D-1, No. 4, pp. 864-872, 2005.
  • 13
    • 69949168432 scopus 로고    scopus 로고
    • T. Hamagami, S. Koakutsu, and H. Hirata, An Adjustment Method of the Number of States on Q-Learning Segmenting State Space Adaptively, Trans. of IEICE, J86-D1, No. 7, pp. 490-499, 2003.
    • T. Hamagami, S. Koakutsu, and H. Hirata, "An Adjustment Method of the Number of States on Q-Learning Segmenting State Space Adaptively," Trans. of IEICE, Vol. J86-D1, No. 7, pp. 490-499, 2003.
  • 14
    • 40949113901 scopus 로고    scopus 로고
    • On the Generalization of Single Input Rule Modules Connected Type Fuzzy Reasoning Method
    • H. Seki, H. Ishii, and M. Mizumoto: "On the Generalization of Single Input Rule Modules Connected Type Fuzzy Reasoning Method," Proc. of the SCIS&ISIS2006, pp. 30-34, 2006.
    • (2006) Proc. of the SCIS&ISIS2006 , pp. 30-34
    • Seki, H.1    Ishii, H.2    Mizumoto, M.3
  • 15
    • 0001568172 scopus 로고    scopus 로고
    • SIRMs Dynamically Connected Fuzzy Inference Model and Its Applications
    • 97, 3, pp
    • N. Yubazaki, J. Yi, M. Otani and K Hirota, "SIRMs Dynamically Connected Fuzzy Inference Model and Its Applications," Proc. IFSA '97, vol. 3, pp. 410-415, 1997.
    • (1997) Proc. IFSA , pp. 410-415
    • Yubazaki, N.1    Yi, J.2    Otani, M.3    Hirota, K.4
  • 16
    • 40949136350 scopus 로고    scopus 로고
    • Learning of Agent Behavior Based on Hierarchical Modular Reinforcement Learning
    • Y. Takahashi and T. Watanabe: "Learning of Agent Behavior Based on Hierarchical Modular Reinforcement Learning," Proc, of the SCIS&ISIS2006, pp. 90-94, 2006.
    • (2006) Proc, of the SCIS&ISIS2006 , pp. 90-94
    • Takahashi, Y.1    Watanabe, T.2
  • 17
    • 40949162812 scopus 로고    scopus 로고
    • Hierarchical Reinforcement Learning Using A Modular Fuzzy Model for Multi-Agent Problem
    • Man, and Cybernetics
    • T. Watanabe and Y. Takahashi, "Hierarchical Reinforcement Learning Using A Modular Fuzzy Model for Multi-Agent Problem," Proc, of the 2007 IEEE International Conference on Systems, Man, and Cybernetics, 2008.
    • (2008) Proc, of the 2007 IEEE International Conference on Systems
    • Watanabe, T.1    Takahashi, Y.2
  • 18
    • 0001940458 scopus 로고
    • Task Decomposition through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks
    • R. A. Jacobs, M. I. Jordan, and A. G. Barto, "Task Decomposition through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks," Neural Computation, Vol. 3, pp. 79-87, 1991.
    • (1991) Neural Computation , vol.3 , pp. 79-87
    • Jacobs, R.A.1    Jordan, M.I.2    Barto, A.G.3
  • 20
    • 33845529505 scopus 로고    scopus 로고
    • Reinforcement Learning: Overview
    • P. Y. Glorennec, "Reinforcement Learning: Overview," Proc. of ESIT, pp. 17-35, 2000.
    • (2000) Proc. of ESIT , pp. 17-35
    • Glorennec, P.Y.1
  • 21
    • 33947165245 scopus 로고    scopus 로고
    • Fuzzy Multi-Agent Cooperative Q-Learning
    • IEEE International Conferece on Information Acquisition, pp
    • D. Gu and H. Hu, "Fuzzy Multi-Agent Cooperative Q-Learning," Proc. of the 2005 IEEE International Conferece on Information Acquisition, pp. 193-197, 2005.
    • (2005) Proc. of the , pp. 193-197
    • Gu, D.1    Hu, H.2
  • 22
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in Reinforcement Learning: Successsful Examples Using Sparce Coarse Coding
    • MIT Press
    • R. S. Sutton, "Generalization in Reinforcement Learning: Successsful Examples Using Sparce Coarse Coding," Advances in Neural Information Processing Systems, Vol. 8, pp. 1038-1044, MIT Press, 1996.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
    • Sutton, R.S.1
  • 23
    • 0021892282 scopus 로고
    • Fuzzy Identification of Systems and Its Applications to Modeling and Control
    • T. Takagi and M. Sugeno: "Fuzzy Identification of Systems and Its Applications to Modeling and Control," IEEE Transaction on Systems, Man, and Cybernetics, Vol. 15, pp. 116-132, 1985.
    • (1985) IEEE Transaction on Systems, Man, and Cybernetics , vol.15 , pp. 116-132
    • Takagi, T.1    Sugeno, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.