메뉴 건너뛰기




Volumn 37, Issue 9, 2006, Pages 77-86

State generalization method with support vector machines in reinforcement learning

Author keywords

Reinforcement learning; State generalization; Support vector machine

Indexed keywords

ALGORITHMS; CACHE MEMORY; COMPUTER SIMULATION; DISCRETE TIME CONTROL SYSTEMS; ROBOTICS;

EID: 33746131676     PISSN: 08821666     EISSN: 1520684X     Source Type: Journal    
DOI: 10.1002/scj.20140     Document Type: Article
Times cited : (6)

References (12)
  • 1
    • 85153940465 scopus 로고
    • Generalization in reinforcement learning: Safely approximating the value function
    • Tesauro G, Touretzky DS, Leen TK (editors). MIT Press
    • Boyan JA, Moore AW. Generalization in reinforcement learning: Safely approximating the value function. In Tesauro G, Touretzky DS, Leen TK (editors). Advances in Neural Information Processing Systems, Vol. 7, MIT Press; 1995. p 369-376.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 369-376
    • Boyan, J.A.1    Moore, A.W.2
  • 2
    • 14044267195 scopus 로고    scopus 로고
    • Serial motor learning in higher order continuous states using reinforcement learning: Learning to stand up
    • Morimoto A, Douya K. Serial motor learning in higher order continuous states using reinforcement learning: Learning to stand up. Trans IEICE 1999;J82-D-II:2118-2131.
    • (1999) Trans IEICE , vol.J82-D-II , pp. 2118-2131
    • Morimoto, A.1    Douya, K.2
  • 3
    • 33746132000 scopus 로고    scopus 로고
    • Tree based discretization for continuous state space reinforcement learning
    • Madison, WI
    • Uther WTB, Veloso MM. Tree based discretization for continuous state space reinforcement learning. Proc AAAI-98, Madison, WI.
    • Proc AAAI-98
    • Wtb, U.1    Veloso, M.M.2
  • 4
    • 33746123049 scopus 로고    scopus 로고
    • State generalization method based on best estimates in consideration of multiple action outcomes
    • Yairi K, Hon K, Nakasuka S. State generalization method based on best estimates in consideration of multiple action outcomes. Trans JSAI 2001;16:130-140.
    • (2001) Trans JSAI , vol.16 , pp. 130-140
    • Yairi, K.1    Hon, K.2    Nakasuka, S.3
  • 5
    • 33746153788 scopus 로고    scopus 로고
    • Autonomous construction of a state space for acquiring robot actions
    • Asada J, Noda A, Hosoda K. Autonomous construction of a state space for acquiring robot actions. JRSJ 1997;15:886-892.
    • (1997) JRSJ , vol.15 , pp. 886-892
    • Asada, J.1    Noda, A.2    Hosoda, K.3
  • 6
    • 33746105714 scopus 로고    scopus 로고
    • Concurrent learning of situational knowledge and rules of behavior for an autonomous agent
    • Ueno A, Hori K, Nakasuka S. Concurrent learning of situational knowledge and rules of behavior for an autonomous agent. 30th SIG-FAI, p 19-24, 1997.
    • (1997) 30th SIG-FAI , pp. 19-24
    • Ueno, A.1    Hori, K.2    Nakasuka, S.3
  • 7
    • 34249753618 scopus 로고
    • Support-vector networks
    • Cortes C, Vapnik V. Support-vector networks. Mach Learn 1995;20:273-297.
    • (1995) Mach Learn , vol.20 , pp. 273-297
    • Cortes, C.1    Vapnik, V.2
  • 9
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • Watkins CJCH, Dayan P. Technical note: Q-learning. Mach Learn 1992;8:279-292.
    • (1992) Mach Learn , vol.8 , pp. 279-292
    • Cjch, W.1    Dayan, P.2
  • 10
    • 0000672424 scopus 로고
    • Fast learning in networks of locally-tuned processing units
    • Moody J, Darken CJ. Fast learning in networks of locally-tuned processing units. Neural Comput 1989;1:281-294.
    • (1989) Neural Comput , vol.1 , pp. 281-294
    • Moody, J.1    Darken, C.J.2
  • 11
    • 0003120218 scopus 로고    scopus 로고
    • Fast training of support vector machines using sequential minimal optimization
    • Schölkopf B, Burges C, Smola A (editors). MIT Press
    • Platt J. Fast training of support vector machines using sequential minimal optimization. In Schölkopf B, Burges C, Smola A (editors). Advances in kernel methods - Support vector learning. MIT Press; 1999. p 185-208.
    • (1999) Advances in Kernel Methods - Support Vector Learning , pp. 185-208
    • Platt, J.1
  • 12
    • 0003425673 scopus 로고    scopus 로고
    • Multi-class support vector machines
    • Department of Computer Science, Royal Holloway, University of London, Egham, TW20 0EX, UK
    • Weston J, Watkins C. Multi-class support vector machines. Technical Report CSD-TR-98-04, Department of Computer Science, Royal Holloway, University of London, Egham, TW20 0EX, UK, 1998.
    • (1998) Technical Report , vol.CSD-TR-98-04
    • Weston, J.1    Watkins, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.