-
1
-
-
85153940465
-
Generalization in reinforcement learning: Safely approximating the value function
-
Tesauro G, Touretzky DS, Leen TK (editors). MIT Press
-
Boyan JA, Moore AW. Generalization in reinforcement learning: Safely approximating the value function. In Tesauro G, Touretzky DS, Leen TK (editors). Advances in Neural Information Processing Systems, Vol. 7, MIT Press; 1995. p 369-376.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 369-376
-
-
Boyan, J.A.1
Moore, A.W.2
-
2
-
-
14044267195
-
Serial motor learning in higher order continuous states using reinforcement learning: Learning to stand up
-
Morimoto A, Douya K. Serial motor learning in higher order continuous states using reinforcement learning: Learning to stand up. Trans IEICE 1999;J82-D-II:2118-2131.
-
(1999)
Trans IEICE
, vol.J82-D-II
, pp. 2118-2131
-
-
Morimoto, A.1
Douya, K.2
-
3
-
-
33746132000
-
Tree based discretization for continuous state space reinforcement learning
-
Madison, WI
-
Uther WTB, Veloso MM. Tree based discretization for continuous state space reinforcement learning. Proc AAAI-98, Madison, WI.
-
Proc AAAI-98
-
-
Wtb, U.1
Veloso, M.M.2
-
4
-
-
33746123049
-
State generalization method based on best estimates in consideration of multiple action outcomes
-
Yairi K, Hon K, Nakasuka S. State generalization method based on best estimates in consideration of multiple action outcomes. Trans JSAI 2001;16:130-140.
-
(2001)
Trans JSAI
, vol.16
, pp. 130-140
-
-
Yairi, K.1
Hon, K.2
Nakasuka, S.3
-
5
-
-
33746153788
-
Autonomous construction of a state space for acquiring robot actions
-
Asada J, Noda A, Hosoda K. Autonomous construction of a state space for acquiring robot actions. JRSJ 1997;15:886-892.
-
(1997)
JRSJ
, vol.15
, pp. 886-892
-
-
Asada, J.1
Noda, A.2
Hosoda, K.3
-
6
-
-
33746105714
-
Concurrent learning of situational knowledge and rules of behavior for an autonomous agent
-
Ueno A, Hori K, Nakasuka S. Concurrent learning of situational knowledge and rules of behavior for an autonomous agent. 30th SIG-FAI, p 19-24, 1997.
-
(1997)
30th SIG-FAI
, pp. 19-24
-
-
Ueno, A.1
Hori, K.2
Nakasuka, S.3
-
7
-
-
34249753618
-
Support-vector networks
-
Cortes C, Vapnik V. Support-vector networks. Mach Learn 1995;20:273-297.
-
(1995)
Mach Learn
, vol.20
, pp. 273-297
-
-
Cortes, C.1
Vapnik, V.2
-
9
-
-
34249833101
-
Technical note: Q-learning
-
Watkins CJCH, Dayan P. Technical note: Q-learning. Mach Learn 1992;8:279-292.
-
(1992)
Mach Learn
, vol.8
, pp. 279-292
-
-
Cjch, W.1
Dayan, P.2
-
10
-
-
0000672424
-
Fast learning in networks of locally-tuned processing units
-
Moody J, Darken CJ. Fast learning in networks of locally-tuned processing units. Neural Comput 1989;1:281-294.
-
(1989)
Neural Comput
, vol.1
, pp. 281-294
-
-
Moody, J.1
Darken, C.J.2
-
11
-
-
0003120218
-
Fast training of support vector machines using sequential minimal optimization
-
Schölkopf B, Burges C, Smola A (editors). MIT Press
-
Platt J. Fast training of support vector machines using sequential minimal optimization. In Schölkopf B, Burges C, Smola A (editors). Advances in kernel methods - Support vector learning. MIT Press; 1999. p 185-208.
-
(1999)
Advances in Kernel Methods - Support Vector Learning
, pp. 185-208
-
-
Platt, J.1
-
12
-
-
0003425673
-
Multi-class support vector machines
-
Department of Computer Science, Royal Holloway, University of London, Egham, TW20 0EX, UK
-
Weston J, Watkins C. Multi-class support vector machines. Technical Report CSD-TR-98-04, Department of Computer Science, Royal Holloway, University of London, Egham, TW20 0EX, UK, 1998.
-
(1998)
Technical Report
, vol.CSD-TR-98-04
-
-
Weston, J.1
Watkins, C.2
|