-
1
-
-
0000439891
-
Convergence of stochastic iterative dynamic programming algorithms
-
Jaakkola, T., Jordan, M.I. and Singh, S.P. (1994), “Convergence of stochastic iterative dynamic programming algorithms”, Neural Computation, Vol. 6, pp. 1185-201.
-
(1994)
Neural Computation
, vol.6
, pp. 1185-1201
-
-
Jaakkola, T.1
Jordan, M.I.2
Singh, S.P.3
-
2
-
-
0029679044
-
Reinforcement learning: a survey
-
Kaelbling, L.P., Littman, M.L. and Moore, A.W. (1996), “Reinforcement learning: a survey”, Journal of Artificial Intelligence Research, Vol. 4, pp. 237-48.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-248
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
3
-
-
84986127448
-
-
B. Knudsen Data, available at:
-
Knudsen, B. (2006), CC5X C Compiler, B. Knudsen Data, available at: www.bknd.com/cc5x/index.shtml.
-
(2006)
CC5X C Compiler
-
-
Knudsen, B.1
-
4
-
-
84986174661
-
-
available at: (accessed 1 February 2007).
-
Linux (2003), Linux RedHat 9.0 Manuals, available at: www.redhat.com/docs/manuals/linux/RHL-9-Manual/ (accessed 1 February 2007).
-
(2003)
Linux RedHat 9.0 Manuals
-
-
-
5
-
-
84898282426
-
Learning with ALiCE II
-
MSc thesis, Electrical and Computer Engineering, University of Manitoba, Manitoba.
-
Lockery, D. (2007), “Learning with ALiCE II”, MSc thesis, Electrical and Computer Engineering, University of Manitoba, Manitoba.
-
(2007)
-
-
Lockery, D.1
-
6
-
-
84986056466
-
-
Microchip Technology Inc., Chandler, AZ, available at:
-
Microchip Technology Inc. (2006), MPLAB Integrated Development Environment, Microchip Technology Inc., Chandler, AZ, available at: www.microchip.com.
-
(2006)
MPLAB Integrated Development Environment
-
-
-
7
-
-
33847754891
-
-
Report 469, Institute for Computer Science, Polish Academy of Sciences, Warszawa.
-
Orlowska, E. (1982), Semantics of Vague Concepts. Applications of Rough Sets, Report 469, Institute for Computer Science, Polish Academy of Sciences, Warszawa.
-
(1982)
Semantics of Vague Concepts. Applications of Rough Sets
-
-
Orlowska, E.1
-
8
-
-
33749669435
-
-
In: Dorn, G. and Weingartner, P. (Eds) Foundations of Logic and Linguistics. Problems and Solutions, Plenum Press, London
-
Orlowska, E. (1985), Semantics of Vague Concepts, In: Dorn, G. and Weingartner, P. (Eds) Foundations of Logic and Linguistics. Problems and Solutions, Plenum Press, London, pp. 465-82.
-
(1985)
Semantics of Vague Concepts
, pp. 465-482
-
-
Orlowska, E.1
-
9
-
-
33846104576
-
-
Report 429, Institute for Computer Science, Polish Academy of Sciences, Warszawa, March.
-
Pawlak, Z. (1981), Classification of Objects by Means of Attributes, Report 429, Institute for Computer Science, Polish Academy of Sciences, Warszawa, March.
-
(1981)
Classification of Objects by Means of Attributes
-
-
Pawlak, Z.1
-
10
-
-
36949021089
-
Rough ethology: toward a biologically-inspired study of collective behaviour in intelligent systems with approximation spaces
-
Springer LNCS 3400, Berlin
-
Peters, J.F. (2005), “Rough ethology: toward a biologically-inspired study of collective behaviour in intelligent systems with approximation spaces”, Transactions on Rough Sets III, Springer LNCS 3400, Berlin, pp. 153-74.
-
(2005)
Transactions on Rough Sets III
, pp. 153-174
-
-
Peters, J.F.1
-
11
-
-
46149105571
-
Biologically-inspired approximate adaptive learning control strategies: a rough set approach
-
Peters, J.F., Henry, C. and Gunderson, D.S. (2006a), “Biologically-inspired approximate adaptive learning control strategies: a rough set approach”, International Journal of Hybrid Intelligent Systems, Vol. 3, pp. 1-14.
-
(2006)
International Journal of Hybrid Intelligent Systems
, vol.3
, pp. 1-14
-
-
Peters, J.F.1
Henry, C.2
Gunderson, D.S.3
-
12
-
-
33645978549
-
Rough Ethograms: study of intelligent system behaviour
-
in K_lopotek, M.A., Wierzcho'n, S. and Trojanowski, K. (Eds)
-
Peters, J.F., Henry, C. and Ramanna, S. (2005), “Rough Ethograms: study of intelligent system behaviour”, in K_lopotek, M.A., Wierzcho'n, S. and Trojanowski, K. (Eds), New Trends in Intelligent Information Processing and Web Mining (IIS05), pp. 17-126.
-
(2005)
New Trends in Intelligent Information Processing and Web Mining (IIS05)
, pp. 17-126
-
-
Peters, J.F.1
Henry, C.2
Ramanna, S.3
-
13
-
-
38049060134
-
Line-crawling bots that inspect electric power transmission line equipment
-
Peters, J.F., Borkowski, M., Henry, C., Lockery, D., Gunderson, D. and Ramanna, S. (2006b), “Line-crawling bots that inspect electric power transmission line equipment”, Proc. 3rd Int. Conf. on Autonomous Robots and Agents (ICARA 2006), pp. 39-44.
-
(2006)
Proc. 3rd Int. Conf. on Autonomous Robots and Agents (ICARA 2006)
, pp. 39-44
-
-
Peters, J.F.1
Borkowski, M.2
Henry, C.3
Lockery, D.4
Gunderson, D.5
Ramanna, S.6
-
14
-
-
84966235265
-
-
Cambridge University Press, Cambridge
-
Press, W.H., Teukolsky, S.A., Vetterling, W.T. and Flannery, B.P. (2002), Numerical Recipes in C++: The Art of Scientific Computing, 2nd ed., Cambridge University Press, Cambridge, pp. 278-90.
-
(2002)
Numerical Recipes in C++: The Art of Scientific Computing, 2nd ed.
, pp. 278-290
-
-
Press, W.H.1
Teukolsky, S.A.2
Vetterling, W.T.3
Flannery, B.P.4
-
15
-
-
0003636089
-
On-line Q-learning using connectionist systems
-
Technical Report CUED/F-INFENG/TR 166, Engineering Department, Cambridge University, Cambridge, September.
-
Rummery, G.A. and Niranjan, M. (1994), “On-line Q-learning using connectionist systems”, Technical Report CUED/F-INFENG/TR 166, Engineering Department, Cambridge University, Cambridge, September.
-
(1994)
-
-
Rummery, G.A.1
Niranjan, M.2
-
16
-
-
84899219189
-
Some themes and primitives in ill-defined systems
-
in Selfridge, O.G., Rissland, E.L. and Arbib, M.A. (Eds), Plenum Press, London.
-
Selfridge, O.G. (1984), “Some themes and primitives in ill-defined systems”, in Selfridge, O.G., Rissland, E.L. and Arbib, M.A. (Eds), Adaptive Control of Ill-defined Systems, Plenum Press, London.
-
(1984)
Adaptive Control of Ill-defined Systems
-
-
Selfridge, O.G.1
-
17
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
Singh, S.P. and Sutton, R.S. (1996), “Reinforcement learning with replacing eligibility traces”, Machine Learning, Vol. 22, pp. 123-58.
-
(1996)
Machine Learning
, vol.22
, pp. 123-158
-
-
Singh, S.P.1
Sutton, R.S.2
-
18
-
-
0033901602
-
Convergence results for single-step on-policy reinforcement-learning algorithms
-
Singh, S.P., Jaakkola, T., Littman, M.L. and Szepesvari, C. (2000), “Convergence results for single-step on-policy reinforcement-learning algorithms”, Machine Learning, Vol. 38 No. 3, pp. 287-308.
-
(2000)
Machine Learning
, vol.38
, Issue.3
, pp. 287-308
-
-
Singh, S.P.1
Jaakkola, T.2
Littman, M.L.3
Szepesvari, C.4
-
19
-
-
85156221438
-
Generalization in reinforcement learning: successful examples using sparse coarse coding
-
The MIT Press, Cambridge, MA
-
Sutton, R.S. (1996), “Generalization in reinforcement learning: successful examples using sparse coarse coding”, Advances in Neural Information Processing Systems, Vol. 8, The MIT Press, Cambridge, MA, pp. 1038-44.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1038-1044
-
-
Sutton, R.S.1
-
20
-
-
0004102479
-
-
The MIT Press, Cambridge, MA.
-
Sutton, R.S. and Barto, A.G. (1998), Reinforcement Learning: An Introduction, The MIT Press, Cambridge, MA.
-
(1998)
Reinforcement Learning: An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
21
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning problems
-
SMC-13
-
Sutton, R.S., Barto, A.G. and Anderson, C.W. (1983), “Neuronlike adaptive elements that can solve difficult learning problems”, IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-13 No. 5, pp. 834-47.
-
(1983)
IEEE Transactions on Systems, Man, and Cybernetics
, Issue.5
, pp. 834-847
-
-
Sutton, R.S.1
Barto, A.G.2
Anderson, C.W.3
-
22
-
-
84989996303
-
On aims and methods of ethology
-
Tinbergen, N. (1963), “On aims and methods of ethology”, Zeitschrift f'ur Tierpsychologie, Vol. 20, pp. 410-33.
-
(1963)
Zeitschrift f'ur Tierpsychologie
, vol.20
, pp. 410-433
-
-
Tinbergen, N.1
-
23
-
-
0004049893
-
Learning from delayed rewards
-
PhD thesis, King's College, Cambridge University, Cambridge, supervisor: Richard Young.
-
Watkins, C.J.C.H. (1989), “Learning from delayed rewards”, PhD thesis, King's College, Cambridge University, Cambridge, supervisor: Richard Young.
-
(1989)
-
-
Watkins, C.J.C.H.1
-
24
-
-
34249833101
-
Technical note: Q-learning
-
Watkins, C.J.C.H. and Dayan, P. (1992), “Technical note: Q-learning”, Machine Learning, Vol. 8, pp. 279-92.
-
(1992)
Machine Learning
, vol.8
, pp. 279-292
-
-
Watkins, C.J.C.H.1
Dayan, P.2
|