-
1
-
-
84898967665
-
Sampling techniques for Kernel methods
-
MIT Press
-
D. Achlioptas, F. McSherry, and B. Scholkopff, "Sampling techniques for Kernel methods," in Proceedings of the 14th International Conference on Neural Information Processing Systems (NIPS), pp. 335-342, MIT Press, 2002.
-
(2002)
Proceedings of the 14th International Conference on Neural Information Processing Systems (NIPS)
, pp. 335-342
-
-
Achlioptas, D.1
McSherry, F.2
Scholkopff, B.3
-
2
-
-
14644405640
-
On the spectra of nonsymmetric Laplacian matrices
-
R. Agaev and P. Cheboratev, "On the spectra of nonsymmetric Laplacian matrices," Linear Algebra and Its Applications, vol.399, pp. 157-168, 2005.
-
(2005)
Linear Algebra and Its Applications
, vol.399
, pp. 157-168
-
-
Agaev, R.1
Cheboratev, P.2
-
3
-
-
0001119106
-
On representations of problems of reasoning about actions
-
(D. Michie, ed.) Elsevier/North-Holland
-
S. Amarei, "On representations of problems of reasoning about actions," in Machine Intelligence 3, (D. Michie, ed.), pp. 131-171, Elsevier/North-Holland, 1968.
-
(1968)
Machine Intelligence
, vol.3
, pp. 131-171
-
-
Amarei, S.1
-
7
-
-
0037288370
-
Recent advances in hierarchical reinforcement learning
-
A. Barto and S. Mahadevan, "Recent advances in hierarchical reinforcement learning," Discrete Event Systems Journal, vol.13, pp. 41-77, 2003.
-
(2003)
Discrete Event Systems Journal
, vol.13
, pp. 41-77
-
-
Barto, A.1
Mahadevan, S.2
-
8
-
-
3142725535
-
Semi-supervised learning on Riemannian manifolds
-
M. Belkin and P. Niyogi, "Semi-supervised learning on Riemannian manifolds," Machine Learning, vol.56, pp. 209-239, 2004.
-
(2004)
Machine Learning
, vol.56
, pp. 209-239
-
-
Belkin, M.1
Niyogi, P.2
-
9
-
-
84949998208
-
Spectral partitioning with indefinite Kernels using the Nyström extension
-
S. Belongie, C. Fowlkes, F. Chung, and J. Malik, "Spectral partitioning with indefinite Kernels using the Nyström extension," in Proceedings of the 7th European Conference on Computer Vision, pp. 531-542, 2002.
-
(2002)
Proceedings of the 7th European Conference on Computer Visio
, pp. 531-542
-
-
Belongie, S.1
Fowlkes, C.2
Chung, F.3
Malik, J.4
-
11
-
-
0024680419
-
Adaptive Aggregation Methods for infinite horizon dynamic programming
-
D. Bertsekas and D. Castanon, "Adaptive Aggregation Methods for infinite horizon dynamic programming," IEEE Transactions on Automatic Control, vol.34, pp. 589-598, 1989.
-
(1989)
IEEE Transactions on Automatic Control
, vol.34
, pp. 589-598
-
-
Bertsekas, D.1
Castanon, D.2
-
14
-
-
84990610636
-
Fast wavelet transforms and numerical algorithms
-
G. Beylkin, R. R. Coifman, and V. Rokhlin, "Fast wavelet transforms and numerical algorithms," Common Pure and Applied Mathematic, vol.44, pp. 141-183, 1991.
-
(1991)
Common Pure and Applied Mathematic
, vol.44
, pp. 141-183
-
-
Beylkin, G.1
Coifman, R.R.2
Rokhlin, V.3
-
15
-
-
0041386088
-
A geometric interpretation of the MetropolisHasting algorithm
-
L. Billera and P. Diaconis, "A geometric interpretation of the MetropolisHasting algorithm," Statistical Science, vol.16, pp. 335-339, 2001.
-
(2001)
Statistical Science
, vol.16
, pp. 335-339
-
-
Billera, L.1
Diaconis, P.2
-
18
-
-
0001771345
-
Linear least-squares algorithms for temporal difference learning
-
S. Bradtke and A. Barto, "Linear least-squares algorithms for temporal difference learning," Machine Learning, vol.22, pp. 33-57, 1996.
-
(1996)
Machine Learning
, vol.22
, pp. 33-57
-
-
Bradtke, S.1
Barto, A.2
-
19
-
-
33745416113
-
Diffusion wavelet packets
-
July
-
J. Bremer, R. Coifman, M. Maggioni, and A. Szlam, "Diffusion wavelet packets," Applied and Computational Harmonic Analysis, vol.21, no.1, pp. 95-112, July 2006.
-
(2006)
Applied and Computational Harmonic Analysis
, vol.21
, Issue.1
, pp. 95-112
-
-
Bremer, J.1
Coifman, R.2
Maggioni, M.3
Szlam, A.4
-
21
-
-
0032027940
-
The Relations among Potentials, Perturbation Analysis, and Markov Decision Processes
-
[21] X. Cao, "The relations among potentials, perturbation analysis, and Markov decision processes," Discrete-Event Dynamic Systems, vol.8, no.1, pp. 71-87, 1998. (Pubitemid 128512397)
-
(1998)
Discrete Event Dynamic Systems: Theory and Applications
, vol.8
, Issue.1
, pp. 71-87
-
-
Cao, X.-R.1
-
22
-
-
33645793931
-
Kernels of directed graph Laplacians
-
J. Caughman and J. Veerman, "Kernels of directed graph Laplacians," Electronic Journal of Combinatorics, vol.13, no.1, pp. 253-274, 2006.
-
(2006)
Electronic Journal of Combinatorics
, vol.13
, Issue.1
, pp. 253-274
-
-
Caughman, J.1
Veerman, J.2
-
24
-
-
84969165400
-
Forest matrices around the Laplacian matrix
-
P. Chebotarev and R. Agaev, "Forest matrices around the Laplacian matrix," Linear Algebra and Its Applications, vol.15, no.1, pp. 253-274, 2002.
-
(2002)
Linear Algebra and Its Applications
, vol.15
, Issue.1
, pp. 253-274
-
-
Chebotarev, P.1
Agaev, R.2
-
25
-
-
0028401429
-
Generalized matrix inversion and rank computation by repeated squaring
-
L. Chen, E. Krishnamurthy, and I. Macleod, "Generalized matrix inversion and rank computation by repeated squaring," Parallel Computing, vol.20, pp. 297-311, 1994.
-
(1994)
Parallel Computing
, vol.20
, pp. 297-311
-
-
Chen, L.1
Krishnamurthy, E.2
Macleod, I.3
-
27
-
-
17444366585
-
Laplacians and the Cheeger inequality for directed graphs
-
April
-
F. Chung, "Laplacians and the Cheeger inequality for directed graphs," Annals of Combinatorics, vol.9, no.1, pp. 1-19, April 2005.
-
(2005)
Annals of Combinatorics
, vol.9
, Issue.1
, pp. 1-19
-
-
Chung, F.1
-
28
-
-
19644394100
-
Geometric diffusions as a tool for harmonic analysis and structure definition of data. Part i: Diffusion maps
-
May
-
R. Coifman, S. Lafon, A. Lee, M. Maggioni, B. Nadler, F. Warner, and S. Zucker, "Geometric diffusions as a tool for harmonic analysis and structure definition of data. Part i: Diffusion maps," Proceedings of National Academy of Science, vol.102, no.21, pp. 7426-7431, May 2005.
-
(2005)
Proceedings of National Academy of Science
, vol.102
, Issue.21
, pp. 7426-7431
-
-
Coifman, R.1
Lafon, S.2
Lee, A.3
Maggioni, M.4
Nadler, B.5
Warner, F.6
Zucker, S.7
-
29
-
-
19644366699
-
Geometrie diffusions as a tool for harmonie analysis and structure definition of data. Part ii: Multiscale methods
-
May
-
R. Coifman, S. Lafon, A. Lee, M. Maggioni, B. Nadler, F. Warner, and S. Zucker, "Geometrie diffusions as a tool for harmonie analysis and structure definition of data. Part ii: Multiscale methods," Proceedings of the National Academy of Science, vol.102, no.21, pp. 7432-7437, May 2005.
-
(2005)
Proceedings of the National Academy of Science
, vol.102
, Issue.21
, pp. 7432-7437
-
-
Coifman, R.1
Lafon, S.2
Lee, A.3
Maggioni, M.4
Nadler, B.5
Warner, F.6
Zucker, S.7
-
30
-
-
33745332989
-
Diffusion wavelets
-
July
-
R. Coifman and M. Maggioni, "Diffusion wavelets," Applied and Computational Harmonic Analysis, vol.21, no.1, pp. 53-94, July 2006.
-
(2006)
Applied and Computational Harmonic Analysis
, vol.21
, Issue.1
, pp. 53-94
-
-
Coifman, R.1
Maggioni, M.2
-
31
-
-
25844521242
-
Geometric diffusions for the analysis of data from sensor networks
-
October
-
R. Coifman, M. Maggioni, S. Zucker, and I. Kevrekidis, "Geometric diffusions for the analysis of data from sensor networks," Curr Opin Neurobiol, vol.15, no.5, pp. 576-584, October 2005.
-
(2005)
Curr Opin Neurobiol
, vol.15
, Issue.5
, pp. 576-584
-
-
Coifman, R.1
Maggioni, M.2
Zucker, S.3
Kevrekidis, I.4
-
33
-
-
0032643313
-
Solving semi-Markov decision problems using average-reward reinforcement learning
-
T. Das, A. Gosavi, S. Mahadevan, and N. Marchalleck, "Solving semi-Markov decision problems using average-reward reinforcement learning," Management Science, vol.45, no.4, pp. 560-574, 1999.
-
(1999)
Management Science
, vol.45
, Issue.4
, pp. 560-574
-
-
Das, T.1
Gosavi, A.2
Mahadevan, S.3
Marchalleck, N.4
-
34
-
-
0003833285
-
-
Society for Industrial and Applied Mathematics.
-
I. Daubechies, Ten Lectures on Wavelets, Society for Industrial and Applied Mathematics. 1992.
-
(1992)
Ten Lectures on Wavelets
-
-
Daubechies, I.1
-
35
-
-
0001158047
-
Improving generalisation for temporal difference learning: The successor representation
-
P. Dayan, "Improving generalisation for temporal difference learning: The successor representation," Neural Computation, vol.5, pp. 613-624, 1993.
-
(1993)
Neural Computation
, vol.5
, pp. 613-624
-
-
Dayan, P.1
-
39
-
-
29244453931
-
On the Nyström method for approximating a Gram matrix for improved Kernel-based learning
-
P. Drineas and M. W. Mahoney, "On the Nyström method for approximating a Gram matrix for improved Kernel-based learning," Journal of Machine Learning Research, vol.6, pp. 2153-2175, 2005.
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 2153-2175
-
-
Drineas, P.1
Mahoney, M.W.2
-
40
-
-
0043247546
-
Accelerating reinforcement learning by composing solutions of automatically identified subtasks
-
C. Drummond, "Accelerating reinforcement learning by composing solutions of automatically identified subtasks," Journal of AI Research, vol.16, pp. 59-104, 2002.
-
(2002)
Journal of AI Research
, vol.16
, pp. 59-104
-
-
Drummond, C.1
-
41
-
-
85095808297
-
Geometric aspects of the theory of Krylov subspace methods
-
[41] M. Eiermann and O. Ernst, "Geometric aspects of the theory of Krylov subspace methods," Acta Numérica, pp. 251-312, 2001. (Pubitemid 33305812)
-
(2001)
ACTA NUMERICA
, pp. 251-312
-
-
Eiermann, M.1
Ernst, O.G.2
-
42
-
-
1942421151
-
Bayes meets Bellman: The Gaussian process approach to temporal difference learning
-
AAAI Press
-
Y. Engel, S. Mannor, and R. Meir, "Bayes meets Bellman: The Gaussian process approach to temporal difference learning," in Proceedings of the 20th International Conference on Machine Learning, pp. 154-161, AAAI Press, 2003.
-
(2003)
Proceedings of the 20th International Conference on Machine Learning
, pp. 154-161
-
-
Engel, Y.1
Mannor, S.2
Meir, R.3
-
44
-
-
0001350119
-
Algebraic connectivity of graphs
-
M. Fiedler, "Algebraic connectivity of graphs," Czechoslovak Mathematical Journal, vol.23, no.98, pp. 298-305, 1973.
-
(1973)
Czechoslovak Mathematical Journal
, vol.23
, Issue.98
, pp. 298-305
-
-
Fiedler, M.1
-
45
-
-
0036832959
-
Structure in the space of value functions
-
D. Foster and P. Dayan, "Structure in the space of value functions," Machine Learning, vol.49, pp. 325-346, 2002.
-
(2002)
Machine Learning
, vol.49
, pp. 325-346
-
-
Foster, D.1
Dayan, P.2
-
46
-
-
0032308232
-
Fast Monte Carlo algorithms for finding low-rank approximations
-
A. Frieze, R. Kannan, and S. Vempala, "Fast Monte Carlo algorithms for finding low-rank approximations," in Proceedings of the 39th Annual IEEE Symposium on Foundations of Computer Science, pp. 370-378, 1998.
-
(1998)
Proceedings of the 39th Annual IEEE Symposium on Foundations of Computer Science
, pp. 370-378
-
-
Frieze, A.1
Kannan, R.2
Vempala, S.3
-
48
-
-
70349352794
-
Model minimization in Markov decision processes
-
R. Givan and T. Dean, "Model minimization in Markov decision processes," AAAI, 1997.
-
(1997)
AAAI
-
-
Givan, R.1
Dean, T.2
-
50
-
-
0038595393
-
-
Technical Report, CMU-CS-95-103, Department of Computer Science, Carnegie Mellon University
-
G. Gordon, "Stable function approximation in dynamic programming," Technical Report, CMU-CS-95-103, Department of Computer Science, Carnegie Mellon University, 1995.
-
(1995)
Stable Function Approximation in Dynamic Programming
-
-
Gordon, G.1
-
51
-
-
4544318426
-
Efficient solution algorithms for factored MDPs
-
C. Guestrin, D. Koller, R. Parr, and S. Venkataraman, "Efficient solution algorithms for factored MDPs," Journal of AI Research, vol.19, pp. 399-468, 2003.
-
(2003)
Journal of AI Research
, vol.19
, pp. 399-468
-
-
Guestrin, C.1
Koller, D.2
Parr, R.3
Venkataraman, S.4
-
53
-
-
34547313657
-
Graph Laplacians and their convergence on random neighborhood graphs
-
M. Hein, J. Audibert, and U. von Luxburg, "Graph Laplacians and their convergence on random neighborhood graphs," Journal of Machine Learning Research, vol.8, pp. 1325-1368, 2007.
-
(2007)
Journal of Machine Learning Research
, vol.8
, pp. 1325-1368
-
-
Hein, M.1
Audibert, J.2
Von Luxburg, U.3
-
55
-
-
0002956570
-
SPUDD: Stochastic planning using decision diagrams
-
Morgan Kaufmann
-
J. Hoey, R. St-aubin, A. Hu, and C. Boutilier, "SPUDD: Stochastic planning using decision diagrams," in Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, pp. 279-288, Morgan Kaufmann, 1999.
-
(1999)
Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence
, pp. 279-288
-
-
Hoey, J.1
St-aubin, R.2
Hu, A.3
Boutilier, C.4
-
62
-
-
0032131147
-
A fast and high quality multilevel scheme for partitioning irregular graphs
-
G. Karypis and V. Kumar, "A fast and high quality multilevel scheme for partitioning irregular graphs," SIAM Journal of Scientific Computing, vol.20, no.1, pp. 359-392, 1999.
-
(1999)
SIAM Journal of Scientific Computing
, vol.20
, Issue.1
, pp. 359-392
-
-
Karypis, G.1
Kumar, V.2
-
63
-
-
33846689581
-
Block diagonalization of Laplacian matrices of symmetric graphs using group theory
-
A. Kaveh and A. Nikbakht, "Block diagonalization of Laplacian matrices of symmetric graphs using group theory," International Journal for Numerical Methods in Engineering, vol.69, pp. 908-947, 2007.
-
(2007)
International Journal for Numerical Methods in Engineering
, vol.69
, pp. 908-947
-
-
Kaveh, A.1
Nikbakht, A.2
-
68
-
-
26444490324
-
-
PhD thesis, Yale University, Department of Mathematics and Applied Mathematics
-
S. Lafon, "Diffusion maps and geometric harmonics," PhD thesis, Yale University, Department of Mathematics and Applied Mathematics, 2004.
-
(2004)
Diffusion Maps and Geometric Harmonics
-
-
Lafon, S.1
-
70
-
-
33750184660
-
Updating the stationary vector of an irreducible Markov chain with an eye on google's pagerank
-
A. Langville and C. Meyer, "Updating the stationary vector of an irreducible Markov chain with an eye on google's pagerank," SIAM Journal on Matrix Analysis, vol.27, pp. 968-987, 2005.
-
(2005)
SIAM Journal on Matrix Analysis
, vol.27
, pp. 968-987
-
-
Langville, A.1
Meyer, C.2
-
75
-
-
84864535343
-
Towards a unified theory of state abstraction for MDPs
-
L. Li, T. Walsh, and M. Littman, "Towards a unified theory of state abstraction for MDPs," in Proceedings of the Ninth International Symposium on Artificial Intelligence and Mathematics, pp. 531-539, 2006.
-
(2006)
Proceedings of the Ninth International Symposium on Artificial Intelligence and Mathematics
, pp. 531-539
-
-
Li, L.1
Walsh, T.2
Littman, M.3
-
76
-
-
33749267463
-
Fast direct policy evaluation using multiscale analysis of Markov diffusion processes
-
New York, NY, USA: ACM Press
-
M. Maggioni and S. Mahadevan, "Fast direct policy evaluation using multiscale analysis of Markov diffusion processes," in Proceedings of the 23rd International Conference on Machine Learning, pp. 601-608, New York, NY, USA: ACM Press, 2006.
-
(2006)
Proceedings of the 23rd International Conference on Machine Learning
, pp. 601-608
-
-
Maggioni, M.1
Mahadevan, S.2
-
81
-
-
0026880130
-
Automatic programming of behavior-based robots using reinforcement learning
-
[81] S. Mahadevan and J. Connell, "Automatic programming of behaviorbased robots using reinforcement learning," Artificial Intelligence, vol.55, pp. 311-365, 1992. Appeared originally as IBM TR RC16359, December 1990. (Pubitemid 23565211)
-
(1992)
Artificial Intelligence
, vol.55
, Issue.2-3
, pp. 311-365
-
-
Mahadevan, S.1
Connell, J.2
-
83
-
-
35748957806
-
Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes
-
S. Mahadevan and M. Maggioni, "Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes," Journal of Machine Learning Research, vol.8, pp. 2169-2231, 2007.
-
(2007)
Journal of Machine Learning Research
, vol.8
, pp. 2169-2231
-
-
Mahadevan, S.1
Maggioni, M.2
-
84
-
-
33750591731
-
Learning representation and control in continuous Markov decision processes
-
S. Mahadevan, M. Maggioni, K. Ferguson, and S. Osentoski, "Learning representation and control in continuous Markov decision processes," in Proceedings of the National Conference on Artificial Intelligence (AAAI), 2006.
-
(2006)
Proceedings of the National Conference on Artificial Intelligence (AAAI)
-
-
Mahadevan, S.1
Maggioni, M.2
Ferguson, K.3
Osentoski, S.4
-
85
-
-
0001963197
-
Self-improving factory simulation using continuous-time average-reward reinforcement learning
-
Morgan Kaufmann
-
S. Mahadevan, N. Marchalleck, T. Das, and A. Gosavi, "Self-improving factory simulation using continuous-time average-reward reinforcement learning," in Proceedings of 14-th International Conference on Machine Learning, pp. 202-210, Morgan Kaufmann, 1997.
-
(1997)
Proceedings of 14-th International Conference on Machine Learning
, pp. 202-210
-
-
Mahadevan, S.1
Marchalleck, N.2
Das, T.3
Gosavi, A.4
-
86
-
-
0024700097
-
A theory for multiresolution signal decomposition: The wavelet representation
-
S. Mallat, "A theory for multiresolution signal decomposition: The wavelet representation," IEEE Transactions on Pattern Analysis of Machanical Intelligence, vol.11, no.7, pp. 674-693, 1989.
-
(1989)
IEEE Transactions on Pattern Analysis of Machanical Intelligence
, vol.11
, Issue.7
, pp. 674-693
-
-
Mallat, S.1
-
88
-
-
57749103516
-
Computing isotypic projections with the lanczos iteration
-
D. Malsen, M. Orrison, and D. Rockmore, "Computing isotypic projections with the lanczos iteration," SIAM, vol.2, nos. 60/61, pp. 601-628, 2003.
-
(2003)
SIAM
, vol.2
, Issue.60-61
, pp. 601-628
-
-
Malsen, D.1
Orrison, M.2
Rockmore, D.3
-
89
-
-
14344250635
-
Dynamic abstraction in reinforcement learning via clustering
-
S. Mannor, I. Menache, A. Hoze, and U. Klein, "Dynamic abstraction in reinforcement learning via clustering," International Conference on Machine Learning, 2004.
-
(2004)
International Conference on Machine Learning
-
-
Mannor, S.1
Menache, I.2
Hoze, A.3
Klein, U.4
-
91
-
-
84898985184
-
Learning segmentation by random walks
-
M. Meila and J. Shi, "Learning segmentation by random walks," NIPS, 2001.
-
(2001)
NIPS
-
-
Meila, M.1
Shi, J.2
-
92
-
-
0043256056
-
Sensitivity of the stationary distribution of a Markov chain
-
C. Meyer, "Sensitivity of the stationary distribution of a Markov chain," SIAM Journal of Matrix Analysis and Applications, vol.15, no.3, pp. 715-728, 1994.
-
(1994)
SIAM Journal of Matrix Analysis and Applications
, vol.15
, Issue.3
, pp. 715-728
-
-
Meyer, C.1
-
93
-
-
0008813538
-
Barycentric interpolators for continuous space and time reinforcement learning
-
MIT Press
-
A. Moore, "Barycentric interpolators for continuous space and time reinforcement learning," in Advances in Neural Information Processing Systems, MIT Press, 1998.
-
(1998)
Advances in Neural Information Processing Systems
-
-
Moore, A.1
-
95
-
-
0004027474
-
-
Princeton University Press
-
E. Nelson, Tensor Analysis. Princeton University Press, 1968.
-
(1968)
Tensor Analysis.
-
-
Nelson, E.1
-
96
-
-
84899013108
-
On spectral clustering: Analysis and an algorithm
-
A. Ng, M. Jordan, and Y. Weiss, "On spectral clustering: Analysis and an algorithm," NIPS, 2002.
-
(2002)
NIPS
-
-
Ng, A.1
Jordan, M.2
Weiss, Y.3
-
97
-
-
84898980684
-
Autonomous helicopter flight via Reinforcement Learning
-
A. Ng, H. Kim, M. Jordan, and S. Sastry, "Autonomous helicopter flight via Reinforcement Learning," in Proceedings of Neural Information Processing Systems, 2004.
-
(2004)
Proceedings of Neural Information Processing Systems
-
-
Ng, A.1
Kim, H.2
Jordan, M.3
Sastry, S.4
-
98
-
-
30844447280
-
-
Technical Report TR-2001-2030, University of Chicago, Computer Science Deparment, November
-
P. Niyogi and M. Belkin, "Semi-supervised learning on Riemannian manifolds," Technical Report TR-2001-2030, University of Chicago, Computer Science Deparment, November 2001.
-
(2001)
Semi-supervised Learning on Riemannian Manifolds
-
-
Niyogi, P.1
Belkin, M.2
-
99
-
-
34547995167
-
-
Techncial Report, University of Chicago, November
-
P. Niyogi, I. Matveeva, and M. Belkin, "Regression and regularization on large graphs," Techncial Report, University of Chicago, November 2003.
-
(2003)
Regression and Regularization on Large Graphs
-
-
Niyogi, P.1
Matveeva, I.2
Belkin, M.3
-
100
-
-
0036832956
-
Kernel-based reinforcement learning
-
D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol.49, nos. 2-3, pp. 161-178, 2002.
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 161-178
-
-
Ormoneit, D.1
Sen, S.2
-
103
-
-
34547982545
-
Analyzing feature generation for value-function approximation
-
ICML 2007 - Proceedings of the 24th International Conference on Machine Learning
-
[103] R. Parr, C. Painter-Wakefiled, L. Li, and M. Littman, "Analyzing feature generation for value function approximation," in Proceedings of the International Conference on Machine Learning (ICML), pp. 737-744, 2007. (Pubitemid 350094046)
-
(2007)
ICML 2007 - Proceedings of the 24th International Conference on Machine Learning
, pp. 737-744
-
-
Parr, R.1
Painter-Wakefield, C.2
Li, H.3
Littman, M.4
-
104
-
-
56449092660
-
An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning
-
R. Parr, C. Painter-Wakefiled, L. Li, and M. Littman, "An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning," in Proceedings of the International Conference on Machine Learning (ICML), 2008.
-
(2008)
Proceedings of the International Conference on Machine Learning (ICML)
-
-
Parr, R.1
Painter-Wakefiled, C.2
Li, L.3
Littman, M.4
-
108
-
-
0036927202
-
Piecewise linear value function approximation for factored Markov decision processes
-
P. Poupart, C. Boutilier, R. Patrascu, and D. Schuurmans, "Piecewise linear value function approximation for factored Markov decision processes," in Proceedings of the National Conference on Artificial Intelligence (AAAI), pp. 285-291, 2002.
-
(2002)
Proceedings of the National Conference on Artificial Intelligence (AAAI)
, pp. 285-291
-
-
Poupart, P.1
Boutilier, C.2
Patrascu, R.3
Schuurmans, D.4
-
116
-
-
0034704222
-
Nonlinear dimensionality reduction by locally linear embedding
-
DOI 10.1126/science.290.5500.2323
-
[116] S. Roweis and L. Saul, "Nonlinear dimensionality reduction by local linear embedding," Science, vol.290, pp. 2323-2326, 2000. (Pubitemid 32041578)
-
(2000)
Science
, vol.290
, Issue.5500
, pp. 2323-2326
-
-
Roweis, S.T.1
Saul, L.K.2
-
119
-
-
32844474095
-
Reinforcement learning with factored states and actions
-
B. Sallans and G. Hinton, "Reinforcement learning with factored states and actions," Journal of Machine Learning Research, vol.5, pp. 1063-1088, 2004.
-
(2004)
Journal of Machine Learning Research
, vol.5
, pp. 1063-1088
-
-
Sallans, B.1
Hinton, G.2
-
120
-
-
0003408420
-
-
MIT Press
-
B. Scholkopf and A. Smola, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, 2001.
-
(2001)
Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond.
-
-
Scholkopf, B.1
Smola, A.2
-
121
-
-
0001296683
-
Perturbation theory and finite Markov chains
-
P. Schweitzer, "Perturbation theory and finite Markov chains," Journal of Applied Probability, vol.5, no.2, pp. 410-413, 1968.
-
(1968)
Journal of Applied Probability
, vol.5
, Issue.2
, pp. 410-413
-
-
Schweitzer, P.1
-
124
-
-
26944499565
-
Approximate policy construction using decision diagrams
-
R. St-Aubin, J. Hoey, and C. Boutilier, "Approximate policy construction using decision diagrams," NIPS, 2000.
-
(2000)
NIPS
-
-
St-Aubin, R.1
Hoey, J.2
Boutilier, C.3
-
130
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
R. S. Sutton, "Learning to predict by the methods of temporal differences," Machine Learning, vol.3, pp. 9-44, 1988.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
131
-
-
0034704229
-
A global geometric framework for nonlinear dimensionality reduction
-
DOI 10.1126/science.290.5500.2319
-
[131] J. Tenenbaum, V. de Silva, and J. Langford, "A global geometric framework for nonlinear dimensionality reduction," Science, vol.290, pp. 2319-2323, 2000. (Pubitemid 32041577)
-
(2000)
Science
, vol.290
, Issue.5500
, pp. 2319-2323
-
-
Tenenbaum, J.B.1
De, S.2
Langford, J.C.3
-
132
-
-
0000985504
-
Td-gammon, a self-teaching backgammon program, achieves master-level play
-
G. Tesauro, "Td-gammon, a self-teaching backgammon program, achieves master-level play," Neural Computation, vol.6, pp. 215-219, 1994.
-
(1994)
Neural Computation
, vol.6
, pp. 215-219
-
-
Tesauro, G.1
-
133
-
-
84899951003
-
Graph Laplacian based transfer learning in reinforcement learning
-
Y. Tsao, K. Xiao, and V. Soo, "Graph Laplacian based transfer learning in reinforcement learning," in AAMAS '08: Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems, pp. 1349-1352, 2008.
-
(2008)
AAMAS '08: Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems
, pp. 1349-1352
-
-
Tsao, Y.1
Xiao, K.2
Soo, V.3
-
140
-
-
0004049893
-
-
PhD thesis, King's College, Cambridge, England
-
C. Watkins, "Learning from delayed rewards," PhD thesis, King's College, Cambridge, England, 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.1
-
141
-
-
0012841228
-
Successive matrix squaring algorithm for computing the Drazin inverse
-
Y. Wei, "Successive matrix squaring algorithm for computing the Drazin inverse," Applied Mathematics and Computation, vol.108, pp. 67-75, 2000.
-
(2000)
Applied Mathematics and Computation
, vol.108
, pp. 67-75
-
-
Wei, Y.1
|