SCOPUS 정보 검색 플랫폼

Foundations and Trends in Machine Learning

Volumn 1, Issue 4, 2008, Pages 403-565

Learning representation and control in markov decision processes: New frontiers

(1) Mahadevan, Sridhar a

a Biologically Inspired Neural and Dynamical Systems Laboratory (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DECISION PROBLEMS; DIAGONALIZATIONS; DIMENSIONALITY REDUCTION; DRAZIN INVERSE; EXACT SOLUTION; GENERIC ALGORITHM; LAPLACIAN OPERATOR; LAPLACIANS; LOW-DIMENSIONAL REPRESENTATION; MACHINE-LEARNING; MARKOV DECISION PROCESSES; MATHEMATICAL FRAMEWORKS; MATRIX REPRESENTATION; MODEL FREE; MODEL-BASED; OFF-DIAGONAL ELEMENTS; OPTIMAL CONTROLS; OPTIMAL POLICIES; POLICY ITERATION; ROW SUMS;

COMPUTER SCIENCE; LAPLACE EQUATION; LAPLACE TRANSFORMS; MARKOV PROCESSES; OPTIMIZATION; PROCESS CONTROL;

MATHEMATICAL OPERATORS;

EID: 70349322784 PISSN: 19358237 EISSN: 19358245 Source Type: Journal
DOI: 10.1561/2200000003 Document Type: Article

Times cited : (46)

References (144)

1
- 84898967665
- Sampling techniques for Kernel methods
- MIT Press
- D. Achlioptas, F. McSherry, and B. Scholkopff, "Sampling techniques for Kernel methods," in Proceedings of the 14th International Conference on Neural Information Processing Systems (NIPS), pp. 335-342, MIT Press, 2002.
- (2002) Proceedings of the 14th International Conference on Neural Information Processing Systems (NIPS) , pp. 335-342
- Achlioptas, D.¹ McSherry, F.² Scholkopff, B.³

2
- 14644405640
- On the spectra of nonsymmetric Laplacian matrices
- R. Agaev and P. Cheboratev, "On the spectra of nonsymmetric Laplacian matrices," Linear Algebra and Its Applications, vol.399, pp. 157-168, 2005.
- (2005) Linear Algebra and Its Applications , vol.399 , pp. 157-168
- Agaev, R.¹ Cheboratev, P.²

3
- 0001119106
- On representations of problems of reasoning about actions
- (D. Michie, ed.) Elsevier/North-Holland
- S. Amarei, "On representations of problems of reasoning about actions," in Machine Intelligence 3, (D. Michie, ed.), pp. 131-171, Elsevier/North-Holland, 1968.
- (1968) Machine Intelligence , vol.3 , pp. 131-171
- Amarei, S.¹

4
- 0004221604
- Springer
- S. Axler, P. Bourdon, and W. Ramey, Harmonic Function Theory. Springer, 2001.
- (2001) Harmonic Function Theory.
- Axler, S.¹ Bourdon, P.² Ramey, W.³

5
- 84858765598
- Covariant Policy Search
- J. Bagnell and J. Schneider, "Covariant Policy Search," in Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), pp. 1019-1024, 2003.
- (2003) Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , pp. 1019-1024
- Bagnell, J.¹ Schneider, J.²

6
- 0003780432
- Oxford: Clarendon Press
- C. T. H. Baker, The Numerical Treatment of Integral Equations. Oxford: Clarendon Press, 1977.
- (1977) The Numerical Treatment of Integral Equations.
- Baker, C.T.H.¹

7
- 0037288370
- Recent advances in hierarchical reinforcement learning
- A. Barto and S. Mahadevan, "Recent advances in hierarchical reinforcement learning," Discrete Event Systems Journal, vol.13, pp. 41-77, 2003.
- (2003) Discrete Event Systems Journal , vol.13 , pp. 41-77
- Barto, A.¹ Mahadevan, S.²

8
- 3142725535
- Semi-supervised learning on Riemannian manifolds
- M. Belkin and P. Niyogi, "Semi-supervised learning on Riemannian manifolds," Machine Learning, vol.56, pp. 209-239, 2004.
- (2004) Machine Learning , vol.56 , pp. 209-239
- Belkin, M.¹ Niyogi, P.²

9
- 84949998208
- Spectral partitioning with indefinite Kernels using the Nyström extension
- S. Belongie, C. Fowlkes, F. Chung, and J. Malik, "Spectral partitioning with indefinite Kernels using the Nyström extension," in Proceedings of the 7th European Conference on Computer Vision, pp. 531-542, 2002.
- (2002) Proceedings of the 7th European Conference on Computer Visio , pp. 531-542
- Belongie, S.¹ Fowlkes, C.² Chung, F.³ Malik, J.⁴

10
- 0003658403
- SIAM Press
- A. Berman and R. Plemmons, Nonnegative Matrices in the Mathematical Sciences. SIAM Press, 1994.
- (1994) Nonnegative Matrices in the Mathematical Sciences.
- Berman, A.¹ Plemmons, R.²

11
- 0024680419
- Adaptive Aggregation Methods for infinite horizon dynamic programming
- D. Bertsekas and D. Castanon, "Adaptive Aggregation Methods for infinite horizon dynamic programming," IEEE Transactions on Automatic Control, vol.34, pp. 589-598, 1989.
- (1989) IEEE Transactions on Automatic Control , vol.34 , pp. 589-598
- Bertsekas, D.¹ Castanon, D.²

12
- 0003487482
- MA: Athena Scientific
- D. Bertsekas and J. Tsitsiklis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming. Belmont
- Bertsekas, D.¹ Tsitsiklis, J.²

13
- 62949146458
- Approximate dynamic programming using support vector regression
- B. Bethke, J. How, and A. Ozdaglar, "Approximate dynamic programming using support vector regression," in Proceedings of the IEEE Conference on Decision and Control, 2008.
- (2008) Proceedings of the IEEE Conference on Decision and Control
- Bethke, B.¹ How, J.² Ozdaglar, A.³

14
- 84990610636
- Fast wavelet transforms and numerical algorithms
- G. Beylkin, R. R. Coifman, and V. Rokhlin, "Fast wavelet transforms and numerical algorithms," Common Pure and Applied Mathematic, vol.44, pp. 141-183, 1991.
- (1991) Common Pure and Applied Mathematic , vol.44 , pp. 141-183
- Beylkin, G.¹ Coifman, R.R.² Rokhlin, V.³

15
- 0041386088
- A geometric interpretation of the MetropolisHasting algorithm
- L. Billera and P. Diaconis, "A geometric interpretation of the MetropolisHasting algorithm," Statistical Science, vol.16, pp. 335-339, 2001.
- (2001) Statistical Science , vol.16 , pp. 335-339
- Billera, L.¹ Diaconis, P.²

16
- 0038595396
- Least-squares temporal difference learning
- San Francisco, CA: Morgan Kaufmann
- J. A. Boyan, "Least-squares temporal difference learning," in Proceedings of the 16th International Conference on Machine Learning, pp. 49-56, San Francisco, CA: Morgan Kaufmann, 1999.
- (1999) Proceedings of the 16th International Conference on Machine Learning , pp. 49-56
- Boyan, J.A.¹

17
- 0004055894
- Cambridge University Press
- S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge University Press, 2004.
- (2004) Convex Optimization.
- Boyd, S.¹ Vandenberghe, L.²

18
- 0001771345
- Linear least-squares algorithms for temporal difference learning
- S. Bradtke and A. Barto, "Linear least-squares algorithms for temporal difference learning," Machine Learning, vol.22, pp. 33-57, 1996.
- (1996) Machine Learning , vol.22 , pp. 33-57
- Bradtke, S.¹ Barto, A.²

19
- 33745416113
- Diffusion wavelet packets
- July
- J. Bremer, R. Coifman, M. Maggioni, and A. Szlam, "Diffusion wavelet packets," Applied and Computational Harmonic Analysis, vol.21, no.1, pp. 95-112, July 2006.
- (2006) Applied and Computational Harmonic Analysis , vol.21 , Issue.1 , pp. 95-112
- Bremer, J.¹ Coifman, R.² Maggioni, M.³ Szlam, A.⁴

20
- 0003598718
- Pitman
- S. Campbell and C. Meyer, Generalized Inverses of Linear Transformations. Pitman, 1979.
- (1979) Generalized Inverses of Linear Transformations.
- Campbell, S.¹ Meyer, C.²

21
- 0032027940
- The Relations among Potentials, Perturbation Analysis, and Markov Decision Processes
- [21] X. Cao, "The relations among potentials, perturbation analysis, and Markov decision processes," Discrete-Event Dynamic Systems, vol.8, no.1, pp. 71-87, 1998. (Pubitemid 128512397)
- (1998) Discrete Event Dynamic Systems: Theory and Applications , vol.8 , Issue.1 , pp. 71-87
- Cao, X.-R.¹

22
- 33645793931
- Kernels of directed graph Laplacians
- J. Caughman and J. Veerman, "Kernels of directed graph Laplacians," Electronic Journal of Combinatorics, vol.13, no.1, pp. 253-274, 2006.
- (2006) Electronic Journal of Combinatorics , vol.13 , Issue.1 , pp. 253-274
- Caughman, J.¹ Veerman, J.²

23
- 0004228846
- Academic Press
- I. Chavel, Eigenvalues in Riemannian Geometry: Pure and Applied Mathematics. Academic Press, 1984.
- (1984) Eigenvalues in Riemannian Geometry: Pure and Applied Mathematics.
- Chavel, I.¹

24
- 84969165400
- Forest matrices around the Laplacian matrix
- P. Chebotarev and R. Agaev, "Forest matrices around the Laplacian matrix," Linear Algebra and Its Applications, vol.15, no.1, pp. 253-274, 2002.
- (2002) Linear Algebra and Its Applications , vol.15 , Issue.1 , pp. 253-274
- Chebotarev, P.¹ Agaev, R.²

25
- 0028401429
- Generalized matrix inversion and rank computation by repeated squaring
- L. Chen, E. Krishnamurthy, and I. Macleod, "Generalized matrix inversion and rank computation by repeated squaring," Parallel Computing, vol.20, pp. 297-311, 1994.
- (1994) Parallel Computing , vol.20 , pp. 297-311
- Chen, L.¹ Krishnamurthy, E.² Macleod, I.³

26
- 70349365936
- American Mathematical Society
- F. Chung, Spectral Graph Theory, Number 92 in CBMS Regional Conference Series in Mathematics. American Mathematical Society, 1997.
- (1997) Spectral Graph Theory, Number 92 in CBMS Regional Conference Series in Mathematics.
- Chung, F.¹

27
- 17444366585
- Laplacians and the Cheeger inequality for directed graphs
- April
- F. Chung, "Laplacians and the Cheeger inequality for directed graphs," Annals of Combinatorics, vol.9, no.1, pp. 1-19, April 2005.
- (2005) Annals of Combinatorics , vol.9 , Issue.1 , pp. 1-19
- Chung, F.¹

28
- 19644394100
- Geometric diffusions as a tool for harmonic analysis and structure definition of data. Part i: Diffusion maps
- May
- R. Coifman, S. Lafon, A. Lee, M. Maggioni, B. Nadler, F. Warner, and S. Zucker, "Geometric diffusions as a tool for harmonic analysis and structure definition of data. Part i: Diffusion maps," Proceedings of National Academy of Science, vol.102, no.21, pp. 7426-7431, May 2005.
- (2005) Proceedings of National Academy of Science , vol.102 , Issue.21 , pp. 7426-7431
- Coifman, R.¹ Lafon, S.² Lee, A.³ Maggioni, M.⁴ Nadler, B.⁵ Warner, F.⁶ Zucker, S.⁷

29
- 19644366699
- Geometrie diffusions as a tool for harmonie analysis and structure definition of data. Part ii: Multiscale methods
- May
- R. Coifman, S. Lafon, A. Lee, M. Maggioni, B. Nadler, F. Warner, and S. Zucker, "Geometrie diffusions as a tool for harmonie analysis and structure definition of data. Part ii: Multiscale methods," Proceedings of the National Academy of Science, vol.102, no.21, pp. 7432-7437, May 2005.
- (2005) Proceedings of the National Academy of Science , vol.102 , Issue.21 , pp. 7432-7437
- Coifman, R.¹ Lafon, S.² Lee, A.³ Maggioni, M.⁴ Nadler, B.⁵ Warner, F.⁶ Zucker, S.⁷

30
- 33745332989
- Diffusion wavelets
- July
- R. Coifman and M. Maggioni, "Diffusion wavelets," Applied and Computational Harmonic Analysis, vol.21, no.1, pp. 53-94, July 2006.
- (2006) Applied and Computational Harmonic Analysis , vol.21 , Issue.1 , pp. 53-94
- Coifman, R.¹ Maggioni, M.²

31
- 25844521242
- Geometric diffusions for the analysis of data from sensor networks
- October
- R. Coifman, M. Maggioni, S. Zucker, and I. Kevrekidis, "Geometric diffusions for the analysis of data from sensor networks," Curr Opin Neurobiol, vol.15, no.5, pp. 576-584, October 2005.
- (2005) Curr Opin Neurobiol , vol.15 , Issue.5 , pp. 576-584
- Coifman, R.¹ Maggioni, M.² Zucker, S.³ Kevrekidis, I.⁴

32
- 0004167131
- Academic Press
- D. Cvetkovic, M. Doob, and H. Sachs, Spectra of Graphs: Theory and Application. Academic Press, 1980.
- (1980) Spectra of Graphs: Theory and Application.
- Cvetkovic, D.¹ Doob, M.² Sachs, H.³

33
- 0032643313
- Solving semi-Markov decision problems using average-reward reinforcement learning
- T. Das, A. Gosavi, S. Mahadevan, and N. Marchalleck, "Solving semi-Markov decision problems using average-reward reinforcement learning," Management Science, vol.45, no.4, pp. 560-574, 1999.
- (1999) Management Science , vol.45 , Issue.4 , pp. 560-574
- Das, T.¹ Gosavi, A.² Mahadevan, S.³ Marchalleck, N.⁴

34
- 0003833285
- Society for Industrial and Applied Mathematics.
- I. Daubechies, Ten Lectures on Wavelets, Society for Industrial and Applied Mathematics. 1992.
- (1992) Ten Lectures on Wavelets
- Daubechies, I.¹

35
- 0001158047
- Improving generalisation for temporal difference learning: The successor representation
- P. Dayan, "Improving generalisation for temporal difference learning: The successor representation," Neural Computation, vol.5, pp. 613-624, 1993.
- (1993) Neural Computation , vol.5 , pp. 613-624
- Dayan, P.¹

36
- 70349371371
- The linear programming approach to approximate dynamic programming
- John Wiley and Sons
- D. de Farias, "The linear programming approach to approximate dynamic programming," in Learning and Approximate Dynamic Programming: Scaling Up to the Real World, John Wiley and Sons, 2003.
- (2003) Learning and Approximate Dynamic Programming: Scaling Up to the Real World
- De Farias, D.¹

37
- 0003397420
- Canadian Mathematical Society
- F. Deutsch, Best Approximation in Inner Product Spaces. Canadian Mathematical Society, 2001.
- (2001) Best Approximation in Inner Product Spaces.
- Deutsch, F.¹

38
- 84899029004
- Batch value function approximation using support vectors
- MIT Press
- T. Dietterich and X. Wang, "Batch value function approximation using support vectors," in Proceedings of Neural Information Processing Systems, MIT Press, 2002.
- (2002) Proceedings of Neural Information Processing Systems
- Dietterich, T.¹ Wang, X.²

39
- 29244453931
- On the Nyström method for approximating a Gram matrix for improved Kernel-based learning
- P. Drineas and M. W. Mahoney, "On the Nyström method for approximating a Gram matrix for improved Kernel-based learning," Journal of Machine Learning Research, vol.6, pp. 2153-2175, 2005.
- (2005) Journal of Machine Learning Research , vol.6 , pp. 2153-2175
- Drineas, P.¹ Mahoney, M.W.²

40
- 0043247546
- Accelerating reinforcement learning by composing solutions of automatically identified subtasks
- C. Drummond, "Accelerating reinforcement learning by composing solutions of automatically identified subtasks," Journal of AI Research, vol.16, pp. 59-104, 2002.
- (2002) Journal of AI Research , vol.16 , pp. 59-104
- Drummond, C.¹

41
- 85095808297
- Geometric aspects of the theory of Krylov subspace methods
- [41] M. Eiermann and O. Ernst, "Geometric aspects of the theory of Krylov subspace methods," Acta Numérica, pp. 251-312, 2001. (Pubitemid 33305812)
- (2001) ACTA NUMERICA , pp. 251-312
- Eiermann, M.¹ Ernst, O.G.²

42
- 1942421151
- Bayes meets Bellman: The Gaussian process approach to temporal difference learning
- AAAI Press
- Y. Engel, S. Mannor, and R. Meir, "Bayes meets Bellman: The Gaussian process approach to temporal difference learning," in Proceedings of the 20th International Conference on Machine Learning, pp. 154-161, AAAI Press, 2003.
- (2003) Proceedings of the 20th International Conference on Machine Learning , pp. 154-161
- Engel, Y.¹ Mannor, S.² Meir, R.³

43
- 58349096666
- Proto-transfer learning in Markov decision processes using spectral methods
- K. Ferguson and S. Mahadevan, "Proto-transfer learning in Markov decision processes using spectral methods," in International Conference on Machine Learning (ICML) Workshop on Transfer Learning, 2006.
- (2006) International Conference on Machine Learning (ICML) Workshop on Transfer Learning
- Ferguson, K.¹ Mahadevan, S.²

44
- 0001350119
- Algebraic connectivity of graphs
- M. Fiedler, "Algebraic connectivity of graphs," Czechoslovak Mathematical Journal, vol.23, no.98, pp. 298-305, 1973.
- (1973) Czechoslovak Mathematical Journal , vol.23 , Issue.98 , pp. 298-305
- Fiedler, M.¹

45
- 0036832959
- Structure in the space of value functions
- D. Foster and P. Dayan, "Structure in the space of value functions," Machine Learning, vol.49, pp. 325-346, 2002.
- (2002) Machine Learning , vol.49 , pp. 325-346
- Foster, D.¹ Dayan, P.²

46
- 0032308232
- Fast Monte Carlo algorithms for finding low-rank approximations
- A. Frieze, R. Kannan, and S. Vempala, "Fast Monte Carlo algorithms for finding low-rank approximations," in Proceedings of the 39th Annual IEEE Symposium on Foundations of Computer Science, pp. 370-378, 1998.
- (1998) Proceedings of the 39th Annual IEEE Symposium on Foundations of Computer Science , pp. 370-378
- Frieze, A.¹ Kannan, R.² Vempala, S.³

47
- 36949027865
- Hierarchical average-reward reinforcement learning
- M. Ghavamzadeh and S. Mahadevan, "Hierarchical average-reward reinforcement learning," Journal of Machine Learning Research, vol.8, pp. 2629-2669, 2007.
- (2007) Journal of Machine Learning Research , vol.8 , pp. 2629-2669
- Ghavamzadeh, M.¹ Mahadevan, S.²

48
- 70349352794
- Model minimization in Markov decision processes
- R. Givan and T. Dean, "Model minimization in Markov decision processes," AAAI, 1997.
- (1997) AAAI
- Givan, R.¹ Dean, T.²

49
- 0004236492
- Johns Hopkins University Press
- G. Golub and C. V. Loan, Matrix Computations. Johns Hopkins University Press, 1989.
- (1989) Matrix Computations.
- Golub, G.¹ Loan, C.V.²

50
- 0038595393
- Technical Report, CMU-CS-95-103, Department of Computer Science, Carnegie Mellon University
- G. Gordon, "Stable function approximation in dynamic programming," Technical Report, CMU-CS-95-103, Department of Computer Science, Carnegie Mellon University, 1995.
- (1995) Stable Function Approximation in Dynamic Programming
- Gordon, G.¹

51
- 4544318426
- Efficient solution algorithms for factored MDPs
- C. Guestrin, D. Koller, R. Parr, and S. Venkataraman, "Efficient solution algorithms for factored MDPs," Journal of AI Research, vol.19, pp. 399-468, 2003.
- (2003) Journal of AI Research , vol.19 , pp. 399-468
- Guestrin, C.¹ Koller, D.² Parr, R.³ Venkataraman, S.⁴

52
- 0345235423
- Group Representations and Laplacians. North-Holland
- D. Gurarie, Symmetries and Laplacians: Introduction to Harmonic Analysis, Group Representations and Laplacians. North-Holland, 1992.
- (1992) Symmetries and Laplacians: Introduction to Harmonic Analysis
- Gurarie, D.¹

53
- 34547313657
- Graph Laplacians and their convergence on random neighborhood graphs
- M. Hein, J. Audibert, and U. von Luxburg, "Graph Laplacians and their convergence on random neighborhood graphs," Journal of Machine Learning Research, vol.8, pp. 1325-1368, 2007.
- (2007) Journal of Machine Learning Research , vol.8 , pp. 1325-1368
- Hein, M.¹ Audibert, J.² Von Luxburg, U.³

54
- 31844438487
- Online learning over graphs
- M. Herbster, M. Pontil, and L. Wainer, "Online learning over graphs," in Proceedings of the Twenty-Second International Conference on Machine Learning, 2005.
- (2005) Proceedings of the Twenty-Second International Conference on Machine Learning
- Herbster, M.¹ Pontil, M.² Wainer, L.³

55
- 0002956570
- SPUDD: Stochastic planning using decision diagrams
- Morgan Kaufmann
- J. Hoey, R. St-aubin, A. Hu, and C. Boutilier, "SPUDD: Stochastic planning using decision diagrams," in Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, pp. 279-288, Morgan Kaufmann, 1999.
- (1999) Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence , pp. 279-288
- Hoey, J.¹ St-aubin, R.² Hu, A.³ Boutilier, C.⁴

56
- 0003644124
- MIT Press
- R. Howard, Dynamic Programming and Markov Decision Processes. MIT Press, 1960.
- (1960) Dynamic Programming and Markov Decision Processes.
- Howard, R.¹

57
- 34547971381
- Constructing basis functions from directed graphs for value function approximation
- ACM Press
- J. Johns and S. Mahadevan, "Constructing basis functions from directed graphs for value function approximation," in Proceedings of the International Conference on Machine Learning (ICML), pp. 385-392, ACM Press, 2007.
- (2007) Proceedings of the International Conference on Machine Learning (ICML) , pp. 385-392
- Johns, J.¹ Mahadevan, S.²

58
- 34547980771
- Compact spectral bases for value function approximation using Kronecker factorization
- J. Johns, S. Mahadevan, and C. Wang, "Compact spectral bases for value function approximation using Kronecker factorization," in Proceedings of the National Conference on Artificial Intelligence (AAAI), 2007.
- (2007) Proceedings of the National Conference on Artificial Intelligence (AAAI)
- Johns, J.¹ Mahadevan, S.² Wang, C.³

59
- 0003946510
- Springer-Verlag
- T. Jolliffe, Principal Components Analysis. Springer-Verlag, 1986.
- (1986) Principal Components Analysis.
- Jolliffe, T.¹

60
- 35748975552
- Forthcoming
- P. Jones, M. Maggioni, and R. Schul, "Universal parametrizations via Eigenfunctions of the Laplacian and heat kernels," Forthcoming 2007.
- (2007) Universal Parametrizations Via Eigenfunctions of the Laplacian and Heat Kernels
- Jones, P.¹ Maggioni, M.² Schul, R.³

61
- 84898930479
- A natural policy gradient
- MIT Press
- S. Kakadě, "A natural policy gradient," in Proceedings of Neural Information Processing Systems, MIT Press, 2002.
- (2002) Proceedings of Neural Information Processing Systems
- Kakadě, S.¹

62
- 0032131147
- A fast and high quality multilevel scheme for partitioning irregular graphs
- G. Karypis and V. Kumar, "A fast and high quality multilevel scheme for partitioning irregular graphs," SIAM Journal of Scientific Computing, vol.20, no.1, pp. 359-392, 1999.
- (1999) SIAM Journal of Scientific Computing , vol.20 , Issue.1 , pp. 359-392
- Karypis, G.¹ Kumar, V.²

63
- 33846689581
- Block diagonalization of Laplacian matrices of symmetric graphs using group theory
- A. Kaveh and A. Nikbakht, "Block diagonalization of Laplacian matrices of symmetric graphs using group theory," International Journal for Numerical Methods in Engineering, vol.69, pp. 908-947, 2007.
- (2007) International Journal for Numerical Methods in Engineering , vol.69 , pp. 908-947
- Kaveh, A.¹ Nikbakht, A.²

64
- 84957661928
- Using temporal neighborhoods to adapt function approximators in reinforcement learning
- R. Kretchmar and C. Anderson, "Using temporal neighborhoods to adapt function approximators in reinforcement learning," in International Work Conference on Artificial and Natural Neural Networks, pp. 488-496, 1999.
- (1999) International Work Conference on Artificial and Natural Neural Networks , pp. 488-496
- Kretchmar, R.¹ Anderson, C.²

65
- 9944258743
- Springer
- H. Kushner and G. Yin, Stochastic Approximation and Recursive Algorithms and Applications. Springer, 2003.
- (2003) Stochastic Approximation and Recursive Algorithms and Applications
- Kushner, H.¹ Yin, G.²

66
- 33750595113
- Learning basis functions in hybrid domains
- B. Kveton, "Learning basis functions in hybrid domains," in Proceedings of the 21st National Conference on Artificial Intelligence, pp. 1161-1166, 2006.
- (2006) Proceedings of the 21st National Conference on Artificial Intelligence , pp. 1161-1166
- Kveton, B.¹

67
- 21844459752
- Diffusion Kernels on statistical manifolds
- J. Lafferty and G. Lebanon, "Diffusion Kernels on statistical manifolds," Journal of Machine Learning Research, vol.6, pp. 129-163, 2005.
- (2005) Journal of Machine Learning Research , vol.6 , pp. 129-163
- Lafferty, J.¹ Lebanon, G.²

68
- 26444490324
- PhD thesis, Yale University, Department of Mathematics and Applied Mathematics
- S. Lafon, "Diffusion maps and geometric harmonics," PhD thesis, Yale University, Department of Mathematics and Applied Mathematics, 2004.
- (2004) Diffusion Maps and Geometric Harmonics
- Lafon, S.¹

69
- 4644323293
- Least-squares policy iteration
- M. Lagoudakis and R. Parr, "Least-squares policy iteration," Journal of Machine Learning Research, vol.4, pp. 1107-1149, 2003.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.¹ Parr, R.²

70
- 33750184660
- Updating the stationary vector of an irreducible Markov chain with an eye on google's pagerank
- A. Langville and C. Meyer, "Updating the stationary vector of an irreducible Markov chain with an eye on google's pagerank," SIAM Journal on Matrix Analysis, vol.27, pp. 968-987, 2005.
- (2005) SIAM Journal on Matrix Analysis , vol.27 , pp. 968-987
- Langville, A.¹ Meyer, C.²

71
- 0003896351
- Kluwer Academic Press
- J. C. Latombe, Robot Motion Planning. Kluwer Academic Press, 1991.
- (1991) Robot Motion Planning.
- Latombe, J.C.¹

72
- 77952010176
- Cambridge University Press
- S. Lavalle, Planning Algorithms. Cambridge University Press, 2006.
- (2006) Planning Algorithms.
- Lavalle, S.¹

73
- 49749112240
- Springer
- J. Lee and M. Verleysen, Nonlinear Dimensionality Reduction. Springer, 2007.
- (2007) Nonlinear Dimensionality Reduction
- Lee, J.¹ Verleysen, M.²

74
- 15544382868
- Springer
- J. M. Lee, Introduction to Smooth Manifolds. Springer, 2003.
- (2003) Introduction to Smooth Manifolds
- Lee, J.M.¹

75
- 84864535343
- Towards a unified theory of state abstraction for MDPs
- L. Li, T. Walsh, and M. Littman, "Towards a unified theory of state abstraction for MDPs," in Proceedings of the Ninth International Symposium on Artificial Intelligence and Mathematics, pp. 531-539, 2006.
- (2006) Proceedings of the Ninth International Symposium on Artificial Intelligence and Mathematics , pp. 531-539
- Li, L.¹ Walsh, T.² Littman, M.³

76
- 33749267463
- Fast direct policy evaluation using multiscale analysis of Markov diffusion processes
- New York, NY, USA: ACM Press
- M. Maggioni and S. Mahadevan, "Fast direct policy evaluation using multiscale analysis of Markov diffusion processes," in Proceedings of the 23rd International Conference on Machine Learning, pp. 601-608, New York, NY, USA: ACM Press, 2006.
- (2006) Proceedings of the 23rd International Conference on Machine Learning , pp. 601-608
- Maggioni, M.¹ Mahadevan, S.²

77
- 31844433360
- Proto-value functions: Developmental reinforcement learning
- S. Mahadevan, "Proto-value functions: Developmental reinforcement learning," in Proceedings of the International Conference on Machine Learning, pp. 553-560, 2005.
- (2005) Proceedings of the International Conference on Machine Learning , pp. 553-560
- Mahadevan, S.¹

78
- 34547966269
- Representation policy iteration
- AUAI Press
- S. Mahadevan, "Representation policy iteration," in Proceedings of the 21st Annual Conference on Uncertainty in Artificial Intelligence (UAI-05), pp. 372-437, AUAI Press, 2005.
- (2005) Proceedings of the 21st Annual Conference on Uncertainty in Artificial Intelligence (UAI-05) , pp. 372-437
- Mahadevan, S.¹

79
- 57749114470
- Fast spectral learning using lanczos eigenspace projections
- S. Mahadevan, "Fast spectral learning using lanczos eigenspace projections," in Proceedings of the National Conference on Artificial Intelligence (AAAI), 2008.
- (2008) Proceedings of the National Conference on Artificial Intelligence (AAAI)
- Mahadevan, S.¹

80
- 67349090062
- Morgan and Claypool Publishers
- S. Mahadevan, Representation Discovery Using Harmonic Analysis. Morgan and Claypool Publishers, 2008.
- (2008) Representation Discovery Using Harmonic Analysis.
- Mahadevan, S.¹

81
- 0026880130
- Automatic programming of behavior-based robots using reinforcement learning
- [81] S. Mahadevan and J. Connell, "Automatic programming of behaviorbased robots using reinforcement learning," Artificial Intelligence, vol.55, pp. 311-365, 1992. Appeared originally as IBM TR RC16359, December 1990. (Pubitemid 23565211)
- (1992) Artificial Intelligence , vol.55 , Issue.2-3 , pp. 311-365
- Mahadevan, S.¹ Connell, J.²

82
- 33746067324
- Value function approximation with diffusion wavelets and Laplacian eigenfunctions
- MIT Press
- S. Mahadevan and M. Maggioni, "Value function approximation with diffusion wavelets and Laplacian eigenfunctions," in Proceedings of the Neural Information Processing Systems (NIPS), MIT Press, 2006.
- (2006) Proceedings of the Neural Information Processing Systems (NIPS)
- Mahadevan, S.¹ Maggioni, M.²

83
- 35748957806
- Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes
- S. Mahadevan and M. Maggioni, "Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes," Journal of Machine Learning Research, vol.8, pp. 2169-2231, 2007.
- (2007) Journal of Machine Learning Research , vol.8 , pp. 2169-2231
- Mahadevan, S.¹ Maggioni, M.²

84
- 33750591731
- Learning representation and control in continuous Markov decision processes
- S. Mahadevan, M. Maggioni, K. Ferguson, and S. Osentoski, "Learning representation and control in continuous Markov decision processes," in Proceedings of the National Conference on Artificial Intelligence (AAAI), 2006.
- (2006) Proceedings of the National Conference on Artificial Intelligence (AAAI)
- Mahadevan, S.¹ Maggioni, M.² Ferguson, K.³ Osentoski, S.⁴

85
- 0001963197
- Self-improving factory simulation using continuous-time average-reward reinforcement learning
- Morgan Kaufmann
- S. Mahadevan, N. Marchalleck, T. Das, and A. Gosavi, "Self-improving factory simulation using continuous-time average-reward reinforcement learning," in Proceedings of 14-th International Conference on Machine Learning, pp. 202-210, Morgan Kaufmann, 1997.
- (1997) Proceedings of 14-th International Conference on Machine Learning , pp. 202-210
- Mahadevan, S.¹ Marchalleck, N.² Das, T.³ Gosavi, A.⁴

86
- 0024700097
- A theory for multiresolution signal decomposition: The wavelet representation
- S. Mallat, "A theory for multiresolution signal decomposition: The wavelet representation," IEEE Transactions on Pattern Analysis of Machanical Intelligence, vol.11, no.7, pp. 674-693, 1989.
- (1989) IEEE Transactions on Pattern Analysis of Machanical Intelligence , vol.11 , Issue.7 , pp. 674-693
- Mallat, S.¹

87
- 0003456805
- Academic Press
- S. Mallat, A Wavelet Tour in Signal Processing. Academic Press, 1998.
- (1998) A Wavelet Tour in Signal Processing.
- Mallat, S.¹

88
- 57749103516
- Computing isotypic projections with the lanczos iteration
- D. Malsen, M. Orrison, and D. Rockmore, "Computing isotypic projections with the lanczos iteration," SIAM, vol.2, nos. 60/61, pp. 601-628, 2003.
- (2003) SIAM , vol.2 , Issue.60-61 , pp. 601-628
- Malsen, D.¹ Orrison, M.² Rockmore, D.³

89
- 14344250635
- Dynamic abstraction in reinforcement learning via clustering
- S. Mannor, I. Menache, A. Hoze, and U. Klein, "Dynamic abstraction in reinforcement learning via clustering," International Conference on Machine Learning, 2004.
- (2004) International Conference on Machine Learning
- Mannor, S.¹ Menache, I.² Hoze, A.³ Klein, U.⁴

90
- 0013527886
- PhD thesis, University of Massachusetts, Amherst
- A. McGovern, "Autonomous discovery of temporal abstractions from interactions with an environment," PhD thesis, University of Massachusetts, Amherst, 2002.
- (2002) Autonomous Discovery of Temporal Abstractions from Interactions with An Environment
- McGovern, A.¹

91
- 84898985184
- Learning segmentation by random walks
- M. Meila and J. Shi, "Learning segmentation by random walks," NIPS, 2001.
- (2001) NIPS
- Meila, M.¹ Shi, J.²

92
- 0043256056
- Sensitivity of the stationary distribution of a Markov chain
- C. Meyer, "Sensitivity of the stationary distribution of a Markov chain," SIAM Journal of Matrix Analysis and Applications, vol.15, no.3, pp. 715-728, 1994.
- (1994) SIAM Journal of Matrix Analysis and Applications , vol.15 , Issue.3 , pp. 715-728
- Meyer, C.¹

93
- 0008813538
- Barycentric interpolators for continuous space and time reinforcement learning
- MIT Press
- A. Moore, "Barycentric interpolators for continuous space and time reinforcement learning," in Advances in Neural Information Processing Systems, MIT Press, 1998.
- (1998) Advances in Neural Information Processing Systems
- Moore, A.¹

94
- 1942516880
- Error bounds for approximate policy iteration
- R. Munos, "Error bounds for approximate policy iteration," in Proceedings of the International Conference on Machine Learning (ICML), pp. 560-567, 2003.
- (2003) Proceedings of the International Conference on Machine Learning (ICML) , pp. 560-567
- Munos, R.¹

95
- 0004027474
- Princeton University Press
- E. Nelson, Tensor Analysis. Princeton University Press, 1968.
- (1968) Tensor Analysis.
- Nelson, E.¹

96
- 84899013108
- On spectral clustering: Analysis and an algorithm
- A. Ng, M. Jordan, and Y. Weiss, "On spectral clustering: Analysis and an algorithm," NIPS, 2002.
- (2002) NIPS
- Ng, A.¹ Jordan, M.² Weiss, Y.³

97
- 84898980684
- Autonomous helicopter flight via Reinforcement Learning
- A. Ng, H. Kim, M. Jordan, and S. Sastry, "Autonomous helicopter flight via Reinforcement Learning," in Proceedings of Neural Information Processing Systems, 2004.
- (2004) Proceedings of Neural Information Processing Systems
- Ng, A.¹ Kim, H.² Jordan, M.³ Sastry, S.⁴

98
- 30844447280
- Technical Report TR-2001-2030, University of Chicago, Computer Science Deparment, November
- P. Niyogi and M. Belkin, "Semi-supervised learning on Riemannian manifolds," Technical Report TR-2001-2030, University of Chicago, Computer Science Deparment, November 2001.
- (2001) Semi-supervised Learning on Riemannian Manifolds
- Niyogi, P.¹ Belkin, M.²

99
- 34547995167
- Techncial Report, University of Chicago, November
- P. Niyogi, I. Matveeva, and M. Belkin, "Regression and regularization on large graphs," Techncial Report, University of Chicago, November 2003.
- (2003) Regression and Regularization on Large Graphs
- Niyogi, P.¹ Matveeva, I.² Belkin, M.³

100
- 0036832956
- Kernel-based reinforcement learning
- D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol.49, nos. 2-3, pp. 161-178, 2002.
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

101
- 70349348017
- PhD thesis, University of Massachusetts, Amherst
- S. Osentoski, "Action-based representation discovery in Markov decision processes," PhD thesis, University of Massachusetts, Amherst, 2009.
- (2009) Action-based Representation Discovery in Markov Decision Processes
- Osentoski, S.¹

102
- 34547966587
- Learning state action basis functions for Hierarchical Markov decison processes
- S. Osentoski and S. Mahadevan, "Learning state action basis functions for Hierarchical Markov decison processes," in Proceedings of the International Conference on Machine Learning (ICML), pp. 705-712, 2007.
- (2007) Proceedings of the International Conference on Machine Learning (ICML) , pp. 705-712
- Osentoski, S.¹ Mahadevan, S.²

103
- 34547982545
- Analyzing feature generation for value-function approximation
- ICML 2007 - Proceedings of the 24th International Conference on Machine Learning
- [103] R. Parr, C. Painter-Wakefiled, L. Li, and M. Littman, "Analyzing feature generation for value function approximation," in Proceedings of the International Conference on Machine Learning (ICML), pp. 737-744, 2007. (Pubitemid 350094046)
- (2007) ICML 2007 - Proceedings of the 24th International Conference on Machine Learning , pp. 737-744
- Parr, R.¹ Painter-Wakefield, C.² Li, H.³ Littman, M.⁴

104
- 56449092660
- An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning
- R. Parr, C. Painter-Wakefiled, L. Li, and M. Littman, "An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning," in Proceedings of the International Conference on Machine Learning (ICML), 2008.
- (2008) Proceedings of the International Conference on Machine Learning (ICML)
- Parr, R.¹ Painter-Wakefiled, C.² Li, L.³ Littman, M.⁴

105
- 2942516828
- Reinforcement learning for humanoid robots
- J. Peters, S. Vijaykumar, and S. Schaal, "Reinforcement learning for humanoid robots," in Proceedings of the Third IEEE-RAS International Conference on Humanoid Robots, 2003.
- (2003) Proceedings of the Third IEEE-RAS International Conference on Humanoid Robots
- Peters, J.¹ Vijaykumar, S.² Schaal, S.³

106
- 84880899807
- An analysis of Laplacian methods for value function approximation in MDPs
- M. Petrik, "An analysis of Laplacian methods for value function approximation in MDPs," in Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), pp. 2574-2579, 2007.
- (2007) Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) , pp. 2574-2579
- Petrik, M.¹

107
- 33748561594
- Value Directed Compression of POMDPs
- P. Poupart and C. Boutilier, "Value Directed Compression of POMDPs," in Proceedings of the International Conference on Neural Information Processing Systems (NIPS), 2003.
- (2003) Proceedings of the International Conference on Neural Information Processing Systems (NIPS)
- Poupart, P.¹ Boutilier, C.²

108
- 0036927202
- Piecewise linear value function approximation for factored Markov decision processes
- P. Poupart, C. Boutilier, R. Patrascu, and D. Schuurmans, "Piecewise linear value function approximation for factored Markov decision processes," in Proceedings of the National Conference on Artificial Intelligence (AAAI), pp. 285-291, 2002.
- (2002) Proceedings of the National Conference on Artificial Intelligence (AAAI) , pp. 285-291
- Poupart, P.¹ Boutilier, C.² Patrascu, R.³ Schuurmans, D.⁴

109
- 47349092417
- Wiley
- W. Powell, Approximate Dynamic Programming: Solving the Curses of Dimensionality. Wiley, 2007.
- (2007) Approximate Dynamic Programming: Solving the Curses of Dimensionality.
- Powell, W.¹

110
- 0003998452
- New York, USA: Wiley Interscience
- M. L. Puterman, Markov Decision Processes. New York, USA: Wiley Interscience, 1994.
- (1994) Markov Decision Processes.
- Puterman, M.L.¹

111
- 84899026055
- Gaussian processes in reinforcement learning
- MIT Press
- C. Rasmussen and M. Kuss, "Gaussian processes in reinforcement learning," in Proceedings of the International Conference on Neural Information Processing Systems, pp. 751-759, MIT Press, 2004.
- (2004) Proceedings of the International Conference on Neural Information Processing Systems , pp. 751-759
- Rasmussen, C.¹ Kuss, M.²

112
- 84880771557
- SMDP homomorphisms: An algebraic approach to abstraction in semi-Markov decision processes
- B. Ravindran and A. Barto, "SMDP homomorphisms: An algebraic approach to abstraction in semi-Markov decision processes," in Proceedings of the 18th International Joint Conference on Artificial Intelligence, 2003.
- (2003) Proceedings of the 18th International Joint Conference on Artificial Intelligence
- Ravindran, B.¹ Barto, A.²

113
- 35448951870
- Springer
- C. Robert and G. Casella, Monte-Carlo Methods in Statistics. Springer, 2005.
- (2005) Monte-Carlo Methods in Statistics.
- Robert, C.¹ Casella, G.²

114
- 31844448949
- Coarticulation: An approach for generating concurrent plans in Markov decision processes
- ACM Press
- K. Rohanimanesh and S. Mahadevan, "Coarticulation: An approach for generating concurrent plans in Markov decision processes," in Proceedings of the International Conference on Machine Learning, ACM Press, 2005.
- (2005) Proceedings of the International Conference on Machine Learning
- Rohanimanesh, K.¹ Mahadevan, S.²

115
- 0003722214
- Cambridge University Press
- S. Rosenberg, The Laplacian on a Riemannian Manifold. Cambridge University Press, 1997.
- (1997) The Laplacian on A Riemannian Manifold.
- Rosenberg, S.¹

116
- 0034704222
- Nonlinear dimensionality reduction by locally linear embedding
- DOI 10.1126/science.290.5500.2323
- [116] S. Roweis and L. Saul, "Nonlinear dimensionality reduction by local linear embedding," Science, vol.290, pp. 2323-2326, 2000. (Pubitemid 32041578)
- (2000) Science , vol.290 , Issue.5500 , pp. 2323-2326
- Roweis, S.T.¹ Saul, L.K.²

117
- 0003584577
- PrenticeHall
- S. Rusell and P. Norvig, Artificial Intelligence: A Modern Approach. PrenticeHall, 2002.
- (2002) Artificial Intelligence: A Modern Approach.
- Rusell, S.¹ Norvig, P.²

118
- 1842829625
- SIAM Press
- Y. Saad, Iterative Methods for Sparse Linear Systems. SIAM Press, 2003.
- (2003) Iterative Methods for Sparse Linear Systems.
- Saad, Y.¹

119
- 32844474095
- Reinforcement learning with factored states and actions
- B. Sallans and G. Hinton, "Reinforcement learning with factored states and actions," Journal of Machine Learning Research, vol.5, pp. 1063-1088, 2004.
- (2004) Journal of Machine Learning Research , vol.5 , pp. 1063-1088
- Sallans, B.¹ Hinton, G.²

120
- 0003408420
- MIT Press
- B. Scholkopf and A. Smola, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, 2001.
- (2001) Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond.
- Scholkopf, B.¹ Smola, A.²

121
- 0001296683
- Perturbation theory and finite Markov chains
- P. Schweitzer, "Perturbation theory and finite Markov chains," Journal of Applied Probability, vol.5, no.2, pp. 410-413, 1968.
- (1968) Journal of Applied Probability , vol.5 , Issue.2 , pp. 410-413
- Schweitzer, P.¹

122
- 0000273218
- Generalized polynomial approximations in Markov decision processes
- P. Schweitzer and A. Seidmann, "Generalized polynomial approximations in Markov decision processes," Journal of Mathematical Analysis and Applications, vol.110, pp. 568-582, 1985.
- (1985) Journal of Mathematical Analysis and Applications , vol.110 , pp. 568-582
- Schweitzer, P.¹ Seidmann, A.²

123
- 0003535571
- Springer
- J. Serre, Linear Representations of Finite Groups. Springer, 1977.
- (1977) Linear Representations of Finite Groups.
- Serre, J.¹

124
- 26944499565
- Approximate policy construction using decision diagrams
- R. St-Aubin, J. Hoey, and C. Boutilier, "Approximate policy construction using decision diagrams," NIPS, 2000.
- (2000) NIPS
- St-Aubin, R.¹ Hoey, J.² Boutilier, C.³

125
- 45349105828
- Princeton University Press
- E. M. Stein and R. Shakarchi, Fourier Analysis: An Introduction. Princeton University Press, 2003.
- (2003) Fourier Analysis: An Introduction.
- Stein, E.M.¹ Shakarchi, R.²

126
- 0004245243
- Academic Press
- G. Stewart and J. Sun, Matrix Perturbation Theory. Academic Press, 1990.
- (1990) Matrix Perturbation Theory.
- Stewart, G.¹ Sun, J.²

127
- 0004203940
- Wellesley-Cambridge Press
- G. Strang, Introduction to Linear Algebra. Wellesley-Cambridge Press, 2003.
- (2003) Introduction to Linear Algebra.
- Strang, G.¹

128
- 0003916311
- PhD thesis, Stanford University
- D. Subramanian, "A theory of justified reformulations," PhD thesis, Stanford University, 1989.
- (1989) A Theory of Justified Reformulations
- Subramanian, D.¹

129
- 0003420416
- MIT Press
- R. Sutton and A. G. Barto, An Introduction to Reinforcement Learning. MIT Press, 1998.
- (1998) An Introduction to Reinforcement Learning.
- Sutton, R.¹ Barto, A.G.²

130
- 33847202724
- Learning to predict by the methods of temporal differences
- R. S. Sutton, "Learning to predict by the methods of temporal differences," Machine Learning, vol.3, pp. 9-44, 1988.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

131
- 0034704229
- A global geometric framework for nonlinear dimensionality reduction
- DOI 10.1126/science.290.5500.2319
- [131] J. Tenenbaum, V. de Silva, and J. Langford, "A global geometric framework for nonlinear dimensionality reduction," Science, vol.290, pp. 2319-2323, 2000. (Pubitemid 32041577)
- (2000) Science , vol.290 , Issue.5500 , pp. 2319-2323
- Tenenbaum, J.B.¹ De, S.² Langford, J.C.³

132
- 0000985504
- Td-gammon, a self-teaching backgammon program, achieves master-level play
- G. Tesauro, "Td-gammon, a self-teaching backgammon program, achieves master-level play," Neural Computation, vol.6, pp. 215-219, 1994.
- (1994) Neural Computation , vol.6 , pp. 215-219
- Tesauro, G.¹

133
- 84899951003
- Graph Laplacian based transfer learning in reinforcement learning
- Y. Tsao, K. Xiao, and V. Soo, "Graph Laplacian based transfer learning in reinforcement learning," in AAMAS '08: Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems, pp. 1349-1352, 2008.
- (2008) AAMAS '08: Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 1349-1352
- Tsao, Y.¹ Xiao, K.² Soo, V.³

134
- 70349346507
- Springer
- B. Turker, J. Leydold, and P. Stadler, Laplacian Eigenvectors of Graphs. Springer, 2007.
- (2007) Laplacian Eigenvectors of Graphs.
- Turker, B.¹ Leydold, J.² Stadler, P.³

135
- 0036782663
- Many-layered learning
- P. Utgoff and D. Stracuzzi, "Many-layered learning," Neural Computation, vol.14, pp. 2497-2529, 2002.
- (2002) Neural Computation , vol.14 , pp. 2497-2529
- Utgoff, P.¹ Stracuzzi, D.²

136
- 0003417587
- SIAM Press
- C. Van Loan, Computational Frameworks for the Fast Fourier Transform. SIAM Press, 1987.
- (1987) Computational Frameworks for the Fast Fourier Transform.
- Van Loan, C.¹

137
- 0002203498
- Approximation with Kronecker products
- Kluwer Publications
- C. Van Loan and N. Pitsianis, "Approximation with Kronecker products," in Linear Algebra for Large Scale and Real Time Applications, pp. 293-314, Kluwer Publications, 1993.
- (1993) Linear Algebra for Large Scale and Real Time Applications , pp. 293-314
- Van Loan, C.¹ Pitsianis, N.²

138
- 0003787427
- PhD thesis, MIT
- B. Van Roy, "Learning and value function approximation in complex decision processes," PhD thesis, MIT, 1998.
- (1998) Learning and Value Function Approximation in Complex Decision Processes
- Van Roy, B.¹

139
- 0003241883
- Spline models for observational data
- G. Wahba, "Spline models for observational data," Society for Industrial and Applied Mathematics, 1990.
- (1990) Society for Industrial and Applied Mathematics
- Wahba, G.¹

140
- 0004049893
- PhD thesis, King's College, Cambridge, England
- C. Watkins, "Learning from delayed rewards," PhD thesis, King's College, Cambridge, England, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

141
- 0012841228
- Successive matrix squaring algorithm for computing the Drazin inverse
- Y. Wei, "Successive matrix squaring algorithm for computing the Drazin inverse," Applied Mathematics and Computation, vol.108, pp. 67-75, 2000.
- (2000) Applied Mathematics and Computation , vol.108 , pp. 67-75
- Wei, Y.¹

142
- 0000350486
- Using the Nyström method to speed up Kernel machines
- C. Williams and M. Seeger, "Using the Nyström method to speed up Kernel machines," in Proceedings of the International Conference on Neural Information Processing Systems, pp. 682-688, 2000.
- (2000) Proceedings of the International Conference on Neural Information Processing Systems , pp. 682-688
- Williams, C.¹ Seeger, M.²

143
- 84918834208
- A reinforcement learning approach to job-shop scheduling
- W. Zhang and T. Dietterich, "A reinforcement learning approach to job-shop scheduling," in Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI), pp. 1114-1120, 1995.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI) , pp. 1114-1120
- Zhang, W.¹ Dietterich, T.²

144
- 26944473857
- PhD thesis, Carnegie Mellon University
- X. Zhou, "Semi-supervised learning with graphs," PhD thesis, Carnegie Mellon University, 2005.
- (2005) Semi-supervised Learning with Graphs
- Zhou, X.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.