-
2
-
-
0000913324
-
SVMTorch: Support vector machines for large-scale regression problems
-
Collobert, R., & Bengio, S. (2001). SVMTorch: Support vector machines for large-scale regression problems. Journal of Machine Learning Research (JMLR), 1, 143-160.
-
(2001)
Journal of Machine Learning Research (JMLR)
, vol.1
, pp. 143-160
-
-
Collobert, R.1
Bengio, S.2
-
5
-
-
84880898477
-
Max-norm projections for factored MDPs
-
Seattle, Washington: Morgan Kaufmann
-
Guestrin, C. E., Koller, D., & Parr, R. (2001). Max-norm projections for factored MDPs. Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-01) (pp. 673 - 680). Seattle, Washington: Morgan Kaufmann.
-
(2001)
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-01)
, pp. 673-680
-
-
Guestrin, C.E.1
Koller, D.2
Parr, R.3
-
6
-
-
0000439891
-
On the convergence of stochastic iterative dynamic programming algorithms
-
Jaakkola, T., Jordan, M., & Singh, S. (1994). On the convergence of stochastic iterative dynamic programming algorithms. Neural Computation, 6, 1185-1201.
-
(1994)
Neural Computation
, vol.6
, pp. 1185-1201
-
-
Jaakkola, T.1
Jordan, M.2
Singh, S.3
-
7
-
-
85153938292
-
Reinforcement learning algorithm for partially observable Markov decision problems
-
Cambridge, Massachusetts: MIT Press
-
Jaakkola, T., Singh, S. P., & Jordan, M. I. (1995). Reinforcement learning algorithm for partially observable Markov decision problems. Advances in Neural Information Processing Systems 7 (pp. 345-352). Cambridge, Massachusetts: MIT Press.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 345-352
-
-
Jaakkola, T.1
Singh, S.P.2
Jordan, M.I.3
-
10
-
-
84880649215
-
A sparse sampling algorithm for near-optimal planning large markov decision processes
-
Stockholm, Sweden: Morgan Kaufmann
-
Kearns, M., Mansour, Y., & Ng, A. Y. (1999). A sparse sampling algorithm for near-optimal planning large markov decision processes. Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99) (pp. 1324-1331). Stockholm, Sweden: Morgan Kaufmann.
-
(1999)
Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99)
, pp. 1324-1331
-
-
Kearns, M.1
Mansour, Y.2
Ng, A.Y.3
-
12
-
-
0141596576
-
Policy invariance under reward transformations: Theory and application to reward shaping
-
Morgan Kaufmann, San Francisco, CA
-
Ng, A. Y., Harada, D., & Russell, S. (1999). Policy invariance under reward transformations: theory and application to reward shaping. Proc. 16th International Conf. on Machine Learning (pp. 278-287). Morgan Kaufmann, San Francisco, CA.
-
(1999)
Proc. 16th International Conf. on Machine Learning
, pp. 278-287
-
-
Ng, A.Y.1
Harada, D.2
Russell, S.3
-
16
-
-
0030082891
-
An approach to fuzzy control of nonlinear systems: Stability and design issues
-
Wang, H. Tanaka, K., & Griffin, M. (1996). An approach to fuzzy control of nonlinear systems: Stability and design issues. IEEE Transactions on Fuzzy Systems, 4, 14-23.
-
(1996)
IEEE Transactions on Fuzzy Systems
, vol.4
, pp. 14-23
-
-
Wang, H.1
Tanaka, K.2
Griffin, M.3
-
17
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
Williams, R. J. (1992). Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8, 229-256.
-
(1992)
Machine Learning
, vol.8
, pp. 229-256
-
-
Williams, R.J.1
-
18
-
-
13444310066
-
Inductive policy selection for first-order MDPs
-
Edmonton, Canada: Morgan Kaufmann
-
Yoon, S. W., Fern, A., & Givan, B. (2002). Inductive policy selection for first-order MDPs. Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI-02). Edmonton, Canada: Morgan Kaufmann.
-
(2002)
Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI-02)
-
-
Yoon, S.W.1
Fern, A.2
Givan, B.3
|