-
1
-
-
0027802035
-
"Scalesensitive dimensions, uniform convergence, and learnability," in
-
1993.
-
N. Alon, S. Ben-David, N. Cesa-Bianchi, and D. Haussler, "Scalesensitive dimensions, uniform convergence, and learnability," in Proc. Conf. Foundations of Computer Science (FOCS), 1993.
-
Proc. Conf. Foundations of Computer Science (FOCS)
-
-
Alon, N.1
Ben-David, S.2
Cesa-Bianchi, N.3
Haussler, D.4
-
2
-
-
0031176507
-
-
vol. 44, no.4, pp. 615-631, 1997.
-
Also, J. Assoc. Comput. Mach, vol. 44, no.4, pp. 615-631, 1997.
-
J. Assoc. Comput. Mach
-
-
-
3
-
-
84955561944
-
"Function learning from interpolation," NeuroCOLT Tech. Rep. NC-TR-94-013 [Online] Available FTP: Ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports, an extended abstract appeared in
-
(Lecture Notes in Artificial Intelligence, vol. 904, P. Vitanyi, Ed.) Berlin, Germany: Springer-Verlag, 1995, pp. 211-221.
-
M. Anthony and P. Bartlett (1995), "Function learning from interpolation," NeuroCOLT Tech. Rep. NC-TR-94-013 [Online] Available FTP: ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports, (an extended abstract appeared in Computational Learning Theory, Proc. 2nd European Conf., EuroCOLT'95 (Lecture Notes in Artificial Intelligence, vol. 904, P. Vitanyi, Ed.) Berlin, Germany: Springer-Verlag, 1995, pp. 211-221.
-
(1995)
Computational Learning Theory, Proc. 2nd European Conf., EuroCOLT'95
-
-
Anthony, M.1
Bartlett, P.2
-
4
-
-
84967907292
-
"The learnability of formal concepts," in
-
Morgan Kaufmann, 1990, pp. 246-257.
-
M. Anthony, N. Biggs, and J. Shawe-Taylor, "The learnability of formal concepts," in Proc. 3rdAnnu. Workshop Computational Learning Theory (Rochester) San Mateo, CA: Morgan Kaufmann, 1990, pp. 246-257.
-
Proc. 3rdAnnu. Workshop Computational Learning Theory (Rochester) San Mateo, CA
-
-
Anthony, M.1
Biggs, N.2
Shawe-Taylor, J.3
-
5
-
-
0041805140
-
"A result of Vapnik with applications,"
-
vol. 47, pp. 207-217, 1993.
-
M. Anthony and J. Shawe-Taylor, "A result of Vapnik with applications," Discr. Appl Math., vol. 47, pp. 207-217, 1993.
-
Discr. Appl Math.
-
-
Anthony, M.1
Shawe-Taylor, J.2
-
6
-
-
0038911768
-
"A sufficient condition for polynomial distribution-dependent learnability,"
-
vol. 77, pp. 1-12, 1997.
-
_, "A sufficient condition for polynomial distribution-dependent learnability," Discr. Appl. Math., vol. 77, pp. 1-12, 1997.
-
Discr. Appl. Math.
-
-
-
7
-
-
0001325515
-
"Approximation and estimation bounds for artificial neural networks,"
-
vol. 14, pp. 115-133, 1994.
-
A. R. Barron, "Approximation and estimation bounds for artificial neural networks," Mach. Learning, vol. 14, pp. 115-133, 1994.
-
Mach. Learning
-
-
Barron, A.R.1
-
8
-
-
0001347323
-
"Complexity regularization with applications to artificial neural networks," in
-
1991, pp. 561-576.
-
_, "Complexity regularization with applications to artificial neural networks," in Nonparametric Functional Estimation and Related Topics, G. Roussas, Ed. Boston, MA: Kluwer, 1991, pp. 561-576.
-
Nonparametric Functional Estimation and Related Topics, G. Roussas, Ed. Boston, MA: Kluwer
-
-
-
9
-
-
0026190366
-
"Minimum complexity density estimation,"
-
vol. 37, pp. 1034-1054, 1738, 1991.
-
A. R. Barron and T. M. Cover, "Minimum complexity density estimation," IEEE Trans. Inform. Theory, vol. 37, pp. 1034-1054, 1738, 1991.
-
IEEE Trans. Inform. Theory
-
-
Barron, A.R.1
Cover, T.M.2
-
10
-
-
0032028728
-
"The sample complexity of pattern classification with neural networks: The size of the weights is more important than the size of the network,"
-
1998, to be published.
-
P. L. Bartlett, "The sample complexity of pattern classification with neural networks: The size of the weights is more important than the size of the network," IEEE Trans. Inform. Theory, 1998, to be published.
-
IEEE Trans. Inform. Theory
-
-
Bartlett, P.L.1
-
11
-
-
0032046897
-
Prediction, learning, uniform convergence, and scale-sensitive dimensions,"
-
1998.
-
P. L. Bartlett and P. M. Long, "Prediction, learning, uniform convergence, and scale-sensitive dimensions," submitted to J. Comp. Syst. Scie. 1998.
-
J. Comp. Syst. Scie.
-
-
Bartlett, P.L.1
Long, P.M.2
-
12
-
-
0030165580
-
"Fat-shattering and the learnability of real-valued functions,"
-
vol. 52, no. 3, pp. 434-452, 1996.
-
P. L. Bartlett, P. M. Long, and R. C. Williamson, "Fat-shattering and the learnability of real-valued functions," J. Comp. Syst. Sci., vol. 52, no. 3, pp. 434-452, 1996.
-
J. Comp. Syst. Sci.
-
-
Bartlett, P.L.1
Long, P.M.2
Williamson, R.C.3
-
13
-
-
0028406575
-
"Nonuniform learnability,"
-
vol. 48, pp. 311-323, 1994.
-
G. M. Benedek and A. Itai, "Nonuniform learnability," J. Comp. Syst. Scie., vol. 48, pp. 311-323, 1994.
-
J. Comp. Syst. Scie.
-
-
Benedek, G.M.1
Itai, A.2
-
15
-
-
0026966646
-
"A training algorithm for optimal margin classifiers," in
-
B. E. Böser, I. M. Guyon, and V. N. Vapnik, "A training algorithm for optimal margin classifiers," in Proc. 5th Anna. Workshop Computational Learning Theory. Pittsburgh, PA: Assoc. Comput. Mach., 1992, pp. 144-152.
-
Proc. 5th Anna. Workshop Computational Learning Theory. Pittsburgh, PA: Assoc. Comput. Mach., 1992, Pp. 144-152.
-
-
Böser, B.E.1
Guyon, I.M.2
Vapnik, V.N.3
-
16
-
-
0030128623
-
"Learning by canonical smooth estimation, Part I: Simultaneous estimation,"
-
vol. 41, no. 4, p. 545, 1996.
-
K. L. Buescher and P. R. Kumar, "Learning by canonical smooth estimation, Part I: Simultaneous estimation," IEEE Trans. Automat. Contr., vol. 41, no. 4, p. 545, 1996.
-
IEEE Trans. Automat. Contr.
-
-
Buescher, K.L.1
Kumar, P.R.2
-
17
-
-
34249753618
-
"Support-vector networks,"
-
vol. 20, pp. 273-297, 1995.
-
C. Cortes and V. Vapnik, "Support-vector networks," Mach. Learning, vol. 20, pp. 273-297, 1995.
-
Mach. Learning
-
-
Cortes, C.1
Vapnik, V.2
-
20
-
-
0029521676
-
"Sample compression, learnability, and the Vapnik-Chervonenkis Dimension,"
-
vol. 21, pp. 269-304, 1995.
-
S. Floyd and M. Warmuth, "Sample compression, learnability, and the Vapnik-Chervonenkis Dimension," Mach. Learning, vol. 21, pp. 269-304, 1995.
-
Mach. Learning
-
-
Floyd, S.1
Warmuth, M.2
-
21
-
-
0001219859
-
"Regularization theory and neural networks architecture,"
-
vol. 7, pp. 219-269, 1995.
-
F. Girosi, M. Jones, and T. Poggio, "Regularization theory and neural networks architecture," Neural Comp., vol. 7, pp. 219-269, 1995.
-
Neural Comp.
-
-
Girosi, F.1
Jones, M.2
Poggio, T.3
-
22
-
-
84955611223
-
"Approximation and learning of convex superpositions," in
-
2nd European Conf., EuroCOLT'95 (Lecture Notes in Artificial Intelligence, vol. 904, P. Vitânyi, Ed.) Berlin, Germany: Springer-Verlag, 1995, pp. 222-236.
-
L. Gurvits and P. Koiran, "Approximation and learning of convex superpositions," in Computational Learning Theory, Proc. 2nd European Conf., EuroCOLT'95 (Lecture Notes in Artificial Intelligence, vol. 904, P. Vitânyi, Ed.) Berlin, Germany: Springer-Verlag, 1995, pp. 222-236.
-
Computational Learning Theory, Proc.
-
-
Gurvits, L.1
Koiran, P.2
-
23
-
-
0001704198
-
"Structural risk minimization for character recognition,"
-
J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 471-479.
-
I. Guyon, V. N. Vapnik, B. E. Böser, L. Bottou, and S. A. Sofia, "Structural risk minimization for character recognition," in Advances in Neural Information Processing Systems 4, J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 471-479.
-
Advances in Neural Information Processing Systems 4
-
-
Guyon, I.1
Vapnik, V.N.2
Böser, B.E.3
Bottou, L.4
Sofia, S.A.5
-
25
-
-
0002192516
-
"Decision theoretic generalizations of the PAC model for neural net and other learning applications,"
-
vol. 100, pp. 78-150, 1992.
-
D. Haussler, "Decision theoretic generalizations of the PAC model for neural net and other learning applications," Inform. Comp., vol. 100, pp. 78-150, 1992.
-
Inform. Comp.
-
-
Haussler, D.1
-
27
-
-
0025794545
-
"Efficient distribution-free learning of probabilistic concepts," in
-
M. J. Kearns and R. E. Schapire, "Efficient distribution-free learning of probabilistic concepts," in Proc. 31st Symp. Foundations of Computer Science. Los Alamitos, CA: IEEE Computer Soc., 1990, pp. 382-391.
-
Proc. 31st Symp. Foundations of Computer Science. Los Alamitos, CA: IEEE Computer Soc., 1990, Pp. 382-391.
-
-
Kearns, M.J.1
Schapire, R.E.2
-
28
-
-
33747213454
-
"Neural networks with quadratic VC dimension," in
-
7 (NIPS95) and to appear in J. Comp. Syst. Sci.; also available as a NeuroCOLT Tech. Rep. NC-TR-95-044 [Online]: Available FTP: ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports.
-
P. Koiran and E. D. Sontag, "Neural networks with quadratic VC dimension," in Neural Information Processing Systems 7 (NIPS95) and to appear in J. Comp. Syst. Sci.; also available as a NeuroCOLT Tech. Rep. NC-TR-95-044 [Online]: Available FTP: ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports.
-
Neural Information Processing Systems
-
-
Koiran, P.1
Sontag, E.D.2
-
29
-
-
0030128525
-
"Learning by canonical smooth estimation, Part 2: Learning and choice of model complexity,"
-
vol. 41, no. 4, p. 557, 1996.
-
P. R. Kumar and K. L. Buescher, "Learning by canonical smooth estimation, Part 2: Learning and choice of model complexity," IEEE Trans. Automat. Contr., vol. 41, no. 4, p. 557, 1996.
-
IEEE Trans. Automat. Contr.
-
-
Kumar, P.R.1
Buescher, K.L.2
-
30
-
-
0025742422
-
"Results on learnability and the Vapnik-Chervonenkis dimension,"
-
vol. 90, pp. 33-49, 1991.
-
N. Linial, Y. Mansour, and R. L. Rivest, "Results on learnability and the Vapnik-Chervonenkis dimension," Inform. Comp., vol. 90, pp. 33-49, 1991.
-
Inform. Comp.
-
-
Linial, N.1
Mansour, Y.2
Rivest, R.L.3
-
31
-
-
34250091945
-
"Learning quickly when irrelevant attributes abound: A new linear threshold algorithm,"
-
vol. 2, pp. 285-318, 1988.
-
N. Littlestone, "Learning quickly when irrelevant attributes abound: A new linear threshold algorithm," Mach. Learning, vol. 2, pp. 285-318, 1988.
-
Mach. Learning
-
-
Littlestone, N.1
-
34
-
-
0041995194
-
-
1986, unpublished manuscript.
-
N. Littlestone and M. Warmuth, "Relating data compression and learnability," Univ. Calif., Santa Cruz, 1986, unpublished manuscript.
-
"Relating data compression and learnability," Univ. Calif., Santa Cruz
-
-
Littlestone, N.1
Warmuth, M.2
-
36
-
-
33747220777
-
-
1996, preprint.
-
G. Lugosi and A. B. Nobel, "Adaptive model selection using empirical complexities," Dept. Math. Comp. Sci., Tech. Univ. Budapest, Budapest, Hungary, 1996, preprint.
-
"Adaptive model selection using empirical complexities," Dept. Math. Comp. Sci., Tech. Univ. Budapest, Budapest, Hungary
-
-
Lugosi, G.1
Nobel, A.B.2
-
38
-
-
0029307575
-
"Nonparametric estimation via empirical risk minimization,"
-
vol. 41, pp. 677-687, May 1995.
-
G. Lugosi and K. Zeger, "Nonparametric estimation via empirical risk minimization," IEEE Trans. Inform. Theory, vol. 41, pp. 677-687, May 1995.
-
IEEE Trans. Inform. Theory
-
-
Lugosi, G.1
Zeger, K.2
-
39
-
-
0029754587
-
"Concept learning using complexity regularization,"
-
vol. 42, pp. 48-54, Jan. 1996.
-
_, "Concept learning using complexity regularization," IEEE Trans. Inform. Theory, vol. 42, pp. 48-54, Jan. 1996.
-
IEEE Trans. Inform. Theory
-
-
-
40
-
-
0000372206
-
"Bayesian model comparison and backprop nets,"
-
J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 839-846.
-
D. J. C. MacKay, "Bayesian model comparison and backprop nets," in Adv. Neural Inform. Processing Syst. 4, J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 839-846.
-
Adv. Neural Inform. Processing Syst. 4
-
-
MacKay, D.J.C.1
-
41
-
-
33747313245
-
-
1996, preprint.
-
_, "Probable networks and plausible predictions-A review of practical Bayesian methods for supervised neural networks," Cavendish Laboratory, Cambridge Univ., Cambridge, U.K., 1996, preprint.
-
"Probable networks and plausible predictions-A review of practical Bayesian methods for supervised neural networks," Cavendish Laboratory, Cambridge Univ., Cambridge, U.K
-
-
-
43
-
-
38249005514
-
"Bounding sample size with the Vapnik-Chervonenkis dimension,"
-
vol. 42, pp. 65-73, 1993.
-
J. Shawe-Taylor, M. Anthony, and N. Biggs, "Bounding sample size with the Vapnik-Chervonenkis dimension," Discr. Appl. Math., vol. 42, pp. 65-73, 1993.
-
Discr. Appl. Math.
-
-
Shawe-Taylor, J.1
Anthony, M.2
Biggs, N.3
-
44
-
-
0030381996
-
"A framework for structural risk minimization," in
-
J. Shawe-Taylor, P. Bartlett, R. Williamson, and M. Anthony, "A framework for structural risk minimization," in Proc. 9th Annu. Conf. Computational Learning Theory. New York: Assoc. Comput. Mach., 1996, pp. 68-76.
-
Proc. 9th Annu. Conf. Computational Learning Theory. New York: Assoc. Comput. Mach., 1996, Pp. 68-76.
-
-
Shawe-Taylor, J.1
Bartlett, P.2
Williamson, R.3
Anthony, M.4
-
45
-
-
33747230456
-
"Shattering all sets of
-
1/2 parameters," Rutgers Center for Systems and Control (SYCON), Rep. 96-01; also NeuroCOLT Tech. Rep. NC-TR-96-042. [Online] Available FTP: ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports.
-
E. D. Sontag, "Shattering all sets of k points in 'general position' requires (k -1)/2 parameters," Rutgers Center for Systems and Control (SYCON), Rep. 96-01; also NeuroCOLT Tech. Rep. NC-TR-96-042. [Online] Available FTP: ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports).
-
K Points in 'General Position' Requires
-
-
Sontag, E.D.1
-
48
-
-
0040864988
-
"Principles of risk minimization for learning theory
-
J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 831-838.
-
_, "Principles of risk minimization for learning theory," in Advances in Neural Information Processing Systems 4, J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 831-838.
-
Advances in Neural Information Processing Systems 4
-
-
-
50
-
-
0001024505
-
"On the uniform convergence of relative frequencies of events to their probabilities,"
-
vol. 16, pp. 264-280, 1971.
-
V. N. Vapnik and A. J. Chervonenkis, "On the uniform convergence of relative frequencies of events to their probabilities," Theory of Probability and Applications, vol. 16, pp. 264-280, 1971.
-
Theory of Probability and Applications
-
-
Vapnik, V.N.1
Chervonenkis, A.J.2
-
51
-
-
33747303882
-
"Ordered risk minimization (I and II),"
-
vol. 34, pp. 1226-1235 and 1403-1412, 1974.
-
_, "Ordered risk minimization (I and II)," Automat. Remote Contr., vol. 34, pp. 1226-1235 and 1403-1412, 1974.
-
Automat. Remote Contr.
-
-
|