SCOPUS 정보 검색 플랫폼

Volumn 44, Issue 5, 1998, Pages 1926-1940

Structural risk minimization over data-dependent hierarchies

(4) Shawe Taylor, John a,b Bartlett, Peter L a,b Williamson, Robert C a,b Anthony, Martin a,b

b AUSTRALIAN NATIONAL UNIVERSITY (Australia)

Author keywords

Computational learning theory, fat shattering dimension, learning machines, maximal margin, probable smooth luckiness, probably approximately correct learning, support vector machines, uniform convergence, vapnik chervonenkis dimension

Indexed keywords

COMPUTATIONAL METHODS; CONVERGENCE OF NUMERICAL METHODS; ERROR ANALYSIS; LEARNING ALGORITHMS;

STRUCTURAL RISK MINIMIZATION (SRM); VAPNIK-CHERVONENSKI DIMENSION;

ARTIFICIAL INTELLIGENCE;

EID: 0032166068 PISSN: 00189448 EISSN: None Source Type: Journal
DOI: 10.1109/18.705570 Document Type: Article

Times cited : (414)

References (51)

1
- 0027802035
- "Scalesensitive dimensions, uniform convergence, and learnability," in
- 1993.
- N. Alon, S. Ben-David, N. Cesa-Bianchi, and D. Haussler, "Scalesensitive dimensions, uniform convergence, and learnability," in Proc. Conf. Foundations of Computer Science (FOCS), 1993.
- Proc. Conf. Foundations of Computer Science (FOCS)
- Alon, N.¹ Ben-David, S.² Cesa-Bianchi, N.³ Haussler, D.⁴

2
- 0031176507
- vol. 44, no.4, pp. 615-631, 1997.
- Also, J. Assoc. Comput. Mach, vol. 44, no.4, pp. 615-631, 1997.
- J. Assoc. Comput. Mach

3
- 84955561944
- "Function learning from interpolation," NeuroCOLT Tech. Rep. NC-TR-94-013 [Online] Available FTP: Ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports, an extended abstract appeared in
- (Lecture Notes in Artificial Intelligence, vol. 904, P. Vitanyi, Ed.) Berlin, Germany: Springer-Verlag, 1995, pp. 211-221.
- M. Anthony and P. Bartlett (1995), "Function learning from interpolation," NeuroCOLT Tech. Rep. NC-TR-94-013 [Online] Available FTP: ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports, (an extended abstract appeared in Computational Learning Theory, Proc. 2nd European Conf., EuroCOLT'95 (Lecture Notes in Artificial Intelligence, vol. 904, P. Vitanyi, Ed.) Berlin, Germany: Springer-Verlag, 1995, pp. 211-221.
- (1995) Computational Learning Theory, Proc. 2nd European Conf., EuroCOLT'95
- Anthony, M.¹ Bartlett, P.²

4
- 84967907292
- "The learnability of formal concepts," in
- Morgan Kaufmann, 1990, pp. 246-257.
- M. Anthony, N. Biggs, and J. Shawe-Taylor, "The learnability of formal concepts," in Proc. 3rdAnnu. Workshop Computational Learning Theory (Rochester) San Mateo, CA: Morgan Kaufmann, 1990, pp. 246-257.
- Proc. 3rdAnnu. Workshop Computational Learning Theory (Rochester) San Mateo, CA
- Anthony, M.¹ Biggs, N.² Shawe-Taylor, J.³

5
- 0041805140
- "A result of Vapnik with applications,"
- vol. 47, pp. 207-217, 1993.
- M. Anthony and J. Shawe-Taylor, "A result of Vapnik with applications," Discr. Appl Math., vol. 47, pp. 207-217, 1993.
- Discr. Appl Math.
- Anthony, M.¹ Shawe-Taylor, J.²

6
- 0038911768
- "A sufficient condition for polynomial distribution-dependent learnability,"
- vol. 77, pp. 1-12, 1997.
- _, "A sufficient condition for polynomial distribution-dependent learnability," Discr. Appl. Math., vol. 77, pp. 1-12, 1997.
- Discr. Appl. Math.

7
- 0001325515
- "Approximation and estimation bounds for artificial neural networks,"
- vol. 14, pp. 115-133, 1994.
- A. R. Barron, "Approximation and estimation bounds for artificial neural networks," Mach. Learning, vol. 14, pp. 115-133, 1994.
- Mach. Learning
- Barron, A.R.¹

8
- 0001347323
- "Complexity regularization with applications to artificial neural networks," in
- 1991, pp. 561-576.
- _, "Complexity regularization with applications to artificial neural networks," in Nonparametric Functional Estimation and Related Topics, G. Roussas, Ed. Boston, MA: Kluwer, 1991, pp. 561-576.
- Nonparametric Functional Estimation and Related Topics, G. Roussas, Ed. Boston, MA: Kluwer

9
- 0026190366
- "Minimum complexity density estimation,"
- vol. 37, pp. 1034-1054, 1738, 1991.
- A. R. Barron and T. M. Cover, "Minimum complexity density estimation," IEEE Trans. Inform. Theory, vol. 37, pp. 1034-1054, 1738, 1991.
- IEEE Trans. Inform. Theory
- Barron, A.R.¹ Cover, T.M.²

10
- 0032028728
- "The sample complexity of pattern classification with neural networks: The size of the weights is more important than the size of the network,"
- 1998, to be published.
- P. L. Bartlett, "The sample complexity of pattern classification with neural networks: The size of the weights is more important than the size of the network," IEEE Trans. Inform. Theory, 1998, to be published.
- IEEE Trans. Inform. Theory
- Bartlett, P.L.¹

11
- 0032046897
- Prediction, learning, uniform convergence, and scale-sensitive dimensions,"
- 1998.
- P. L. Bartlett and P. M. Long, "Prediction, learning, uniform convergence, and scale-sensitive dimensions," submitted to J. Comp. Syst. Scie. 1998.
- J. Comp. Syst. Scie.
- Bartlett, P.L.¹ Long, P.M.²

12
- 0030165580
- "Fat-shattering and the learnability of real-valued functions,"
- vol. 52, no. 3, pp. 434-452, 1996.
- P. L. Bartlett, P. M. Long, and R. C. Williamson, "Fat-shattering and the learnability of real-valued functions," J. Comp. Syst. Sci., vol. 52, no. 3, pp. 434-452, 1996.
- J. Comp. Syst. Sci.
- Bartlett, P.L.¹ Long, P.M.² Williamson, R.C.³

13
- 0028406575
- "Nonuniform learnability,"
- vol. 48, pp. 311-323, 1994.
- G. M. Benedek and A. Itai, "Nonuniform learnability," J. Comp. Syst. Scie., vol. 48, pp. 311-323, 1994.
- J. Comp. Syst. Scie.
- Benedek, G.M.¹ Itai, A.²

14
- 33747220117
- "Perceptron learning: The largest version space," in
- M. Biehl and M. Opper, "Perceptron learning: The largest version space," in Neural Networks: The Statistical Mechanics Perspective, Proc. CTP-PBSRI Workshop on Theoretical /Vrysics.World Scientific, also available at: http://brain.postech.ac.kr/nnsmp/compressed/biehl.ps.Z-l-.
- Neural Networks: the Statistical Mechanics Perspective, Proc. CTP-PBSRI Workshop on Theoretical /Vrysics.World Scientific, Also Available At: Http://brain.postech.ac.kr/nnsmp/compressed/biehl.ps.Z-l-.
- Biehl, M.¹ Opper, M.²

15
- 0026966646
- "A training algorithm for optimal margin classifiers," in
- B. E. Böser, I. M. Guyon, and V. N. Vapnik, "A training algorithm for optimal margin classifiers," in Proc. 5th Anna. Workshop Computational Learning Theory. Pittsburgh, PA: Assoc. Comput. Mach., 1992, pp. 144-152.
- Proc. 5th Anna. Workshop Computational Learning Theory. Pittsburgh, PA: Assoc. Comput. Mach., 1992, Pp. 144-152.
- Böser, B.E.¹ Guyon, I.M.² Vapnik, V.N.³

16
- 0030128623
- "Learning by canonical smooth estimation, Part I: Simultaneous estimation,"
- vol. 41, no. 4, p. 545, 1996.
- K. L. Buescher and P. R. Kumar, "Learning by canonical smooth estimation, Part I: Simultaneous estimation," IEEE Trans. Automat. Contr., vol. 41, no. 4, p. 545, 1996.
- IEEE Trans. Automat. Contr.
- Buescher, K.L.¹ Kumar, P.R.²

17
- 34249753618
- "Support-vector networks,"
- vol. 20, pp. 273-297, 1995.
- C. Cortes and V. Vapnik, "Support-vector networks," Mach. Learning, vol. 20, pp. 273-297, 1995.
- Mach. Learning
- Cortes, C.¹ Vapnik, V.²

18
- 84889281816
- T. M. Cover and J. Thomas, Elements of Information Theory. New York: Wiley, 1994.
- Elements of Information Theory. New York: Wiley, 1994.
- Cover, T.M.¹ Thomas, J.²

19
- 33747324869
- R. O. Duda and P. E. Hart, Pattern Classification and Scene Analysis. New York: Wiley, 1973.
- Pattern Classification and Scene Analysis. New York: Wiley, 1973.
- Duda, R.O.¹ Hart, P.E.²

20
- 0029521676
- "Sample compression, learnability, and the Vapnik-Chervonenkis Dimension,"
- vol. 21, pp. 269-304, 1995.
- S. Floyd and M. Warmuth, "Sample compression, learnability, and the Vapnik-Chervonenkis Dimension," Mach. Learning, vol. 21, pp. 269-304, 1995.
- Mach. Learning
- Floyd, S.¹ Warmuth, M.²

21
- 0001219859
- "Regularization theory and neural networks architecture,"
- vol. 7, pp. 219-269, 1995.
- F. Girosi, M. Jones, and T. Poggio, "Regularization theory and neural networks architecture," Neural Comp., vol. 7, pp. 219-269, 1995.
- Neural Comp.
- Girosi, F.¹ Jones, M.² Poggio, T.³

22
- 84955611223
- "Approximation and learning of convex superpositions," in
- 2nd European Conf., EuroCOLT'95 (Lecture Notes in Artificial Intelligence, vol. 904, P. Vitânyi, Ed.) Berlin, Germany: Springer-Verlag, 1995, pp. 222-236.
- L. Gurvits and P. Koiran, "Approximation and learning of convex superpositions," in Computational Learning Theory, Proc. 2nd European Conf., EuroCOLT'95 (Lecture Notes in Artificial Intelligence, vol. 904, P. Vitânyi, Ed.) Berlin, Germany: Springer-Verlag, 1995, pp. 222-236.
- Computational Learning Theory, Proc.
- Gurvits, L.¹ Koiran, P.²

23
- 0001704198
- "Structural risk minimization for character recognition,"
- J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 471-479.
- I. Guyon, V. N. Vapnik, B. E. Böser, L. Bottou, and S. A. Sofia, "Structural risk minimization for character recognition," in Advances in Neural Information Processing Systems 4, J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 471-479.
- Advances in Neural Information Processing Systems 4
- Guyon, I.¹ Vapnik, V.N.² Böser, B.E.³ Bottou, L.⁴ Sofia, S.A.⁵

24
- 33747236153
- M. H. Hassoun, Fundamentals of Artificial Neural Networks. Cambridge, MA: MIT Press, 1995.
- Fundamentals of Artificial Neural Networks. Cambridge, MA: MIT Press, 1995.
- Hassoun, M.H.¹

25
- 0002192516
- "Decision theoretic generalizations of the PAC model for neural net and other learning applications,"
- vol. 100, pp. 78-150, 1992.
- D. Haussler, "Decision theoretic generalizations of the PAC model for neural net and other learning applications," Inform. Comp., vol. 100, pp. 78-150, 1992.
- Inform. Comp.
- Haussler, D.¹

26
- 33747297557
- D. Home, The Lucky Country: Australia in the Sixties. Victoria: Penguin Books, Ringwood, 1964.
- The Lucky Country: Australia in the Sixties. Victoria: Penguin Books, Ringwood, 1964.
- Home, D.¹

27
- 0025794545
- "Efficient distribution-free learning of probabilistic concepts," in
- M. J. Kearns and R. E. Schapire, "Efficient distribution-free learning of probabilistic concepts," in Proc. 31st Symp. Foundations of Computer Science. Los Alamitos, CA: IEEE Computer Soc., 1990, pp. 382-391.
- Proc. 31st Symp. Foundations of Computer Science. Los Alamitos, CA: IEEE Computer Soc., 1990, Pp. 382-391.
- Kearns, M.J.¹ Schapire, R.E.²

28
- 33747213454
- "Neural networks with quadratic VC dimension," in
- 7 (NIPS95) and to appear in J. Comp. Syst. Sci.; also available as a NeuroCOLT Tech. Rep. NC-TR-95-044 [Online]: Available FTP: ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports.
- P. Koiran and E. D. Sontag, "Neural networks with quadratic VC dimension," in Neural Information Processing Systems 7 (NIPS95) and to appear in J. Comp. Syst. Sci.; also available as a NeuroCOLT Tech. Rep. NC-TR-95-044 [Online]: Available FTP: ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports.
- Neural Information Processing Systems
- Koiran, P.¹ Sontag, E.D.²

29
- 0030128525
- "Learning by canonical smooth estimation, Part 2: Learning and choice of model complexity,"
- vol. 41, no. 4, p. 557, 1996.
- P. R. Kumar and K. L. Buescher, "Learning by canonical smooth estimation, Part 2: Learning and choice of model complexity," IEEE Trans. Automat. Contr., vol. 41, no. 4, p. 557, 1996.
- IEEE Trans. Automat. Contr.
- Kumar, P.R.¹ Buescher, K.L.²

30
- 0025742422
- "Results on learnability and the Vapnik-Chervonenkis dimension,"
- vol. 90, pp. 33-49, 1991.
- N. Linial, Y. Mansour, and R. L. Rivest, "Results on learnability and the Vapnik-Chervonenkis dimension," Inform. Comp., vol. 90, pp. 33-49, 1991.
- Inform. Comp.
- Linial, N.¹ Mansour, Y.² Rivest, R.L.³

31
- 34250091945
- "Learning quickly when irrelevant attributes abound: A new linear threshold algorithm,"
- vol. 2, pp. 285-318, 1988.
- N. Littlestone, "Learning quickly when irrelevant attributes abound: A new linear threshold algorithm," Mach. Learning, vol. 2, pp. 285-318, 1988.
- Mach. Learning
- Littlestone, N.¹

32
- 33747278443
- _, "Mistake-driven bayes sports: Bounds for symmetric Apobayesian learning algorithms," Tech. Rep., NEC Res. Ctr.,1996.
- "Mistake-driven bayes sports: Bounds for symmetric Apobayesian learning algorithms," Tech. Rep., NEC Res. Ctr.,1996

33
- 33747318263
- 1996, preprint.
- N. Littlestone and C. Mesterham, "An Apobayesian relative of winnow," NEC Res. Ctr., 1996, preprint.
- "An Apobayesian relative of winnow," NEC Res. Ctr
- Littlestone, N.¹ Mesterham, C.²

34
- 0041995194
- 1986, unpublished manuscript.
- N. Littlestone and M. Warmuth, "Relating data compression and learnability," Univ. Calif., Santa Cruz, 1986, unpublished manuscript.
- "Relating data compression and learnability," Univ. Calif., Santa Cruz
- Littlestone, N.¹ Warmuth, M.²

35
- 33646932607
- L. Ljung, System Identification: Theory for the User. Upper Saddle River, NJ: Prentice-Hall PTR, 1987.
- System Identification: Theory for the User. Upper Saddle River, NJ: Prentice-Hall PTR, 1987.
- Ljung, L.¹

36
- 33747220777
- 1996, preprint.
- G. Lugosi and A. B. Nobel, "Adaptive model selection using empirical complexities," Dept. Math. Comp. Sci., Tech. Univ. Budapest, Budapest, Hungary, 1996, preprint.
- "Adaptive model selection using empirical complexities," Dept. Math. Comp. Sci., Tech. Univ. Budapest, Budapest, Hungary
- Lugosi, G.¹ Nobel, A.B.²

37
- 0030398632
- "A data-dependent skeleton estimate for learning," in
- G. Lugosi and M. Pintér, "A data-dependent skeleton estimate for learning," in Proc. 9th Annu. Workshop Computational Learning Theory. New York: Assoc. Comput. Mach., 1996, pp. 51-56.
- Proc. 9th Annu. Workshop Computational Learning Theory. New York: Assoc. Comput. Mach., 1996, Pp. 51-56.
- Lugosi, G.¹ Pintér, M.²

38
- 0029307575
- "Nonparametric estimation via empirical risk minimization,"
- vol. 41, pp. 677-687, May 1995.
- G. Lugosi and K. Zeger, "Nonparametric estimation via empirical risk minimization," IEEE Trans. Inform. Theory, vol. 41, pp. 677-687, May 1995.
- IEEE Trans. Inform. Theory
- Lugosi, G.¹ Zeger, K.²

39
- 0029754587
- "Concept learning using complexity regularization,"
- vol. 42, pp. 48-54, Jan. 1996.
- _, "Concept learning using complexity regularization," IEEE Trans. Inform. Theory, vol. 42, pp. 48-54, Jan. 1996.
- IEEE Trans. Inform. Theory

40
- 0000372206
- "Bayesian model comparison and backprop nets,"
- J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 839-846.
- D. J. C. MacKay, "Bayesian model comparison and backprop nets," in Adv. Neural Inform. Processing Syst. 4, J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 839-846.
- Adv. Neural Inform. Processing Syst. 4
- MacKay, D.J.C.¹

41
- 33747313245
- 1996, preprint.
- _, "Probable networks and plausible predictions-A review of practical Bayesian methods for supervised neural networks," Cavendish Laboratory, Cambridge Univ., Cambridge, U.K., 1996, preprint.
- "Probable networks and plausible predictions-A review of practical Bayesian methods for supervised neural networks," Cavendish Laboratory, Cambridge Univ., Cambridge, U.K

42
- 33747251594
- D. Pollard, Convergence of Stochastic Processes. New York: Springer, 1984.
- Convergence of Stochastic Processes. New York: Springer, 1984.
- Pollard, D.¹

43
- 38249005514
- "Bounding sample size with the Vapnik-Chervonenkis dimension,"
- vol. 42, pp. 65-73, 1993.
- J. Shawe-Taylor, M. Anthony, and N. Biggs, "Bounding sample size with the Vapnik-Chervonenkis dimension," Discr. Appl. Math., vol. 42, pp. 65-73, 1993.
- Discr. Appl. Math.
- Shawe-Taylor, J.¹ Anthony, M.² Biggs, N.³

44
- 0030381996
- "A framework for structural risk minimization," in
- J. Shawe-Taylor, P. Bartlett, R. Williamson, and M. Anthony, "A framework for structural risk minimization," in Proc. 9th Annu. Conf. Computational Learning Theory. New York: Assoc. Comput. Mach., 1996, pp. 68-76.
- Proc. 9th Annu. Conf. Computational Learning Theory. New York: Assoc. Comput. Mach., 1996, Pp. 68-76.
- Shawe-Taylor, J.¹ Bartlett, P.² Williamson, R.³ Anthony, M.⁴

45
- 33747230456
- "Shattering all sets of
- 1/2 parameters," Rutgers Center for Systems and Control (SYCON), Rep. 96-01; also NeuroCOLT Tech. Rep. NC-TR-96-042. [Online] Available FTP: ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports.
- E. D. Sontag, "Shattering all sets of k points in 'general position' requires (k -1)/2 parameters," Rutgers Center for Systems and Control (SYCON), Rep. 96-01; also NeuroCOLT Tech. Rep. NC-TR-96-042. [Online] Available FTP: ftp.dcs.rhbnc.ac.uk/pub/neurocolt/tech_reports).
- K Points in 'General Position' Requires
- Sontag, E.D.¹

46
- 0003679019
- A. W. van der Vaart and J. A. Wellner, Weak Convergence and Empirical Processes. New York: Springer-Verlag, 1996.
- Weak Convergence and Empirical Processes. New York: Springer-Verlag, 1996.
- Van Der Vaart, A.W.¹ Wellner, J.A.²

47
- 0003969585
- V. N. Vapnik, Estimation of Dependences Based on Empirical Data. New York: Springer-Verlag, 1982.
- Estimation of Dependences Based on Empirical Data. New York: Springer-Verlag, 1982.
- Vapnik, V.N.¹

48
- 0040864988
- "Principles of risk minimization for learning theory
- J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 831-838.
- _, "Principles of risk minimization for learning theory," in Advances in Neural Information Processing Systems 4, J. E. Moody et al., Eds. San Mateo, CA: Morgan Kaufmann, 1992, pp. 831-838.
- Advances in Neural Information Processing Systems 4

49
- 33747301780
- _, The Nature of Statistical Learning Theory. New York: Springer-Verlag, 1995.
- The Nature of Statistical Learning Theory. New York: Springer-Verlag, 1995.

50
- 0001024505
- "On the uniform convergence of relative frequencies of events to their probabilities,"
- vol. 16, pp. 264-280, 1971.
- V. N. Vapnik and A. J. Chervonenkis, "On the uniform convergence of relative frequencies of events to their probabilities," Theory of Probability and Applications, vol. 16, pp. 264-280, 1971.
- Theory of Probability and Applications
- Vapnik, V.N.¹ Chervonenkis, A.J.²

51
- 33747303882
- "Ordered risk minimization (I and II),"
- vol. 34, pp. 1226-1235 and 1403-1412, 1974.
- _, "Ordered risk minimization (I and II)," Automat. Remote Contr., vol. 34, pp. 1226-1235 and 1403-1412, 1974.
- Automat. Remote Contr.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.