메뉴 건너뛰기




Volumn 13, Issue 3, 1991, Pages 252-264

Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners

Author keywords

Classification error; classifier design; curse of dimensionality; feature selection; statistical pattern recognition; test samples; training samples

Indexed keywords

IMAGE PROCESSING - IMAGE ANALYSIS; MATHEMATICAL STATISTICS; PROBABILITY;

EID: 0026120032     PISSN: 01628828     EISSN: None     Source Type: Journal    
DOI: 10.1109/34.75512     Document Type: Article
Times cited : (1088)

References (58)
  • 1
    • 18244374302 scopus 로고
    • Unbiased estimators and classification problems for multivariate normal populations
    • (in Russian).
    • R.A. Abusev and Y.P. Lumelskij, “Unbiased estimators and classification problems for multivariate normal populations”, Theor. Prob. and Appl., vol. 25, pp. 381–389, 1980 (in Russian).
    • (1980) Theor. Prob. and Appl. , vol.25 , pp. 381-389
    • Abusev, R.A.1    Lumelskij, Y.P.2
  • 4
    • 70350346892 scopus 로고
    • Use of distance measures, information measures and error bounds in feature evaluation
    • P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands: North-Holland
    • M. Ben-Bassat, “Use of distance measures, information measures and error bounds in feature evaluation”, in Handbook of Statistics, vol. 2, P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands: North-Holland, 1982, pp. 773–791.
    • (1982) Handbook of Statistics , vol.2 , pp. 773-791
    • Ben-Bassat, M.1
  • 6
    • 3042547134 scopus 로고
    • Nonparametric Classification
    • P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands: North-Holland
    • Y. D. Broffitt, “Nonparametric Classification”, in Handbook of Statistics, vol. 2, P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands: North-Holland, 1982, pp. 139–168.
    • (1982) Handbook of Statistics , vol.2 , pp. 139-168
    • Broffitt, Y.D.1
  • 8
    • 0003556272 scopus 로고
    • Nearest neighbor methods in discrimination
    • P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands: North-Holland
    • L. Devroye and T. J. Wagner, “Nearest neighbor methods in discrimination”, in Handbook of Statistics, vol. 2, P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands: North-Holland, 1982, pp. 193–198.
    • (1982) Handbook of Statistics , vol.2 , pp. 193-198
    • Devroye, L.1    Wagner, T.J.2
  • 10
    • 84886193637 scopus 로고
    • The efficiency of logistic regression compared to normal discriminant analysis
    • B. Efron, “The efficiency of logistic regression compared to normal discriminant analysis”, J. Amer. Statist. Assoc., vol. 70, pp. 892–898, 1975.
    • (1975) J. Amer. Statist. Assoc. , vol.70 , pp. 892-898
    • Efron, B.1
  • 11
    • 84939696855 scopus 로고
    • A choice of a set of measurements with maximal discriminating power in the case of limited learning sample size
    • Moscow, USSR: Nauka, (in Russian).
    • I. S. Enukov, “A choice of a set of measurements with maximal discriminating power in the case of limited learning sample size”, in Multivariate Statistical Analysis in Social-Economic Research. Moscow, USSR: Nauka, 1974, pp. 394–397 (in Russian).
    • (1974) Multivariate Statistical Analysis in Social-Economic Research , pp. 394-397
    • Enukov, I.S.1
  • 12
    • 0015395891 scopus 로고
    • Considerations of sample and feature size
    • D. M. Foley, “Considerations of sample and feature size”, IEEE Trans. Inform. Theory, vol. IT-18, pp. 618–626, 1972.
    • (1972) IEEE Trans. Inform. Theory , vol.IT-18 , pp. 618-626
    • Foley, D.M.1
  • 13
    • 0022831637 scopus 로고
    • Statistical pattern recognition
    • T. Y. Young and K. S. Fu, Eds. New York: Academic
    • K. Fukunaga, “Statistical pattern recognition”, in Handbook of Pattern Recognition and Image Processing, T. Y. Young and K. S. Fu, Eds. New York: Academic, 1986, pp. 3–32.
    • (1986) Handbook of Pattern Recognition and Image Processing , pp. 3-32
    • Fukunaga, K.1
  • 14
    • 27644572056 scopus 로고
    • Optimization of K-nearest-neighbor density estimates
    • K. Fukunaga and L. D. Hostetler, “Optimization of K-nearest-neighbor density estimates”, IEEE Trans. Inform. Theory, vol. IT-19, pp. 320–326, 1973.
    • (1973) IEEE Trans. Inform. Theory , vol.IT-19 , pp. 320-326
    • Fukunaga, K.1    Hostetler, L.D.2
  • 15
    • 0042167947 scopus 로고
    • Posterior odds for multivariate normal classifications
    • S. Geiser, “Posterior odds for multivariate normal classifications”, J. Roy. Statist. Soc. B, vol. 21, no. 1, pp. 69 -76, 1964.
    • (1964) J. Roy. Statist. Soc. B , vol.21 , Issue.1 , pp. 69-76
    • Geiser, S.1
  • 16
    • 0018179125 scopus 로고
    • Additive estimators for probabilities of correct classification
    • N. Glick, “Additive estimators for probabilities of correct classification”, Pattern Recog., vol. 10, no. 3, pp. 211–222, 1978.
    • (1978) Pattern Recog. , vol.10 , Issue.3 , pp. 211-222
    • Glick, N.1
  • 18
    • 84942217817 scopus 로고
    • Acad. Sci., Lithuania, personal communication
    • V. Grabauskas, Inst. Math. Cybern., Acad. Sci., Lithuania, personal communication, 1983.
    • (1983) Inst. Math. Cybern.
    • Grabauskas, V.1
  • 19
    • 0006858967 scopus 로고
    • On the expected probability of the classification error of the classifier for discrete variables
    • Ed. Vilnius, USSR: Inst. Math. Cybern. Press, (in Russian).
    • D. Griškevičius and Š. Raudys, “On the expected probability of the classification error of the classifier for discrete variables”, in Statistical Problems of Control, issue 38, Š. Raudys, Ed. Vilnius, USSR: Inst. Math. Cybern. Press, 1979, pp. 95–112 (in Russian).
    • (1979) Statistical Problems of Control, issue , Issue.38 , pp. 95-112
    • Griškevičius, D.1    Raudys, Š.2
  • 20
    • 0001020401 scopus 로고
    • Recent advances in error rate estimation
    • D.J. Hand, “Recent advances in error rate estimation”, Pattern Recog. Lett., vol. 5, pp. 335–346, 1986.
    • (1986) Pattern Recog. Lett. , vol.5 , pp. 335-346
    • Hand, D.J.1
  • 22
    • 63249112814 scopus 로고
    • Dimensionality and sample size considerations in pattern recognition practice
    • P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands, North-Holland
    • A. K. Jain and B. Chandrasekaran, “Dimensionality and sample size considerations in pattern recognition practice”, in Handbook of Statistics, vol. 2, P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands, North-Holland, 1982, pp. 835–855.
    • (1982) Handbook of Statistics , vol.2 , pp. 835-855
    • Jain, A.K.1    Chandrasekaran, B.2
  • 23
    • 85012675241 scopus 로고
    • Classifier design with Parzen windows
    • E. S. Gelsema and L. N. Kanal, Eds. Amsterdam, The Netherlands: Elsevier
    • A. K. Jain and M. D. Ramaswami, “Classifier design with Parzen windows”, in Pattern Recognition and Artificial Intelligence, E. S. Gelsema and L. N. Kanal, Eds. Amsterdam, The Netherlands: Elsevier, 1988, pp. 211–228.
    • (1988) Pattern Recognition and Artificial Intelligence , pp. 211-228
    • Jain, A.K.1    Ramaswami, M.D.2
  • 24
    • 0018253340 scopus 로고
    • On the optimal number of features in the classification of multivariate Gaussian data
    • A. K. Jain and W. G. Waller, “On the optimal number of features in the classification of multivariate Gaussian data”, Pattern Recog., vol. 10, pp. 365–374, 1978.
    • (1978) Pattern Recog. , vol.10 , pp. 365-374
    • Jain, A.K.1    Waller, W.G.2
  • 25
    • 0016125209 scopus 로고
    • Patterns in pattern recognition 1968-1974
    • L. Kanal, “Patterns in pattern recognition 1968-1974”, Trans. Inform. Theory, vol. IT-20, pp. 697–722, 1974.
    • (1974) Trans. Inform. Theory , vol.IT-20 , pp. 697-722
    • Kanal, L.1
  • 26
    • 0001050283 scopus 로고
    • On dimensionality and sample size in statistical pattern classification
    • L. Kanal and B. Chandrasekaran, “On dimensionality and sample size in statistical pattern classification”, Pattern Recog., vol. 3, pp. 238–255, 1971.
    • (1971) Pattern Recog. , vol.3 , pp. 238-255
    • Kanal, L.1    Chandrasekaran, B.2
  • 27
    • 0009820365 scopus 로고
    • A note on learning for Gaussian properties
    • D. G. Keehn, “A note on learning for Gaussian properties”, IEEE Trans. Inform. Theory, vol. IT-11, no. 1, pp. 126–131, 1965.
    • (1965) IEEE Trans. Inform. Theory , vol.IT-11 , Issue.1 , pp. 126-131
    • Keehn, D.G.1
  • 28
    • 0022848955 scopus 로고
    • Feature selection and extraction
    • T. Y. Young and K. S. Fu, Eds. New York: Academic
    • J. Kittler, “Feature selection and extraction”, in Handbook of Pattern Recognition and Image Processing, T. Y. Young and K. S. Fu, Eds. New York: Academic, 1986, pp. 60–83.
    • (1986) Handbook of Pattern Recognition and Image Processing , pp. 60-83
    • Kittler, J.1
  • 29
    • 84879799188 scopus 로고
    • Estimation of error rates in discriminant analysis
    • P.A. Lachenbruch and R. M. Mickey, “Estimation of error rates in discriminant analysis”, Technometrics, vol. 10, no. 1, pp. 1–11, 1968.
    • (1968) Technometrics , vol.10 , Issue.1 , pp. 1-11
    • Lachenbruch, P.A.1    Mickey, R.M.2
  • 30
    • 84972948264 scopus 로고
    • Robustness the linear and quadratic discriminant functions to certain types of non-normality
    • P. A. Lachenbruch, C. Sneeringer, and L. T. Revo, “Robustness the linear and quadratic discriminant functions to certain types of non-normality”, Commun. Statist., vol. 1, no. 1, pp. 39–56, 1972.
    • (1972) Commun. Statist. , vol.1 , Issue.1 , pp. 39-56
    • Lachenbruch, P.A.1    Sneeringer, C.2    Revo, L.T.3
  • 31
    • 13444287457 scopus 로고
    • Logical functions in the problems of empirical prediction
    • P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands: North-Holland
    • G. S. Lbov, “Logical functions in the problems of empirical prediction”, in Handbook of Statistics, vol. 2, P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands: North-Holland, 1982, pp. 479–491.
    • (1982) Handbook of Statistics , vol.2 , pp. 479-491
    • Lbov, G.S.1
  • 32
    • 0016898159 scopus 로고
    • Error estimation in pattern via L-distance between posterior density functions
    • T. Lissack and K. S. Fu, “Error estimation in pattern via L-distance between posterior density functions”, IEEE Trans. Inform. Theory, vol. IT-22, pp. 34–45, 1976.
    • (1976) IEEE Trans. Inform. Theory , vol.IT-22 , pp. 34-45
    • Lissack, T.1    Fu, K.S.2
  • 33
    • 85042336383 scopus 로고
    • The bias of the apparent error rate in discriminant analysis
    • G.J. McLachlan, “The bias of the apparent error rate in discriminant analysis”, Biometrika, vol. 63, pp. 239 - 244, 1976.
    • (1976) Biometrika , vol.63 , pp. 239-244
    • McLachlan, G.J.1
  • 34
    • 0004818998 scopus 로고
    • Assessing the performance of an allocation rule
    • G.J. McLachlan, “Assessing the performance of an allocation rule”, Comput. Math. Applicat., vol. 12A, pp. 261–272, 1976.
    • (1976) Comput. Math. Applicat. , vol.12A , pp. 261-272
    • McLachlan, G.J.1
  • 35
    • 0042140768 scopus 로고
    • The efficiency of Efron's ‘bootstrap’ to error estimation in discriminant analysis
    • G.J. McLachlan, “The efficiency of Efron's ‘bootstrap’ to error estimation in discriminant analysis”, J. Stat. Comput. Simulation, vol. 11, pp. 273–279, 1980.
    • (1980) J. Stat. Comput. Simulation , vol.11 , pp. 273-279
    • McLachlan, G.J.1
  • 36
    • 0002809091 scopus 로고
    • Error rate advances
    • A. K. Gupta, Ed. Dordrect, The Netherlands: Reidel
    • G.J. McLachlan, “Error rate advances”, in Advances in Multivariate Statistical Analysis, A. K. Gupta, Ed. Dordrect, The Netherlands: Reidel, 1987, pp. 233–252.
    • (1987) Advances in Multivariate Statistical Analysis , pp. 233-252
    • McLachlan, G.J.1
  • 37
    • 44449125148 scopus 로고
    • Comparison of algorithms for selecting the best feature set in pattern recognition
    • Vilnius, USSR: Inst. Math. Cybern. Press, (in Russian)
    • L. Miroshnichenko, “Comparison of algorithms for selecting the best feature set in pattern recognition”, in Statistical Problems of Control, issue 93. Vilnius, USSR: Inst. Math. Cybern. Press, 1990, pp. 78–91 (in Russian).
    • (1990) Statistical Problems of Control, issue , Issue.93 , pp. 78-91
    • Miroshnichenko, L.1
  • 38
    • 0001115280 scopus 로고
    • The error rate a classification procedure with application to logistic regression discrimination
    • T. Y. O'Neill, “The error rate a classification procedure with application to logistic regression discrimination”, J. Amer. Statist. Assoc., vol. 75, pp. 154–160, 1980.
    • (1980) J. Amer. Statist. Assoc. , vol.75 , pp. 154-160
    • O'Neill, T.Y.1
  • 40
    • 0345285314 scopus 로고
    • Analysis of learning speed of three linear classifiers
    • Ph.D. Dissertation, Inst. Phys. Math., Vilnius. (in Russian)
    • V. Pikelis, “Analysis of learning speed of three linear classifiers”, Ph.D. Dissertation, Inst. Phys. Math., Vilnius, pp. 1–136, 1974 (in Russian).
    • (1974) , pp. 1-136
    • Pikelis, V.1
  • 41
    • 0039596305 scopus 로고
    • On the problems of sample size in pattern recognition
    • Moscow, USSR: Nauka (in Russian)
    • Š. Raudys, “On the problems of sample size in pattern recognition”, in Proc. 2nd All-Union Conf. Statistical Methods in Control Theory, Moscow, USSR: Nauka, 1970, pp. 64–67 (in Russian).
    • (1970) Proc. 2nd All-Union Conf. Statistical Methods in Control Theory , pp. 64-67
    • Raudys, Š.1
  • 42
    • 1442282676 scopus 로고
    • Experimental comparison of thirteen classification algorithms
    • Vilnius, USSR: Inst. Phys. Math. Press, (in Russian).
    • Š. Raudys, V. Pikelis, and K. Juškevičius, “Experimental comparison of thirteen classification algorithms”, in Statistical Problems of Control, issue 11, Vilnius, USSR: Inst. Phys. Math. Press, 1975, pp. 35–80 (in Russian).
    • (1975) Statistical Problems of Control , Issue.11 , pp. 35-80
    • Raudys, Š.1    Pikelis, V.2    Juškevičius, K.3
  • 43
    • 67649396277 scopus 로고
    • Comparison of the estimates of the probability of misclassification
    • Kyoto, Japan, Nov.
    • Š. Raudys, “Comparison of the estimates of the probability of misclassification”, in Proc. 4th Int. Conf. Pattern Recognition, Kyoto, Japan, Nov. 1978, pp. 280–282.
    • (1978) Proc. 4th Int. Conf. Pattern Recognition , pp. 280-282
    • Raudys, Š.1
  • 44
    • 0018655616 scopus 로고
    • Determination of optimal dimensionality in pattern classification
    • Š. Raudys, “Determination of optimal dimensionality in pattern classification”, Pattern Recog., vol. 11, pp. 263 -270, 1979.
    • (1979) Pattern Recog. , vol.11 , pp. 263-270
    • Raudys, Š.1
  • 45
    • 0019020917 scopus 로고
    • On dimensionality, sample size, classification error, and complexity of classification algorithm in pattern recognition
    • Š. Raudys and V. Pikelis, “On dimensionality, sample size, classification error, and complexity of classification algorithm in pattern recognition”, IEEE Trans. Pattern Anal. Machine Intell, vol. PAMI-2, no. 3, pp. 242–252, 1980.
    • (1980) IEEE Trans. Pattern Anal. Machine Intell , vol.PAMI-2 , Issue.3 , pp. 242-252
    • Raudys, Š.1    Pikelis, V.2
  • 46
    • 0342974667 scopus 로고
    • The influence of sample size on classification performance
    • Issue Vilnius, USSR, Inst. Math. Cybern. Press, (in Russian).
    • Š. Raudys, “The influence of sample size on classification performance”, in Statistical Problems of Control, issue 66. Vilnius, USSR, Inst. Math. Cybern. Press, 1984, pp. 9–42 (in Russian).
    • (1984) Statistical Problems of Control , Issue.66 , pp. 9-42
    • Raudys, Š.1
  • 47
    • 1442282682 scopus 로고
    • Methods to estimate the probability of misclassification
    • Issue Vilnius, USSR: Inst. Math. Cybern. Press, (in Russian).
    • Š. Raudys, and V. Vaitukaitis, “Methods to estimate the probability of misclassification”, in Statistical Problems of Control, issue 66. Vilnius, USSR: Inst. Math. Cybern. Press, 1984, pp. 43–65 (in Russian).
    • (1984) Statistical Problems of Control , Issue.66 , pp. 43-65
    • Raudys, Š.1    Vaitukaitis, V.2
  • 48
    • 0024172308 scopus 로고
    • On the accuracy of a bootstrap estimate of the classification error
    • Rome, Italy, Nov.
    • Š. Raudys, “On the accuracy of a bootstrap estimate of the classification error”, in Proc. 9th Int. Conf Pattern Recognition, Rome, Italy, Nov. 1988, pp. 1230–1232.
    • (1988) Proc. 9th Int. Conf Pattern Recognition , pp. 1230-1232
    • Raudys, Š.1
  • 49
    • 84942217818 scopus 로고
    • The effects of the number of initial and final features, the dependence between the features and the type of a classification rule on the accuracy of feature selection
    • Submitted for publication.
    • Š. Raudys, V. Pikelis, and D. Stasaitis, “The effects of the number of initial and final features, the dependence between the features and the type of a classification rule on the accuracy of feature selection”, Pattern Recog. Artificial Intell., 1990, submitted for publication.
    • (1990) Pattern Recog. Artificial Intell.
    • Raudys, Š.1    Pikelis, V.2    Stasaitis, D.3
  • 50
    • 1442282625 scopus 로고
    • The distribution of actual error rates in linear discriminant analysis
    • J. W. Sayre, “The distribution of actual error rates in linear discriminant analysis”, J. Amer. Statist. Assoc., vol. 75, pp. 201–205, 1980.
    • (1980) J. Amer. Statist. Assoc. , vol.75 , pp. 201-205
    • Sayre, J.W.1
  • 52
    • 0006586801 scopus 로고
    • Large sample approximations and asymptotic expansions of classification statistics
    • P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands: North-Holland
    • M. Siotani, “Large sample approximations and asymptotic expansions of classification statistics”, in Handbook of Statistics, vol. 2, P. R. Krishnaiah and L. N. Kanal, Eds. Amsterdam, The Netherlands: North-Holland, 1982, pp. 61–100.
    • (1982) Handbook of Statistics , vol.2 , pp. 61-100
    • Siotani, M.1
  • 53
    • 46249114558 scopus 로고
    • Effect of the kernel form on the quality of nonparametric Parzen window classifier
    • Issue Vilnius, USSR: Inst. Math. Cybern. Press, (in Russian).
    • M. Skurikhina, “Effect of the kernel form on the quality of nonparametric Parzen window classifier”, in Statistical Problems of Control, issue 93. Vilnius, USSR: Inst. Math. Cybern. Press, 1990 (in Russian).
    • (1990) Statistical Problems of Control , Issue.93
    • Skurikhina, M.1
  • 54
    • 0016082639 scopus 로고
    • Bibliography on estimation of misclassification
    • G.T. Toussaint, “Bibliography on estimation of misclassification”, IEEE Trans. Inform. Theory, vol. 20, pp. 472–479, 1974.
    • (1974) IEEE Trans. Inform. Theory , vol.20 , pp. 472-479
    • Toussaint, G.T.1
  • 55
    • 0009583908 scopus 로고
    • Tree structured classification via recursive discriminant analysis
    • Ph.D. Dissertation, Univ. Wisconsin
    • N. Vanichsetakul, “Tree structured classification via recursive discriminant analysis”, Ph.D. Dissertation, Univ. Wisconsin, 1986.
    • (1986)
    • Vanichsetakul, N.1
  • 57
    • 0000614009 scopus 로고
    • Asymptotically optimal discriminant functions for pattern classification
    • C.T. Wolverton and T.J. Wagner, “Asymptotically optimal discriminant functions for pattern classification”, IEEE Trans. Inform. Theory, vol. IT-15, no. 2, pp. 258–265, 1969.
    • (1969) IEEE Trans. Inform. Theory , vol.IT-15 , Issue.2 , pp. 258-265
    • Wolverton, C.T.1    Wagner, T.J.2
  • 58
    • 84942217819 scopus 로고
    • Criteria for selecting the informative features in pattern recognition
    • Issue, Vilnius, USSR: Inst. Math. Cybern. Press, (in Russian).
    • D. Zvirenaite, “Criteria for selecting the informative features in pattern recognition”, in Statistical Problems of Control, issue 74. Vilnius, USSR: Inst. Math. Cybern. Press, 1986, pp. 76–103 (in Russian).
    • (1986) Statistical Problems of Control , Issue.74 , pp. 76-103
    • *Zvirenaite, D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.