메뉴 건너뛰기




Volumn 97, Issue 4, 2010, Pages 893-904

Consistent selection of the number of clusters via crossvalidation

Author keywords

Cluster analysis; Crossvalidation; k means; Selection consistency; Spectral clustering; Stability

Indexed keywords


EID: 78651282047     PISSN: 00063444     EISSN: 14643510     Source Type: Journal    
DOI: 10.1093/biomet/asq061     Document Type: Article
Times cited : (153)

References (23)
  • 2
    • 0036359730 scopus 로고    scopus 로고
    • A stability based method for discovering structure in clustered data
    • BEN-HUR, A., ELISSEEFF, A. & GUYON, I. (2002). A stability based method for discovering structure in clustered data. In Pac. Symp. Biocomp. 2002, 6-17.
    • (2002) Pac. Symp. Biocomp , vol.2002 , pp. 6-17
    • Ben-Hur, A.1    Elisseeff, A.2    Guyon, I.3
  • 3
    • 0000354976 scopus 로고
    • A comparative study of ordinary cross-validation, v-fold crossvalidation and the repeated learningtesting methods
    • BURMAN, P. (1989). A comparative study of ordinary cross-validation, v-fold crossvalidation and the repeated learningtesting methods. Biometrika 76, 503-14.
    • (1989) Biometrika , vol.76 , pp. 503-514
    • Burman, P.1
  • 4
    • 84972893020 scopus 로고
    • A dendrite method for cluster analysis
    • CALINSKI, R. B. & HARABASZ, J. (1974). A dendrite method for cluster analysis. Commun. Statist. 3, 1-27.
    • (1974) Commun. Statist. , vol.3 , pp. 1-27
    • Calinski, R.B.1    Harabasz, J.2
  • 5
    • 84950645271 scopus 로고
    • The predictive sample reuse method with applications
    • GEISSER, S. (1975). The predictive sample reuse method with applications. J. Am. Statist. Assoc. 70, 320-8.
    • (1975) J. Am. Statist. Assoc. , vol.70 , pp. 320-328
    • Geisser, S.1
  • 7
    • 11144268948 scopus 로고    scopus 로고
    • Large deviations for sums of partly dependent random variables
    • JANSON, S. (2004). Large deviations for sums of partly dependent random variables. Random Struct. Algor. 24, 234-48.
    • (2004) Random Struct. Algor , vol.24 , pp. 234-248
    • Janson, S.1
  • 9
    • 0033196672 scopus 로고    scopus 로고
    • A cautionary note on using internal crossvalidation to select the number of clusters
    • KRIEGER, A. M. & GREEN, P. E. (1999). A cautionary note on using internal crossvalidation to select the number of clusters. Psychometrika 64, 341-53.
    • (1999) Psychometrika , vol.64 , pp. 341-353
    • Krieger, A.M.1    Green, P.E.2
  • 10
    • 33646597836 scopus 로고
    • A criterion for determining the number of clusters in a data set
    • KRZANOWSKI, W. J. & LAI, Y. T. (1985). A criterion for determining the number of clusters in a data set. Biometrics 44, 23-34.
    • (1985) Biometrics , vol.44 , pp. 23-34
    • Krzanowski, W.J.1    Lai, Y.T.2
  • 11
    • 2442611856 scopus 로고    scopus 로고
    • Stability-based validation of clustering solutions
    • LANGE, T., ROTH, V., BRAUN, M. & BUHMANN, J. (2004). Stability-based validation of clustering solutions. Neural Comp. 16, 1299-323.
    • (2004) Neural. Comp. , vol.16 , pp. 1299-1323
    • Lange, T.1    Roth, V.2    Braun, M.3    Buhmann, J.4
  • 13
    • 84899013108 scopus 로고    scopus 로고
    • On spectral clustering: Analysis and an algorithm
    • Ed. T. Dietterich, S. Becker and Z. Ghahramani, Cambridge: MIT Press
    • NG, A., JORDAN, M. & WEISS, Y. (2001). On spectral clustering: analysis and an algorithm. In Adv. Neural. Info. Processing Sys. (NIPS2001), Ed. T. Dietterich, S. Becker and Z. Ghahramani, pp. 849-56. Cambridge: MIT Press.
    • (2001) Adv. Neural. Info. Processing Sys. (NIPS2001) , pp. 849-856
    • Ng, A.1    Jordan, M.2    Weiss, Y.3
  • 14
    • 25844520574 scopus 로고    scopus 로고
    • Interpretation of clusters in the framework of shadowed sets
    • PEDRYCZ, W. (2005). Interpretation of clusters in the framework of shadowed sets. Pat. Recog. Lett. 26, 2439-49.
    • (2005) Pat. Recog. Lett. , vol.26 , pp. 2439-2449
    • Pedrycz, W.1
  • 15
    • 85048667824 scopus 로고    scopus 로고
    • Cluster stability for finite samples
    • Ed. J. Platt, D. Koller, Y. Singer and S. Roweis, Cambridge: MIT Press
    • SHAMIR, O. & TISHBY, T. (2007). Cluster stability for finite samples. In Adv. Neural Info. Processing Sys. (NIPS2007), Ed. J. Platt, D. Koller, Y. Singer and S. Roweis, pp. 1297-304. Cambridge: MIT Press.
    • (2007) Adv. Neural. Info. Processing Sys. (NIPS2007) , pp. 1297-1304
    • Shamir, O.1    Tishby, T.2
  • 16
    • 21144474350 scopus 로고
    • Linear model selection by cross-validation
    • SHAO, J. (1993). Linear model selection by cross-validation. J. Am. Statist. Assoc. 88, 486-94.
    • (1993) J. Am. Statist. Assoc. , vol.88 , pp. 486-494
    • Shao, J.1
  • 17
    • 0242679438 scopus 로고    scopus 로고
    • Finding the number of clusters in a data set: An information theoretic approach
    • SUGAR, C. & JAMES, G. (2003). Finding the number of clusters in a data set: an information theoretic approach. J. Am. Statist. Assoc. 98, 750-63.
    • (2003) J. Am. Statist. Assoc. , vol.98 , pp. 750-763
    • Sugar, C.1    James, G.2
  • 19
    • 0035532141 scopus 로고    scopus 로고
    • Estimating the number of clusters in a data set via the gap statistic
    • TIBSHIRANI, R., WALTHER, G. & HASTIE, T. (2001b). Estimating the number of clusters in a data set via the gap statistic. J. R. Statist. Soc. B 63, 411-23.
    • (2001) J. R. Statist. Soc. B , vol.63 , pp. 411-423
    • Tibshirani, R.1    Walther, G.2    Hastie, T.3
  • 21
    • 33746167868 scopus 로고    scopus 로고
    • Comparing learning methods for classification
    • YANG, Y. (2006). Comparing learning methods for classification. Statist. Sinica 16, 635-57.
    • (2006) Statist. Sinica , vol.16 , pp. 635-657
    • Yang, Y.1
  • 22
    • 39649100346 scopus 로고    scopus 로고
    • Consistency of crossvalidation for comparing regression procedures
    • YANG, Y. (2007). Consistency of crossvalidation for comparing regression procedures. Ann. Statist. 35, 2450-73.
    • (2007) Ann. Statist. , vol.35 , pp. 2450-2473
    • Yang, Y.1
  • 23
    • 21144472438 scopus 로고
    • Model selection via multifold crossvalidation
    • ZHANG, P. (1993). Model selection via multifold crossvalidation. Ann. Statist. 21, 299-313.
    • (1993) Ann. Statist. , vol.21 , pp. 299-313
    • Zhang, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.