메뉴 건너뛰기




Volumn 880, Issue , 2015, Pages 32-41

A new strategy to prevent over-fitting in partial least squares models based on model population analysis

Author keywords

Cross validation; Model population analysis; Model selection; Model stability; Over fitting; Partial least squares

Indexed keywords

CHEMICAL ANALYSIS; FORECASTING; STABILITY;

EID: 84931562522     PISSN: 00032670     EISSN: 18734324     Source Type: Journal    
DOI: 10.1016/j.aca.2015.04.045     Document Type: Article
Times cited : (65)

References (54)
  • 4
    • 34548275795 scopus 로고    scopus 로고
    • The Dantzig selector: statistical estimation when p is much larger than n
    • Candes E., Tao T. The Dantzig selector: statistical estimation when p is much larger than n. Ann. Stat. 2007, 35:2313-2351.
    • (2007) Ann. Stat. , vol.35 , pp. 2313-2351
    • Candes, E.1    Tao, T.2
  • 6
    • 34250813108 scopus 로고    scopus 로고
    • How to avoid over-fitting in multivariate calibration - the conventional validation approach and an alternative
    • Faber N.M., Rajko R. How to avoid over-fitting in multivariate calibration - the conventional validation approach and an alternative. Anal. Chim. Acta 2007, 595:98-106.
    • (2007) Anal. Chim. Acta , vol.595 , pp. 98-106
    • Faber, N.M.1    Rajko, R.2
  • 7
    • 0037191154 scopus 로고    scopus 로고
    • Model selection for partial least squares regression
    • Li B.B., Morris J., Martin E.B. Model selection for partial least squares regression. Chemometr. Intell. Lab. 2002, 64:79-89.
    • (2002) Chemometr. Intell. Lab. , vol.64 , pp. 79-89
    • Li, B.B.1    Morris, J.2    Martin, E.B.3
  • 8
    • 84906691299 scopus 로고    scopus 로고
    • A novel variable selection approach that iteratively optimizes variable space using weighted binary matrix sampling
    • Deng B.C., Yun Y.H., Liang Y.Z., Yi L.Z. A novel variable selection approach that iteratively optimizes variable space using weighted binary matrix sampling. Analyst 2014, 139:4836-4845.
    • (2014) Analyst , vol.139 , pp. 4836-4845
    • Deng, B.C.1    Yun, Y.H.2    Liang, Y.Z.3    Yi, L.Z.4
  • 9
    • 84890439287 scopus 로고    scopus 로고
    • A strategy that iteratively retains informative variables for selecting optimal variable subset in multivariate calibration
    • Yun Y.H., Wang W.T., Tan M.L., Liang Y.Z., Li H.D., Cao D.S., Lu H.M., Xu Q.S. A strategy that iteratively retains informative variables for selecting optimal variable subset in multivariate calibration. Anal. Chim. Acta 2014, 807:36-43.
    • (2014) Anal. Chim. Acta , vol.807 , pp. 36-43
    • Yun, Y.H.1    Wang, W.T.2    Tan, M.L.3    Liang, Y.Z.4    Li, H.D.5    Cao, D.S.6    Lu, H.M.7    Xu, Q.S.8
  • 10
    • 84923974832 scopus 로고    scopus 로고
    • A new method for wavelength interval selection that intelligently optimizes the locations, widths and combinations of the intervals
    • Deng B.C., Yun Y.H., Ma P., Lin C.C., Ren D.B., Liang Y.Z. A new method for wavelength interval selection that intelligently optimizes the locations, widths and combinations of the intervals. Analyst 2015, 140:1876-1885.
    • (2015) Analyst , vol.140 , pp. 1876-1885
    • Deng, B.C.1    Yun, Y.H.2    Ma, P.3    Lin, C.C.4    Ren, D.B.5    Liang, Y.Z.6
  • 13
    • 0016355478 scopus 로고
    • A new look at the statistical model identification
    • Akaike H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19:716-723.
    • (1974) IEEE Trans. Autom. Control , vol.19 , pp. 716-723
    • Akaike, H.1
  • 14
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • Schwarz G. Estimating the dimension of a model. Ann. Stat. 1978, 6:461-464.
    • (1978) Ann. Stat. , vol.6 , pp. 461-464
    • Schwarz, G.1
  • 15
    • 0000629975 scopus 로고
    • Cross-validatory choice and assessment of statistical predictions
    • Stone M. Cross-validatory choice and assessment of statistical predictions. J. R. Stat. Soc. 1974, 36:111-147.
    • (1974) J. R. Stat. Soc. , vol.36 , pp. 111-147
    • Stone, M.1
  • 16
    • 84951601886 scopus 로고
    • Cross-validatory estimation of the number of components in factor and principal components models
    • Wold S. Cross-validatory estimation of the number of components in factor and principal components models. Technometrics 1978, 20:397-405.
    • (1978) Technometrics , vol.20 , pp. 397-405
    • Wold, S.1
  • 17
    • 21144474350 scopus 로고
    • Linear model selection by cross-validation
    • Shao J. Linear model selection by cross-validation. J. Am. Stat. Assoc. 1993, 88:486-494.
    • (1993) J. Am. Stat. Assoc. , vol.88 , pp. 486-494
    • Shao, J.1
  • 18
    • 0000025871 scopus 로고
    • Science and statistics
    • Box G.E. Science and statistics. J. Am. Stat. Assoc. 1976, 71:791-799.
    • (1976) J. Am. Stat. Assoc. , vol.71 , pp. 791-799
    • Box, G.E.1
  • 19
    • 0000245743 scopus 로고    scopus 로고
    • Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author).
    • L. Breiman, Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author), (2001) 199-231.
    • (2001) , pp. 199-231
    • Breiman, L.1
  • 21
    • 0023526237 scopus 로고
    • Cross-validation in principal component analysis
    • Krzanowski W. Cross-validation in principal component analysis. Biometrics 1987, 43:575-584.
    • (1987) Biometrics , vol.43 , pp. 575-584
    • Krzanowski, W.1
  • 22
    • 0002656714 scopus 로고
    • Selection of optimal regression models via cross-validation
    • Osten D.W. Selection of optimal regression models via cross-validation. J. Chemometr. 1988, 2:39-48.
    • (1988) J. Chemometr. , vol.2 , pp. 39-48
    • Osten, D.W.1
  • 24
    • 0000507891 scopus 로고    scopus 로고
    • Pseudo-degrees of freedom for complex predictive models: the example of partial least squares
    • Van der Voet H. Pseudo-degrees of freedom for complex predictive models: the example of partial least squares. J. Chemometr. 1999, 13:195-208.
    • (1999) J. Chemometr. , vol.13 , pp. 195-208
    • Van der Voet, H.1
  • 25
    • 64649097388 scopus 로고    scopus 로고
    • Multivariate calibration with basis functions derived from optical filters
    • Tarumi T., Wu Y., Small G.W. Multivariate calibration with basis functions derived from optical filters. Anal. Chem. 2009, 81:2199-2207.
    • (2009) Anal. Chem. , vol.81 , pp. 2199-2207
    • Tarumi, T.1    Wu, Y.2    Small, G.W.3
  • 26
    • 84900311457 scopus 로고    scopus 로고
    • Characterizing multivariate calibration tradeoffs (bias, variance, selectivity, and sensitivity) to select model tuning parameters
    • Kalivas J.H., Palmer J. Characterizing multivariate calibration tradeoffs (bias, variance, selectivity, and sensitivity) to select model tuning parameters. J. Chemometr. 2014, 28:347-357.
    • (2014) J. Chemometr. , vol.28 , pp. 347-357
    • Kalivas, J.H.1    Palmer, J.2
  • 28
    • 79952134208 scopus 로고    scopus 로고
    • Preventing over-fitting in PLS calibration models of near-infrared (NIR) spectroscopy data using regression coefficients
    • Gowen A.A., Downey G., Esquerre C., O'Donnell C.P. Preventing over-fitting in PLS calibration models of near-infrared (NIR) spectroscopy data using regression coefficients. J. Chemometr. 2011, 25:375-381.
    • (2011) J. Chemometr. , vol.25 , pp. 375-381
    • Gowen, A.A.1    Downey, G.2    Esquerre, C.3    O'Donnell, C.P.4
  • 29
    • 34249328882 scopus 로고    scopus 로고
    • Impartial graphical comparison of multivariate calibration methods and the harmony/parsimony tradeoff
    • Stout F., Baines M.R., Kalivas J.H. Impartial graphical comparison of multivariate calibration methods and the harmony/parsimony tradeoff. J. Chemometr. 2006, 20:464-475.
    • (2006) J. Chemometr. , vol.20 , pp. 464-475
    • Stout, F.1    Baines, M.R.2    Kalivas, J.H.3
  • 30
    • 0037060974 scopus 로고    scopus 로고
    • Durbin-Watson statistic as a morphological estimator of information content
    • Rutledge D.N., Barros A.S. Durbin-Watson statistic as a morphological estimator of information content. Anal. Chim. Acta 2002, 454:277-295.
    • (2002) Anal. Chim. Acta , vol.454 , pp. 277-295
    • Rutledge, D.N.1    Barros, A.S.2
  • 31
    • 84865148650 scopus 로고    scopus 로고
    • Model-population analysis and its applications in chemical and biological modeling
    • Li H.D., Liang Y.Z., Xu Q.S., Cao D.S. Model-population analysis and its applications in chemical and biological modeling. TrAC, Trends Anal. Chem. 2012, 38:154-162.
    • (2012) TrAC, Trends Anal. Chem. , vol.38 , pp. 154-162
    • Li, H.D.1    Liang, Y.Z.2    Xu, Q.S.3    Cao, D.S.4
  • 33
    • 0039079123 scopus 로고
    • A note on modifications of the jackknife criterion for model selection
    • Herzberg A.M., Tsukanov A. A note on modifications of the jackknife criterion for model selection. Utilitas Math. 1986, 29:209-216.
    • (1986) Utilitas Math. , vol.29 , pp. 209-216
    • Herzberg, A.M.1    Tsukanov, A.2
  • 34
    • 84950645271 scopus 로고
    • The predictive sample reuse method with applications
    • Geisser S. The predictive sample reuse method with applications. J. Am. Stat. Assoc. 1975, 70:320-328.
    • (1975) J. Am. Stat. Assoc. , vol.70 , pp. 320-328
    • Geisser, S.1
  • 37
    • 0000079353 scopus 로고    scopus 로고
    • Propagation of measurement errors for the validation of predictions obtained by principal component regression and partial least squares
    • Faber K., Kowalski B.R. Propagation of measurement errors for the validation of predictions obtained by principal component regression and partial least squares. J. Chemometr. 1997, 11:181-238.
    • (1997) J. Chemometr. , vol.11 , pp. 181-238
    • Faber, K.1    Kowalski, B.R.2
  • 38
    • 0035242236 scopus 로고    scopus 로고
    • Basis sets for multivariate regression
    • Kalivas J.H. Basis sets for multivariate regression. Anal. Chim. Acta 2001, 428:31-40.
    • (2001) Anal. Chim. Acta , vol.428 , pp. 31-40
    • Kalivas, J.H.1
  • 39
    • 0035701969 scopus 로고    scopus 로고
    • Pareto optimal multivariate calibration for spectroscopic data
    • Kalivas J.H., Green R.L. Pareto optimal multivariate calibration for spectroscopic data. Appl. Spectrosc. 2001, 55:1645-1652.
    • (2001) Appl. Spectrosc. , vol.55 , pp. 1645-1652
    • Kalivas, J.H.1    Green, R.L.2
  • 40
    • 0000141851 scopus 로고    scopus 로고
    • A closer look at the bias-variance trade-off in multivariate calibration
    • Faber N.M. A closer look at the bias-variance trade-off in multivariate calibration. J. Chemometr. 1999, 13:185-192.
    • (1999) J. Chemometr. , vol.13 , pp. 185-192
    • Faber, N.M.1
  • 42
    • 1942438016 scopus 로고    scopus 로고
    • Rules of evidence for cancer molecular-marker discovery and validation
    • Ransohoff D.F. Rules of evidence for cancer molecular-marker discovery and validation. Nat. Rev. Cancer 2004, 4:309-314.
    • (2004) Nat. Rev. Cancer , vol.4 , pp. 309-314
    • Ransohoff, D.F.1
  • 43
    • 84885001692 scopus 로고    scopus 로고
    • A perspective demonstration on the importance of variable selection in inverse calibration for complex analytical systems
    • Yun Y.H., Liang Y.Z., Xie G.X., Li H.D., Cao D.S., Xu Q.S. A perspective demonstration on the importance of variable selection in inverse calibration for complex analytical systems. Analyst 2013, 138:6412-6421.
    • (2013) Analyst , vol.138 , pp. 6412-6421
    • Yun, Y.H.1    Liang, Y.Z.2    Xie, G.X.3    Li, H.D.4    Cao, D.S.5    Xu, Q.S.6
  • 46
    • 0027413806 scopus 로고
    • White, grey and black multicomponent systems: a classification of mixture problems and methods for their quantitative analysis
    • Liang Y.-Z., Kvalheim O.M., Manne R. White, grey and black multicomponent systems: a classification of mixture problems and methods for their quantitative analysis. Chemometr. Intell. Lab. 1993, 18:235-250.
    • (1993) Chemometr. Intell. Lab. , vol.18 , pp. 235-250
    • Liang, Y.-Z.1    Kvalheim, O.M.2    Manne, R.3
  • 47
    • 0022723652 scopus 로고
    • Error propagation and figures of merit for quantification by solving matrix equations
    • Lorber A. Error propagation and figures of merit for quantification by solving matrix equations. Anal. Chem. 1986, 58:1167-1172.
    • (1986) Anal. Chem. , vol.58 , pp. 1167-1172
    • Lorber, A.1
  • 48
    • 0000479247 scopus 로고    scopus 로고
    • Net analyte signal calculation in multivariate calibration
    • Lorber A., Faber K., Kowalski B.R. Net analyte signal calculation in multivariate calibration. Anal. Chem. 1997, 69:1620-1626.
    • (1997) Anal. Chem. , vol.69 , pp. 1620-1626
    • Lorber, A.1    Faber, K.2    Kowalski, B.R.3
  • 49
    • 84873277702 scopus 로고    scopus 로고
    • The continuity of sample complexity and its relationship to multivariate calibration: a general perspective on first-order calibration of spectral data in analytical chemistry
    • Li H.-D., Liang Y.-Z., Long X.-X., Yun Y.-H., Xu Q.-S. The continuity of sample complexity and its relationship to multivariate calibration: a general perspective on first-order calibration of spectral data in analytical chemistry. Chemometr. Intell. Lab. 2013, 122:23-30.
    • (2013) Chemometr. Intell. Lab. , vol.122 , pp. 23-30
    • Li, H.-D.1    Liang, Y.-Z.2    Long, X.-X.3    Yun, Y.-H.4    Xu, Q.-S.5
  • 50
    • 63049100737 scopus 로고    scopus 로고
    • Critical factors limiting the interpretation of regression vectors in multivariate calibration
    • Brown C.D., Green R.L. Critical factors limiting the interpretation of regression vectors in multivariate calibration. TrAC, Trends Anal. Chem. 2009, 28:506-514.
    • (2009) TrAC, Trends Anal. Chem. , vol.28 , pp. 506-514
    • Brown, C.D.1    Green, R.L.2
  • 51
    • 3242726813 scopus 로고    scopus 로고
    • Monte Carlo cross-validation for selecting a model and estimating the prediction error in multivariate calibration
    • Xu Q.S., Liang Y.Z., Du Y.P. Monte Carlo cross-validation for selecting a model and estimating the prediction error in multivariate calibration. J. Chemometr. 2004, 18:112-120.
    • (2004) J. Chemometr. , vol.18 , pp. 112-120
    • Xu, Q.S.1    Liang, Y.Z.2    Du, Y.P.3
  • 54
    • 78651234224 scopus 로고    scopus 로고
    • Toward better QSAR/QSPR modeling: simultaneous outlier detection and variable selection using distribution of model features
    • Cao D., Liang Y., Xu Q., Yun Y., Li H. Toward better QSAR/QSPR modeling: simultaneous outlier detection and variable selection using distribution of model features. J. Comput. Aid. Mol. Des. 2011, 25:67-80.
    • (2011) J. Comput. Aid. Mol. Des. , vol.25 , pp. 67-80
    • Cao, D.1    Liang, Y.2    Xu, Q.3    Yun, Y.4    Li, H.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.