메뉴 건너뛰기




Volumn 19, Issue 4, 2013, Pages 1176-1211

The potential and perils of preprocessing: Building new foundations

Author keywords

Data compression; Data repositories; Measurement error; Multiphase inference; Multiple imputation; Statistical principles

Indexed keywords


EID: 84885073996     PISSN: 13507265     EISSN: None     Source Type: Journal    
DOI: 10.3150/13-BEJSP16     Document Type: Article
Times cited : (28)

References (55)
  • 1
    • 0141966130 scopus 로고    scopus 로고
    • Affymetrix, Inc., Santa Clara, CA (Accessed April, 2013.)
    • Affymetrix, I. (2002). Statistical algorithms description document. Affymetrix, Inc., Santa Clara, CA. Available at http://media.affymetrix.com/ support/technical/whitepapers/sadd-whitepaper.pdf (Accessed April, 2013.).
    • (2002) Statistical Algorithms Description Document
    • Affymetrix, I.1
  • 3
    • 0001677717 scopus 로고
    • Controlling the false discovery rate: A practical and powerful approach to multiple testing
    • MR1325392
    • Benjamini, Y. and Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B Stat. Methodol. 57 289-300. MR1325392
    • (1995) J. R. Stat. Soc. Ser. B Stat. Methodol. , vol.57 , pp. 289-300
    • Benjamini, Y.1    Hochberg, Y.2
  • 5
    • 0001141391 scopus 로고
    • On the development of reference priors
    • PeñíScola New York: Oxford Univ. Press. MR1380269
    • Berger, J.O. and Bernardo, J.M. (1992). On the development of reference priors. In Bayesian Statistics, 4 (PeñíScola, 1991) 35-60. New York: Oxford Univ. Press. MR1380269
    • (1991) Bayesian Statistics , vol.4 , pp. 35-60
    • Berger, J.O.1    Bernardo, J.M.2
  • 7
    • 0000129333 scopus 로고
    • Equivalent comparisons of experiments
    • MR0056251
    • Blackwell, D. (1953). Equivalent comparisons of experiments. Ann. Math. Statist. 24 265-272. MR0056251
    • (1953) Ann. Math. Statist. , vol.24 , pp. 265-272
    • Blackwell, D.1
  • 8
    • 84896598225 scopus 로고    scopus 로고
    • Semi-parametric robust event detection for massive time-domain databases
    • (E.D. Feigelson and G.J. Babu, eds.). Lecture Notes in Statistics New York, NY: Springer
    • Blocker, A.W. and Protopapas, P. (2012). Semi-parametric robust event detection for massive time-domain databases. In Statistical Challenges in Modern Astronomy V (E.D. Feigelson and G.J. Babu, eds.). Lecture Notes in Statistics 902 177-187. New York, NY: Springer.
    • (2012) Statistical Challenges in Modern Astronomy V , vol.902 , pp. 177-187
    • Blocker, A.W.1    Protopapas, P.2
  • 9
    • 0037316303 scopus 로고    scopus 로고
    • A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
    • Bolstad, B.M.M., Irizarry, R.A.A., Astrand, M. and Speed, T.P.P. (2003). A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 19 185-193.
    • (2003) Bioinformatics , vol.19 , pp. 185-193
    • Bolstad, B.M.M.1    Irizarry, R.A.A.2    Astrand, M.3    Speed, T.P.P.4
  • 12
    • 0000336139 scopus 로고
    • Regression models and life-tables
    • MR0341758
    • Cox, D.R. (1972). Regression models and life-tables. J. R. Stat. Soc. Ser. B Stat. Methodol. 34 187-220. MR0341758
    • (1972) J. R. Stat. Soc. Ser. B Stat. Methodol. , vol.34 , pp. 187-220
    • Cox, D.R.1
  • 13
    • 0016794660 scopus 로고
    • Partial likelihood
    • MR0400509
    • Cox, D.R. (1975). Partial likelihood. Biometrika 62 269-276. MR0400509
    • (1975) Biometrika , vol.62 , pp. 269-276
    • Cox, D.R.1
  • 14
    • 84885066604 scopus 로고    scopus 로고
    • Massive data streams
    • Presented at
    • Davey, A. (2012). Massive data streams. Presented at SolarStat 2012.
    • (2012) SolarStat 2012
    • Davey, A.1
  • 18
    • 84885030216 scopus 로고
    • On a necessary and sufficient condition for admissibility of estimators when strictly convex loss is used
    • MR0231484
    • Farrell, R.H. (1968). On a necessary and sufficient condition for admissibility of estimators when strictly convex loss is used. Ann. Math. Statist. 39 23-28. MR0231484
    • (1968) Ann. Math. Statist. , vol.39 , pp. 23-28
    • Farrell, R.H.1
  • 19
    • 0001526195 scopus 로고
    • A predictive approach to model selection
    • MR0529531
    • Geisser, S. and Eddy, W.F. (1979). A predictive approach to model selection. J. Amer. Statist. Assoc. 74 153-160. MR0529531
    • (1979) J. Amer. Statist. Assoc. , vol.74 , pp. 153-160
    • Geisser, S.1    Eddy, W.F.2
  • 21
    • 18544368641 scopus 로고    scopus 로고
    • Classifying gene expression profiles from pairwise mRNA comparisons
    • (electronic). MR2101468
    • Geman, D., d'Avignon, C., Naiman, D.Q. and Winslow, R.L. (2004). Classifying gene expression profiles from pairwise mRNA comparisons. Stat. Appl. Genet. Mol. Biol. 3 21 pp. (electronic). MR2101468
    • (2004) Stat. Appl. Genet. Mol. Biol. , vol.3
    • Geman, D.1    D'Avignon, C.2    Naiman, D.Q.3    Winslow, R.L.4
  • 22
    • 0042003916 scopus 로고
    • Comparison of experiments and information measures
    • MR0536509
    • Goel, P.K. and DeGroot, M.H. (1979). Comparison of experiments and information measures. Ann. Statist. 7 1066-1077. MR0536509
    • (1979) Ann. Statist. , vol.7 , pp. 1066-1077
    • Goel, P.K.1    DeGroot, M.H.2
  • 24
    • 0013058848 scopus 로고
    • Invariant prior distributions
    • MR0161406
    • Hartigan, J. (1964). Invariant prior distributions. Ann. Math. Statist. 35 836-845. MR0161406
    • (1964) Ann. Math. Statist. , vol.35 , pp. 836-845
    • Hartigan, J.1
  • 25
    • 82755192534 scopus 로고    scopus 로고
    • Improving validation practices in "omics" research
    • Ioannidis, J.P.A. and Khoury, M.J. (2011). Improving validation practices in "omics" research. Science 334 1230-1232.
    • (2011) Science , vol.334 , pp. 1230-1232
    • Ioannidis, J.P.A.1    Khoury, M.J.2
  • 26
    • 33645326509 scopus 로고    scopus 로고
    • Comparison of affymetrix GeneChip expression measures
    • Irizarry, R.A., Wu, Z. and Jaffee, H.A. (2006). Comparison of Affymetrix GeneChip expression measures. Bioinformatics 22 789-794.
    • (2006) Bioinformatics , vol.22 , pp. 789-794
    • Irizarry, R.A.1    Wu, Z.2    Jaffee, H.A.3
  • 28
    • 84873751778 scopus 로고
    • An invariant form for the prior probability in estimation problems
    • MR0017504
    • Jeffreys, H. (1946). An invariant form for the prior probability in estimation problems. Proc. Roy. Soc. London. Ser. A. 186 453-461. MR0017504
    • (1946) Proc. Roy. Soc. London. Ser. A. , vol.186 , pp. 453-461
    • Jeffreys, H.1
  • 29
    • 51249163831 scopus 로고
    • Several Bayesians: A review
    • MR1265483
    • Kadane, J.B. (1993). Several Bayesians: A review. TEST 2 1-32. MR1265483
    • (1993) TEST , vol.2 , pp. 1-32
    • Kadane, J.B.1
  • 30
    • 0030327756 scopus 로고    scopus 로고
    • The selection of prior distributions by formal rules
    • Kass, R.E. and Wasserman, L. (1996). The selection of prior distributions by formal rules. J. Amer. Statist. Assoc. 91 1343-1370.
    • (1996) J. Amer. Statist. Assoc. , vol.91 , pp. 1343-1370
    • Kass, R.E.1    Wasserman, L.2
  • 31
    • 84861491090 scopus 로고    scopus 로고
    • Dust spectral energy distributions in the era of herschel and planck: A hierarchical Bayesian-fitting technique
    • Kelly, B.C., Shetty, R., Stutz, A.M., Kauffmann, J., Goodman, A.A. and Launhardt, R. (2012). Dust spectral energy distributions in the era of Herschel and Planck: A hierarchical Bayesian-fitting technique. The Astrophysical Journal 752 55.
    • (2012) The Astrophysical Journal , vol.752 , pp. 55
    • Kelly, B.C.1    Shetty, R.2    Stutz, A.M.3    Kauffmann, J.4    Goodman, A.A.5    Launhardt, R.6
  • 32
    • 0000457130 scopus 로고
    • Sufficiency and approximate sufficiency
    • MR0207093
    • Le Cam, L. (1964). Sufficiency and approximate sufficiency. Ann. Math. Statist. 35 1419-1455. MR0207093
    • (1964) Ann. Math. Statist. , vol.35 , pp. 1419-1455
    • Le Cam, L.1
  • 33
    • 0004029130 scopus 로고    scopus 로고
    • 2nd ed. Springer Texts in Statistics. New York: Springer. MR1639875
    • Lehmann, E.L. and Casella, G. (1998). Theory of Point Estimation, 2nd ed. Springer Texts in Statistics. New York: Springer. MR1639875
    • (1998) Theory of Point Estimation
    • Lehmann, E.L.1    Casella, G.2
  • 34
    • 0000138970 scopus 로고
    • On the reconciliation of probability assessments
    • MR0547236
    • Lindley, D.V., Tversky, A. and Brown, R.V. (1979). On the reconciliation of probability assessments. J. Roy. Statist. Soc. Ser. A 142 146-180. MR0547236
    • (1979) J. Roy. Statist. Soc. Ser. A , vol.142 , pp. 146-180
    • Lindley, D.V.1    Tversky, A.2    Brown, R.V.3
  • 35
    • 33749027459 scopus 로고    scopus 로고
    • Parameter estimation for the exponential-normal convolution model for background correction of affymetrix GeneChip data
    • (electronic). MR2306487
    • McGee, M. and Chen, Z. (2006). Parameter estimation for the exponential-normal convolution model for background correction of Affymetrix GeneChip data. Stat. Appl. Genet. Mol. Biol. 5 27 pp. (electronic). MR2306487
    • (2006) Stat. Appl. Genet. Mol. Biol. , vol.5
    • McGee, M.1    Chen, Z.2
  • 36
    • 84972537494 scopus 로고
    • Multiple-imputation inferences with uncongenial sources of input (with discussion)
    • Meng, X.L. (1994). Multiple-imputation inferences with uncongenial sources of input (with discussion). Statist. Sci. 9 538-558.
    • (1994) Statist. Sci. , vol.9 , pp. 538-558
    • Meng, X.L.1
  • 37
    • 0344379018 scopus 로고    scopus 로고
    • Discussion: Efficiency and self-efficiency with multiple imputation inference
    • Meng, X.L. and Romero, M. (2003). Discussion: Efficiency and self-efficiency with multiple imputation inference. International Statistical Review 71 607-618.
    • (2003) International Statistical Review , vol.71 , pp. 607-618
    • Meng, X.L.1    Romero, M.2
  • 38
    • 84864615423 scopus 로고
    • Using EM to obtain asymptotic variance-covariance matrices: The SEM algorithm
    • Meng, X.L. and Rubin, D.B. (1991). Using EM to obtain asymptotic variance-covariance matrices: The SEM algorithm. J. Amer. Statist. Assoc. 86 899-909.
    • (1991) J. Amer. Statist. Assoc. , vol.86 , pp. 899-909
    • Meng, X.L.1    Rubin, D.B.2
  • 40
    • 0001831031 scopus 로고
    • Consistent estimates based on partially consistent observations
    • MR0025113
    • Neyman, J. and Scott, E.L. (1948). Consistent estimates based on partially consistent observations. Econometrica 16 1-32. MR0025113
    • (1948) Econometrica , vol.16 , pp. 1-32
    • Neyman, J.1    Scott, E.L.2
  • 41
    • 65349190055 scopus 로고    scopus 로고
    • On surrogate loss functions and f-divergences
    • MR2502654
    • Nguyen, X., Wainwright, M.J. and Jordan, M.I. (2009). On surrogate loss functions and f-divergences. Ann. Statist. 37 876-904. MR2502654
    • (2009) Ann. Statist. , vol.37 , pp. 876-904
    • Nguyen, X.1    Wainwright, M.J.2    Jordan, M.I.3
  • 42
    • 0345659236 scopus 로고    scopus 로고
    • Proper and improper multiple imputation
    • Nielsen, S.F. (2003). Proper and improper multiple imputation. International Statistical Review 71 593-607.
    • (2003) International Statistical Review , vol.71 , pp. 593-607
    • Nielsen, S.F.1
  • 44
    • 0036898577 scopus 로고    scopus 로고
    • Microarray data normalization and transformation
    • Quackenbush, J. (2002). Microarray data normalization and transformation. Nat. Genet. 32 Suppl 496-501.
    • (2002) Nat. Genet. , vol.32 , Issue.SUPPL. , pp. 496-501
    • Quackenbush, J.1
  • 46
    • 0017133178 scopus 로고
    • Inference and missing data
    • MR0455196
    • Rubin, D.B. (1976). Inference and missing data. Biometrika 63 581-592. MR0455196
    • (1976) Biometrika , vol.63 , pp. 581-592
    • Rubin, D.B.1
  • 47
    • 0003738155 scopus 로고
    • Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics. New York: Wiley. MR0899519
    • Rubin, D.B. (1987). Multiple Imputation for Nonresponse in Surveys. Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics. New York: Wiley. MR0899519
    • (1987) Multiple Imputation for Nonresponse in Surveys.
    • Rubin, D.B.1
  • 48
    • 0030539070 scopus 로고    scopus 로고
    • Multiple imputation after 18+ years
    • Rubin, D.B. (1996). Multiple imputation after 18+ years. J. Amer. Statist. Assoc. 91 473-489.
    • (1996) J. Amer. Statist. Assoc. , vol.91 , pp. 473-489
    • Rubin, D.B.1
  • 49
    • 0007280722 scopus 로고
    • On rereading R. A. Fisher
    • MR0403889
    • Savage, L.J. (1976). On rereading R. A. Fisher. Ann. Statist. 4 441-500. MR0403889
    • (1976) Ann. Statist. , vol.4 , pp. 441-500
    • Savage, L.J.1
  • 50
    • 73349128000 scopus 로고    scopus 로고
    • The effect of line-of-sight temperature variation and noise on dust continuum observations
    • Shetty, R., Kauffmann, J., Schnee, S., Goodman, A.A. and Ercolano, B. (2009). The effect of line-of-sight temperature variation and noise on dust continuum observations. The Astrophysical Journal 696 2234-2251.
    • (2009) The Astrophysical Journal , vol.696 , pp. 2234-2251
    • Shetty, R.1    Kauffmann, J.2    Schnee, S.3    Goodman, A.A.4    Ercolano, B.5
  • 51
    • 33644872577 scopus 로고    scopus 로고
    • Limma: Linear models for microarray data
    • (R. Gentelman, V. Carey, S. Dudoit, R. Irizarry and W. Huber, eds.) 2005 Berlin: Springer
    • Smyth, G.K. (2005). Limma: Linear models for microarray data. In Bioinformatics and Computational Biology Solutions Using R and Bioconductor (R. Gentelman, V. Carey, S. Dudoit, R. Irizarry and W. Huber, eds.) 2005 397-420. Berlin: Springer.
    • (2005) Bioinformatics and Computational Biology Solutions Using R and Bioconductor , pp. 397-420
    • Smyth, G.K.1
  • 52
    • 27544451127 scopus 로고    scopus 로고
    • Simple decision rules for classifying human cancers from gene expression profiles
    • Tan, A.C., Naiman, D.Q., Xu, L., Winslow, R.L. and Geman, D. (2005). Simple decision rules for classifying human cancers from gene expression profiles. Bioinformatics 21 3896-3904.
    • (2005) Bioinformatics , vol.21 , pp. 3896-3904
    • Tan, A.C.1    Naiman, D.Q.2    Xu, L.3    Winslow, R.L.4    Geman, D.5
  • 53
    • 0035942271 scopus 로고    scopus 로고
    • Significance analysis of microarrays applied to the ionizing radiation response
    • Tusher, V.G., Tibshirani, R. and Chu, G. (2001). Significance analysis of microarrays applied to the ionizing radiation response. Proc. Natl. Acad. Sci. USA 98 5116-5121.
    • (2001) Proc. Natl. Acad. Sci. USA , vol.98 , pp. 5116-5121
    • Tusher, V.G.1    Tibshirani, R.2    Chu, G.3
  • 55
    • 62549119576 scopus 로고    scopus 로고
    • Statistical methods of background correction for illumina bead array data
    • Xie, Y., Wang, X. and Story, M. (2009). Statistical methods of background correction for Illumina Bead Array data. Bioinformatics 25 751-757.
    • (2009) Bioinformatics , vol.25 , pp. 751-757
    • Xie, Y.1    Wang, X.2    Story, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.