메뉴 건너뛰기




Volumn 49, Issue 3, 2007, Pages 291-304

Large-scale bayesian logistic regression for text categorization

Author keywords

Information retrieval; Lasso; Penalization; Ridge regression; Support vector classifier; Variable selection

Indexed keywords

CLASSIFICATION (OF INFORMATION); FEATURE EXTRACTION; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; NATURAL LANGUAGE PROCESSING SYSTEMS; REGRESSION ANALYSIS; SUPPORT VECTOR MACHINES;

EID: 34548105186     PISSN: 00401706     EISSN: None     Source Type: Journal    
DOI: 10.1198/004017007000000245     Document Type: Article
Times cited : (614)

References (65)
  • 1
    • 34548059625 scopus 로고    scopus 로고
    • Ackham, P. J. (2004), An Algorithm for Computing the Inverse Normal Cumulative Distribution Function, available at http://home.online. no/∼pjacklam/notes/invnorm/.
    • Ackham, P. J. (2004), "An Algorithm for Computing the Inverse Normal Cumulative Distribution Function," available at http://home.online. no/∼pjacklam/notes/invnorm/.
  • 4
    • 84973057463 scopus 로고
    • A View of Unconstrained Optimization
    • eds. G. L. Nemhauser, A. H. G. Rinnooy Kan, and M. J. Todd, Amsterdam: Elsevier, pp
    • Dennis, J. E., Jr., and Schnabel, R. B. (1989), "A View of Unconstrained Optimization," in Optimization, eds. G. L. Nemhauser, A. H. G. Rinnooy Kan, and M. J. Todd, Amsterdam: Elsevier, pp. 1-72.
    • (1989) Optimization , pp. 1-72
    • Dennis Jr., J.E.1    Schnabel, R.B.2
  • 8
    • 2942731012 scopus 로고    scopus 로고
    • An Extensive Empirical Study of Feature Selection Metrics for Text Classification
    • Forman, G. (2003), "An Extensive Empirical Study of Feature Selection Metrics for Text Classification," Journal of Machine Learning Research, 3, 1289-1305.
    • (2003) Journal of Machine Learning Research , vol.3 , pp. 1289-1305
    • Forman, G.1
  • 9
    • 0000249788 scopus 로고    scopus 로고
    • An Equivalence Between Sparse Approximation and Support Vector Machines
    • Girosi, F. (1998), "An Equivalence Between Sparse Approximation and Support Vector Machines," Neural Computation, 10, 1455-1480.
    • (1998) Neural Computation , vol.10 , pp. 1455-1480
    • Girosi, F.1
  • 10
    • 0034159815 scopus 로고    scopus 로고
    • Problems From Small Samples and Sparse Data in Conditional Logistic Regression Analysis
    • Greenland, S., Schwartzbaum, J. A., and Finkle, W. D. (2000), "Problems From Small Samples and Sparse Data in Conditional Logistic Regression Analysis," American Journal of Epidemiology, 151, 531-539.
    • (2000) American Journal of Epidemiology , vol.151 , pp. 531-539
    • Greenland, S.1    Schwartzbaum, J.A.2    Finkle, W.D.3
  • 11
    • 0037445869 scopus 로고    scopus 로고
    • Consistency of Logistic Regression Coefficient Estimates Calculated From a Training Sample
    • Hadjicostas, P. (2003), "Consistency of Logistic Regression Coefficient Estimates Calculated From a Training Sample," Statistics and Probability Letters, 62, 293-303.
    • (2003) Statistics and Probability Letters , vol.62 , pp. 293-303
    • Hadjicostas, P.1
  • 12
    • 0002585592 scopus 로고
    • Generalized Linear Models
    • eds. J. M. Chambers and T. J. Hastie, Pacific Grove, CA: Wadsworth & Brooks, pp
    • Hastie, T. J., and Pregibon, D. (1992), "Generalized Linear Models," in Statistical Models in S, eds. J. M. Chambers and T. J. Hastie, Pacific Grove, CA: Wadsworth & Brooks, pp. 377-420.
    • (1992) Statistical Models in S , pp. 377-420
    • Hastie, T.J.1    Pregibon, D.2
  • 16
    • 84942484786 scopus 로고
    • Ridge Regression: Biased Estimation for Nonorthogonal Problems
    • Hoerl, A. E., and Kennard, R. W. (1970), "Ridge Regression: Biased Estimation for Nonorthogonal Problems," Technometrics, 12, 55-67.
    • (1970) Technometrics , vol.12 , pp. 55-67
    • Hoerl, A.E.1    Kennard, R.W.2
  • 18
    • 84957069814 scopus 로고    scopus 로고
    • Text Categorization With Support Vector Machines: Learning With Many Relevant Features
    • Heildelberg: Springer, pp
    • Joachims, T. (1998), "Text Categorization With Support Vector Machines: Learning With Many Relevant Features," in Machine Learning: ECML '98, 10th European Conference on Machine Learning, Heildelberg: Springer, pp. 137-142.
    • (1998) Machine Learning: ECML '98, 10th European Conference on Machine Learning , pp. 137-142
    • Joachims, T.1
  • 21
    • 0022848955 scopus 로고
    • Feature Selection and Extraction
    • eds. T. Y. Young and K.-S. Fu, Orlando, FL: Academic Press, pp
    • Kittler, J. (1986), "Feature Selection and Extraction," in Handbook of Pattern Recognition and Image Processing, eds. T. Y. Young and K.-S. Fu, Orlando, FL: Academic Press, pp. 59-83.
    • (1986) Handbook of Pattern Recognition and Image Processing , pp. 59-83
    • Kittler, J.1
  • 22
    • 0035575628 scopus 로고    scopus 로고
    • Relative Loss Bounds for Multidimensional Regression Problems
    • Kivinen, J., and Warmuth, M. K. (2001), "Relative Loss Bounds for Multidimensional Regression Problems," Machine Learning, 45, 301-329.
    • (2001) Machine Learning , vol.45 , pp. 301-329
    • Kivinen, J.1    Warmuth, M.K.2
  • 26
    • 33644681560 scopus 로고    scopus 로고
    • Recovering Genetic Regulatory Networks From Micro-Array Data and Location Analysis Data
    • Li, F., and Yang, Y. (2004), "Recovering Genetic Regulatory Networks From Micro-Array Data and Location Analysis Data," Genome Informatics, 15, 131-140.
    • (2004) Genome Informatics , vol.15 , pp. 131-140
    • Li, F.1    Yang, Y.2
  • 28
  • 31
    • 84876811202 scopus 로고    scopus 로고
    • Lewis, D. D., Yang, Y., Rose, T., and Li, F. (2004), RCV1: A New Benchmark Collection for Text Categorization Research, Journal of Machine Learning Research, 5, 361-397. Data set available at http://jmlr.csail.mit.edu/ papers/volume5/lewis04a/lyrl2004_rcv1v2_README. htm.
    • Lewis, D. D., Yang, Y., Rose, T., and Li, F. (2004), "RCV1: A New Benchmark Collection for Text Categorization Research," Journal of Machine Learning Research, 5, 361-397. Data set available at http://jmlr.csail.mit.edu/ papers/volume5/lewis04a/lyrl2004_rcv1v2_README. htm.
  • 32
    • 0028230073 scopus 로고
    • Understanding and Using the Medical Subject Headings (MeSH) Vocabulary to Perform Literature Searches
    • Lowe, H. J., and Barnett, G. O. (1994), "Understanding and Using the Medical Subject Headings (MeSH) Vocabulary to Perform Literature Searches," Journal of the American Medical Association, 271, 1103-1108.
    • (1994) Journal of the American Medical Association , vol.271 , pp. 1103-1108
    • Lowe, H.J.1    Barnett, G.O.2
  • 34
    • 33748521081 scopus 로고    scopus 로고
    • Statistics and the War on Spam
    • eds. R. Peck, G. Casella, G. Cobb, R. Hoerl, D. Nolan, R. Starbuck, and H. Stern, Duxbury Press, pp
    • Madigan, D. (2005), "Statistics and the War on Spam," in Statistics: A Guide to the Unknown, eds. R. Peck, G. Casella, G. Cobb, R. Hoerl, D. Nolan, R. Starbuck, and H. Stern, Duxbury Press, pp. 135-147.
    • (2005) Statistics: A Guide to the Unknown , pp. 135-147
    • Madigan, D.1
  • 35
    • 34548104236 scopus 로고    scopus 로고
    • Madigan, D., and Ridgeway, G. (2004), Discussion of Least Angle Regression, by B. Efron, T. Hastie, I. Johnstone, and R. Tibshirani, The Annals of Statistics, 32, 465-469.
    • Madigan, D., and Ridgeway, G. (2004), Discussion of "Least Angle Regression," by B. Efron, T. Hastie, I. Johnstone, and R. Tibshirani, The Annals of Statistics, 32, 465-469.
  • 37
    • 33751240089 scopus 로고    scopus 로고
    • Madigan, D., Genkin, A., Lewis, D. D., and Fradkin, D. (2005b), Bayesian Multinomial Logistic Regression for Author Identification, in Bayesian Analysis and Maximum Entropy Methods in Science and Engineering: 25th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, AIP Conference Proceedings, 803, Melville, NY: AIP, pp. 509-516.
    • Madigan, D., Genkin, A., Lewis, D. D., and Fradkin, D. (2005b), "Bayesian Multinomial Logistic Regression for Author Identification," in Bayesian Analysis and Maximum Entropy Methods in Science and Engineering: 25th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, AIP Conference Proceedings, Vol. 803, Melville, NY: AIP, pp. 509-516.
  • 41
    • 0000943086 scopus 로고    scopus 로고
    • Maron, M. E. (1961), Automatic Indexing: An Experimental Inquiry, Journal of the ACM, 8, 404-417. Meinshausen, N. (2005), Lasso With Relaxation, technical report, University of California Berkeley, Dept. of Statistics.
    • Maron, M. E. (1961), "Automatic Indexing: An Experimental Inquiry," Journal of the ACM, 8, 404-417. Meinshausen, N. (2005), "Lasso With Relaxation," technical report, University of California Berkeley, Dept. of Statistics.
  • 44
    • 34548097408 scopus 로고    scopus 로고
    • Park, M.-Y, and Hastie, T. (2006), An L1 Regularization-Path Algorithm for Generalized Linear Models, available at http://www-stat. stanford.edu/∼ hastie/Papers/glmpath.pdf.
    • Park, M.-Y, and Hastie, T. (2006), "An L1 Regularization-Path Algorithm for Generalized Linear Models," available at http://www-stat. stanford.edu/∼" hastie/Papers/glmpath.pdf.
  • 45
    • 0018824137 scopus 로고
    • Bias and Efficiency in Logistic Analysis of Stratified Case-Control Studies
    • Pike, M. C., Hill, A. P., and Smith, P. G. (1980), "Bias and Efficiency in Logistic Analysis of Stratified Case-Control Studies," American Journal of Epidemiology, 9, 89-95.
    • (1980) American Journal of Epidemiology , vol.9 , pp. 89-95
    • Pike, M.C.1    Hill, A.P.2    Smith, P.G.3
  • 46
    • 84948481845 scopus 로고
    • An Algorithm for Suffix Stripping
    • Porter, M. F. (1980), "An Algorithm for Suffix Stripping," Program, 14 (3), 130-137.
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.F.1
  • 47
    • 34548105804 scopus 로고    scopus 로고
    • _ (2003), The Porter Stemming Algorithm, available at http://www.tartarus,org/∼martin/PorterStemmer/index.html.
    • _ (2003), "The Porter Stemming Algorithm," available at http://www.tartarus,org/∼martin/PorterStemmer/index.html.
  • 50
    • 45549117987 scopus 로고
    • Term-Weighting Approaches in Automatic Text Retrieval
    • Salton, G., and Buckley, C. (1988), "Term-Weighting Approaches in Automatic Text Retrieval," Information Processing and Management, 24, 513-523.
    • (1988) Information Processing and Management , vol.24 , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 52
    • 0033905095 scopus 로고    scopus 로고
    • BoosTexter: A Boosting-Based System for Text Categorization
    • Schapire, R. E., and Singer, Y. (2000), "BoosTexter: A Boosting-Based System for Text Categorization," Machine Learning, 39, 135-168.
    • (2000) Machine Learning , vol.39 , pp. 135-168
    • Schapire, R.E.1    Singer, Y.2
  • 53
    • 0002442796 scopus 로고    scopus 로고
    • Machine Learning in Automated Text Categorization
    • Sebastiani, F. (2002), "Machine Learning in Automated Text Categorization," ACM Computing Surveys, 34, 1-47.
    • (2002) ACM Computing Surveys , vol.34 , pp. 1-47
    • Sebastiani, F.1
  • 54
    • 0345327592 scopus 로고    scopus 로고
    • A Simple and Efficient Algorithm for Gene Selection Using Sparse Logistic Regression
    • Shevade, S. K., and Keerthi, S. S. (2003), "A Simple and Efficient Algorithm for Gene Selection Using Sparse Logistic Regression," Bioinformatcs, 19, 2246-2253.
    • (2003) Bioinformatcs , vol.19 , pp. 2246-2253
    • Shevade, S.K.1    Keerthi, S.S.2
  • 55
    • 0012950799 scopus 로고    scopus 로고
    • Bayesian and Frequentist Approaches to Parametric Predictive Inference (with discussion)
    • eds. J. M. Bernardo, J. O. Berger, A. P. Dawid, and A. F. M. Smith, Oxford, U.K, Oxford University Press, pp
    • Smith, R. L. (1999), "Bayesian and Frequentist Approaches to Parametric Predictive Inference" (with discussion), in Bayesian Statistics 6, eds. J. M. Bernardo, J. O. Berger, A. P. Dawid, and A. F. M. Smith, Oxford, U.K.: Oxford University Press, pp. 589-612.
    • (1999) Bayesian Statistics 6 , pp. 589-612
    • Smith, R.L.1
  • 57
    • 0001224048 scopus 로고    scopus 로고
    • Sparse Bayesian Learning and the Relevance Vector Machine
    • Rijsbergen
    • Tipping, M. E. (2001), "Sparse Bayesian Learning and the Relevance Vector Machine," Journal of Machine Learning Research, 1, 211-244. Rijsbergen
    • (2001) Journal of Machine Learning Research , vol.1 , pp. 211-244
    • Tipping, M.E.1
  • 58
    • 34548090902 scopus 로고
    • Automatic Information Structuring and Retrieval,
    • unpublished doctoral thesis, King's College, Cambridge
    • van Rijsbergen, C. J. (1972), "Automatic Information Structuring and Retrieval," unpublished doctoral thesis, King's College, Cambridge.
    • (1972)
    • van Rijsbergen, C.J.1
  • 63
    • 0001868572 scopus 로고    scopus 로고
    • Text Categorization Based on Regularized Linear Classifiers
    • Zhang, T., and Oles, F. (2001), "Text Categorization Based on Regularized Linear Classifiers," Information Retrieval, 4, 5-31.
    • (2001) Information Retrieval , vol.4 , pp. 5-31
    • Zhang, T.1    Oles, F.2
  • 64
    • 33845263263 scopus 로고    scopus 로고
    • On Model Selection Consistency of Lasso
    • Zhao, P., and Yu, B. (2006), "On Model Selection Consistency of Lasso," Journal of Machine Learning Research, 7, 2541-2567.
    • (2006) Journal of Machine Learning Research , vol.7 , pp. 2541-2567
    • Zhao, P.1    Yu, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.