메뉴 건너뛰기




Volumn 8, Issue 1, 2014, Pages 499-529

Concise comparative summaries (CCS) of large text corpora with a human experiment

Author keywords

Co occurrence; High dimensional analysis; L1 regularized logistic regression; L2 normalization; Lasso; Sparse modeling; Text summarization; Tf idf

Indexed keywords


EID: 84898020235     PISSN: 19326157     EISSN: 19417330     Source Type: Journal    
DOI: 10.1214/13-AOAS698     Document Type: Article
Times cited : (13)

References (44)
  • 2
    • 85161981998 scopus 로고    scopus 로고
    • Supervised topic models
    • J. C. Platt, D. Koller, Y. Singer and S. Roweis, eds.), MIT Press, Cambridge, MA
    • BLEI, D. and MCAULIFFE, J. (2008). Supervised topic models. In Advances in Neural Information Processing Systems 20 (J. C. Platt, D. Koller, Y. Singer and S. Roweis, eds.) 121-128. MIT Press, Cambridge, MA.
    • (2008) Advances In Neural Information Processing Systems 20 , pp. 121-128
    • Blei, D.1    McAuliffe, J.2
  • 4
    • 84863381525 scopus 로고    scopus 로고
    • Reading tea leaves: How humans interpret topic models
    • Y. Bengio, D. Schuurmans, J. Lafferty, C. K. I.Williams and A. Culotta, eds.), Vancouver, BC, Canada
    • CHANG, J., Boyd-Graber, J., GERRISH, S., WANG, C. and BLEI, D. (2009). Reading tea leaves: How humans interpret topic models. In Advances in Neural Information Processing Systems 22 (Y. Bengio, D. Schuurmans, J. Lafferty, C. K. I.Williams and A. Culotta, eds.) 288-296. Vancouver, BC, Canada.
    • (2009) Advances In Neural Information Processing Systems , vol.22 , pp. 288-296
    • Chang, J.1    Boyd-Graber, J.2    Gerrish, S.3    Wang, C.4    Blei, D.5
  • 10
    • 84985070018 scopus 로고
    • Framing: Toward clarification of a fractured paradigm
    • ENTMAN, R. M. (1993). Framing: Toward clarification of a fractured paradigm. Journal of Communication 43 52-57.
    • (1993) Journal of Communication , vol.43 , pp. 52-57
    • Entman, R.M.1
  • 12
    • 2942731012 scopus 로고    scopus 로고
    • An extensive empirical study of feature selection metrics for text classification
    • FORMAN, G. (2003). An extensive empirical study of feature selection metrics for text classification. J. Mach. Learn. Res. 3 1289-1305.
    • (2003) J. Mach. Learn. Res , vol.3 , pp. 1289-1305
    • Forman, G.1
  • 15
    • 34548105186 scopus 로고    scopus 로고
    • Large-scale Bayesian logistic regression for text categorization
    • MR2408634
    • GENKIN, A., LEWIS, D. D. and MADIGAN, D. (2007). Large-scale Bayesian logistic regression for text categorization. Technometrics 49 291-304. MR2408634
    • (2007) Technometrics , vol.49 , pp. 291-304
    • Genkin, A.1    Lewis, D.D.2    Madigan, D.3
  • 20
    • 84857292872 scopus 로고    scopus 로고
    • Topic-based multi-document summarization with probabilistic latent semantic analysis
    • Association for Computational Linguistics, Borovets, Bulgaria
    • HENNIG, L. (2009). Topic-based multi-document summarization with probabilistic latent semantic analysis. In Recent Advances in Natural Language Processing (RANLP) 144-149. Association for Computational Linguistics, Borovets, Bulgaria.
    • (2009) Recent Advances In Natural Language Processing (RANLP) , pp. 144-149
    • Hennig, L.1
  • 21
    • 73649109957 scopus 로고    scopus 로고
    • A method of automated nonparametric content analysis for social science
    • HOPKINS, D. and KING, G. (2010). A method of automated nonparametric content analysis for social science. American Journal of Political Science 54 229-247.
    • (2010) American Journal of Political Science , vol.54 , pp. 229-247
    • Hopkins, D.1    King, G.2
  • 24
    • 37249049983 scopus 로고    scopus 로고
    • International agenda-building and agenda-setting: Exploring the influence of public relations counsel on US news media and public perceptions of foreign nations
    • KIOUSIS, S. and WU, X. (2008). International agenda-building and agenda-setting: Exploring the influence of public relations counsel on US news media and public perceptions of foreign nations. The International Communications Gazette 70 58-75.
    • (2008) The International Communications Gazette , vol.70 , pp. 58-75
    • Kiousis, S.1    Wu, X.2
  • 25
    • 74449092515 scopus 로고    scopus 로고
    • Globalisation: News media, images of nations and the flow of international capital with special reference to the role of rating agencies
    • KUNCZIK, M. (2000). Globalisation: News media, images of nations and the flow of international capital with special reference to the role of rating agencies. J. International Communication 8 39-79.
    • (2000) J. International Communication , vol.8 , pp. 39-79
    • Kunczik, M.1
  • 27
    • 33746259694 scopus 로고    scopus 로고
    • New methods for text categorization based on a new feature selection method and a new similarity measure between documents
    • LEE, L. and CHEN, S. (2006). New methods for text categorization based on a new feature selection method and a new similarity measure between documents. Lecture Notes in Comput. Sci. 4031 1280.
    • (2006) Lecture Notes In Comput. Sci , vol.4031 , pp. 1280
    • Lee, L.1    Chen, S.2
  • 29
    • 62249190300 scopus 로고    scopus 로고
    • Fightin' words: Lexical feature selection and evaluation for identifying the content of political conflict
    • MONROE, B. L., COLARESI, M. P. and QUINN, K. M. (2008). Fightin' words: Lexical feature selection and evaluation for identifying the content of political conflict. Political Analysis 16 372-403.
    • (2008) Political Analysis , vol.16 , pp. 372-403
    • Monroe, B.L.1    Colaresi, M.P.2    Quinn, K.M.3
  • 33
    • 71149096419 scopus 로고    scopus 로고
    • News and its communicative quality: The inverted pyramid when and why did it appear?
    • POTTKER, H. (2003). News and its communicative quality: The inverted pyramid when and why did it appear? Journalism Studies 4 501-511.
    • (2003) Journalism Studies , vol.4 , pp. 501-511
    • Pottker, H.1
  • 34
    • 84868695730 scopus 로고    scopus 로고
    • Automatic keyword extraction from individual documents
    • (M.W. Berry and J. Kogan, eds.). Wiley, Chichester
    • ROSE, S., ENGEL, D., CRAMER, N. and COWLEY, W. (2010). Automatic keyword extraction from individual documents. In Text Mining: Applications and Theory (M.W. Berry and J. Kogan, eds.). Wiley, Chichester.
    • (2010) Text Mining: Applications and Theory
    • Rose, S.1    Engel, D.2    Cramer, N.3    Cowley, W.4
  • 35
    • 0000417994 scopus 로고
    • Developments in automatic text retrieval
    • SALTON, G. (1991). Developments in automatic text retrieval. Science 253 974-980.
    • (1991) Science , vol.253 , pp. 974-980
    • Salton, G.1
  • 36
    • 45549117987 scopus 로고
    • Term-weighting approaches in automatic text retrieval
    • SALTON, G. and BUCKLEY, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing and Management 24 513-523.
    • (1988) Information Processing and Management , vol.24 , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 39
    • 85194972808 scopus 로고    scopus 로고
    • Regression shrinkage and selection via the lasso
    • MR1379242
    • TIBSHIRANI, R. (1996). Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B Stat. Methodol. 58 267-288. MR1379242
    • (1996) J. R. Stat. Soc. Ser. B Stat. Methodol , vol.58 , pp. 267-288
    • Tibshirani, R.1
  • 42
    • 0001868572 scopus 로고    scopus 로고
    • Text categorization based on regularized linear classification methods
    • ZHANG, T. and OLES, F. J. (2001). Text categorization based on regularized linear classification methods. Information Retrieval 4 5-31.
    • (2001) Information Retrieval , vol.4 , pp. 5-31
    • Zhang, T.1    Oles, F.J.2
  • 43
    • 37749006178 scopus 로고    scopus 로고
    • Stagewise lasso
    • MR2383572
    • ZHAO, P. and YU, B. (2007). Stagewise lasso. J. Mach. Learn. Res. 8 2701-2726. MR2383572
    • (2007) J. Mach. Learn. Res , vol.8 , pp. 2701-2726
    • Zhao, P.1    Yu, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.