메뉴 건너뛰기




Volumn 30, Issue 2, 2012, Pages

Authorship attribution based on specific vocabulary

Author keywords

Authorship attribution; Lexical statistics; Text classifiaction

Indexed keywords

AUTHORSHIP ATTRIBUTION; BAYES APPROACH; BINOMIAL DISTRIBUTION; CLASSIFICATION SCHEME; DISTANCE MEASURE; DISTANCE-BASED; PUNCTUATION MARKS; TERM OCCURRENCES; TEXT CLASSIFIACTION; TEXT REPRESENTATION; WORD PROFILES; Z-SCORES;

EID: 84863735149     PISSN: 10468188     EISSN: 15582868     Source Type: Journal    
DOI: 10.1145/2180868.2180874     Document Type: Article
Times cited : (41)

References (93)
  • 1
    • 33748457123 scopus 로고    scopus 로고
    • Introduction to the special topic selection on the computational analysis of style
    • ARGAMON, S. 2006. Introduction to the special topic selection on the computational analysis of style. J. Amer. Soc. Inf. Sci. Technol. 57, 11, 1503-1505.
    • (2006) J. Amer. Soc. Inf. Sci. Technol. , vol.57 , Issue.11 , pp. 1503-1505
    • Argamon, S.1
  • 2
    • 44449177126 scopus 로고    scopus 로고
    • Interpreting Burrows's delta: Geometric and probabilistic foundations
    • ARGAMON, S. 2008. Interpreting Burrows's delta: Geometric and probabilistic foundations. Liter. Linguist. Comput. 23, 2, 131-147.
    • (2008) Liter. Linguist. Comput. , vol.23 , Issue.2 , pp. 131-147
    • Argamon, S.1
  • 3
    • 58849089737 scopus 로고    scopus 로고
    • Automatically profiling the author of an anonymous text
    • ARGAMON, S., KOPPEL, M., PENNEBAKER, J. W., AND SCHLER, J. 2009. Automatically profiling the author of an anonymous text. Comm. ACM 52, 2, 119-123.
    • (2009) Comm. ACM , vol.52 , Issue.2 , pp. 119-123
    • Argamon, S.1    Koppel, M.2    Pennebaker, J.W.3    Schler, J.4
  • 8
    • 84937183956 scopus 로고    scopus 로고
    • The application of principal component analysis to stylometry
    • BINONGA, J. N. G. AND SMITH, M. W. 1999. The application of principal component analysis to stylometry. Liter. Linguist. Comput. 14, 4, 445-465.
    • (1999) Liter. Linguist. Comput. , vol.14 , Issue.4 , pp. 445-465
    • Binonga, J.N.G.1    Smith, M.W.2
  • 10
    • 84867919822 scopus 로고
    • Transformation-Based error driven learning and natural language processing: A case study in part-of-speech tagging
    • BRILL, E. 1995. Transformation-Based error driven learning and natural language processing: A case study in part-of-speech tagging. Comput. Linguist. 21, 4, 543-565.
    • (1995) Comput. Linguist. , vol.21 , Issue.4 , pp. 543-565
    • Brill, E.1
  • 11
    • 0040233170 scopus 로고
    • Not unless you ask nicely: The interpretative nexus between analysis and information
    • BURROWS, J. F. 1992. Not unless you ask nicely: The interpretative nexus between analysis and information. Liter. Linguist. Comput. 7, 1, 91-109.
    • (1992) Liter. Linguist. Comput. , vol.7 , Issue.1 , pp. 91-109
    • Burrows, J.F.1
  • 12
    • 85006107664 scopus 로고    scopus 로고
    • Delta: A measure of stylistic difference and a guide to likely authorship
    • BURROWS, J. F. 2002. Delta: A measure of stylistic difference and a guide to likely authorship. Liter. Linguist. Comput. 17, 3, 267-287.
    • (2002) Liter. Linguist. Comput. , vol.17 , Issue.3 , pp. 267-287
    • Burrows, J.F.1
  • 16
    • 84889398472 scopus 로고    scopus 로고
    • John Wiley and Sons, Chichester
    • CRAWLEY, M. J. 2007. The R Book. John Wiley and Sons, Chichester.
    • (2007) The R Book
    • Crawley, M.J.1
  • 18
    • 0346816045 scopus 로고
    • Goldsmith's periodical essays: A statistical analysis
    • DIXON, P. AND MANNION, D. 1993. Goldsmith's periodical essays: A statistical analysis. Liter. Linguist. Comput. 8, 1, 1-19.
    • (1993) Liter. Linguist. Comput. , vol.8 , Issue.1 , pp. 1-19
    • Dixon, P.1    Mannion, D.2
  • 19
    • 72849149278 scopus 로고    scopus 로고
    • When stopword lists make the difference
    • DOLAMIC, L. AND SAVOY, J. 2010. When stopword lists make the difference. J. Amer. Soc. Inf. Sci. Technol. 61, 1, 200-203.
    • (2010) J. Amer. Soc. Inf. Sci. Technol. , vol.61 , Issue.1 , pp. 200-203
    • Dolamic, L.1    Savoy, J.2
  • 21
    • 0017139943 scopus 로고
    • Estimating the number of unseen species: How many words did Shakespeare know?
    • EFRON, B. AND THISTED, R. 1976. Estimating the number of unseen species: How many words did Shakespeare know? Biomerika 63, 3, 435-447.
    • (1976) Biomerika , vol.63 , Issue.3 , pp. 435-447
    • Efron, B.1    Thisted, R.2
  • 22
    • 67650860199 scopus 로고    scopus 로고
    • Algorithmic stemmers or morphological analysis: An evaluation
    • FAUTSCH, C. AND SAVOY, J. 2009. Algorithmic stemmers or morphological analysis: An evaluation. J. A m e r. Soc. Inf. Sci. Technol. 60, 8, 1616-1624.
    • (2009) J. A M e R. Soc. Inf. Sci. Technol. , vol.60 , Issue.8 , pp. 1616-1624
    • Fautsch, C.1    Savoy, J.2
  • 23
    • 33748471520 scopus 로고    scopus 로고
    • Learning to classify documents according to genre
    • FINN, A. AND KUSHMERICK, N. 2005. Learning to classify documents according to genre. J. Amer. Soc. Inf. Sci. Technol. 57, 11, 1506-1518.
    • (2005) J. Amer. Soc. Inf. Sci. Technol. , vol.57 , Issue.11 , pp. 1506-1518
    • Finn, A.1    Kushmerick, N.2
  • 24
    • 84976804655 scopus 로고
    • A stop list for general text
    • FOX, C. 1990. A stop list for general text. ACM SIGIR Forum 24, 19-35.
    • (1990) ACM SIGIR Forum , vol.24 , pp. 19-35
    • Fox, C.1
  • 26
    • 0142045521 scopus 로고
    • What is wrong with adding one?
    • N. Oostdijk and P. de Hann Eds., Harcourt Brace
    • GALE, W. A. AND CHURCH, K. W. 1994. What is wrong with adding one? In Corpus-Based Research into Language, N. Oostdijk and P. de Hann Eds., Harcourt Brace.
    • (1994) Corpus-Based Research into Language
    • Gale, W.A.1    Church, K.W.2
  • 29
    • 34548288405 scopus 로고    scopus 로고
    • Quantitative authorship attribution: An evaluation of techniques
    • GRIEVE, J. 2007. Quantitative authorship attribution: An evaluation of techniques. Liter. Linguist. Comput. 22, 3, 251-270.
    • (2007) Liter. Linguist. Comput. , vol.22 , Issue.3 , pp. 251-270
    • Grieve, J.1
  • 30
    • 84989569822 scopus 로고
    • How effective is suffixing?
    • HARMAN, D. 1991. How effective is suffixing? J. Amer. Soc. Inf. Sci. 42, 1, 7-15.
    • (1991) J. Amer. Soc. Inf. Sci. , vol.42 , Issue.1 , pp. 7-15
    • Harman, D.1
  • 32
    • 0005542307 scopus 로고
    • A stylometric analysis of Mormon scripture and related texts
    • HOLMES, D. I. 1992. A stylometric analysis of Mormon scripture and related texts. J Roy. Statist. Soc. A155, 1, 91-120.
    • (1992) J Roy. Statist. Soc. A , vol.155 , Issue.1 , pp. 91-120
    • Holmes, D.I.1
  • 33
    • 84967627259 scopus 로고    scopus 로고
    • The evolution of stylometry in humanities scholarship
    • HOLMES, D. I. 1998. The evolution of stylometry in humanities scholarship. Liter. Linguist. Comput. 13, 3, 111-117.
    • (1998) Liter. Linguist. Comput. , vol.13 , Issue.3 , pp. 111-117
    • Holmes, D.I.1
  • 34
    • 0011358177 scopus 로고
    • The Federalist revisited: New directions in authorship attribution
    • HOLMES, D. I. AND FORSYTH, R. S. 1995. The Federalist revisited: New directions in authorship attribution. Liter. Linguist. Comput. 10, 2, 111-127.
    • (1995) Liter. Linguist. Comput. , vol.10 , Issue.2 , pp. 111-127
    • Holmes, D.I.1    Forsyth, R.S.2
  • 35
    • 77953893283 scopus 로고    scopus 로고
    • The Diary of a Public Man: A Case Study in Traditional and Non-Traditional Authorship Attribution
    • HOLMES, D. I. AND CROFTS, D. W. 2010. The Diary of a Public Man: A Case Study in Traditional and Non-Traditional Authorship Attribution. Liter. Linguist. Comput. 25, 2, 179-197.
    • (2010) Liter. Linguist. Comput. , vol.25 , Issue.2 , pp. 179-197
    • Holmes, D.I.1    Crofts, D.W.2
  • 36
    • 0027580356 scopus 로고
    • Very simple classification rules perform well on most commonly used datasets
    • HOLTE, R. C. 1993. Very simple classification rules perform well on most commonly used datasets. Mach. Learn. 11, 1, 63-90.
    • (1993) Mach. Learn. , vol.11 , Issue.1 , pp. 63-90
    • Holte, R.C.1
  • 37
    • 33746068301 scopus 로고    scopus 로고
    • Another perspective on vocabulary richness
    • HOOVER, D. L. 2003. Another perspective on vocabulary richness. Comput. Humanit. 37, 151-178.
    • (2003) Comput. Humanit. , vol.37 , pp. 151-178
    • Hoover, D.L.1
  • 39
    • 33947386292 scopus 로고    scopus 로고
    • Testing Burrows's delta
    • HOOVER, D. L. 2004b. Testing Burrows's delta. Liter. Linguist. Comput. 19, 4, 453-475.
    • (2004) Liter. Linguist. Comput. , vol.19 , Issue.4 , pp. 453-475
    • Hoover, D.L.1
  • 42
    • 71049172777 scopus 로고    scopus 로고
    • An exercise in non-ideal authorship attribution: The mysterious Maria Ward
    • HOOVER, D. L. AND HESS, S. 2009. An exercise in non-ideal authorship attribution: The mysterious Maria Ward. Liter. Linguist. Comput. 24, 4, 467-489.
    • (2009) Liter. Linguist. Comput. , vol.24 , Issue.4 , pp. 467-489
    • Hoover, D.L.1    Hess, S.2
  • 44
    • 2542498498 scopus 로고    scopus 로고
    • Learning to Classify Text Using Support Vector Machines
    • Kluwer, Boston
    • JOACHIMS, T. 2002. Learning to Classify Text Using Support Vector Machines. Methods, Theory, and Algorithms. Kluwer, Boston.
    • (2002) Methods, Theory, and Algorithms
    • Joachims, T.1
  • 45
    • 77953886062 scopus 로고    scopus 로고
    • A comparative study of machine learning methods for authorship attribution
    • JOCKERS, M. L. AND WITTEN, D. M. 2010. A comparative study of machine learning methods for authorship attribution. Liter. Linguist. Comput. 25, 2, 215-223.
    • (2010) Liter. Linguist. Comput. , vol.25 , Issue.2 , pp. 215-223
    • Jockers, M.L.1    Witten, D.M.2
  • 46
    • 62549165250 scopus 로고    scopus 로고
    • Reassessing authorship of the Book of Mormon using delta and nearest shrunken centroid classification
    • JOCKERS, M. L., WITTEN, D. M., AND CRIDDLE, C. S. 2008. Reassessing authorship of the Book of Mormon using delta and nearest shrunken centroid classification. Liter. Linguist. Comput. 23, 4, 465-491.
    • (2008) Liter. Linguist. Comput. , vol.23 , Issue.4 , pp. 465-491
    • Jockers, M.L.1    Witten, D.M.2    Criddle, C.S.3
  • 50
    • 0003340059 scopus 로고
    • The Art of Computer Programming
    • Addison-Wesley, Reading, MA
    • KNUTH, D. E. 1981. The Art of Computer Programming, Vol. 2 Seminumerical Algorithms. Addison-Wesley, Reading, MA.
    • (1981) Seminumerical Algorithms , vol.2
    • Knuth, D.E.1
  • 52
    • 84859220050 scopus 로고    scopus 로고
    • Normalisation et lemmatisation d'une question ouverte
    • LABBÉ, D. 2001. Normalisation et lemmatisation d'une question ouverte. J. Soc. Franc. Statist. 142, 4, 37-57.
    • (2001) J. Soc. Franc. Statist. , vol.142 , Issue.4 , pp. 37-57
    • Labbé, D.1
  • 53
    • 43249145437 scopus 로고    scopus 로고
    • Experiments on authorship attribution by intertextual distance in English
    • LABBÉ, D. 2007. Experiments on authorship attribution by intertextual distance in English. J. Quant. Linguist. 14, 1, 33-80.
    • (2007) J. Quant. Linguist. , vol.14 , Issue.1 , pp. 33-80
    • Labbé, D.1
  • 54
    • 24944449527 scopus 로고
    • Shakespeare, Fletcher, and the Two Noble Kinsmen
    • LEDGER, G. AND MERRIAM, R. 1994. Shakespeare, Fletcher, and The Two Noble Kinsmen. Liter. Linguist. Comput. 9, 3, 235-248.
    • (1994) Liter. Linguist. Comput. , vol.9 , Issue.3 , pp. 235-248
    • Ledger, G.1    Merriam, R.2
  • 55
    • 0011077440 scopus 로고
    • Note on the general case of the Bayes-Laplace formula for inductive or a posteriori probabilities
    • LIDSTONE, G. J. 1920. Note on the general case of the Bayes-Laplace formula for inductive or a posteriori probabilities. Trans. Faculty Actuar. 8, 182-192.
    • (1920) Trans. Faculty Actuar. , vol.8 , pp. 182-192
    • Lidstone, G.J.1
  • 59
    • 34249852033 scopus 로고
    • Building a large annotated corpus of english: The penn treebank
    • MARCUS, M. P., SANTORINI, B., AND MARCINKIEWICZ, M. A. 1993. Building a large annotated corpus of english: The penn treebank. Comput. Linguist. 19, 2, 313-330.
    • (1993) Comput. Linguist. , vol.19 , Issue.2 , pp. 313-330
    • Marcus, M.P.1    Santorini, B.2    Marcinkiewicz, M.A.3
  • 60
    • 3843127500 scopus 로고    scopus 로고
    • Character n-gram tokenization for European language text retrieval
    • MCNAMEE, P. AND MAYFIELD, J. 2004. Character n-gram tokenization for European language text retrieval. Inform. Retrieval 7, 1-2, 73-97.
    • (2004) Inform. Retrieval , vol.7 , Issue.1-2 , pp. 73-97
    • McNamee, P.1    Mayfield, J.2
  • 61
    • 84937261711 scopus 로고    scopus 로고
    • Heterogeneous authorship in early Shakespeare and the problem of Henry v
    • MERRIAM, T. 1998. Heterogeneous authorship in early Shakespeare and the problem of Henry V. Liter. Linguist. Comput. 13, 15-28.
    • (1998) Liter. Linguist. Comput. , vol.13 , pp. 15-28
    • Merriam, T.1
  • 63
  • 65
    • 33746130618 scopus 로고
    • Once. A test of authorship based on words which are not repeated in the sample
    • MORTON, A. Q. 1986. Once. A test of authorship based on words which are not repeated in the sample. Liter. Linguist. Comput. 1, 1, 1-8.
    • (1986) Liter. Linguist. Comput. , vol.1 , Issue.1 , pp. 1-8
    • Morton, A.Q.1
  • 70
    • 0012026401 scopus 로고    scopus 로고
    • Cross-Language Information Retrieval and Evaluation
    • Springer
    • PETERS, C. 2001. Cross-Language Information Retrieval and Evaluation. Lectures Notes in Computer Science, vol. 2069, Springer.
    • (2001) Lectures Notes in Computer Science , vol.2069
    • Peters, C.1
  • 72
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • PORTER, M. F. 1980. An algorithm for suffix stripping. Program 14, 3, 130-137.
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.F.1
  • 74
    • 3843065912 scopus 로고    scopus 로고
    • Report on CLEF-2001 experiments
    • C. Peters, M. Braschler, J. Gonzalo, and M. Kluck Eds., Lectures Notes in Computer Science Springer
    • SAVOY, J. 2001. Report on CLEF-2001 experiments. In Cross-Language Information Retrieval and Evaluation, C. Peters, M. Braschler, J. Gonzalo, and M. Kluck Eds., Lectures Notes in Computer Science, vol. 2069, Springer, 27-43.
    • (2001) Cross-Language Information Retrieval and Evaluation , vol.2069 , pp. 27-43
    • Savoy, J.1
  • 75
    • 77952324552 scopus 로고    scopus 로고
    • Lexical analysis of US political speeches
    • SAVOY, J. 2010. Lexical analysis of US political speeches. J. Quant. Linguist. 17, 2, 123-141.
    • (2010) J. Quant. Linguist. , vol.17 , Issue.2 , pp. 123-141
    • Savoy, J.1
  • 76
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automatic text categorization
    • SEBASTIANI, F. 2002. Machine learning in automatic text categorization. ACM Comput. Surv. 14, 1, 1-27.
    • (2002) ACM Comput. Surv. , vol.14 , Issue.1 , pp. 1-27
    • Sebastiani, F.1
  • 77
    • 0000672396 scopus 로고
    • On a distribution law for word frequencies
    • SICHEL, H. S. 1975. On a distribution law for word frequencies. J. Amer. Statist. Assoc. 70, 351, 542-547.
    • (1975) J. Amer. Statist. Assoc. , vol.70 , Issue.351 , pp. 542-547
    • Sichel, H.S.1
  • 78
    • 62549150881 scopus 로고    scopus 로고
    • A survey of modern authorship attribution methods
    • STAMATATOS, E. 2009. A survey of modern authorship attribution methods. J. Amer. Soc. Inf. Sci. Technol. 60, 3.
    • (2009) J. Amer. Soc. Inf. Sci. Technol. , vol.60 , pp. 3
    • Stamatatos, E.1
  • 79
    • 17444445377 scopus 로고    scopus 로고
    • Automatic text categorization in terms of genre and author
    • STAMATATOS, E., FAKOTAKIS, N., AND KOKKINAKIS, G. 2001. Automatic text categorization in terms of genre and author. Comput. Linguist. 26, 4, 471-495.
    • (2001) Comput. Linguist. , vol.26 , Issue.4 , pp. 471-495
    • Stamatatos, E.1    Fakotakis, N.2    Kokkinakis, G.3
  • 81
    • 0005647839 scopus 로고
    • Did Shakespeare write a newly-discovered poem?
    • THISTED, R. AND EFRON, B. 1987. Did Shakespeare write a newly-discovered poem? Biomerika 74, 3, 445-455.
    • (1987) Biomerika , vol.74 , Issue.3 , pp. 445-455
    • Thisted, R.1    Efron, B.2
  • 82
    • 34248708122 scopus 로고    scopus 로고
    • The development of statistical stylistics a survey
    • TULDAVA, J. 2004. The development of statistical stylistics a survey. J. Quant. Linguist. 11, 1-2, 141-151.
    • (2004) J. Quant. Linguist. , vol.11 , Issue.1-2 , pp. 141-151
    • Tuldava, J.1
  • 87
    • 0142031148 scopus 로고    scopus 로고
    • Information categorization approach to literary authorship disputes
    • YANG, A. C.-C., PENG, C.-K., YIEN, H.-W. AND GOLDBERGER, A. L. 2003. Information categorization approach to literary authorship disputes. Physica A, 329, 473-483.
    • (2003) Physica A , vol.329 , pp. 473-483
    • Yang, A.C.-C.1    Peng, C.-K.2    Yien, H.-W.3    Goldberger, A.L.4
  • 88
    • 3042824043 scopus 로고    scopus 로고
    • A study of smoothing methods for language models applied to information retrieval
    • ZHAI, C. X. AND LAFFERTY, J. 2004. A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst. 22, 2, 179-214.
    • (2004) ACM Trans. Inf. Syst. , vol.22 , Issue.2 , pp. 179-214
    • Zhai, C.X.1    Lafferty, J.2
  • 93
    • 33644552803 scopus 로고    scopus 로고
    • A framework for authorship identification of online messages: Writing-Style features and classification techniques
    • ZHENG, R., LI, J., CHEN, H., AND HUANG, Z. 2006. A framework for authorship identification of online messages: Writing-Style features and classification techniques. J. Amer. Soc. Inf. Sci. Technol. 57, 3, 378-393.
    • (2006) J. Amer. Soc. Inf. Sci. Technol. , vol.57 , Issue.3 , pp. 378-393
    • Zheng, R.1    Li, J.2    Chen, H.3    Huang, Z.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.