메뉴 건너뛰기




Volumn 60, Issue 3, 2009, Pages 538-556

A survey of modern authorship attribution methods

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL LINGUISTICS; INDUSTRIAL RESEARCH; INFORMATION SERVICES; INTERNET; LEARNING SYSTEMS; NATURAL LANGUAGE PROCESSING SYSTEMS; SURVEYS;

EID: 62549150881     PISSN: 15322882     EISSN: 15322890     Source Type: Journal    
DOI: 10.1002/asi.21001     Document Type: Article
Times cited : (1264)

References (102)
  • 1
    • 27344450109 scopus 로고    scopus 로고
    • Applying authorship analysis to extremist-group web forum messages
    • Abbasi, A., & Chen, H. (2005). Applying authorship analysis to extremist-group web forum messages. IEEE Intelligent Systems, 20(5), 67-75.
    • (2005) IEEE Intelligent Systems , vol.20 , Issue.5 , pp. 67-75
    • Abbasi, A.1    Chen, H.2
  • 2
    • 44449177126 scopus 로고    scopus 로고
    • Interpreting Burrows' Delta: Geometric and probabilistic foundations
    • Argamon, S. (2008). Interpreting Burrows' Delta: Geometric and probabilistic foundations. Literary and Linguistic Computing, 23(2), 131-147.
    • (2008) Literary and Linguistic Computing , vol.23 , Issue.2 , pp. 131-147
    • Argamon, S.1
  • 4
    • 18744405825 scopus 로고    scopus 로고
    • Style mining of electronic messages for multiple authorship discrimination: First results
    • New York: ACM Press
    • Argamon, S., Saric, M., & Stein, S. (2003). Style mining of electronic messages for multiple authorship discrimination: First results. In Proceedings of the 9th ACM S1GKDD (pp. 475-480). New York: ACM Press.
    • (2003) Proceedings of the 9th ACM S1GKDD , pp. 475-480
    • Argamon, S.1    Saric, M.2    Stein, S.3
  • 8
    • 85040385892 scopus 로고    scopus 로고
    • Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution
    • Baayen, R., van Halteren, H., & Tweedie, F. (1996). Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution. Literary and Linguistic Computing, 11(3), 121-131.
    • (1996) Literary and Linguistic Computing , vol.11 , Issue.3 , pp. 121-131
    • Baayen, R.1    van Halteren, H.2    Tweedie, F.3
  • 10
    • 10344248121 scopus 로고    scopus 로고
    • Who wrote the 15th book of Oz? An application of multivariate analysis to authorship attribution
    • Binongo, J. (2003). Who wrote the 15th book of Oz? An application of multivariate analysis to authorship attribution. Chance, 16(2), 9-17.
    • (2003) Chance , vol.16 , Issue.2 , pp. 9-17
    • Binongo, J.1
  • 12
    • 84960603650 scopus 로고
    • Word patterns and story shapes: The statistical analysis of narrative style
    • Burrows, J.F. (1987). Word patterns and story shapes: The statistical analysis of narrative style. Literary and Linguistic Computing, 2, 61-70.
    • (1987) Literary and Linguistic Computing , vol.2 , pp. 61-70
    • Burrows, J.F.1
  • 13
    • 0040233170 scopus 로고
    • Not unless you ask nicely: The interpretative nexus between analysis and information
    • Burrows, J.F. (1992). Not unless you ask nicely: The interpretative nexus between analysis and information. Literary and Linguistic Computing, 7(2), 91-109.
    • (1992) Literary and Linguistic Computing , vol.7 , Issue.2 , pp. 91-109
    • Burrows, J.F.1
  • 14
    • 85006107664 scopus 로고    scopus 로고
    • Delta: A measure of stylistic difference and a guide to likely authorship
    • Burrows, J.F. (2002). "Delta:" A measure of stylistic difference and a guide to likely authorship. Literary and Linguistic Computing, 17(3), 267-287.
    • (2002) Literary and Linguistic Computing , vol.17 , Issue.3 , pp. 267-287
    • Burrows, J.F.1
  • 16
    • 56249097939 scopus 로고    scopus 로고
    • Empirical evaluations of language-based author identification techniques
    • Chaski, C.E. (2001). Empirical evaluations of language-based author identification techniques. Forensic Linguistics, 8(1), 1-65.
    • (2001) Forensic Linguistics , vol.8 , Issue.1 , pp. 1-65
    • Chaski, C.E.1
  • 17
    • 34047230751 scopus 로고    scopus 로고
    • Who's at the key board? Authorship attribution in digital evidence investigations
    • Chaski, C.E. (2005). Who's at the key board? Authorship attribution in digital evidence investigations. International Journal of Digital Evidence, 4(1).
    • (2005) International Journal of Digital Evidence , vol.4 , Issue.1
    • Chaski, C.E.1
  • 19
    • 33646470157 scopus 로고    scopus 로고
    • Ngram and Bayesian classification of documents for topic and authorship
    • Clement, R., & Sharp, D. (2003). Ngram and Bayesian classification of documents for topic and authorship. Literary and Linguistic Computing, 18(4), 423-447.
    • (2003) Literary and Linguistic Computing , vol.18 , Issue.4 , pp. 423-447
    • Clement, R.1    Sharp, D.2
  • 20
    • 28344440777 scopus 로고    scopus 로고
    • Detecting collaborations in text: Comparing the authors' rhetorical language choices in the Federalist Papers
    • Collins, J., Kaufer, D., Vlachos, P., Butler, B., & Ishizaki, S. (2004). Detecting collaborations in text: Comparing the authors' rhetorical language choices in the Federalist Papers. Computers and the Humanities, 38, 15-36.
    • (2004) Computers and the Humanities , vol.38 , pp. 15-36
    • Collins, J.1    Kaufer, D.2    Vlachos, P.3    Butler, B.4    Ishizaki, S.5
  • 23
    • 0042367634 scopus 로고    scopus 로고
    • Mining e-mail content for author identification forensics
    • de Vel, O., Anderson, A., Corney, M., & Mohay, G. (2001). Mining e-mail content for author identification forensics. SIGMOD Record, 30(4), 55-64.
    • (2001) SIGMOD Record , vol.30 , Issue.4 , pp. 55-64
    • de Vel, O.1    Anderson, A.2    Corney, M.3    Mohay, G.4
  • 26
    • 2942731012 scopus 로고    scopus 로고
    • An extensive empirical study of feature selection metrics for text classification
    • Forman, G. (2003). An extensive empirical study of feature selection metrics for text classification. Journal of Machine Learning Research. 3, 1289-1305.
    • (2003) Journal of Machine Learning Research , vol.3 , pp. 1289-1305
    • Forman, G.1
  • 27
  • 29
    • 85119093321 scopus 로고    scopus 로고
    • Linguistic correlates of style: Authorship classification with deep linguistic analysis features
    • Morristown. NJ: Association for Computational Linguistics
    • Gamon, M. (2004). Linguistic correlates of style: Authorship classification with deep linguistic analysis features. In Proceedings of the 20th International Conference on Computational Linguistics (pp. 611-617). Morristown. NJ: Association for Computational Linguistics.
    • (2004) Proceedings of the 20th International Conference on Computational Linguistics , pp. 611-617
    • Gamon, M.1
  • 30
    • 62549083187 scopus 로고    scopus 로고
    • comment on language trees and zipping. Retrieved December 2, 2008, from
    • Goodman, J. (2002). Extended comment on language trees and zipping. Retrieved December 2, 2008, from http://arxiv.org/abs/cond-mat/0202383
    • (2002) Extended
    • Goodman, J.1
  • 33
    • 34548288405 scopus 로고    scopus 로고
    • Quantitative authorship attribution: An evaluation of techniques
    • Grieve, J. (2007). Quantitative authorship attribution: An evaluation of techniques. Literary and Linguistic Computing, 22(3), 251-270.
    • (2007) Literary and Linguistic Computing , vol.22 , Issue.3 , pp. 251-270
    • Grieve, J.1
  • 35
    • 35648941630 scopus 로고    scopus 로고
    • Bigrams of syntactic labels for authorship discrimination of short texts
    • Hirst, G. & Feiguina, O. (2007). Bigrams of syntactic labels for authorship discrimination of short texts. Literary and Linguistic Computing, 22(4), 405-417.
    • (2007) Literary and Linguistic Computing , vol.22 , Issue.4 , pp. 405-417
    • Hirst, G.1    Feiguina, O.2
  • 37
    • 84967627259 scopus 로고    scopus 로고
    • The evolution of stylometry in humanities scholarship
    • Holmes, D.l. (1998). The evolution of stylometry in humanities scholarship. Literary and Linguistic Computing, 13(3), 111-117.
    • (1998) Literary and Linguistic Computing , vol.13 , Issue.3 , pp. 111-117
    • Holmes, D.L.1
  • 38
    • 0011358177 scopus 로고
    • The Federalist revisited: New directions in authorship attribution
    • Holmes, D.I, & Forsyth, R. (1995). The Federalist revisited: New directions in authorship attribution. Literary and Linguistic Computing, 10(2), 111-127.
    • (1995) Literary and Linguistic Computing , vol.10 , Issue.2 , pp. 111-127
    • Holmes, D.I.1    Forsyth, R.2
  • 44
    • 84957069814 scopus 로고    scopus 로고
    • Text categorization with support vector machines: Learning with many relevant features
    • Berlin, Germany: Springer
    • Joachims, T. (1998). Text categorization with support vector machines: Learning with many relevant features. In Proceedings of the 10th European Conference on Machine Learning (pp. 137-142). Berlin, Germany: Springer.
    • (1998) Proceedings of the 10th European Conference on Machine Learning , pp. 137-142
    • Joachims, T.1
  • 46
    • 34047224691 scopus 로고    scopus 로고
    • Authorship attribution for electronic documents
    • M. Olivier & S. Shenoi Eds, Boston: Springer
    • Juola, P. (2006). Authorship attribution for electronic documents. In M. Olivier & S. Shenoi (Eds.), Advances in digital forensics II (pp. 119-130). Boston: Springer.
    • (2006) Advances in digital forensics II , pp. 119-130
    • Juola, P.1
  • 47
    • 36448935354 scopus 로고    scopus 로고
    • Future trends in authorship attribution
    • P. Craiger & S. Shenoi Eds, Boston: Springer
    • Juola, P. (2007). Future trends in authorship attribution. In P. Craiger & S. Shenoi (Eds.), Advances in digital forensics III (pp. 119-132). Boston: Springer.
    • (2007) Advances in digital forensics III , pp. 119-132
    • Juola, P.1
  • 48
    • 31044454127 scopus 로고    scopus 로고
    • A controlled-corpus experiment in audiorship attribution by cross-entropy
    • Juola, P., & Baayen, R. (2005). A controlled-corpus experiment in audiorship attribution by cross-entropy. Literary and Linguistic Computing, 20, 59-67.
    • (2005) Literary and Linguistic Computing , vol.20 , pp. 59-67
    • Juola, P.1    Baayen, R.2
  • 50
    • 33745868242 scopus 로고    scopus 로고
    • N-gram-based author profiles for authorship attribution
    • Association for Computational Linguistics pp
    • Keselj, V., Peng. F., Cercone, N., & Thomas, C. (2003). N-gram-based author profiles for authorship attribution. In Proceedings of the Pacific Association for Computational Linguistics (pp. 255-264).
    • (2003) Proceedings of the Pacific , pp. 255-264
    • Keselj, V.1    Peng, F.2    Cercone, N.3    Thomas, C.4
  • 51
    • 1542377547 scopus 로고    scopus 로고
    • A repetition based measure for verification of text collections and for text categorization
    • New York: ACM Press
    • Khmelev, D.V., & Teahan, W.J. (2003a). A repetition based measure for verification of text collections and for text categorization. In Proceedings of the 26th ACM SIGIR (pp. 104-110). New York: ACM Press.
    • (2003) Proceedings of the 26th ACM SIGIR , pp. 104-110
    • Khmelev, D.V.1    Teahan, W.J.2
  • 54
    • 12244278769 scopus 로고
    • Discrimination of authorship using visualization
    • Kjell, B. (1994). Discrimination of authorship using visualization. Information Processing and Management, 30(1), 141-150.
    • (1994) Information Processing and Management , vol.30 , Issue.1 , pp. 141-150
    • Kjell, B.1
  • 55
    • 0031381525 scopus 로고    scopus 로고
    • Wrappers for feature subset selection
    • Kohavi, R., & John, G. (1997). Wrappers for feature subset selection. Artificial Intelligence, 97(1-2), 273-324.
    • (1997) Artificial Intelligence , vol.97 , Issue.1-2 , pp. 273-324
    • Kohavi, R.1    John, G.2
  • 57
    • 84985033441 scopus 로고    scopus 로고
    • Automatically categorizing written texts by author gender
    • Koppel, M., Argamon, S., & Shimoni, A.R. (2002). Automatically categorizing written texts by author gender. Literary and Linguistic Computing, 17(4), 401-412.
    • (2002) Literary and Linguistic Computing , vol.17 , Issue.4 , pp. 401-412
    • Koppel, M.1    Argamon, S.2    Shimoni, A.R.3
  • 60
    • 33750364891 scopus 로고    scopus 로고
    • Authorship attribution with thousands of candidate authors
    • New York: ACM Press
    • Koppel, M., Schler, J., Argamon, S., & Messeri, E. (2006). Authorship attribution with thousands of candidate authors. In Proceedings of the 29th ACM SIGIR (pp. 659-660). New York: ACM Press.
    • (2006) Proceedings of the 29th ACM SIGIR , pp. 659-660
    • Koppel, M.1    Schler, J.2    Argamon, S.3    Messeri, E.4
  • 63
  • 67
    • 84974721781 scopus 로고    scopus 로고
    • Extraction of authors' characteristics from Japanese modern sentences via n-gram distribution
    • Berlin, Germany: Springer
    • Matsuura, T, & Kanada, Y. (2000). Extraction of authors' characteristics from Japanese modern sentences via n-gram distribution. In Proceedings of the 3rd International Conference on Discovery Science (pp. 315-319). Berlin, Germany: Springer.
    • (2000) Proceedings of the 3rd International Conference on Discovery Science , pp. 315-319
    • Matsuura, T.1    Kanada, Y.2
  • 68
    • 14344252197 scopus 로고
    • Neural computation in stylometry: An application to the works of Shakespeare and Fletcher
    • Matthews, R., & Merriam, T. (1993). Neural computation in stylometry: An application to the works of Shakespeare and Fletcher. Literary and Linguistic Computing, 8(4), 203-209.
    • (1993) Literary and Linguistic Computing , vol.8 , Issue.4 , pp. 203-209
    • Matthews, R.1    Merriam, T.2
  • 70
    • 0038468599 scopus 로고
    • The characteristic curves of composition
    • Mendenhall, T.C. (1887). The characteristic curves of composition. Science, IX, 237-249.
    • (1887) Science , vol.9 , pp. 237-249
    • Mendenhall, T.C.1
  • 71
    • 33644514895 scopus 로고
    • Neural computation in stylometry II: An application to the works of Shakespeare and Marlowe
    • Merriam, T, & Matthews, R. (1994). Neural computation in stylometry II: An application to the works of Shakespeare and Marlowe. Literary and Linguistic Computing, 9(1), 1-6.
    • (1994) Literary and Linguistic Computing , vol.9 , Issue.1 , pp. 1-6
    • Merriam, T.1    Matthews, R.2
  • 72
    • 84879567529 scopus 로고    scopus 로고
    • Plagiarism detection without reference collections
    • Berlin, Germany: Springer
    • Meyer zu Eissen, S., Stein, B., & Kulig, M. (2007). Plagiarism detection without reference collections. Advances in data analysis (pp. 359-366). Berlin, Germany: Springer.
    • (2007) Advances in data analysis , pp. 359-366
    • Meyer zu Eissen, S.1    Stein, B.2    Kulig, M.3
  • 75
    • 8244245494 scopus 로고
    • The qsum plot
    • Technical Report CSR-3-90, University of Edinburgh, UK
    • Morton, A.Q., & Michaelson, S. (1990). The qsum plot. Technical Report CSR-3-90, University of Edinburgh, UK.
    • (1990)
    • Morton, A.Q.1    Michaelson, S.2
  • 78
    • 3843083955 scopus 로고    scopus 로고
    • Augmenting naive Bayes classifiers with statistical language models
    • Peng, F, Shuurmans, D., & Wang, S. (2004). Augmenting naive Bayes classifiers with statistical language models. Information Retrieval Journal, 7(1), 317-345.
    • (2004) Information Retrieval Journal , vol.7 , Issue.1 , pp. 317-345
    • Peng, F.1    Shuurmans, D.2    Wang, S.3
  • 79
    • 0001318320 scopus 로고    scopus 로고
    • The state of authorship attribution studies: Some problems and solutions
    • Rudman, J. (1998). The state of authorship attribution studies: Some problems and solutions. Computers and the Humanities, 31, 351-365.
    • (1998) Computers and the Humanities , vol.31 , pp. 351-365
    • Rudman, J.1
  • 80
    • 58449114905 scopus 로고    scopus 로고
    • Short text authorship attribution via sequence kernels, Markov chains and author unmasking: An investigation
    • Morristown, NJ: Association for Computational Linguistics
    • Sanderson, C, & Guenter, S. (2006). Short text authorship attribution via sequence kernels, Markov chains and author unmasking: An investigation. In Proceedings of the International Conference on Empirical Methods in Natural Language Engineering (pp. 482-491). Morristown, NJ: Association for Computational Linguistics.
    • (2006) Proceedings of the International Conference on Empirical Methods in Natural Language Engineering , pp. 482-491
    • Sanderson, C.1    Guenter, S.2
  • 81
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • Sebastiani, F. (2002). Machine learning in automated text categorization. ACM Computing Surveys, 34(1), 1-47.
    • (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 82
    • 33750149835 scopus 로고    scopus 로고
    • Authorship attribution based on feature set sub-spacing ensembles
    • Stamatatos, E. (2006a). Authorship attribution based on feature set sub-spacing ensembles. International Journal on Artificial Intelligence Tools, 15(5), 823-838.
    • (2006) International Journal on Artificial Intelligence Tools , vol.15 , Issue.5 , pp. 823-838
    • Stamatatos, E.1
  • 85
    • 39649105441 scopus 로고    scopus 로고
    • Author identification: Using text sampling to handle the class imbalance problem
    • Stamatatos, E. (2008). Author identification: Using text sampling to handle the class imbalance problem. Information Processing and Management, 44(2), 790-799.
    • (2008) Information Processing and Management , vol.44 , Issue.2 , pp. 790-799
    • Stamatatos, E.1
  • 86
    • 17444445377 scopus 로고    scopus 로고
    • Automatic text categorization in terms of genre and author
    • Stamatatos, E., Fakotakis, N., & Kokkinakis, G. (2000). Automatic text categorization in terms of genre and author. Computational Linguistics, 26(4), 471-495.
    • (2000) Computational Linguistics , vol.26 , Issue.4 , pp. 471-495
    • Stamatatos, E.1    Fakotakis, N.2    Kokkinakis, G.3
  • 87
    • 3843140883 scopus 로고    scopus 로고
    • Computer-based authorship attribution without lexical measures
    • Stamatatos, E., Fakotakis, N., & Kokkinakis, G. (2001). Computer-based authorship attribution without lexical measures. Computers and the Humanities, 35(2), 193-214.
    • (2001) Computers and the Humanities , vol.35 , Issue.2 , pp. 193-214
    • Stamatatos, E.1    Fakotakis, N.2    Kokkinakis, G.3
  • 89
    • 33646013753 scopus 로고    scopus 로고
    • Discriminating the registers and styles in the Modern Greek language-Part 2: Extending the feature vector to optimize author discrimination
    • Tambouratzis, G., Markantonatou. S., Hairetakis, N., Vassiliou, M., Carayannis, G., & Tambouratzis, D. (2004). Discriminating the registers and styles in the Modern Greek language-Part 2: Extending the feature vector to optimize author discrimination. Literary and Linguistic Computing, 19(2), 221-242.
    • (2004) Literary and Linguistic Computing , vol.19 , Issue.2 , pp. 221-242
    • Tambouratzis, G.1    Markantonatou, S.2    Hairetakis, N.3    Vassiliou, M.4    Carayannis, G.5    Tambouratzis, D.6
  • 90
    • 8644241028 scopus 로고    scopus 로고
    • W.B. Croft & J. Lafferty (Eds, Language modeling and information retrieval pp, Berlin, Germany: Springer
    • Teahan, W., & Harper, D. (2003). Using compression-based language models for text categorization. In W.B. Croft & J. Lafferty (Eds.), Language modeling and information retrieval (pp. 141-165). Berlin, Germany: Springer.
    • (2003) Using compression-based language models for text categorization , pp. 141-165
    • Teahan, W.1    Harper, D.2
  • 92
    • 54749139664 scopus 로고    scopus 로고
    • How variable may a constant be? Measures of lexical richness in perspective
    • Tweedie, F., & Baayen, R. (1998). How variable may a constant be? Measures of lexical richness in perspective. Computers and the Humanities, 32(5), 323-352.
    • (1998) Computers and the Humanities , vol.32 , Issue.5 , pp. 323-352
    • Tweedie, F.1    Baayen, R.2
  • 93
    • 53349096656 scopus 로고    scopus 로고
    • Neural network applications in stylometry: The Federalist Papers
    • Tweedie, F., Singh, S., & Holmes, D. (1996). Neural network applications in stylometry: The Federalist Papers. Computers and the Humanities, 30(1), 1-10.
    • (1996) Computers and the Humanities , vol.30 , Issue.1 , pp. 1-10
    • Tweedie, F.1    Singh, S.2    Holmes, D.3
  • 95
    • 33846949415 scopus 로고    scopus 로고
    • Author verification by linguistic profiling: An exploration of the parameter space
    • van Halteren, H. (2007). Author verification by linguistic profiling: An exploration of the parameter space. ACM Transactions on Speech and Language Processing, 4(1), 1-17.
    • (2007) ACM Transactions on Speech and Language Processing , vol.4 , Issue.1 , pp. 1-17
    • van Halteren, H.1
  • 96
    • 0038468602 scopus 로고
    • On sentence-length as a statistical characteristic of style in prose, with application to two cases of disputed audiorship
    • Yule, G.U. (1938). On sentence-length as a statistical characteristic of style in prose, with application to two cases of disputed audiorship. Biometrika, 30, 363-390.
    • (1938) Biometrika , vol.30 , pp. 363-390
    • Yule, G.U.1
  • 101
    • 33644552803 scopus 로고    scopus 로고
    • A framework for authorship identification of online messages: Writing style features and classification techniques
    • Zheng, R., Li, J., Chen, H., & Huang, Z. (2006). A framework for authorship identification of online messages: Writing style features and classification techniques. Journal of the American Society of Information Science and Technology, 57(3), 378-393.
    • (2006) Journal of the American Society of Information Science and Technology , vol.57 , Issue.3 , pp. 378-393
    • Zheng, R.1    Li, J.2    Chen, H.3    Huang, Z.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.