메뉴 건너뛰기




Volumn 57, Issue 3, 2006, Pages 378-393

A framework for authorship identification of online messages: Writing-style features and classification techniques

Author keywords

[No Author keywords available]

Indexed keywords

DECISION TREES; ONLINE MESSAGES; SUPPORT VECTOR MACHINES;

EID: 33644552803     PISSN: 15322882     EISSN: 15322890     Source Type: Journal    
DOI: 10.1002/asi.20316     Document Type: Article
Times cited : (560)

References (59)
  • 3
    • 0037791890 scopus 로고    scopus 로고
    • Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution
    • Baayen, R.H., Van Halteren, H., & Tweedie, F.J. (1996). Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution. Literary and Linguistic Computing, 2, 110-120.
    • (1996) Literary and Linguistic Computing , vol.2 , pp. 110-120
    • Baayen, R.H.1    Van Halteren, H.2    Tweedie, F.J.3
  • 5
    • 84937183956 scopus 로고    scopus 로고
    • The application of principal component analysis to stylometry
    • Binongo, J.N.G., & Smith, M.W.A. (1999). The application of principal component analysis to stylometry. Literary and Linguistic Computing, 14(4), 445-466.
    • (1999) Literary and Linguistic Computing , vol.14 , Issue.4 , pp. 445-466
    • Binongo, J.N.G.1    Smith, M.W.A.2
  • 6
    • 84960603650 scopus 로고
    • Word patterns and story shapes: The statistical analysis of narrative style
    • Burrows, J.F. (1987). Word patterns and story shapes: The statistical analysis of narrative style. Literary and Linguistic Computing, 2, 61-67.
    • (1987) Literary and Linguistic Computing , vol.2 , pp. 61-67
    • Burrows, J.F.1
  • 7
    • 33644543698 scopus 로고
    • "An ocean where each kind.": Statistical analysis and some major determinants of literary style
    • Burrows, J.F. (1989). "An ocean where each kind.": Statistical analysis and some major determinants of literary style. Computers and the Humanities, 23, 309-321.
    • (1989) Computers and the Humanities , vol.23 , pp. 309-321
    • Burrows, J.F.1
  • 8
    • 0032098609 scopus 로고    scopus 로고
    • A machine learning approach to inductive query by examples: An experiment using relevance feedback, ID3, genetic algorithms, and simulated annealing
    • Chen, H., Shankaranarayanan, G., Iyer, A., & She, L. (1998). A machine learning approach to inductive query by examples: An experiment using relevance feedback, ID3, genetic algorithms, and simulated annealing. Journal of the American Society for Information Science, 49(8), 693-705.
    • (1998) Journal of the American Society for Information Science , vol.49 , Issue.8 , pp. 693-705
    • Chen, H.1    Shankaranarayanan, G.2    Iyer, A.3    She, L.4
  • 11
    • 84937187144 scopus 로고    scopus 로고
    • Authorial attribution and computational stylistics: If you can tell authors apart, have you learned anything about them?
    • Craig, H. (1999). Authorial attribution and computational stylistics: If you can tell authors apart, have you learned anything about them? Literary and Linguistic Computing, 14(1), 103-113.
    • (1999) Literary and Linguistic Computing , vol.14 , Issue.1 , pp. 103-113
    • Craig, H.1
  • 14
    • 0042367634 scopus 로고    scopus 로고
    • Mining e-mail content for author identification forensics
    • de Vel, O., Anderson, A., Corney, M., & Mohay, G. (2001). Mining e-mail content for author identification forensics. SIGMOD Record, 30(4), 55-64.
    • (2001) SIGMOD Record , vol.30 , Issue.4 , pp. 55-64
    • De Vel, O.1    Anderson, A.2    Corney, M.3    Mohay, G.4
  • 19
    • 85008035943 scopus 로고    scopus 로고
    • Neural networks and hybrid intelligent models - Foundations, theory, and applications
    • Giles, C.L., Sun, R., & Zurada, J.M. (1998). Neural networks and hybrid intelligent models - Foundations, theory, and applications. IEEE Transactions on Neural Networks, 9(5), 721-723.
    • (1998) IEEE Transactions on Neural Networks , vol.9 , Issue.5 , pp. 721-723
    • Giles, C.L.1    Sun, R.2    Zurada, J.M.3
  • 21
    • 0005542307 scopus 로고
    • A stylometric analysis of Mormon scripture and related texts
    • Holmes, D.I. (1992). A stylometric analysis of Mormon scripture and related texts. Journal of Royal Statistical Society, 155, 91-120.
    • (1992) Journal of Royal Statistical Society , vol.155 , pp. 91-120
    • Holmes, D.I.1
  • 22
    • 84967627259 scopus 로고    scopus 로고
    • The evolution of stylometry in humanities
    • Holmes, D.I. (1998). The evolution of stylometry in humanities. Literary and Linguistic Computing, 13(3), 111-117.
    • (1998) Literary and Linguistic Computing , vol.13 , Issue.3 , pp. 111-117
    • Holmes, D.I.1
  • 23
    • 0011358177 scopus 로고
    • The Federalist revisited: New directions in authorship attribution
    • Holmes, D.I., & Forsyth, R.S. (1995). The Federalist revisited: New directions in authorship attribution. Literary and Linguistic Computing, 10, 111-127.
    • (1995) Literary and Linguistic Computing , vol.10 , pp. 111-127
    • Holmes, D.I.1    Forsyth, R.S.2
  • 25
    • 0036505670 scopus 로고    scopus 로고
    • A comparison on methods for multi-class support vector machines
    • Hsu, C.W., & Lin, C.J. (2002). A comparison on methods for multi-class support vector machines. IEEE Transactions on Neural Networks, 13, 415-425.
    • (2002) IEEE Transactions on Neural Networks , vol.13 , pp. 415-425
    • Hsu, C.W.1    Lin, C.J.2
  • 26
    • 84957069814 scopus 로고    scopus 로고
    • Text categorization with Support Vector Machines: Learning with many relevant features
    • Springer-Verlag
    • Joachims, T. (1998). Text categorization with Support Vector Machines: Learning with many relevant features. In Proceedings of the 10th European Conference on Machine learning (ECML) (pp. 137-142). Springer-Verlag.
    • (1998) Proceedings of the 10th European Conference on Machine Learning (ECML) , pp. 137-142
    • Joachims, T.1
  • 29
    • 33646138699 scopus 로고    scopus 로고
    • Using Markov chains for identification of writers
    • Khmelev, D.V., & Tweedie, F.J. (2001). Using Markov chains for identification of writers. Literary and Linguistic Computing, 16(4), 299-307.
    • (2001) Literary and Linguistic Computing , vol.16 , Issue.4 , pp. 299-307
    • Khmelev, D.V.1    Tweedie, F.J.2
  • 31
    • 33644557982 scopus 로고
    • Authorship determination using letter-pair frequency features with Neural Network classifiers
    • Kjell, B. (1994). Authorship determination using letter-pair frequency features with Neural Network classifiers. Literary and Linguistic Computing, 9, 119-124.
    • (1994) Literary and Linguistic Computing , vol.9 , pp. 119-124
    • Kjell, B.1
  • 32
    • 84985033441 scopus 로고    scopus 로고
    • Automatically categorizing written texts by author gender
    • Koppel, M., Argamon, S., & Shimoni, A.R. (2002). Automatically categorizing written texts by author gender. Literary and Linguistic Computing, 17(4), 401-412.
    • (2002) Literary and Linguistic Computing , vol.17 , Issue.4 , pp. 401-412
    • Koppel, M.1    Argamon, S.2    Shimoni, A.R.3
  • 35
    • 0037791903 scopus 로고
    • Shakespeare vs. Fletcher: A stylometric analysis by radial basis functions
    • Lowe, D., & Matthews, R. (1995). Shakespeare vs. Fletcher: A stylometric analysis by radial basis functions. Computers and the Humanities, 29, 449-461.
    • (1995) Computers and the Humanities , vol.29 , pp. 449-461
    • Lowe, D.1    Matthews, R.2
  • 36
    • 24944521601 scopus 로고
    • On the utility of content analysis in author attribution: The Federalist
    • Martindale, C., & McKenzie, D. (1995). On the utility of content analysis in author attribution: The Federalist. Computer and the Humanities, 29, 259-270.
    • (1995) Computer and the Humanities , vol.29 , pp. 259-270
    • Martindale, C.1    McKenzie, D.2
  • 38
    • 0038468599 scopus 로고
    • The characteristic curves of composition
    • Mendenhall, T.C. (1887). The characteristic curves of composition. Science, 11(11), 237-249.
    • (1887) Science , vol.11 , Issue.11 , pp. 237-249
    • Mendenhall, T.C.1
  • 39
    • 33644514895 scopus 로고
    • Neural computation in stylometry: II. An application to the works of Shakespeare and Marlowe
    • Merriam, T.V.N., & Matthews, R.A.J. (1994). Neural computation in stylometry: II. An application to the works of Shakespeare and Marlowe. Literary and Linguistic Computing, 9, 1-6.
    • (1994) Literary and Linguistic Computing , vol.9 , pp. 1-6
    • Merriam, T.V.N.1    Matthews, R.A.J.2
  • 40
    • 0023491747 scopus 로고
    • Spelling checkers, spelling correctors and the misspellings of poor spellers
    • Mitton, R. (1987). Spelling checkers, spelling correctors and the misspellings of poor spellers. Information Processing and Management, 23(5), 495-505.
    • (1987) Information Processing and Management , vol.23 , Issue.5 , pp. 495-505
    • Mitton, R.1
  • 44
  • 47
    • 33744584654 scopus 로고
    • Induction of decision trees
    • Quinlan, J.R. (1986). Induction of decision trees. Machine Learning, 1(1), 81-106.
    • (1986) Machine Learning , vol.1 , Issue.1 , pp. 81-106
    • Quinlan, J.R.1
  • 48
    • 0001318320 scopus 로고    scopus 로고
    • The state of authorship attribution studies: Some problems and solutions
    • Rudman, J. (1998). The state of authorship attribution studies: Some problems and solutions. Computers and the Humanities, 31, 351-365.
    • (1998) Computers and the Humanities , vol.31 , pp. 351-365
    • Rudman, J.1
  • 49
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • Sebastiani, F. (2002). Machine learning in automated text categorization. ACM Computing Surveys, 34, 1-47.
    • (2002) ACM Computing Surveys , vol.34 , pp. 1-47
    • Sebastiani, F.1
  • 50
    • 3843140883 scopus 로고    scopus 로고
    • Computer-based authorship attribution without lexical measures
    • Stamatatos, E., Fakotakis, N., & Kokkinakis, G. (2001). Computer-based authorship attribution without lexical measures. Computers and the Humanities, 35(2), 193-214.
    • (2001) Computers and the Humanities , vol.35 , Issue.2 , pp. 193-214
    • Stamatatos, E.1    Fakotakis, N.2    Kokkinakis, G.3
  • 52
    • 0034547894 scopus 로고    scopus 로고
    • Estimating drug/plasma concentration levels by applying Neural Networks to pharmacokinetic data sets
    • Tolle, K.M., Chen, H., & Chow, H. (2000). Estimating drug/plasma concentration levels by applying Neural Networks to pharmacokinetic data sets. Decision Support Systems, 30(2), 139-152.
    • (2000) Decision Support Systems , vol.30 , Issue.2 , pp. 139-152
    • Tolle, K.M.1    Chen, H.2    Chow, H.3
  • 53
    • 0013132586 scopus 로고    scopus 로고
    • How variable may a constant be? Measures of lexical richness in perspective
    • Tweedie, F.J., & Baayen, R.H. (1998). How variable may a constant be? Measures of lexical richness in perspective. Computers and the Humanities, 32, 323-352.
    • (1998) Computers and the Humanities , vol.32 , pp. 323-352
    • Tweedie, F.J.1    Baayen, R.H.2
  • 54
    • 0009302395 scopus 로고    scopus 로고
    • Neural Network applications in stylometry: The Federalist Papers
    • Tweedie, F.J., Singh, S., & Holmes, D.I. (1996). Neural Network applications in stylometry: The Federalist Papers. Computers and the Humanities, 30(1), 1-10.
    • (1996) Computers and the Humanities , vol.30 , Issue.1 , pp. 1-10
    • Tweedie, F.J.1    Singh, S.2    Holmes, D.I.3
  • 56
    • 0028387642 scopus 로고
    • Neural Networks: Applications in industry, business, and science
    • Widrow, B., Rumelhart, D.E., & Lehr, M.A. (1994). Neural Networks: Applications in industry, business, and science. Communications of the ACM, 37, 93-105.
    • (1994) Communications of the ACM , vol.37 , pp. 93-105
    • Widrow, B.1    Rumelhart, D.E.2    Lehr, M.A.3
  • 57
    • 0038468602 scopus 로고
    • On sentence length as a statistical characteristic of style in prose
    • Yule, G.U. (1938). On sentence length as a statistical characteristic of style in prose. Biometrika, 30, 363-390.
    • (1938) Biometrika , vol.30 , pp. 363-390
    • Yule, G.U.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.