메뉴 건너뛰기




Volumn 26, Issue 2, 2008, Pages

Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace

Author keywords

Discourse; Online text; Style classification; Stylometry; Text mining

Indexed keywords

ACCOUNTABILITY; ONLINE TEXT; STYLE CLASSIFICATION; STYLOMETRY;

EID: 42049084142     PISSN: 10468188     EISSN: 15582868     Source Type: Journal    
DOI: 10.1145/1344411.1344413     Document Type: Article
Times cited : (335)

References (65)
  • 1
    • 27344450109 scopus 로고    scopus 로고
    • ABBASI, A. AND CHEN, H. 2005. Identification and comparison of extremist-group Web forum messages using authorship analysis. IEEE Intel. Syst. 20, 5, 67-75.
    • ABBASI, A. AND CHEN, H. 2005. Identification and comparison of extremist-group Web forum messages using authorship analysis. IEEE Intel. Syst. 20, 5, 67-75.
  • 4
    • 18744405825 scopus 로고    scopus 로고
    • ARGAMON, S., SARIC, M., AND STEIN, S. S. 2003. Style mining of electronic messages for multiple authorship discrimination: First results In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
    • ARGAMON, S., SARIC, M., AND STEIN, S. S. 2003. Style mining of electronic messages for multiple authorship discrimination: First results In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
  • 7
    • 0037791890 scopus 로고    scopus 로고
    • Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution
    • BAYYEN, R. H., HALTEREN, H. V., AND TWEEDIE, F. J. 1996. Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution. Liter. Linguist. Comput. 2, 110-120.
    • (1996) Liter. Linguist. Comput , vol.2 , pp. 110-120
    • BAYYEN, R.H.1    HALTEREN, H.V.2    TWEEDIE, F.J.3
  • 8
    • 0021859150 scopus 로고
    • A style analysis of C programs
    • BERRY, R. E. AND MEEKINGS, B. A. E. 1985. A style analysis of C programs. Commun. ACM 28, 1, 80-88.
    • (1985) Commun. ACM , vol.28 , Issue.1 , pp. 80-88
    • BERRY, R.E.1    MEEKINGS, B.A.E.2
  • 9
    • 84937183956 scopus 로고    scopus 로고
    • The application of principal component analysis to stylometry
    • BINONGO, J. N. G. AND SMITH, M. W. A. 1999. The application of principal component analysis to stylometry. Liter. Linguist. Compu. 14, 4, 445-466.
    • (1999) Liter. Linguist. Compu , vol.14 , Issue.4 , pp. 445-466
    • BINONGO, J.N.G.1    SMITH, M.W.A.2
  • 10
    • 84960603650 scopus 로고
    • Word patterns and story shapes: The statistical analysis of narrative style
    • BURROWS, J. F. 1987. Word patterns and story shapes: The statistical analysis of narrative style. Liter. Linguist. Comput. 2, 61-67.
    • (1987) Liter. Linguist. Comput , vol.2 , pp. 61-67
    • BURROWS, J.F.1
  • 11
    • 34047230751 scopus 로고    scopus 로고
    • Who's at the keyboard? Authorship attribution in digital evidence investigation
    • CHASKI, C. E. 2005. Who's at the keyboard? Authorship attribution in digital evidence investigation. Int. J. Digit. Evidence 4, 1, 1-13.
    • (2005) Int. J. Digit. Evidence , vol.4 , Issue.1 , pp. 1-13
    • CHASKI, C.E.1
  • 12
    • 56249097939 scopus 로고    scopus 로고
    • Empirical evaluation of language-based author identification techniques
    • CHASKI, C. E. 2001. Empirical evaluation of language-based author identification techniques. Forensic Linguist. 8, 1, 1-65.
    • (2001) Forensic Linguist , vol.8 , Issue.1 , pp. 1-65
    • CHASKI, C.E.1
  • 13
    • 0001920992 scopus 로고    scopus 로고
    • Human expert-level performance on a scientific image analysis task by a system using combined artificial neural networks
    • P. Chan, ed
    • CHERKAUER, K. J. 1996. Human expert-level performance on a scientific image analysis task by a system using combined artificial neural networks, In Working Notes of the AAAI Workshop on Integrating Multiple Learned Models, P. Chan, ed., 15-21.
    • (1996) Working Notes of the AAAI Workshop on Integrating Multiple Learned Models , pp. 15-21
    • CHERKAUER, K.J.1
  • 15
    • 0013326060 scopus 로고    scopus 로고
    • Feature selection for classification
    • DASH, M. AND LIU, H. 1997. Feature selection for classification. Intell. Data, Anal. 1, 131-156.
    • (1997) Intell. Data, Anal , vol.1 , pp. 131-156
    • DASH, M.1    LIU, H.2
  • 16
    • 0042367634 scopus 로고    scopus 로고
    • Mining e-mail content for author identification forensics
    • DE VEL, O., ANDERSON, A., CORNEY, M., AND MOHAY, G. 2001. Mining e-mail content for author identification forensics. ACM SIGMOD Rec. 30, 4, 55-64.
    • (2001) ACM SIGMOD Rec , vol.30 , Issue.4 , pp. 55-64
    • DE VEL, O.1    ANDERSON, A.2    CORNEY, M.3    MOHAY, G.4
  • 18
    • 0038604019 scopus 로고    scopus 로고
    • Authorship attribution with support vector machines
    • DIEDERICH, J., KINDERMANN, J., LEOPOLD, E., AND PAASS, G. 2003. Authorship attribution with support vector machines. Appl. Intell. 19, 109-123.
    • (2003) Appl. Intell , vol.19 , pp. 109-123
    • DIEDERICH, J.1    KINDERMANN, J.2    LEOPOLD, E.3    PAASS, G.4
  • 19
    • 1842583320 scopus 로고    scopus 로고
    • Extraction of Java program fingerprints for software authorship identification
    • DING, H. AND SAMADZAHEH, H. M. 2004. Extraction of Java program fingerprints for software authorship identification. J. Syst. Softw. 72, 49-57.
    • (2004) J. Syst. Softw , vol.72 , pp. 49-57
    • DING, H.1    SAMADZAHEH, H.M.2
  • 20
    • 33747192724 scopus 로고    scopus 로고
    • Implications of the recursive representation problem for automatic concept identification in on-line government information
    • EFRON, M., MARCHIONINI, G., AND ZHIANG, J. 2004. Implications of the recursive representation problem for automatic concept identification in on-line government information. In Proceedings of the ASIST SIG-CR Workshop.
    • (2004) Proceedings of the ASIST SIG-CR Workshop
    • EFRON, M.1    MARCHIONINI, G.2    ZHIANG, J.3
  • 21
    • 84991061352 scopus 로고    scopus 로고
    • Social translucence: An approach to designing systems that support social processes
    • ERICKSON, T. AND KELLOGG, W. A. 2000. Social translucence: An approach to designing systems that support social processes. ACM Trans. Comput. Hum. Interact. 7, 1, 59-83.
    • (2000) ACM Trans. Comput. Hum. Interact. 7 , vol.1 , pp. 59-83
    • ERICKSON, T.1    KELLOGG, W.A.2
  • 23
    • 2942731012 scopus 로고    scopus 로고
    • An extensive empirical study of feature selection metrics for text classification
    • FORMAN, G. 2003. An extensive empirical study of feature selection metrics for text classification, Journal of Machine Learning Research 3, 1289-1305.
    • (2003) Journal of Machine Learning Research , vol.3 , pp. 1289-1305
    • FORMAN, G.1
  • 24
    • 84937279393 scopus 로고    scopus 로고
    • Feature finding for text classification
    • FORSYTH, R. S. AND HOLMES, D. I. 1996. Feature finding for text classification. Litera. Linguist. Comput. 11, 4, 163-174.
    • (1996) Litera. Linguist. Comput , vol.11 , Issue.4 , pp. 163-174
    • FORSYTH, R.S.1    HOLMES, D.I.2
  • 27
    • 33745561205 scopus 로고    scopus 로고
    • An introduction to variable and feature selection
    • GUYON, I., AND ELISSEEFF, A. 2003. An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157-1182.
    • (2003) J. Mach. Learn. Res , vol.3 , pp. 1157-1182
    • GUYON, I.1    ELISSEEFF, A.2
  • 28
    • 0031236568 scopus 로고    scopus 로고
    • Attribution accuracy when using anonymity in group support systems
    • HAYNE, C. S. AND RICE, E. R. 1997. Attribution accuracy when using anonymity in group support systems. Int. J. Hum. Comput. Studies 47, 429-452.
    • (1997) Int. J. Hum. Comput. Studies , vol.47 , pp. 429-452
    • HAYNE, C.S.1    RICE, E.R.2
  • 29
    • 0041928091 scopus 로고    scopus 로고
    • Identification of comment authorship in anonymous group support systems
    • HAYNE, C. S., POLLARD, E. C., AND RICE, E. R. 2003. Identification of comment authorship in anonymous group support systems. J. Manage. Inf. Syst. 20, 1, 301-329.
    • (2003) J. Manage. Inf. Syst , vol.20 , Issue.1 , pp. 301-329
    • HAYNE, C.S.1    POLLARD, E.C.2    RICE, E.R.3
  • 30
    • 0036963722 scopus 로고    scopus 로고
    • Computer-Mediated communication on the Internet
    • HERRING, S. C. 2002. Computer-Mediated communication on the Internet. Ann. Rev. Inf. Sci. Technol. 36, 1, 109-168.
    • (2002) Ann. Rev. Inf. Sci. Technol , vol.36 , Issue.1 , pp. 109-168
    • HERRING, S.C.1
  • 31
    • 0005542307 scopus 로고
    • A stylometric analysis of Mormon scripture and related texts
    • HOLMES, D. I. 1992. A stylometric analysis of Mormon scripture and related texts. J. Royal Statis. Soci. 155, 91-120.
    • (1992) J. Royal Statis. Soci , vol.155 , pp. 91-120
    • HOLMES, D.I.1
  • 32
    • 0027332773 scopus 로고
    • Stopping rules in principal component analysis: A comparison of heuristical and statistical approaches
    • JACKSON, D. 1993. Stopping rules in principal component analysis: A comparison of heuristical and statistical approaches. Ecol. 74, 8, 2204-2214.
    • (1993) Ecol , vol.74 , Issue.8 , pp. 2204-2214
    • JACKSON, D.1
  • 33
    • 33846834126 scopus 로고    scopus 로고
    • A survey of trust and reputation systems for online service provision
    • JOSANG, A., ISMAIL, R., AND BOYD, C. 2007. A survey of trust and reputation systems for online service provision. Decis. Support Syst. 43, 2, 618-644.
    • (2007) Decis. Support Syst , vol.43 , Issue.2 , pp. 618-644
    • JOSANG, A.1    ISMAIL, R.2    BOYD, C.3
  • 34
    • 31044454127 scopus 로고    scopus 로고
    • A controlled-corpus experiment in authorship identification by cross-entropy
    • JUOLA, P. AND BAAYEN, H. 2005. A controlled-corpus experiment in authorship identification by cross-entropy. Liter. Linguist. Comput. 20, 59-67.
    • (2005) Liter. Linguist. Comput , vol.20 , pp. 59-67
    • JUOLA, P.1    BAAYEN, H.2
  • 35
    • 0025236073 scopus 로고
    • Application of the Karhunen-Loeve procedure for the characterization of human faces
    • KIRBY, M. AND SIROVICH, L. 1990. Application of the Karhunen-Loeve procedure for the characterization of human faces. IEEE Trans. Pattern Anal. Mach. Intell. 12, 1, 103-108.
    • (1990) IEEE Trans. Pattern Anal. Mach. Intell , vol.12 , Issue.1 , pp. 103-108
    • KIRBY, M.1    SIROVICH, L.2
  • 36
    • 12244278769 scopus 로고
    • Discrimination of authorship using visualization
    • KJELL, B. WOODS, W. A., AND FRIEDER, O. 1994. Discrimination of authorship using visualization. Inf. Process. Manage. 30, 1, 141-150.
    • (1994) Inf. Process. Manage , vol.30 , Issue.1 , pp. 141-150
    • KJELL, B.1    WOODS, W.A.2    FRIEDER, O.3
  • 38
    • 33748455668 scopus 로고    scopus 로고
    • Feature instability as a criterion for selecting potential style markers
    • KOPPEL, M. AKIVA, N., AND DAGAN, I. 2006. Feature instability as a criterion for selecting potential style markers. J. Amer. Soc. Inf. Sci. Technol. 57, 11, 1519-1525.
    • (2006) J. Amer. Soc. Inf. Sci. Technol , vol.57 , Issue.11 , pp. 1519-1525
    • KOPPEL, M.1    AKIVA, N.2    DAGAN, I.3
  • 39
    • 0030653064 scopus 로고    scopus 로고
    • Authorship analysis: Identifying the author of a program
    • KRSUL, I. AND SPAFFORD, H. E. 1997. Authorship analysis: Identifying the author of a program. Comput. Secur. 16, 3, 233-257.
    • (1997) Comput. Secur , vol.16 , Issue.3 , pp. 233-257
    • KRSUL, I.1    SPAFFORD, H.E.2
  • 40
    • 33745213523 scopus 로고    scopus 로고
    • From fingerprint to writeprint
    • LI, J., ZHENG, R., AND CHEN, H. 2006. From fingerprint to writeprint. Commun. ACM49, 4, 76-82.
    • (2006) Commun. ACM49 , vol.4 , pp. 76-82
    • LI, J.1    ZHENG, R.2    CHEN, H.3
  • 41
    • 24944521601 scopus 로고
    • On the utility of content analysis in author attribution: The federalist
    • MARTINDALE, C. AND MCKENZIE, D. 1995. On the utility of content analysis in author attribution: The federalist. Comput. Humanit. 29, 259-270.
    • (1995) Comput. Humanit , vol.29 , pp. 259-270
    • MARTINDALE, C.1    MCKENZIE, D.2
  • 42
    • 12344328483 scopus 로고    scopus 로고
    • Extracting gene pathway relations using a hybrid grammar: The Arizona relation parser
    • MCDONALD, D., CHEN, H., HUA, S., AND MARSHALL, B. 2004. Extracting gene pathway relations using a hybrid grammar: The Arizona relation parser. Bioinf. 20, 18, 3370-3378.
    • (2004) Bioinf , vol.20 , Issue.18 , pp. 3370-3378
    • MCDONALD, D.1    CHEN, H.2    HUA, S.3    MARSHALL, B.4
  • 43
    • 33644514895 scopus 로고
    • Neural computation in stylometry II: An application to the works of Shakespeare and Marlowe
    • MERRIAM, T. V. N. AND MATTHEWS, R. A. J. 1994. Neural computation in stylometry II: An application to the works of Shakespeare and Marlowe. Liter. Linguist. Comput. 9, 1-6.
    • (1994) Liter. Linguist. Comput , vol.9 , pp. 1-6
    • MERRIAM, T.V.N.1    MATTHEWS, R.A.J.2
  • 44
    • 0042223455 scopus 로고    scopus 로고
    • Software piracy: A view from Hong Kong
    • MOORES, T. AND DHILLON, G. 2000. Software piracy: A view from Hong Kong. Commun. ACM 43, 12, 88-93.
    • (2000) Commun. ACM , vol.43 , Issue.12 , pp. 88-93
    • MOORES, T.1    DHILLON, G.2
  • 50
    • 0003120218 scopus 로고    scopus 로고
    • Fast training on SVMs using sequential minimal optimization
    • B. Scholkopf et al, eds. MIT Press, Cambridge, MA
    • PLATT, J. 1999. Fast training on SVMs using sequential minimal optimization. In Advances in Kernel Methods: Support Vector Learning, B. Scholkopf et al., eds. MIT Press, Cambridge, MA, 185-208.
    • (1999) Advances in Kernel Methods: Support Vector Learning , pp. 185-208
    • PLATT, J.1
  • 51
    • 0001318320 scopus 로고    scopus 로고
    • The state of authorship attribution studies: Some problems and solutions
    • RUDMAN, J. 1997. The state of authorship attribution studies: Some problems and solutions. Comput. Humanit. 31, 351-365.
    • (1997) Comput. Humanit , vol.31 , pp. 351-365
    • RUDMAN, J.1
  • 52
    • 0034435015 scopus 로고    scopus 로고
    • Conversation Map: An interface for very large-scale conversations
    • SACK, W. 2000. Conversation Map: An interface for very large-scale conversations. J. Manage. Inf. Syst. 17, 3, 73-92.
    • (2000) J. Manage. Inf. Syst , vol.17 , Issue.3 , pp. 73-92
    • SACK, W.1
  • 54
    • 17444445377 scopus 로고    scopus 로고
    • Automatic text categorization in terms of genre and author
    • STAMATATOS, E., FAKOTAKIS, N., AND KOKKINAKIS, G. 2000. Automatic text categorization in terms of genre and author. Comput. Linguist 26, 4, 471-495.
    • (2000) Comput. Linguist , vol.26 , Issue.4 , pp. 471-495
    • STAMATATOS, E.1    FAKOTAKIS, N.2    KOKKINAKIS, G.3
  • 55
    • 42049108391 scopus 로고    scopus 로고
    • Seduced into scams: Online lovers often duped
    • July 28
    • SULLIVAN, B. 2005. Seduced into scams: Online lovers often duped. MSNBC, July 28.
    • (2005) MSNBC
    • SULLIVAN, B.1
  • 56
    • 0009302395 scopus 로고    scopus 로고
    • Neural network applications in stylometry: The Federalist papers
    • TWEEDIE, F. J., SINGH, S., AND HOLMES, D. I. 1996. Neural network applications in stylometry: The Federalist papers. Comput. Humanit. 30, 1, 1-10.
    • (1996) Comput. Humanit , vol.30 , Issue.1 , pp. 1-10
    • TWEEDIE, F.J.1    SINGH, S.2    HOLMES, D.I.3
  • 57
    • 0031209892 scopus 로고    scopus 로고
    • Use of the Fourier and Karhunen-Loeve decomposition for fast pattern matching with a large set of features
    • UENOHARA, M. AND KANADE, T. 1997. Use of the Fourier and Karhunen-Loeve decomposition for fast pattern matching with a large set of features. IEEE Trans. Pattern Analy. Mach. Intell. 19, 8, 891-897.
    • (1997) IEEE Trans. Pattern Analy. Mach. Intell , vol.19 , Issue.8 , pp. 891-897
    • UENOHARA, M.1    KANADE, T.2
  • 64
    • 0038468602 scopus 로고
    • On sentence length as a statistical characteristic on style prose
    • YULE, G. U. 1938. On sentence length as a statistical characteristic on style prose. Biometrika 30.
    • (1938) Biometrika , vol.30
    • YULE, G.U.1
  • 65
    • 33644552803 scopus 로고    scopus 로고
    • A framework for authorship analysis of online messages: Writing-style features and techniques
    • ZHENG, R., LI, J., HUANG, Z., AND CHEN, H. 2006. A framework for authorship analysis of online messages: Writing-style features and techniques. J. Amer. Soc. Inf. Sci. Technol. 57, 3, 378-393.
    • (2006) J. Amer. Soc. Inf. Sci. Technol , vol.57 , Issue.3 , pp. 378-393
    • ZHENG, R.1    LI, J.2    HUANG, Z.3    CHEN, H.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.