-
2
-
-
84871101442
-
A scalable system for identifying co-derivative documents
-
A. Apostolico & M. Melucci (Eds.), Padova, Italy: Springer. Published as LNCS 3246
-
Bernstein, Y., & Zobel, J. (2004). A scalable system for identifying co-derivative documents. In A. Apostolico & M. Melucci (Eds.), Proceedings of the string processing and information retrieval symposium (SPIRE) (pp. 55-67). Padova, Italy: Springer. Published as LNCS 3246.
-
(2004)
Proceedings of the string processing and information retrieval symposium (SPIRE)
, pp. 55-67
-
-
Bernstein, Y.1
Zobel, J.2
-
3
-
-
84976810280
-
Copy detection mechanisms for digital documents
-
New York, NY, USA: ACM Press. ISBN 0-89791-731-6
-
Brin, S., Davis, J., & Garcia-Molina, H. (1995). Copy detection mechanisms for digital documents. In SIGMOD '95 (pp. 398-409). New York, NY, USA: ACM Press. ISBN 0-89791-731-6.
-
(1995)
SIGMOD '95
, pp. 398-409
-
-
Brin, S.1
Davis, J.2
Garcia-Molina, H.3
-
4
-
-
33745590124
-
Indexing shared content in information retrieval systems
-
Broder, A. Z., Eiron, N., Fontoura, M., Herscovici, M., Lempel, R., McPherson, J., et al. (2006). Indexing shared content in information retrieval systems. In EDBT '06 (pp. 313-330).
-
(2006)
EDBT '06
, pp. 313-330
-
-
Broder, A.Z.1
Eiron, N.2
Fontoura, M.3
Herscovici, M.4
Lempel, R.5
McPherson, J.6
-
6
-
-
34047230751
-
Who's at the keyboard? authorship attribution in digital evidence investigations
-
Chaski, C. E. (2005). Who's at the keyboard? authorship attribution in digital evidence investigations. IJDE, 4(1), 1-14.
-
(2005)
Ijde
, vol.4
, Issue.1
, pp. 1-14
-
-
Chaski, C.E.1
-
7
-
-
0346586663
-
Smote: Synthetic minority over-sampling technique
-
Chawla, N. V., Bowyer, K. W., Kegelmeyer, P. W. (2002). Smote: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 16, 321-357.
-
(2002)
Journal of Artificial Intelligence Research
, vol.16
, pp. 321-357
-
-
Chawla, N.V.1
Bowyer, K.W.2
Kegelmeyer, P.W.3
-
10
-
-
16644399456
-
Signature extraction for overlap detection in documents
-
Australian Computer Society, Inc. ISBN 0-909925-82-8
-
Finkel, R. A., Zaslavsky, A., Monostori, K., & Schmidt, H. (2002). Signature extraction for overlap detection in documents. In Proceedings of the 25th Australian conference on Computer science (pp. 59-64). Australian Computer Society, Inc. ISBN 0-909925-82-8.
-
(2002)
Proceedings of the 25th Australian conference on Computer science
, pp. 59-64
-
-
Finkel, R.A.1
Zaslavsky, A.2
Monostori, K.3
Schmidt, H.4
-
11
-
-
0344229953
-
A new readability yardstick
-
Flesch, R. (1948). A new readability yardstick. Journal of Applied Psychology, 32, 221-233.
-
(1948)
Journal of Applied Psychology
, vol.32
, pp. 221-233
-
-
Flesch, R.1
-
12
-
-
0001944742
-
Similarity search in high dimensions via hashing
-
Scotland
-
Gionis, A., Indyk, P., & Motwani, R. (1999). Similarity search in high dimensions via hashing. In Proceedings of the 25th VLDB conference Edinburgh, Scotland (pp. 518-529).
-
(1999)
Proceedings of the 25th VLDB conference Edinburgh
, pp. 518-529
-
-
Gionis, A.1
Indyk, P.2
Motwani, R.3
-
13
-
-
28344449091
-
Segmenting a document by stylistic character
-
Supersedes August 2003 workshop version
-
Graham, N., Hirst, G., & Marthi, B. (2005). Segmenting a document by stylistic character. Natural Language Engineering, 11(4), 397-415. Supersedes August 2003 workshop version.
-
(2005)
Natural Language Engineering
, vol.11
, Issue.4
, pp. 397-415
-
-
Graham, N.1
Hirst, G.2
Marthi, B.3
-
16
-
-
8244230710
-
An assessment of cumulative sum charts for authorship attribution
-
Hilton, M. L., & Holmes, D. I. (1993). An assessment of cumulative sum charts for authorship attribution. Literary and Linguistic Computing, 8(2), 73-80.
-
(1993)
Literary and Linguistic Computing
, vol.6
, Issue.2
, pp. 73-80
-
-
Hilton, M.L.1
Holmes, D.I.2
-
17
-
-
33746600649
-
Reducing the dimensionality of data with neural networks
-
Hinton, G. E., & Salakhutdinov, R. R. (2006). Reducing the dimensionality of data with neural networks. Science, 313, 504-507.
-
(2006)
Science
, vol.313
, pp. 504-507
-
-
Hinton, G.E.1
Salakhutdinov, R.R.2
-
18
-
-
0037319544
-
Methods for identifying versioned and plagiarised documents
-
Hoad, T. C., & Zobel, J. (2003). Methods for identifying versioned and plagiarised documents. American Society for Information Science and Technology, 54(3), 203-215.
-
(2003)
American Society for Information Science and Technology
, vol.54
, Issue.3
, pp. 203-215
-
-
Hoad, T.C.1
Zobel, J.2
-
19
-
-
84967627259
-
The evolution of stylometry in humanities scholarship
-
doi: 10. 1093/llc/13. 3. 111
-
Holmes, D. I. (1998). The evolution of stylometry in humanities scholarship. Literary and Linguistic, 13(3), 111-117. doi: 10. 1093/llc/13. 3. 111.
-
(1998)
Literary and Linguistic
, vol.13
, Issue.3
, pp. 111-117
-
-
Holmes, D.I.1
-
22
-
-
43149126601
-
Authorship attribution
-
ISSN 1554-0669. doi: 10. 1561/1500000005
-
Juola, P. (2006). Authorship attribution. Foundation Trends Information Retrieval 1(3), 233-334, ISSN 1554-0669. doi: 10. 1561/1500000005.
-
(2006)
Foundation Trends Information Retrieval
, vol.1
, Issue.3
, pp. 233-334
-
-
Juola, P.1
-
23
-
-
49049084540
-
Obfuscating document stylometry to preserve author anonymity
-
Morristown, NJ, USA: Association for Computational Linguistics
-
Kacmarcik, G., & Gamon, M. (2006). Obfuscating document stylometry to preserve author anonymity. In Proceedings of the COLING/ACL on main conference poster sessions (pp. 444-451). Morristown, NJ, USA: Association for Computational Linguistics.
-
(2006)
Proceedings of the COLING/ACL on main conference poster sessions
, pp. 444-451
-
-
Kacmarcik, G.1
Gamon, M.2
-
24
-
-
0005579279
-
Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel
-
Millington TN: Naval Technical Training US Naval Air Station
-
Kincaid, J., Fishburne, R. P., Rogers, R. L., & Chissom, B. S. (1975). Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel. Research branch report 8-75. Millington TN: Naval Technical Training US Naval Air Station.
-
(1975)
Research branch report
, pp. 8-75
-
-
Kincaid, J.1
Fishburne, R.P.2
Rogers, R.L.3
Chissom, B.S.4
-
25
-
-
12244278769
-
Discrimination of authorship using visualization
-
ISSN 0306-4573. doi: 10. 1016/0306-4573(94)90029-9
-
Kjell, B., Woods Addison, W., & Frieder, O. (1994). Discrimination of authorship using visualization. Information Processing and Management, 30(1), 141-150. ISSN 0306-4573. doi: 10. 1016/0306-4573(94)90029-9.
-
(1994)
Information Processing and Management
, vol.30
, Issue.1
, pp. 141-150
-
-
Kjell, B.1
Woods Addison, W.2
Frieder, O.3
-
28
-
-
33745822115
-
Authorship verification as a one-class classification problem
-
New York, NY, USA: ACM. ISBN 1-58113-828-5. doi: 10. 1145/1015330. 1015448
-
Koppel, M., & Schler, J. (2004a). Authorship verification as a one-class classification problem. In ICML '04: Proceedings of the twenty-first international conference on Machine learning (pp. 62). New York, NY, USA: ACM. ISBN 1-58113-828-5. doi:10.1145/1015330.1015448.
-
(2004)
ICML '04: Proceedings of the twenty-first international conference on Machine learning
, pp. 62
-
-
Koppel, M.1
Schler, J.2
-
30
-
-
33750364891
-
Authorship attribution with thousands of candidate authors
-
New York, NY, USA: ACM. ISBN 1-59593-369-7. doi: 10. 1145/1148170. 1148304
-
Koppel, M., Schler, J., Argamon, S., & Messeri, E. (2006). Authorship attribution with thousands of candidate authors. In Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval (pp. 659-660). New York, NY, USA: ACM. ISBN 1-59593-369-7. doi: 10. 1145/1148170. 1148304.
-
(2006)
Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval
, pp. 659-660
-
-
Koppel, M.1
Schler, J.2
Argamon, S.3
Messeri, E.4
-
31
-
-
34250648062
-
Measuring differentiability: Unmasking pseudonymous authors
-
ISSN 1533-7928
-
Koppel, M., Schler, J., & Bonchek-Dokow, E. (2007). Measuring differentiability: Unmasking pseudonymous authors. Journal of Machine Learning Research, 8, 1261-1276. ISSN 1533-7928.
-
(2007)
Journal of Machine Learning Research
, Issue.8
, pp. 1261-1276
-
-
Koppel, M.1
Schler, J.2
Bonchek-Dokow, E.3
-
32
-
-
58449112832
-
Computational methods in authorship attribution
-
Koppel, M., Schler, J., & Argamon, S. (2009). Computational methods in authorship attribution. Journal of the American Society for Information Science and Technology, 60(1), 9-26.
-
(2009)
Journal of the American Society for Information Science and Technology
, vol.60
, Issue.1
, pp. 9-26
-
-
Koppel, M.1
Schler, J.2
Argamon, S.3
-
33
-
-
77952507676
-
Authorship attribution of texts: A review
-
Malyutov, M. B. (2006). Authorship attribution of texts: A review. Lecture Notes in Computer Science, 2063, 362-380.
-
(2006)
Lecture Notes in Computer Science
, vol.2063
, pp. 362-380
-
-
Malyutov, M.B.1
-
36
-
-
77049104346
-
Genre classification of web pages: User study and feasibility analysis
-
S. Biundo, T. Frühwirth, & G. Palm (Eds.),ISBN 0302-9743, Berlin Heidelberg New York: Springer
-
Meyer zu Eissen, S., & Stein, B. (2004). Genre classification of web pages: User study and feasibility analysis. In S. Biundo, T. Frühwirth, & G. Palm (Eds.), KI 2004: Advances in artificial intelligence, vol. 3228 LNAI of Lecture Notes in artificial intelligence (pp. 256-269). Berlin Heidelberg New York: Springer. ISBN 0302-9743.
-
(2004)
KI 2004: Advances in artificial intelligence, vol. 3228 LNAI of Lecture Notes in artificial intelligence
, pp. 256-269
-
-
Meyer zu Eissen, S.1
Stein, B.2
-
37
-
-
33745814686
-
Intrinsic plagiarism detection
-
M. Lalmas, A. MacFarlane, S. M. Rüger, A. Tombros, T. Tsikrika, & A. Yavlinsky (Eds.), New York: Springer. ISBN 3-540-33347-9
-
Meyer zu Eissen, S., & Stein, B. (2006). Intrinsic plagiarism detection. In M. Lalmas, A. MacFarlane, S. M. Rüger, A. Tombros, T. Tsikrika, & A. Yavlinsky (Eds.), Proceedings of the European conference on information retrieval (ECIR 2006), vol. 3936 of Lecture Notes in Computer Science (pp. 565-569). New York: Springer. ISBN 3-540-33347-9.
-
(2006)
Proceedings of the European conference on information retrieval (ECIR 2006), vol. 3936 of Lecture Notes in Computer Science
, pp. 565-569
-
-
Meyer zu Eissen, S.1
Stein, B.2
-
38
-
-
84879567529
-
Plagiarism detection without reference collections
-
R. Decker & H. J. Lenz (Eds.), New York: Springer. ISBN 978-3-540-70980-0
-
Meyer zu Eissen, S., Stein, B., & Kulig, M. (2007). Plagiarism detection without reference collections. In R. Decker & H. J. Lenz (Eds.), Advances in data analysis (pp. 359-366). New York: Springer. ISBN 978-3-540-70980-0.
-
(2007)
Advances in data analysis
, pp. 359-366
-
-
Meyer zu Eissen, S.1
Stein, B.2
Kulig, M.3
-
41
-
-
61849181444
-
Using conjunctions and adverbs for author verification
-
Pavelec, D., Oliveira, L. S., Justino, E. J. R., & Batista, L. V. (2008). Using conjunctions and adverbs for author verification. Journal of UCS, 14(18), 2967-2981.
-
(2008)
Journal of UCS
, vol.14
, Issue.18
, pp. 2967-2981
-
-
Pavelec, D.1
Oliveira, L.S.2
Justino, E.J.R.3
Batista, L.V.4
-
42
-
-
84857501781
-
Webis at Bauhaus-Universität Weimar and NLEL at Universidad Polytécnica de Valencia
-
Potthast, M., Eiselt, A., Stein, B., Barròn Cedeño, A., & Rosso, P. (Eds.). (2009). Webis at Bauhaus-Universität Weimar and NLEL at Universidad Polytécnica de Valencia. PAN Plagiarism Corpus 2009 (PAN-PC-09). http://www. webis. de/research/corpora.
-
(2009)
PAN Plagiarism Corpus 2009 (PAN-PC-09)
-
-
Potthast, M.1
Eiselt, A.2
Stein, B.3
Barròn Cedeño, A.4
Rosso, P.5
-
43
-
-
0036709275
-
Constructing boosting algorithms from SVMs: An application to one-class classification
-
ISSN 0162-8828. doi: 10. 1109/TPAMI. 2002. 1033211
-
Rätsch, G., Mika, S., Schölkopf, B., & Müller, K.-R. (2002). Constructing boosting algorithms from SVMs: An application to one-class classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(9), 1184-1199. ISSN 0162-8828. doi: 10. 1109/TPAMI. 2002. 1033211.
-
(2002)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.24
, Issue.9
, pp. 1184-1199
-
-
Rätsch, G.1
Mika, S.2
Schölkopf, B.3
Müller, K.-R.4
-
45
-
-
0001318320
-
The state of authorship attribution studies: Some problems and solutions
-
Rudman, J. (1997). The state of authorship attribution studies: Some problems and solutions. Computers and the Humanities, 31, 351-365.
-
(1997)
Computers and the Humanities
, vol.31
, pp. 351-365
-
-
Rudman, J.1
-
47
-
-
34147123127
-
On authorship attribution via markov chains and sequence kernels
-
doi: 10. 1109/ICPR. 2006. 899
-
Sanderson, C., & Guenter, S. (2006a). On authorship attribution via markov chains and sequence kernels. In Pattern recognition, 2006. ICPR 2006. 18th international conference on (vol. 3, pp. 437-440). doi: 10. 1109/ICPR. 2006. 899.
-
(2006)
Pattern recognition, 2006. ICPR 2006. 18th international conference on
, vol.3
, pp. 437-440
-
-
Sanderson, C.1
Guenter, S.2
-
48
-
-
58449114905
-
Short text authorship attribution via sequence kernels, markov chains and author unmasking: An investigation
-
Sanderson, C., & Guenter, S. (2006b). Short text authorship attribution via sequence kernels, markov chains and author unmasking: An investigation. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (pp. 482-491). URL http://acl. ldc. upenn. edu/W/W06/W06-1657. pdf.
-
(2006)
Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
, pp. 482-491
-
-
Sanderson, C.1
Guenter, S.2
-
49
-
-
47849113906
-
Author identification using imbalanced and limited training texts
-
A. M. Tjoa & R. R. Wagner (Eds.), IEEE, September 2007. ISBN 0-7695-2932-1. doi: 10. 1109/DEXA. 2007. 37
-
Stamatatos, E. (2007). Author identification using imbalanced and limited training texts. In A. M. Tjoa & R. R. Wagner (Eds.), 18th international conference on database and expert systems applications (DEXA 07) (pp. 237-241). IEEE, September 2007. ISBN 0-7695-2932-1. doi: 10. 1109/DEXA. 2007. 37.
-
(2007)
18th international conference on database and expert systems applications (DEXA 07)
, pp. 237-241
-
-
Stamatatos, E.1
-
50
-
-
62549150881
-
A survey of modern authorship attribution methods
-
ISSN 1532-2882. doi: 10. 1002/asi. v60: 3
-
Stamatatos, E. (2009). A survey of modern authorship attribution methods. Journal of American Society for Information Science & Technology, 60(3), 538-556. ISSN 1532-2882. doi: 10. 1002/asi. v60: 3.
-
(2009)
Journal of American Society for Information Science & Technology
, vol.60
, Issue.3
, pp. 538-556
-
-
Stamatatos, E.1
-
51
-
-
52549096781
-
Computer-based authorship attribution without lexical measures
-
Stamatatos, E., Fakotakis, N., & Kokkinakis, G. (2001). Computer-based authorship attribution without lexical measures. Computers and the Humanities, 35, 193-214.
-
(2001)
Computers and the Humanities
, vol.35
, pp. 193-214
-
-
Stamatatos, E.1
Fakotakis, N.2
Kokkinakis, G.3
-
54
-
-
36448954599
-
Principles of hash-based text retrieval
-
C. Clarke, N. Fuhr, N. Kando, W. Kraaij, & A. de Vries (Eds.),ACM, July 2007. ISBN 987-1-59593-597-7
-
Stein, B. (2007). Principles of hash-based text retrieval. In C. Clarke, N. Fuhr, N. Kando, W. Kraaij, & A. de Vries (Eds.), 30th annual international ACM SIGIR conference (pp. 527-534). ACM, July 2007. ISBN 987-1-59593-597-7.
-
(2007)
30th annual international ACM SIGIR conference
, pp. 527-534
-
-
Stein, B.1
-
55
-
-
78650103270
-
Intrinsic plagiarism analysis with meta learning
-
B. Stein, M. Koppel, & E. Stamatatos (Eds.), CEUR-WS. org, July 2007
-
Stein, B., & Meyer zu Eissen, S. (2007). Intrinsic plagiarism analysis with meta learning. In B. Stein, M. Koppel, & E. Stamatatos (Eds.), SIGIR workshop workshop on plagiarism analysis, authorship identification, and near-duplicate detection (PAN 07) (pp. 45-50). CEUR-WS. org, July 2007. URL http://ceur-ws. org/Vol-276.
-
(2007)
SIGIR workshop workshop on plagiarism analysis, authorship identification, and near-duplicate detection (PAN 07)
, pp. 45-50
-
-
Stein, B.1
Meyer zu Eissen, S.2
-
56
-
-
79952311416
-
Topic-identifikation: Formalisierung, analyse und neue Verfahren
-
ISSN 0933-1875
-
Stein, B., & Meyer zu Eissen, S. (2007). Topic-identifikation: Formalisierung, analyse und neue Verfahren. KI-Künstliche Intelligenz, 3, 16-22. ISSN 0933-1875. URL http://www. kuenstliche-intelligenz. de/index. php?id=7758.
-
(2007)
KI-Künstliche Intelligenz
, vol.3
, pp. 16-22
-
-
Stein, B.1
Meyer zu Eissen, S.2
-
57
-
-
57849104012
-
Meta analysis within authorship verification
-
A. M. Tjoa & R. R. Wagner (Eds.), IEEE, September 2008. ISBN 978-0-7695-3299-8. doi: 10. 1109/DEXA. 2008. 20
-
Stein, B., Lipka, N., & Meyer zu Eissen, S. (2008). Meta analysis within authorship verification. In A. M. Tjoa & R. R. Wagner (Eds.), 19th international conference on database and expert systems applications (DEXA 08) (pp. 34-39). IEEE, September 2008. ISBN 978-0-7695-3299-8. doi: 10. 1109/DEXA. 2008. 20.
-
(2008)
19th international conference on database and expert systems applications (DEXA 08)
, pp. 34-39
-
-
Stein, B.1
Lipka, N.2
Meyer zu Eissen, S.3
-
58
-
-
79952314728
-
-
Final project report CS391L, University of Texas at Austin
-
Surdulescu R. (2004). Verifying authorship. Final project report CS391L, University of Texas at Austin.
-
(2004)
Verifying authorship
-
-
Surdulescu, R.1
-
59
-
-
0037753593
-
-
Ph. D. thesis, Technische Universiteit Delft
-
Tax, D. M. J. (2001). One-class classification. Ph. D. thesis, Technische Universiteit Delft.
-
(2001)
One-class classification
-
-
Tax, D.M.J.1
-
61
-
-
0013132586
-
How variable may a constant be? measures of lexical richness in perspective
-
doi:10.1023/A:1001749303137
-
Tweedie, F. J., & Baayen, H. R. (1998). How variable may a constant be? measures of lexical richness in perspective. Computers and the Humanities 32(5): 323-352. doi: 10. 1023/A: 1001749303137.
-
(1998)
Computers and the Humanities
, vol.32
, Issue.5
, pp. 323-352
-
-
Tweedie, F.J.1
Baayen, H.R.2
-
62
-
-
85149113847
-
Linguistic profiling for author recognition and verification
-
Morristown, NJ, USA: Association for Computational Linguistics. doi: 10. 3115/1218955. 1218981
-
van Halteren, H. (2004). Linguistic profiling for author recognition and verification. In ACL '04: Proceedings of the 42nd annual meeting on association for computational linguistics (pp. 199). Morristown, NJ, USA: Association for Computational Linguistics. doi: 10. 3115/1218955. 1218981.
-
(2004)
ACL '04: Proceedings of the 42nd annual meeting on association for computational linguistics
, pp. 199
-
-
van Halteren, H.1
-
63
-
-
33846949415
-
Author verification by linguistic profiling: An exploration of the parameter space
-
1. ISSN 1550-4875. doi: 10. 1145/1187415. 1187416
-
van Halteren, H. (2007). Author verification by linguistic profiling: An exploration of the parameter space. ACM Transactions on Speech and Language Processing, 4(1), 1. ISSN 1550-4875. doi: 10. 1145/1187415. 1187416.
-
(2007)
ACM Transactions on Speech and Language Processing
, vol.4
, Issue.1
-
-
van Halteren, H.1
-
64
-
-
33750311279
-
Near-duplicate detection by instance-level constrained clustering
-
E. N. Efthimiadis, S. Dumais, D. Hawking, & K. Järvelin (Eds.), ISBN 1-59593-369-7
-
Yang, H., & Callan, J. P. (2006). Near-duplicate detection by instance-level constrained clustering. In E. N. Efthimiadis, S. Dumais, D. Hawking, & K. Järvelin (Eds.), SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval (pp. 421-428). ISBN 1-59593-369-7.
-
(2006)
SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval
, pp. 421-428
-
-
Yang, H.1
Callan, J.P.2
-
66
-
-
33644552803
-
A framework for authorship identification of online messages: Writing-style features and classification techniques
-
doi: 10. 1002/asi. 20316
-
Zheng, R., Li, J., Chen, H., & Huang, Z. (2006). A framework for authorship identification of online messages: Writing-style features and classification techniques. Journal of the American Society for Information Science and Technology, 57(3), 378-393. doi: 10. 1002/asi. 20316.
-
(2006)
Journal of the American Society for Information Science and Technology
, vol.57
, Issue.3
, pp. 378-393
-
-
Zheng, R.1
Li, J.2
Chen, H.3
Huang, Z.4
|