메뉴 건너뛰기




Volumn 42, Issue 2, 2012, Pages 133-149

Understanding plagiarism linguistic patterns, textual features, and detection methods

Author keywords

Linguistic patterns; plagiarism; plagiarism detection; taxonomy; textual features

Indexed keywords

CONCEPT GENERALIZATION; CROSS-LINGUAL; DETECTION METHODS; EXISTING SYSTEMS; LINGUISTIC PATTERNS; PLAGIARISM; PLAGIARISM DETECTION; SYSTEMATIC FRAMEWORK; TEXTUAL FEATURES;

EID: 84857505386     PISSN: 10946977     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCC.2011.2134847     Document Type: Review
Times cited : (195)

References (142)
  • 1
    • 77958149912 scopus 로고    scopus 로고
    • ENCOPLOT: Pairwise sequence matching in linear time applied to plagiarism detection
    • Donostia, Spain
    • C. Grozea, C. Gehl, and M. Popescu, "ENCOPLOT: Pairwise sequence matching in linear time applied to plagiarism detection," in Proc. SEPLN, Donostia, Spain, 2012, pp. 10-18.
    • (2012) Proc. SEPLN , pp. 10-18
    • Grozea, C.1    Gehl, C.2    Popescu, M.3
  • 2
    • 84872045317 scopus 로고    scopus 로고
    • A plagiarism detection procedure in three steps: Selection, matches and squares
    • Donostia, Spain
    • C. Basile, D. Benedetto, E. Caglioti, G. Cristadoro, and M. D. Esposti, "A plagiarism detection procedure in three steps: Selection, matches and "squares," in Proc. SEPLN, Donostia, Spain, pp. 19-23.
    • Proc. SEPLN , pp. 19-23
    • Basile, C.1    Benedetto, D.2    Caglioti, E.3    Cristadoro, G.4    Esposti, M.D.5
  • 3
    • 77958136011 scopus 로고    scopus 로고
    • Finding plagiarism by evaluating document similarities
    • Donostia, Spain
    • J.Kasprzak,M.Brandejs, and M.K?ripač, "Finding plagiarism by evaluating document similarities," in Proc. SEPLN, Donostia, Spain, pp. 24-28.
    • Proc. SEPLN , pp. 24-28
    • Kasprzak, J.1    Brandejs, M.2    Kripač, M.3
  • 4
    • 84857506709 scopus 로고    scopus 로고
    • Using Microsoft SQL server platform for plagiarism detection
    • Donostia, Spain
    • V. Scherbinin and S. Butakov, "Using Microsoft SQL server platform for plagiarism detection," in Proc. SEPLN, Donostia, Spain, pp. 36-37.
    • Proc. SEPLN , pp. 36-37
    • Scherbinin, V.1    Butakov, S.2
  • 6
    • 77749301855 scopus 로고    scopus 로고
    • Duplicate detection in documents and webpages using improved longest common subsequence and documents syntactical structures
    • Seoul, Korea, Nov.
    • M. Elhadi and A. Al-Tobi, "Duplicate detection in documents and webpages using improved longest common subsequence and documents syntactical structures," in Proc. 4th Int. Conf. Comput. Sci. Converg. Inf. Technol., Seoul, Korea, Nov. 2009, pp. 679-684.
    • (2009) Proc. 4th Int. Conf. Comput. Sci. Converg. Inf. Technol. , pp. 679-684
    • Elhadi, M.1    Al-Tobi, A.2
  • 9
    • 77955176662 scopus 로고    scopus 로고
    • Efficient privacy-preserving similar document detection
    • M. Murugesan, W. Jiang, C. Clifton, L. Si, and J. Vaidya, "Efficient privacy-preserving similar document detection," VLDB J., vol. 19, no. 4, pp. 457-475, 2010.
    • (2010) VLDB J. , vol.19 , Issue.4 , pp. 457-475
    • Murugesan, M.1    Jiang, W.2    Clifton, C.3    Si, L.4    Vaidya, J.5
  • 11
    • 27944463057 scopus 로고    scopus 로고
    • Sentence-based natural language plagiarism detection
    • R.W. Daniel and S. J.Mike, "Sentence-based natural language plagiarism detection," ACM J. Edu. Resour. Comput., vol. 4, p. 2, 2004.
    • (2004) ACM J. Edu. Resour. Comput. , vol.4 , pp. 2
    • Daniel, R.W.1    Mike, S.J.2
  • 12
    • 62949125921 scopus 로고    scopus 로고
    • Use of text syntactical structures in detection of document duplicates
    • London, U.K.
    • M. Elhadi and A. Al-Tobi, "Use of text syntactical structures in detection of document duplicates," in Proc. 3rd Int. Conf. Digital Inf. Manage., London, U.K., 2008, pp. 520-525.
    • (2008) Proc. 3rd Int. Conf. Digital Inf. Manage. , pp. 520-525
    • Elhadi, M.1    Al-Tobi, A.2
  • 13
    • 42749109029 scopus 로고    scopus 로고
    • Detecting translations of the same text and data with common source
    • K. Koroutchev and M. Cebrián, "Detecting translations of the same text and data with common source," J. Stat. Mech.: Theor. Exp., p. P10009, 2006.
    • (2006) J. Stat. Mech.: Theor. Exp.
    • Koroutchev, K.1    Cebrián, M.2
  • 14
    • 33746369754 scopus 로고    scopus 로고
    • Sentence similarity based on semantic nets and corpus statistics
    • Aug.
    • Y. Li,D.McLean, Z.A. Bandar, J. D. O'Shea, andK. Crockett, "Sentence similarity based on semantic nets and corpus statistics," IEEE Trans. Knowl. Data Eng., vol. 18, no. 8, pp. 1138-1150, Aug. 2006.
    • (2006) IEEE Trans. Knowl. Data Eng. , vol.18 , Issue.8 , pp. 1138-1150
    • Li, Y.1    McLean, D.2    Bandar, Z.A.3    O'shea, J.D.4    Crockett, K.5
  • 16
    • 26944498639 scopus 로고    scopus 로고
    • A sentence-based copy detection approach for web documents
    • R. Yerra and Y.-K. Ng, "A sentence-based copy detection approach for web documents," in Fuzzy System and Knowledge Discovery, 2005, pp. 557-570.
    • (2005) Fuzzy System and Knowledge Discovery , pp. 557-570
    • Yerra, R.1    Ng, Y.-K.2
  • 18
    • 71449101585 scopus 로고    scopus 로고
    • On the use of fuzzy information retrieval for gauging similarity of arabic documents
    • S. Alzahrani and N. Salim, "On the use of fuzzy information retrieval for gauging similarity of arabic documents," in Proc. 2nd Int. Conf. Appl. Digital Inf. Web Technol., 2009, pp. 539-544.
    • (2009) Proc. 2nd Int. Conf. Appl. Digital Inf. Web Technol. , pp. 539-544
    • Alzahrani, S.1    Salim, N.2
  • 19
    • 84857499765 scopus 로고    scopus 로고
    • Statement-based fuzzy-set IR versus fingerprints matching for plagiarism detection in arabic documents
    • Johor Bahru, Malaysia
    • S. Alzahrani and N. Salim, "Statement-based fuzzy-set IR versus fingerprints matching for plagiarism detection in arabic documents," in Proc. 5th Postgraduate Annu. Res. Seminar, Johor Bahru, Malaysia, 2009, pp. 267-268.
    • (2009) Proc. 5th Postgraduate Annu. Res. Seminar , pp. 267-268
    • Alzahrani, S.1    Salim, N.2
  • 21
    • 77957997341 scopus 로고    scopus 로고
    • A coarse-to-fine framework to efficiently thwart plagiarism
    • H. Zhang and T. W. S. Chow, "A coarse-to-fine framework to efficiently thwart plagiarism," Pattern Recog., vol. 44, pp. 471-487, 2011.
    • (2011) Pattern Recog. , vol.44 , pp. 471-487
    • Zhang, H.1    Chow, T.W.S.2
  • 22
    • 62549150881 scopus 로고    scopus 로고
    • A survey of modern authorship attribution methods
    • E. Stamatatos, "A survey of modern authorship attribution methods," J. Amer. Soc. Inf. Sci. Technol., vol. 60, pp. 538-556, 2009.
    • (2009) J. Amer. Soc. Inf. Sci. Technol. , vol.60 , pp. 538-556
    • Stamatatos, E.1
  • 24
    • 36448995739 scopus 로고    scopus 로고
    • Strategies for retrieving plagiarized documents
    • Amsterdam, The Netherlands
    • B. Stein, S. M. z. Eissen, and M. Potthast, "Strategies for retrieving plagiarized documents," in Proc. 30th Annu. Int. ACM SIGIR, Amsterdam, The Netherlands, 2007, pp. 825-826.
    • (2007) Proc. 30th Annu. Int. ACM SIGIR , pp. 825-826
    • Stein, B.1    Eissen, S.M.Z.2    Potthast, M.3
  • 25
    • 77958131800 scopus 로고    scopus 로고
    • Overview of the 1st international competition on plagiarism detection
    • Donostia, Spain
    • M. Potthast, B. Stein, A. Eiselt, A. Barrón-Cedeño, and P. Rosso, "Overview of the 1st international competition on plagiarism detection," in Proc. SEPLN, Donostia, Spain, pp. 1-9.
    • Proc. SEPLN , pp. 1-9
    • Potthast, M.1    Stein, B.2    Eiselt, A.3    Barrón-Cedeño, A.4    Rosso, P.5
  • 26
    • 84857506708 scopus 로고    scopus 로고
    • External and intrinsic plagiarism detection using vector space models
    • Donostia, Spain
    • M. Zechner, M. Muhr, R. Kern, and M. Granitzer, "External and intrinsic plagiarism detection using vector space models," in Proc. SEPLN, Donostia, Spain, pp. 47-55.
    • Proc. SEPLN , pp. 47-55
    • Zechner, M.1    Muhr, M.2    Kern, R.3    Granitzer, M.4
  • 27
    • 84885224337 scopus 로고    scopus 로고
    • On cross-lingual plagiarism analysis using a statistical model
    • Patras, Greece
    • A. Barrón-Cedeño, P. Rosso, D. Pinto, and A. Juan, "On cross-lingual plagiarism analysis using a statistical model," in Proc. ECAI PAN Workshop, Patras, Greece, pp. 9-13.
    • Proc. ECAI PAN Workshop , pp. 9-13
    • Barrón-Cedeño, A.1    Rosso, P.2    Pinto, D.3    Juan, A.4
  • 28
    • 34250902471 scopus 로고
    • Computer algorithms for plagiarism detection
    • May
    • A. Parker and J. O. Hamblen, "Computer algorithms for plagiarism detection," IEEE Trans. Educ., vol. 32, no. 2, pp. 94-99, May 1989.
    • (1989) IEEE Trans. Educ. , vol.32 , Issue.2 , pp. 94-99
    • Parker, A.1    Hamblen, J.O.2
  • 29
    • 70349246347 scopus 로고    scopus 로고
    • Multilayer SOM with treestructured data for efficient document retrieval and plagiarism detection
    • Sep.
    • T. W. S. Chow and M. K. M. Rahman, "Multilayer SOM with treestructured data for efficient document retrieval and plagiarism detection," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1385-1402, Sep. 2009.
    • (2009) IEEE Trans. Neural Netw. , vol.20 , Issue.9 , pp. 1385-1402
    • Chow, T.W.S.1    Rahman, M.K.M.2
  • 30
    • 84922032105 scopus 로고    scopus 로고
    • Fuzzy semantic-based string similarity for extrinsic plagiarism detection: Lab report for PAN at CLEF'10
    • presented at the, Padua, Italy
    • S. Alzahrani and N. Salim, "Fuzzy semantic-based string similarity for extrinsic plagiarism detection: Lab report for PAN at CLEF'10," presented at the 4th Int. Workshop PAN-10, Padua, Italy, 2010.
    • (2010) 4th Int. Workshop PAN-10
    • Alzahrani, S.1    Salim, N.2
  • 32
    • 33644501762 scopus 로고    scopus 로고
    • Tool support for plagiarism detection in text documents
    • Santa Fe, NM
    • G. Stefan and N. Stuart, "Tool support for plagiarism detection in text documents," in Proc. ACM Symp. Appl. Comput., Santa Fe, NM, 2005, pp. 776-781.
    • (2005) Proc. ACM Symp. Appl. Comput. , pp. 776-781
    • Stefan, G.1    Stuart, N.2
  • 33
    • 84879567529 scopus 로고    scopus 로고
    • Plagiarism detection without reference collections
    • M. zu Eissen, B. Stein, and M. Kulig, "Plagiarism detection without reference collections," in Advances in Data Analysis, 2007, pp. 359-366.
    • (2007) Advances in Data Analysis , pp. 359-366
    • Zu Eissen, M.1    Stein, B.2    Kulig, M.3
  • 35
    • 38549175583 scopus 로고    scopus 로고
    • An application of detecting plagiarism using dynamic incremental comparison method
    • Guangzhou, China
    • A. Byung-Ryul, K. Heon, and K.Moon-Hyun, "An application of detecting plagiarism using dynamic incremental comparison method," in Proc. Int. Conf. Comput. Intell. Security, Guangzhou, China, 2006, pp. 864-867.
    • (2006) Proc. Int. Conf. Comput. Intell. Security , pp. 864-867
    • Byung-Ryul, A.1    Heon, K.2    Moon-Hyun, K.3
  • 36
    • 84940363657 scopus 로고    scopus 로고
    • A statistical approach to crosslingual natural language tasks
    • D. Pinto, J. Civera, A. Barrón-Cedeño, A. Juan, and P. Rosso, "A statistical approach to crosslingual natural language tasks," J. Algorithms, vol. 64, pp. 51-60, 2009.
    • (2009) J. Algorithms , vol.64 , pp. 51-60
    • Pinto, D.1    Civera, J.2    Barrón-Cedeño, A.3    Juan, A.4    Rosso, P.5
  • 38
    • 78049356024 scopus 로고    scopus 로고
    • A new approach for cross-language plagiarism analysis
    • M. Agosti, N. Ferro, C. Peters, M. de Rijke, and A. Smeaton, Eds. Berlin, Germany: Springer
    • R. Corezola Pereira, V. Moreira, and R. Galante, "A new approach for cross-language plagiarism analysis," in Multilingual and Multimodal Information Access Evaluation, vol. 6360, M. Agosti, N. Ferro, C. Peters, M. de Rijke, and A. Smeaton, Eds. Berlin, Germany: Springer, 2010, pp. 15-26.
    • (2010) Multilingual and Multimodal Information Access Evaluation , vol.6360 , pp. 15-26
    • Corezola Pereira, R.1    Moreira, V.2    Galante, R.3
  • 39
    • 33644552803 scopus 로고    scopus 로고
    • A framework for authorship identification of online messages: Writing-style features and classification techniques
    • R. Zheng, J. Li, H. Chen, and Z. Huang, "A framework for authorship identification of online messages: Writing-style features and classification techniques," J. Amer. Soc. Inf. Sci. Technol., vol. 57, pp. 378-393, 2006.
    • (2006) J. Amer. Soc. Inf. Sci. Technol. , vol.57 , pp. 378-393
    • Zheng, R.1    Li, J.2    Chen, H.3    Huang, Z.4
  • 41
    • 33846949415 scopus 로고    scopus 로고
    • Author verification by linguistic profiling: An exploration of the parameter space
    • H. V. Halteren, "Author verification by linguistic profiling: An exploration of the parameter space," ACM Trans. Speech Lang. Process., vol. 4, pp. 1-17, 2007.
    • (2007) ACM Trans. Speech Lang. Process. , vol.4 , pp. 1-17
    • Halteren, H.V.1
  • 43
    • 33749397963 scopus 로고
    • SCAM: A copy detection mechanism for digital documents
    • N. Shivakumar and H. Garcia-Molina, "SCAM: A copy detection mechanism for digital documents," in D-Lib Mag., 1995.
    • (1995) D-Lib Mag.
    • Shivakumar, N.1    Garcia-Molina, H.2
  • 44
    • 84976799118 scopus 로고
    • An algorithmic approach to the detection and prevention of plagiarism
    • K. J. Ottenstein, "An algorithmic approach to the detection and prevention of plagiarism," SIGCSE Bull., vol. 8, no. 4, pp. 30-41, 1977.
    • (1977) SIGCSE Bull. , vol.8 , Issue.4 , pp. 30-41
    • Ottenstein, K.J.1
  • 45
    • 84976764707 scopus 로고
    • A plagiarism detection system
    • L. D. John, L. Ann-Marie, and H. S. Paula, "A plagiarism detection system," SIGCSE Bull., vol. 13, no. 1, pp. 21-25, 1981.
    • (1981) SIGCSE Bull. , vol.13 , Issue.1 , pp. 21-25
    • John, L.D.1    Ann-Marie, L.2    Paula, H.S.3
  • 46
    • 84976757541 scopus 로고
    • A tool that detects plagiarism in Pascal programs
    • G. Sam, "A tool that detects plagiarism in Pascal programs," SIGCSE Bull., vol. 13, no. 1, pp. 15-20, 1981.
    • (1981) SIGCSE Bull. , vol.13 , Issue.1 , pp. 15-20
    • Sam, G.1
  • 48
    • 78449260282 scopus 로고    scopus 로고
    • The future of copy detection techniques
    • Pilsen, Czech Republic
    • Z. Ceska, "The future of copy detection techniques," in Proc. YRCAS, Pilsen, Czech Republic, pp. 5-10.
    • Proc. YRCAS , pp. 5-10
    • Ceska, Z.1
  • 49
    • 84967627259 scopus 로고    scopus 로고
    • The evolution of stylometry in humanities scholarship
    • D. I. Holmes, "The evolution of stylometry in humanities scholarship," Lit Linguist Comput., vol. 13, pp. 111-117, 1998.
    • (1998) Lit Linguist Comput. , vol.13 , pp. 111-117
    • Holmes, D.I.1
  • 50
    • 0042367634 scopus 로고    scopus 로고
    • Mining e-mail content for author identification forensics
    • O. deVel, A. Anderson, M. Corney, and G. Mohay, "Mining e-mail content for author identification forensics," SIGMOD Rec., vol. 30, pp. 55-64, 2001.
    • (2001) SIGMOD Rec. , vol.30 , pp. 55-64
    • Devel, O.1    Anderson, A.2    Corney, M.3    Mohay, G.4
  • 51
    • 54749139664 scopus 로고    scopus 로고
    • How variable may a constant be? Measures of lexical richness in perspective
    • F. J. Tweedie and R. H. Baayen, "How variable may a constant be? Measures of lexical richness in perspective," Comput. Humanities, vol. 32, pp. 323-352, 1998.
    • (1998) Comput. Humanities , vol.32 , pp. 323-352
    • Tweedie, F.J.1    Baayen, R.H.2
  • 53
    • 62949095390 scopus 로고    scopus 로고
    • Old and new challenges in automatic plagiarism detection
    • [Online]
    • P. Clough, (2003) Old and new challenges in automatic plagiarism detection. National UK Plagiarism Advisory Service. [Online]. Available: http://ir.shef.ac.uk/cloughie/papers/pas-plagiarism.pdf
    • (2003) National UK Plagiarism Advisory Service
    • Clough, P.1
  • 55
    • 70449604984 scopus 로고    scopus 로고
    • Computer-based plagiarism detection methods and tools: An overview
    • presented at the, Rousse, Bulgaria
    • L. Romans, G. Vita, and G. Janis, "Computer-based plagiarism detection methods and tools: An overview," presented at the Int. Conf. Comput. Syst. Technol., Rousse, Bulgaria, 2007.
    • (2007) Int. Conf. Comput. Syst. Technol.
    • Romans, L.1    Vita, G.2    Janis, G.3
  • 56
    • 18744405825 scopus 로고    scopus 로고
    • Style mining of electronic messages for multiple authorship discrimination: First results
    • presented at the, Washington, DC
    • S. Argamon, A. Marin, and S. S. Stein, "Style mining of electronic messages for multiple authorship discrimination: First results," presented at the 9th ACM SIGKDD Int. Conf. Know. Discovery Data Mining, Washington, DC, 2003.
    • (2003) 9th ACM SIGKDD Int. Conf. Know. Discovery Data Mining
    • Argamon, S.1    Marin, A.2    Stein, S.S.3
  • 59
    • 34147123127 scopus 로고    scopus 로고
    • On authorship attribution via Markov chains and sequence kernels
    • Hong Kong
    • C. Sanderson and S. Guenter, "On authorship attribution via Markov chains and sequence kernels," in Proc. 18th Int. Conf. Pattern Recog., Hong Kong, 2006, pp. 437-440.
    • (2006) Proc. 18th Int. Conf. Pattern Recog. , pp. 437-440
    • Sanderson, C.1    Guenter, S.2
  • 63
    • 85050835142 scopus 로고    scopus 로고
    • Academic writing and plagiarism: A linguistic analysis
    • J. Bloch, "Academic writing and plagiarism: A linguistic analysis," English for Specific Purposes, vol. 28, pp. 282-285, 2009.
    • (2009) English for Specific Purposes , vol.28 , pp. 282-285
    • Bloch, J.1
  • 64
    • 62449255391 scopus 로고    scopus 로고
    • Avoiding plagiarism in academic writing
    • I. Anderson, "Avoiding plagiarism in academic writing," Nurs. Standard, vol. 23, no. 18, pp. 35-37, 2009.
    • (2009) Nurs. Standard , vol.23 , Issue.18 , pp. 35-37
    • Anderson, I.1
  • 65
    • 41549103836 scopus 로고    scopus 로고
    • Plagiarism, a scourge
    • K. R. Rao, "Plagiarism, a scourge," Current Sci., vol. 94, pp. 581-586, 2008.
    • (2008) Current Sci. , vol.94 , pp. 581-586
    • Rao, K.R.1
  • 66
    • 84857499905 scopus 로고    scopus 로고
    • Intrinsic plagiarism detection using character n-gram profiles
    • Donostia, Spain
    • E. Stamatatos, "Intrinsic plagiarism detection using character n-gram profiles," in Proc. SEPLN, Donostia, Spain, pp. 38-46.
    • Proc. SEPLN , pp. 38-46
    • Stamatatos, E.1
  • 67
    • 84857502951 scopus 로고    scopus 로고
    • Authors, genre, and linguistic convention
    • presented at the, Amsterdam, The Netherlands, to be published
    • J. Karlgren and G. Eriksson, "Authors, genre, and linguistic convention," presented at the SIGIR Forum PAN, Amsterdam, The Netherlands, to be published.
    • SIGIR Forum PAN
    • Karlgren, J.1    Eriksson, G.2
  • 68
    • 85040385892 scopus 로고    scopus 로고
    • Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution
    • Sep.
    • H. Baayen, H. van Halteren, and F. Tweedie, "Outside the cave of shadows: using syntactic annotation to enhance authorship attribution," Lit. Linguist. Comput., vol. 11, pp. 121-132, Sep. 1996.
    • (1996) Lit. Linguist. Comput. , vol.11 , pp. 121-132
    • Baayen, H.1    Van Halteren, H.2    Tweedie, F.3
  • 69
    • 85119093321 scopus 로고    scopus 로고
    • Linguistic correlates of style: Authorship classification with deep linguistic analysis features
    • presented at the, Geneva, Switzerland
    • M. Gamon, "Linguistic correlates of style: Authorship classification with deep linguistic analysis features," presented at the 20th Int. Conf. Comput. Linguist., Geneva, Switzerland, 2004.
    • (2004) 20th Int. Conf. Comput. Linguist.
    • Gamon, M.1
  • 71
    • 84857374459 scopus 로고    scopus 로고
    • Back-translation: The latest form of plagiarism
    • presented at the, Wollongong, Australia
    • M. Jones, "Back-translation: The latest form of plagiarism," presented at the 4th Asia Pacific Conf. Edu Integr., Wollongong, Australia, 2009.
    • (2009) 4th Asia Pacific Conf. Edu Integr.
    • Jones, M.1
  • 73
    • 1542378981 scopus 로고    scopus 로고
    • Intelligent plagiarists are the most dangerous
    • L. Stenflo, "Intelligent plagiarists are the most dangerous," Nature, vol. 427, p. 777, 2004.
    • (2004) Nature , vol.427 , pp. 777
    • Stenflo, L.1
  • 74
    • 49749089724 scopus 로고    scopus 로고
    • Plagiarism: Words and ideas
    • M. Bouville, "Plagiarism: Words and ideas," Sci. Eng. Ethics, vol. 14, pp. 311-322, 2008.
    • (2008) Sci. Eng. Ethics , vol.14 , pp. 311-322
    • Bouville, M.1
  • 76
    • 33646013753 scopus 로고    scopus 로고
    • Discriminating the registers and styles in the modern Greek language-Part 2: Extending the feature vector to optimise author discrimination
    • G. Tambouratzis, S. Markantonatou, N. Hairetakis, M. Vassiliou, G. Carayannis, and D. Tambouratzis, "Discriminating the registers and styles in the modern Greek language-Part 2: Extending the feature vector to optimise author discrimination," Lit. Linguist. Comput., vol. 19, pp. 221-242, 2004.
    • (2004) Lit. Linguist. Comput. , vol.19 , pp. 221-242
    • Tambouratzis, G.1    Markantonatou, S.2    Hairetakis, N.3    Vassiliou, M.4    Carayannis, G.5    Tambouratzis, D.6
  • 77
    • 51849157438 scopus 로고    scopus 로고
    • Detecting and tracing plagiarized documents by reconstruction plagiarism-evolution tree
    • Sydney, N.S.W.
    • C. K. Ryu, H. J. Kim, S. H. Ji, G. Woo, and H. G. Cho, "Detecting and tracing plagiarized documents by reconstruction plagiarism-evolution tree," in Proc. 8th Int. Conf. Comput. Inf. Technol., Sydney, N.S.W., 2008, pp. 119-124.
    • (2008) Proc. 8th Int. Conf. Comput. Inf. Technol. , pp. 119-124
    • Ryu, C.K.1    Kim, H.J.2    Ji, S.H.3    Woo, G.4    Cho, H.G.5
  • 78
    • 84857506319 scopus 로고    scopus 로고
    • "Counter plagiarism detection software" and "counter counter plagiarism detection" methods
    • Donostia, Spain
    • Y. Palkovskii, ""Counter plagiarism detection software" and "Counter counter plagiarism detection" methods," in Proc. SEPLN, Donostia, Spain, pp. 67-68.
    • Proc. SEPLN , pp. 67-68
    • Palkovskii, Y.1
  • 79
    • 84857506320 scopus 로고    scopus 로고
    • Tackling the PAN'09 external plagiarism detection corpus with a desktop plagiarism detector
    • Donostia, Spain
    • J. A. Malcolm and P. C. R. Lane, "Tackling the PAN'09 external plagiarism detection corpus with a desktop plagiarism detector," in Proc. SEPLN, Donostia, Spain, pp. 29-33.
    • Proc. SEPLN , pp. 29-33
    • Malcolm, J.A.1    Lane, P.C.R.2
  • 80
    • 70449417857 scopus 로고    scopus 로고
    • A word-frequency based method for detecting plagiarism in documents
    • Las Vegas, NV
    • R. Lackes, J. Bartels, E. Berndt, and E. Frank, "A word-frequency based method for detecting plagiarism in documents," in Proc. Int. Conf. Inf. Reuse Integr., Las Vegas, NV, 2009, pp. 163-166.
    • (2009) Proc. Int. Conf. Inf. Reuse Integr. , pp. 163-166
    • Lackes, R.1    Bartels, J.2    Berndt, E.3    Frank, E.4
  • 81
    • 70449433577 scopus 로고    scopus 로고
    • On the number of search queries required for Internet plagiarism detection
    • Riga, Latvia
    • S. Butakov and V. Shcherbinin, "On the number of search queries required for Internet plagiarism detection," in Proc. 9th IEEE Int. Conf. Adv. Learn. Technol., Riga, Latvia, 2009, pp. 482-483.
    • (2009) Proc. 9th IEEE Int. Conf. Adv. Learn. Technol. , pp. 482-483
    • Butakov, S.1    Shcherbinin, V.2
  • 82
    • 60749099356 scopus 로고    scopus 로고
    • The toolbox for local and global plagiarism detection
    • S. Butakov and V. Scherbinin, "The toolbox for local and global plagiarism detection," Comput. Educ., vol. 52, pp. 781-788, 2009.
    • (2009) Comput. Educ. , vol.52 , pp. 781-788
    • Butakov, S.1    Scherbinin, V.2
  • 84
    • 78649264331 scopus 로고    scopus 로고
    • Putting ourselves in SME's shoes: Automatic detection of plagiarism by the WCopyFind tool
    • Donostia, Spain
    • E. V. Balaguer, "Putting ourselves in SME's shoes: Automatic detection of plagiarism by the WCopyFind tool," in Proc. SEPLN,Donostia, Spain, pp. 34-35.
    • Proc. SEPLN , pp. 34-35
    • Balaguer, E.V.1
  • 85
    • 57849106779 scopus 로고    scopus 로고
    • Plagiarism detection in Chinese based on chunk and paragraph weight
    • Kunming, Beijing, China
    • T. Wang, X. Z. Fan, and J. Liu, "Plagiarism detection in Chinese based on chunk and paragraph weight," in Proc. 7th Int. Conf. Mach. Learn. Cybern., Kunming, Beijing, China, 2008, pp. 2574-2579.
    • (2008) Proc. 7th Int. Conf. Mach. Learn. Cybern. , pp. 2574-2579
    • Wang, T.1    Fan, X.Z.2    Liu, J.3
  • 86
    • 62949153064 scopus 로고    scopus 로고
    • Algorithm of the longest commonly consecutive word for plagiarism detection in text based document
    • London, U.K.
    • A. Sediyono, K. Ruhana, and K. Mahamud, "Algorithm of the longest commonly consecutive word for plagiarism detection in text based document," in Proc. 3rd Int. Conf. Dig. Inf. Manage., London, U.K., 2008, pp. 253-259.
    • (2008) Proc. 3rd Int. Conf. Dig. Inf. Manage. , pp. 253-259
    • Sediyono, A.1    Ruhana, K.2    Mahamud, K.3
  • 87
    • 84897097498 scopus 로고    scopus 로고
    • Plagiarism detection through vector space models applied to a digital library
    • Karlova Studánka, Czech Republic
    • R. Řehurek, "Plagiarism detection through vector space models applied to a digital library," in Proc. RASLAN, Karlova Studánka, Czech Republic, pp. 75-83.
    • Proc. RASLAN , pp. 75-83
    • Řehurek, R.1
  • 88
    • 52149094802 scopus 로고    scopus 로고
    • Plagiarism detection based on singular value decomposition
    • Lecture Notes in Artificial Intelligence
    • Z. Ceska, "Plagiarism detection based on singular value decomposition," in Lecture Notes in Computer Science, vol. 5221, Lecture Notes in Artificial Intelligence, pp. 108-119, 2008.
    • (2008) Lecture Notes in Computer Science , vol.5221 , pp. 108-119
    • Ceska, Z.1
  • 89
    • 62949198590 scopus 로고    scopus 로고
    • A natural language processing approach to automatic plagiarism detection
    • New York
    • C. H. Leung and Y. Y. Chan, "A natural language processing approach to automatic plagiarism detection," in Proc. ACM Inf. Technol. Educ. Conf., New York, 2007, pp. 213-218.
    • (2007) Proc. ACM Inf. Technol. Educ. Conf. , pp. 213-218
    • Leung, C.H.1    Chan, Y.Y.2
  • 91
    • 50049121116 scopus 로고    scopus 로고
    • Fast and reliable plagiarism detection system
    • presented at the, Milwaukee, WI
    • M. Mozgovoy, S. Karakovskiy, and V. Klyuev, "Fast and reliable plagiarism detection system," presented at the Frontiers Educ. Conf., Milwaukee, WI, 2007.
    • (2007) Frontiers Educ. Conf.
    • Mozgovoy, M.1    Karakovskiy, S.2    Klyuev, V.3
  • 94
    • 37849028081 scopus 로고    scopus 로고
    • SNITCH: A software tool for detecting cut and paste plagiarism
    • New York
    • N. Sebastian and P.W. Thomas, "SNITCH: A software tool for detecting cut and paste plagiarism," in Proc. 37th SIGCSE Symp. Comput. Sci. Educ., New York, 2006, pp. 51-55.
    • (2006) Proc. 37th SIGCSE Symp. Comput. Sci. Educ. , pp. 51-55
    • Sebastian, N.1    Thomas, P.W.2
  • 97
    • 33750236086 scopus 로고    scopus 로고
    • PPChecker: Plagiarism pattern checker in document copy detection
    • N. Kang, A. Gelbukh, and S. Han, "PPChecker: Plagiarism pattern checker in document copy detection," in Text, Speech and Dialogue, 2006, pp. 661-667.
    • (2006) Text, Speech and Dialogue , pp. 661-667
    • Kang, N.1    Gelbukh, A.2    Han, S.3
  • 99
    • 35548982223 scopus 로고    scopus 로고
    • Copy detection inChinese documents using Ferret
    • J. Bao, C.Lyon, and P.Lane, "Copy detection inChinese documents using Ferret," in Language Resources & Evaluation, 2006, vol. 40, pp. 357-365.
    • (2006) Language Resources & Evaluation , vol.40 , pp. 357-365
    • Bao, J.1    Lyon, C.2    Lane, P.3
  • 101
    • 85044002196 scopus 로고    scopus 로고
    • Plagiarism detection in Arabic scripts using fuzzy information retrieval
    • presented at the, Johor Bahru, Malaysia
    • S. M. Alzahrani and N. Salim, "Plagiarism detection in Arabic scripts using fuzzy information retrieval," presented at the Student Conf. Res. Develop., Johor Bahru, Malaysia, 2008.
    • (2008) Student Conf. Res. Develop.
    • Alzahrani, S.M.1    Salim, N.2
  • 103
    • 39649105441 scopus 로고    scopus 로고
    • Author identification: Using text sampling to handle the class imbalance problem
    • E. Stamatatos, "Author identification: Using text sampling to handle the class imbalance problem," Inf. Process. Manage., vol. 44, pp. 790-799, 2008.
    • (2008) Inf. Process. Manage. , vol.44 , pp. 790-799
    • Stamatatos, E.1
  • 105
    • 80053148605 scopus 로고    scopus 로고
    • Intrinsic plagiarism detection using complexity analysis
    • Donostia, Spain
    • L. Seaward and S. Matwin, "Intrinsic plagiarism detection using complexity analysis," in Proc. SEPLN, Donostia, Spain, pp. 56-61.
    • Proc. SEPLN , pp. 56-61
    • Seaward, L.1    Matwin, S.2
  • 106
    • 84857499902 scopus 로고    scopus 로고
    • Ordinal measures in authorship identification
    • Donostia, Spain
    • L. P. Dinu and M. Popescu, "Ordinal measures in authorship identification," in Proc. SEPLN, Donostia, Spain, pp. 62-66.
    • Proc. SEPLN , pp. 62-66
    • Dinu, L.P.1    Popescu, M.2
  • 107
    • 77950586006 scopus 로고    scopus 로고
    • Plagiarism analysis, authorship identification, and near-duplicate detection
    • New York
    • S. Benno, K. Moshe, and S. Efstathios, "Plagiarism analysis, authorship identification, and near-duplicate detection," in Proc. ACM SIGIR Forum PAN'07, New York, pp. 68-71.
    • Proc. ACM SIGIR Forum PAN'07 , pp. 68-71
    • Benno, S.1    Moshe, K.2    Efstathios, S.3
  • 108
    • 52449127415 scopus 로고    scopus 로고
    • A platform framework for crosslingual text relatedness evaluation and plagiarism detection
    • presented at the, Dalian, Liaoning, China
    • C. H. Lee, C. H. Wu, and H. C. Yang, "A platform framework for crosslingual text relatedness evaluation and plagiarism detection," presented at the 3rd Int. Conf. Innov. Comput. Inf. Control, Dalian, Liaoning, China, 2008.
    • (2008) 3rd Int. Conf. Innov. Comput. Inf. Control
    • Lee, C.H.1    Wu, C.H.2    Yang, H.C.3
  • 112
    • 85040683480 scopus 로고    scopus 로고
    • A taxonomy of information retrieval models and tools
    • G. Canfora and L. Cerulo, "A taxonomy of information retrieval models and tools," J. Comput. Inf. Technol., vol. 12, pp. 175-194, 2004.
    • (2004) J. Comput. Inf. Technol. , vol.12 , pp. 175-194
    • Canfora, G.1    Cerulo, L.2
  • 115
    • 34548080780 scopus 로고    scopus 로고
    • Web search basics: Near-duplicates and shingling
    • Cambridge, U.K.: Cambridge Univ. Press
    • C. D. Manning, P. Raghavan, and H. Schütze, "Web search basics: Near-duplicates and shingling," in Introduction to Information Retrieval. Cambridge, U.K.: Cambridge Univ. Press, 2008, pp. 437-442.
    • (2008) Introduction to Information Retrieval , pp. 437-442
    • Manning, C.D.1    Raghavan, P.2    Schütze, H.3
  • 116
    • 0013207911 scopus 로고    scopus 로고
    • Scalable document fingerprinting
    • Commerce
    • N. Heintze, "Scalable document fingerprinting," in Proc. 2nd USENIX Workshop Electron. Commerce, 1996, pp. 191-200.
    • (1996) Proc. 2nd USENIX Workshop Electron , pp. 191-200
    • Heintze, N.1
  • 117
    • 36448954599 scopus 로고    scopus 로고
    • Principles of hash-based text retrieval
    • Amsterdam, The Netherlands
    • B. Stein, "Principles of hash-based text retrieval," in Proc. 30th Annu. Int. ACM SIGIR, Amsterdam, The Netherlands, 2007, pp. 527-534.
    • (2007) Proc. 30th Annu. Int. ACM SIGIR , pp. 527-534
    • Stein, B.1
  • 119
    • 78650343482 scopus 로고    scopus 로고
    • Scoring, term weighting and the vector space model
    • Cambridge, U.K.: Cambridge Univ. Press
    • C. D. Manning, P. Raghavan, and H. Schütze, "Scoring, term weighting and the vector space model," in Introduction to Information Retrieval. Cambridge, U.K.: Cambridge Univ. Press, 2009, pp. 109-133.
    • (2009) Introduction to Information Retrieval , pp. 109-133
    • Manning, C.D.1    Raghavan, P.2    Schütze, H.3
  • 120
    • 34548080780 scopus 로고    scopus 로고
    • Matrix decompositions and latent semantic indexing
    • Cambridge, U.K.: Cambridge Univ. Press
    • C. D. Manning, P. Raghavan, and H. Schütze, "Matrix decompositions and latent semantic indexing," in Introduction to Information Retrieval. Cambridge, U.K.: Cambridge Univ. Press, 2009, pp. 403-417.
    • (2009) Introduction to Information Retrieval. , pp. 403-417
    • Manning, C.D.1    Raghavan, P.2    Schütze, H.3
  • 121
    • 85013386603 scopus 로고    scopus 로고
    • A similarity-based probability model for latent semantic indexing
    • Berkeley, CA
    • H. Q. D. Chris, "A similarity-based probability model for latent semantic indexing," in Proc. 22nd Annu. Int. ACM SIGIR, Berkeley, CA, 1999, pp. 58-65.
    • (1999) Proc. 22nd Annu. Int. ACM SIGIR , pp. 58-65
    • Chris, H.Q.D.1
  • 122
    • 0002723321 scopus 로고
    • Generating, integrating, and activating thesauri for concept-based document retrieval
    • Apr.
    • H. Chen, K. J. Lynch, K. Basu, and T. D. Ng, "Generating, integrating, and activating thesauri for concept-based document retrieval," IEEE Expert, Intelli. Syst. Appl., vol. 8, no. 2, pp. 25-34, Apr. 1993.
    • (1993) IEEE Expert, Intelli. Syst. Appl. , vol.8 , Issue.2 , pp. 25-34
    • Chen, H.1    Lynch, K.J.2    Basu, K.3    Ng, T.D.4
  • 124
    • 0000465965 scopus 로고
    • A fuzzy document retrieval system using the keyword connection matrix and a learning method
    • Y. Ogawa, T. Morita, and K. Kobayashi, "A fuzzy document retrieval system using the keyword connection matrix and a learning method," Fuzzy Sets Syst., vol. 39, pp. 163-179, 1991.
    • (1991) Fuzzy Sets Syst. , vol.39 , pp. 163-179
    • Ogawa, Y.1    Morita, T.2    Kobayashi, K.3
  • 125
    • 0001300961 scopus 로고
    • Fuzzy information retrieval
    • V. Cross, "Fuzzy information retrieval," J. Intell. Syst., vol. 3, pp. 29-56, 1994.
    • (1994) J. Intell. Syst. , vol.3 , pp. 29-56
    • Cross, V.1
  • 126
    • 67349233682 scopus 로고    scopus 로고
    • Fuzzy information retrieval model revisited
    • S. Zadrozny and K. Nowacka, "Fuzzy information retrieval model revisited," Fuzzy Sets Syst., vol. 160, pp. 2173-2191, 2009.
    • (2009) Fuzzy Sets Syst. , vol.160 , pp. 2173-2191
    • Zadrozny, S.1    Nowacka, K.2
  • 127
    • 0032498339 scopus 로고    scopus 로고
    • An introduction to fuzzy systems
    • D. Dubois and H. Prade, "An introduction to fuzzy systems," Clinica Chimica Acta, vol. 270, pp. 3-29, 1998.
    • (1998) Clinica Chimica Acta , vol.270 , pp. 3-29
    • Dubois, D.1    Prade, H.2
  • 128
    • 84857501380 scopus 로고    scopus 로고
    • Language models for information retrieval
    • Cambridge, U.K.: Cambridge Univ. Press
    • C. D. Manning, P. Raghavan, and H. Schütze, "Language models for information retrieval," in Introduction to Information Retrieval. Cambridge, U.K.: Cambridge Univ. Press, 2009, pp. 237-252.
    • (2009) Introduction to Information Retrieval. , pp. 237-252
    • Manning, C.D.1    Raghavan, P.2    Schütze, H.3
  • 132
    • 0032099961 scopus 로고    scopus 로고
    • Conceptual clustering in information retrieval
    • Jun.
    • S. K. Bhatia and J. S. Deogun, "Conceptual clustering in information retrieval," IEEE Trans. Systems Man Cybern. B, Cybern., vol. 28, no. 3, pp. 427-436, Jun. 1998.
    • (1998) IEEE Trans. Systems Man Cybern. B, Cybern. , vol.28 , Issue.3 , pp. 427-436
    • Bhatia, S.K.1    Deogun, J.S.2
  • 134
    • 33846267243 scopus 로고    scopus 로고
    • A flexible multi-layer self-organizing map for generic processing of tree-structured data
    • M. K. M. Rahman, W. Pi Yang, T. W. S. Chow, and S. Wu, "A flexible multi-layer self-organizing map for generic processing of tree-structured data," Pattern Recog., vol. 40, pp. 1406-1424, 2007.
    • (2007) Pattern Recog. , vol.40 , pp. 1406-1424
    • Rahman, M.K.M.1    Pi Yang, W.2    Chow, T.W.S.3    Wu, S.4
  • 135
    • 84857506703 scopus 로고    scopus 로고
    • Soft computing pattern recognition, data mining and web intelligence
    • N. Zhong and J. Lie, Eds. Berlin, Germany: Springer-Verlag
    • S. K. Pal, S. Mitra, and P. Mitra, "Soft computing pattern recognition, data mining and web intelligence," in Intelligent Technologies for Information Analysis, N. Zhong and J. Lie, Eds. Berlin, Germany: Springer-Verlag, 2004.
    • (2004) Intelligent Technologies for Information Analysis
    • Pal, S.K.1    Mitra, S.2    Mitra, P.3
  • 136
    • 0344972929 scopus 로고    scopus 로고
    • WEBSOM: Selforganizing maps of document collections
    • S. Kaski, T. Honkela, K. Lagus, and T. Kohonen, "WEBSOM: Selforganizing maps of document collections," Neurocomput., vol. 21, pp. 101-117, 1998.
    • (1998) Neurocomput. , vol.21 , pp. 101-117
    • Kaski, S.1    Honkela, T.2    Lagus, K.3    Kohonen, T.4
  • 137
    • 3543087974 scopus 로고    scopus 로고
    • LSISOM: A latent semantic indexing approach to self-organizing maps of document collections
    • N. Ampazis and S. Perantonis, "LSISOM: A latent semantic indexing approach to self-organizing maps of document collections," Neural Process. Lett., vol. 19, pp. 157-173, 2004.
    • (2004) Neural Process. Lett. , vol.19 , pp. 157-173
    • Ampazis, N.1    Perantonis, S.2
  • 139
    • 0345566149 scopus 로고    scopus 로고
    • A guided tour to approximate string matching
    • G. Navarro, "A guided tour to approximate string matching," ACM Comput. Surveys, vol. 33, pp. 31-88, 2001.
    • (2001) ACM Comput. Surveys , vol.33 , pp. 31-88
    • Navarro, G.1
  • 140
  • 141
    • 84976702763 scopus 로고
    • WordNet: A lexical database for English
    • G. A. Miller, "WordNet: A lexical database for English," Commun. ACM, vol. 38, pp. 39-41, 1995.
    • (1995) Commun. ACM , vol.38 , pp. 39-41
    • Miller, G.A.1
  • 142
    • 71349083400 scopus 로고    scopus 로고
    • Content-based hierarchical document organization using multi-layer hybrid network and tree-structured features
    • M. K. M. Rahman and T. W. S. Chow, "Content-based hierarchical document organization using multi-layer hybrid network and tree-structured features," Expert Syst. Appl., vol. 37, pp. 2874-2881, 2010.
    • (2010) Expert Syst. Appl. , vol.37 , pp. 2874-2881
    • Rahman, M.K.M.1    Chow, T.W.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.