메뉴 건너뛰기




Volumn 27, Issue 3, 2015, Pages 248-268

Uncovering highly obfuscated plagiarism cases using fuzzy semantic-based similarity model

Author keywords

Feature extraction; Fuzzy similarity; Obfuscation; Plagiarism detection; Semantic similarity

Indexed keywords


EID: 84945497641     PISSN: 13191578     EISSN: 22131248     Source Type: Journal    
DOI: 10.1016/j.jksuci.2014.12.001     Document Type: Article
Times cited : (21)

References (60)
  • 4
    • 84857505386 scopus 로고    scopus 로고
    • Understanding plagiarism linguistic patterns, textual features and detection methods
    • Alzahrani, S.M., Salim, N., Abraham, A., 2012. Understanding plagiarism linguistic patterns, textual features and detection methods. IEEE Trans. Syst. Man Cybernet. C Appl. Rev. 42, 133-149.
    • (2012) IEEE Trans. Syst. Man Cybernet. C Appl. Rev , vol.42 , pp. 133-149
    • Alzahrani, S.M.1    Salim, N.2    Abraham, A.3
  • 5
    • 84857361390 scopus 로고    scopus 로고
    • Using structural information and citation evidence to detect significant plagiarism cases
    • Alzahrani, S., Palade, V., Salim, N., Abraham, A., 2012. Using structural information and citation evidence to detect significant plagiarism cases. J. Am. Soc. Inf. Sci. Technol. 63, 286-312.
    • (2012) J. Am. Soc. Inf. Sci. Technol , vol.63 , pp. 286-312
    • Alzahrani, S.1    Palade, V.2    Salim, N.3    Abraham, A.4
  • 7
    • 67650705687 scopus 로고    scopus 로고
    • On automatic plagiarism detection based on n-grams comparison
    • Barrón-Cedeño, A., Rosso, P., 2009. On automatic plagiarism detection based on n-grams comparison. In: Advances in Information Retrieval. pp. 696-700.
    • (2009) Advances in Information Retrieval , pp. 696-700
    • Barrón-Cedeño, A.1    Rosso, P.2
  • 9
    • 84872045317 scopus 로고    scopus 로고
    • A plagiarism detection procedure in three steps: Selection, Matches and ''Squares''
    • Stein, B., Rosso, P., Stamatatos, E., Koppel, M., Agirre, E. (Eds.), SEPLN'09. Donostia, Spain
    • Basile, C., Benedetto, D., Caglioti, E., Cristadoro, G., Esposti, M.D., 2009. A plagiarism detection procedure in three steps: Selection, Matches and ''Squares''. In: Stein, B., Rosso, P., Stamatatos, E., Koppel, M., Agirre, E. (Eds.), 25th Conference of the Spanish Society for Natural Language Processing, SEPLN'09. Donostia, Spain, pp. 19-23.
    • (2009) 25th Conference of the Spanish Society for Natural Language Processing , pp. 19-23
    • Basile, C.1    Benedetto, D.2    Caglioti, E.3    Cristadoro, G.4    Esposti, M.D.5
  • 11
    • 84989585135 scopus 로고
    • A fuzzy linguistic approach generalizing boolean information retrieval: a model and its evaluation
    • Bordogna, G., Pasi, G., 1993. A fuzzy linguistic approach generalizing boolean information retrieval: a model and its evaluation. J. Am. Soc. Inf. Sci. Technol. 44, 70-82.
    • (1993) J. Am. Soc. Inf. Sci. Technol , vol.44 , pp. 70-82
    • Bordogna, G.1    Pasi, G.2
  • 12
    • 49749089724 scopus 로고    scopus 로고
    • Plagiarism: words and ideas
    • Bouville, M., 2008. Plagiarism: words and ideas. Sci. Eng. Ethics 14, 311-322.
    • (2008) Sci. Eng. Ethics , vol.14 , pp. 311-322
    • Bouville, M.1
  • 13
    • 33646760990 scopus 로고    scopus 로고
    • Evaluating WordNet-based measures of lexical semantic relatedness
    • Budanitsky, A., Hirst, G., 2006. Evaluating WordNet-based measures of lexical semantic relatedness. Computat. Linguist. 32, 13-47.
    • (2006) Computat. Linguist , vol.32 , pp. 13-47
    • Budanitsky, A.1    Hirst, G.2
  • 15
    • 52149094802 scopus 로고    scopus 로고
    • Plagiarism detection based on singular value decomposition
    • Ceska, Z., 2008. Plagiarism detection based on singular value decomposition. In: Lecture Notes in Computer Science. pp. 108-119.
    • (2008) Lecture Notes in Computer Science , pp. 108-119
    • Ceska, Z.1
  • 16
  • 17
    • 79952245827 scopus 로고    scopus 로고
    • Developing a corpus of plagiarised short answers
    • Special Issue on Plagiarism and Authorship Analysis
    • Clough, P., Stevenson, M., 2011. Developing a corpus of plagiarised short answers. Lang. Resour. Evaluat. 45, 5-24, Special Issue on Plagiarism and Authorship Analysis.
    • (2011) Lang. Resour. Evaluat , vol.45 , pp. 5-24
    • Clough, P.1    Stevenson, M.2
  • 19
    • 0001300961 scopus 로고
    • Fuzzy information retrieval
    • Cross, V., 1994. Fuzzy information retrieval. J. Intell. Inf. Syst. 3, 29-56.
    • (1994) J. Intell. Inf. Syst , vol.3 , pp. 29-56
    • Cross, V.1
  • 20
    • 85105937549 scopus 로고    scopus 로고
    • Unsupervised construction of large paraphrase corpora: exploiting massively parallel news sources
    • Association for Computational Linguistics, Geneva, Switzerland
    • Dolan, B., Quirk, C., Brockett, C., 2004. Unsupervised construction of large paraphrase corpora: exploiting massively parallel news sources. In: 20th International Conference on Computational Linguistics. Association for Computational Linguistics, Geneva, Switzerland, p. 350.
    • (2004) 20th International Conference on Computational Linguistics , pp. 350
    • Dolan, B.1    Quirk, C.2    Brockett, C.3
  • 23
    • 77749301855 scopus 로고    scopus 로고
    • Duplicate detection in documents and webpages using improved longest common subsequence and documents syntactical structures
    • Seoul, Korea
    • Elhadi, M., Al-Tobi, A., 2009. Duplicate detection in documents and webpages using improved longest common subsequence and documents syntactical structures. In: 4th International Conference on Computer Sciences and Convergence Information Technology, Seoul, Korea. pp. 679-684.
    • (2009) 4th International Conference on Computer Sciences and Convergence Information Technology , pp. 679-684
    • Elhadi, M.1    Al-Tobi, A.2
  • 26
    • 84870919167 scopus 로고    scopus 로고
    • ENCOPLOT: pairwise sequence matching in linear time applied to plagiarism detection
    • Stein, B., Rosso, P., Stamatatos, E., Koppel, M., Agirre, E. (Eds.), SEPLN'09. Donostia, Spain
    • Grozea, C., Gehl, C., Popescu, M., 2009. ENCOPLOT: pairwise sequence matching in linear time applied to plagiarism detection. In: Stein, B., Rosso, P., Stamatatos, E., Koppel, M., Agirre, E. (Eds.), 25th Conference of the Spanish Society for Natural Language Processing, SEPLN'09. Donostia, Spain, pp. 10-18.
    • (2009) 25th Conference of the Spanish Society for Natural Language Processing , pp. 10-18
    • Grozea, C.1    Gehl, C.2    Popescu, M.3
  • 27
    • 0012992939 scopus 로고    scopus 로고
    • Lexical chains as representation of context for the detection and correction malapropisms
    • Fellbaum (Ed.). (Language, Speech, and Communication). The MIT Press
    • Hirst, G., St Onge, D., 1998. Lexical chains as representation of context for the detection and correction malapropisms. In: Fellbaum (Ed.), WordNet: An Electronic Lexical Database (Language, Speech, and Communication). The MIT Press, pp. 305-332.
    • (1998) WordNet: An Electronic Lexical Database , pp. 305-332
    • Hirst, G.1    St Onge, D.2
  • 29
    • 84887479157 scopus 로고    scopus 로고
    • Finding plagiarism by evaluating document similarities
    • Stein, B., Rosso, P., Stamatatos, E., Koppel, M., Agirre, E. (Eds.), SEPLN'09. Donostia, Spain
    • Kasprzak, J., Brandejs, M., Křipač, M., 2009. Finding plagiarism by evaluating document similarities. In: Stein, B., Rosso, P., Stamatatos, E., Koppel, M., Agirre, E. (Eds.), 25th Conference of the Spanish Society for Natural Language Processing, SEPLN'09. Donostia, Spain, pp. 24-28.
    • (2009) 25th Conference of the Spanish Society for Natural Language Processing , pp. 24-28
    • Kasprzak, J.1    Brandejs, M.2    Křipač, M.3
  • 31
    • 42749109029 scopus 로고    scopus 로고
    • Detecting translations of the same text and data with common source
    • Koroutchev, K., Cebrian, M., 2006. Detecting translations of the same text and data with common source. J. Statist. Mech. Theory Experiment 2006, P10009.
    • (2006) J. Statist. Mech. Theory Experiment , vol.2006
    • Koroutchev, K.1    Cebrian, M.2
  • 32
    • 0002542095 scopus 로고    scopus 로고
    • Combining local context with WordNet similarity for word sense identification
    • Fellbaum, C. (Ed.). MIT Press, Cambridge, MA
    • Leacock, C., Chodorow, M., 1998. Combining local context with WordNet similarity for word sense identification. In: Fellbaum, C. (Ed.), WordNet: A Lexical Reference System and its Application. MIT Press, Cambridge, MA, pp. 265-283.
    • (1998) WordNet: A Lexical Reference System and its Application , pp. 265-283
    • Leacock, C.1    Chodorow, M.2
  • 33
    • 79151473873 scopus 로고    scopus 로고
    • A novel sentence similarity measure for semanticbased expert systems
    • Lee, M.C., 2011. A novel sentence similarity measure for semanticbased expert systems. Expert Syst. Appl. 38, 6392-6399.
    • (2011) Expert Syst. Appl , vol.38 , pp. 6392-6399
    • Lee, M.C.1
  • 35
    • 0042850512 scopus 로고    scopus 로고
    • An approach for measuring semantic similarity between words using multiple information sources
    • Li, Y., Bandar, Z.A., McLean, D., 2003. An approach for measuring semantic similarity between words using multiple information sources. IEEE Trans. Knowledge Data Eng. 15, 871-882.
    • (2003) IEEE Trans. Knowledge Data Eng , vol.15 , pp. 871-882
    • Li, Y.1    Bandar, Z.A.2    McLean, D.3
  • 37
    • 0005180705 scopus 로고    scopus 로고
    • An information-theoretic definition of similarity
    • Shavlik, J.W. (Ed.) Morgan Kaufmann Publishers Inc., San Francisco, CA, USA
    • Lin, D., 1998. An information-theoretic definition of similarity. In: Shavlik, J.W. (Ed.), Fifteenth International Conference on Machine Learning (ICML '98). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, pp. 296-304.
    • (1998) Fifteenth International Conference on Machine Learning (ICML '98) , pp. 296-304
    • Lin, D.1
  • 38
    • 0005180705 scopus 로고    scopus 로고
    • An information-theoretic definition of similarity
    • Madison, Wisconsin, USA
    • Lin, D., 1998. An information-theoretic definition of similarity. In: 15th International Conference on Machine Learning, ICML '98, Madison, Wisconsin, USA. pp. 296-304.
    • (1998) 15th International Conference on Machine Learning, ICML '98 , pp. 296-304
    • Lin, D.1
  • 39
    • 79958001937 scopus 로고    scopus 로고
    • A semantic term weighting scheme for text categorization
    • Luo, Q., Chen, E., Xiong, H., 2011. A semantic term weighting scheme for text categorization. Expert Syst. Appl. 38, 12708-12716.
    • (2011) Expert Syst. Appl , vol.38 , pp. 12708-12716
    • Luo, Q.1    Chen, E.2    Xiong, H.3
  • 40
  • 43
    • 84976702763 scopus 로고
    • WordNet: a lexical database for English
    • Miller, G.A., 1995. WordNet: a lexical database for English. Commun. ACM 38, 39-41.
    • (1995) Commun. ACM , vol.38 , pp. 39-41
    • Miller, G.A.1
  • 45
    • 0000465965 scopus 로고
    • A fuzzy document retrieval system using the keyword connection matrix and a learning method
    • Ogawa, Y., Morita, T., Kobayashi, K., 1991. A fuzzy document retrieval system using the keyword connection matrix and a learning method. Fuzzy Sets Syst 39, 163-179.
    • (1991) Fuzzy Sets Syst , vol.39 , pp. 163-179
    • Ogawa, Y.1    Morita, T.2    Kobayashi, K.3
  • 46
    • 79551697630 scopus 로고    scopus 로고
    • SimPaD: a word-similarity sentencebased plagiarism detection tool on Web documents
    • Pera, M.S., Ng, Y.-K., 2011. SimPaD: a word-similarity sentencebased plagiarism detection tool on Web documents. Web Intell. Agent Syst., IOS Press 9, 27-41.
    • (2011) Web Intell. Agent Syst., IOS Press , vol.9 , pp. 27-41
    • Pera, M.S.1    Ng, Y.-K.2
  • 52
    • 0003033112 scopus 로고
    • Using information content to evaluate semantic similarity in a taxonomy
    • Mellish, Chris S. (Ed.). vol. 1. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA
    • Resnik, P., 1995. Using information content to evaluate semantic similarity in a taxonomy. In: Mellish, Chris S. (Ed.),. In: 14th International Joint Conference on Artificial Intelligence-Volume 1 (IJCAI'95), vol. 1. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, pp. 448-453.
    • (1995) 14th International Joint Conference on Artificial Intelligence (IJCAI'95) , vol.1 , pp. 448-453
    • Resnik, P.1
  • 54
    • 84887436954 scopus 로고    scopus 로고
    • Using Microsoft SQL server platform for plagiarism detection
    • Stein, B., Rosso, P., Stamatatos, E., Koppel, M., Agirre, E. (Eds.), SEPLN'09. Donostia, Spain
    • Scherbinin, V., Butakov, S., 2009. Using Microsoft SQL server platform for plagiarism detection. In: Stein, B., Rosso, P., Stamatatos, E., Koppel, M., Agirre, E. (Eds.), 25th Conference of the Spanish Society for Natural Language Processing, SEPLN'09. Donostia, Spain, pp. 36-37.
    • (2009) 25th Conference of the Spanish Society for Natural Language Processing , pp. 36-37
    • Scherbinin, V.1    Butakov, S.2
  • 55
    • 77956016296 scopus 로고    scopus 로고
    • An efficient concept-based mining model for enhancing text clustering
    • Shehata, S., Karray, F., Kamel, M., 2010. An efficient concept-based mining model for enhancing text clustering. IEEE Trans. Knowledge Data Eng. 22, 1360-1371.
    • (2010) IEEE Trans. Knowledge Data Eng , vol.22 , pp. 1360-1371
    • Shehata, S.1    Karray, F.2    Kamel, M.3
  • 57
    • 0345551404 scopus 로고    scopus 로고
    • Mining the web for synonyms: PMI-IR versus LSA on TOEFL
    • Springer-Verlag, London, UK
    • Turney, P.D., 2001. Mining the web for synonyms: PMI-IR versus LSA on TOEFL. In: 12th European Conference on Machine Learning. Springer-Verlag, London, UK.
    • (2001) 12th European Conference on Machine Learning
    • Turney, P.D.1
  • 59
    • 33749028969 scopus 로고    scopus 로고
    • A sentence-based copy detection approach for web documents
    • Yerra, R., Ng, Y.-K., 2005. A sentence-based copy detection approach for web documents. In: Fuzzy Systems and Knowledge Discovery. pp. 557-570.
    • (2005) Fuzzy Systems and Knowledge Discovery , pp. 557-570
    • Yerra, R.1    Ng, Y.-K.2
  • 60
    • 84879442725 scopus 로고    scopus 로고
    • External and intrinsic plagiarism detection using vector space models
    • Stein, B., Rosso, P., Stamatatos, E., Koppel, M., Agirre, E. (Eds.), SEPLN'09. Donostia, Spain
    • Zechner, M., Muhr, M., Kern, R., Granitzer, M., 2009. External and intrinsic plagiarism detection using vector space models. In: Stein, B., Rosso, P., Stamatatos, E., Koppel, M., Agirre, E. (Eds.), 25th Conference of the Spanish Society for Natural Language Processing, SEPLN'09. Donostia, Spain, pp. 47-55.
    • (2009) 25th Conference of the Spanish Society for Natural Language Processing , pp. 47-55
    • Zechner, M.1    Muhr, M.2    Kern, R.3    Granitzer, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.