메뉴 건너뛰기




Volumn , Issue , 2011, Pages 197-202

Detecting text similarity over chinese research papers using MapReduce

Author keywords

Chinese Research Papers; Copy Detection; MapReduce; Parallel Algorithm; Similarity

Indexed keywords

2-TUPLE; COPY DETECTION; KEY-PHRASE; MAP-REDUCE; RESEARCH PAPERS; RUNNING TIME; SIMILARITY; SIMILARITY COEFFICIENTS; STATE-OF-THE-ART METHODS; TEXT SIMILARITY;

EID: 81255143639     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SNPD.2011.29     Document Type: Conference Paper
Times cited : (3)

References (21)
  • 1
    • 34250638291 scopus 로고    scopus 로고
    • A Web-based Kernel Function for Measuring the Similarity of Short Text Snippets
    • Sahami M. and Heilman T. D., "A Web-based Kernel Function for Measuring the Similarity of Short Text Snippets," in proceedings of WWW,2006.
    • Proceedings of WWW,2006
    • Sahami, M.1    Heilman, T.D.2
  • 2
    • 0037319544 scopus 로고    scopus 로고
    • Methods for Identifying Versioned and Plagiarized Documents
    • Hoad T. C. and Zobel. J., "Methods for Identifying Versioned and Plagiarized Documents," in JASIST,vol. 54,2003,pp. 203-215.
    • (2003) JASIST , vol.54 , pp. 203-215
    • Hoad, T.C.1    Zobel, J.2
  • 3
    • 32344452531 scopus 로고    scopus 로고
    • Evaluating Similarity Measures: A Large-scale Study in the Orkut Social Network
    • Spertus E.,Sahami M. and Buyukkokten O., "Evaluating Similarity Measures: A Large-scale Study in the Orkut Social Network," in proceedings of KDD,2005.
    • Proceedings of KDD,2005
    • Spertus, E.1    Buyukkokten O, S.M.2
  • 4
    • 79952095979 scopus 로고    scopus 로고
    • Features Based Text Similarity Detection
    • Kent C.W. and Salim N., "Features Based Text Similarity Detection," in Journal of Computing,vol. 2,2010,pp. 53-57.
    • (2010) Journal of Computing , vol.2 , pp. 53-57
    • Kent, C.W.1    Salim, N.2
  • 7
  • 8
    • 85094045221 scopus 로고    scopus 로고
    • Document Representation and Multilevel Measures of Document Similarity
    • Matveeva I., "Document Representation and Multilevel Measures of Document Similarity," in proceedings of ACL-HLT, 2006.
    • Proceedings of ACL-HLT, 2006
    • Matveeva, I.1
  • 9
    • 84863347445 scopus 로고    scopus 로고
    • Detecting Text Similarity over Short Passages: Exploring Linguistic Feature Cominations via Machine Learning
    • Hatzivassiloglou V.,Klavans J.L. and Eskin E., "Detecting Text Similarity over Short Passages: Exploring Linguistic Feature Cominations via Machine Learning," in proceedings of SIGDAT,1999.
    • Proceedings of SIGDAT,1999
    • Hatzivassiloglou, V.1    Klavans, J.L.2    Eskin, E.3
  • 10
    • 79957966387 scopus 로고    scopus 로고
    • Learning Term-weighting Functions for Similarity Measures
    • Yih W., "Learning Term-weighting Functions for Similarity Measures," in proceedings of EMNLP,2009.
    • Proceedings of EMNLP,2009
    • Yih, W.1
  • 12
    • 4944224800 scopus 로고    scopus 로고
    • Identifying and Filtering Near Duplicate Documents
    • Broder A. Z., "Identifying and Filtering Near Duplicate Documents," in processing of COM,2000.
    • (2000) Processing of COM
    • Broder, A.Z.1
  • 13
    • 0001368373 scopus 로고
    • Comparative de la Distribution Florale Dans Une Portion des Alpes et des Jura
    • Jaccard P. E., "Comparative de la Distribution Florale Dans Une Portion Des Alpes et Des Jura," in Bulletin del la Socit Vaudoise des Sciences Naturelles,vol. 37,1901,pp. 547-579.
    • (1901) Bulletin del la Socit Vaudoise des Sciences Naturelles , vol.37 , pp. 547-579
    • Jaccard, P.E.1
  • 18
    • 84859921422 scopus 로고    scopus 로고
    • Pairwise Document Similarity in Large Collections with MapReduce
    • Elsayed t.,Lin J. and Oard D.W., "Pairwise Document Similarity in Large Collections with MapReduce," in proceedings of ACL-HLT,2008.
    • Proceedings of ACL-HLT,2008
    • T, E.1    Lin, J.2    Oard, D.W.3
  • 20
    • 37549003336 scopus 로고    scopus 로고
    • MapReduce:Simplified Data Processing on Large Clusters
    • Dean,J. and Ghemawat,S., "MapReduce:Simplified Data Processing on Large Clusters," in Commun. ACM,vol. 51,2008,pp 107-113.
    • (2008) Commun. ACM , vol.51 , pp. 107-113
    • Dean, J.1    Ghemawat, S.2
  • 21
    • 81255211426 scopus 로고    scopus 로고
    • website
    • Hadoop website, http://hadoop.apache.org/, 2011.
    • (2011)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.