메뉴 건너뛰기




Volumn , Issue , 2008, Pages 25-28

Exploring linguistic features for web spam detection: A preliminary study

Author keywords

Content features; Linguistic features; Web spam detection

Indexed keywords

INFORMATION RETRIEVAL; INFORMATION SERVICES; LINGUISTICS; SPAMMING;

EID: 63049135689     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1451983.1451990     Document Type: Conference Paper
Times cited : (58)

References (15)
  • 1
    • 63049106076 scopus 로고    scopus 로고
    • J. Abernethy, O. Chapelle, and C. Castillo. Witch: A new approach to web spam detection, 2007. submitted.
    • J. Abernethy, O. Chapelle, and C. Castillo. Witch: A new approach to web spam detection, 2007. submitted.
  • 2
    • 35548932906 scopus 로고    scopus 로고
    • Web spam detection via commercial intent analysis
    • New York, NY, USA, ACM
    • A. Benczúr, I. Bíró, K. Csalogány, and T. Sarlós. Web spam detection via commercial intent analysis. In Proceedings of AIRWeb 2007, pages 89-92, New York, NY, USA, 2007. ACM.
    • (2007) Proceedings of AIRWeb , pp. 89-92
    • Benczúr, A.1    Bíró, I.2    Csalogány, K.3    Sarlós, T.4
  • 4
    • 33646432218 scopus 로고    scopus 로고
    • Thwarting the nigritude ultramarine: Learning to identify link spam
    • Proceedings of ECML 2005, of, Porto, Portugal
    • I. Drost and T. Scheffer. Thwarting the nigritude ultramarine: learning to identify link spam. In Proceedings of ECML 2005, volume 3720 of LNAI, pages 233-243, Porto, Portugal, 2005.
    • (2005) LNAI , vol.3720 , pp. 233-243
    • Drost, I.1    Scheffer, T.2
  • 6
    • 85031789656 scopus 로고    scopus 로고
    • SENTIWORDNET: A publicly available lexical resource for opinion mining
    • Genova, IT
    • A. Esuli and F. Sebastiani. SENTIWORDNET: A publicly available lexical resource for opinion mining. In Proceedings of LREC 2006, pages 417-422, Genova, IT, 2006.
    • (2006) Proceedings of LREC , pp. 417-422
    • Esuli, A.1    Sebastiani, F.2
  • 7
    • 27344433890 scopus 로고    scopus 로고
    • Spam, damn spam, and statistics: Using statistical analysis to locate spam web
    • New York, USA
    • D. Fetterly, M. Manasse, and M. Najork. Spam, damn spam, and statistics: using statistical analysis to locate spam web pages. In Proceedings of WebDB '04, New York, USA, 2004.
    • (2004) Proceedings of WebDB '04
    • Fetterly, D.1    Manasse, M.2    Najork, M.3
  • 8
    • 84885639910 scopus 로고    scopus 로고
    • Detecting phrase-level duplication on the world wide web
    • New York, NY, USA, ACM
    • D. Fetterly, M. Manasse, and M. Najork. Detecting phrase-level duplication on the world wide web. In Proceedings of SIGIR '05, pages 170-177, New York, NY, USA, 2005. ACM.
    • (2005) Proceedings of SIGIR '05 , pp. 170-177
    • Fetterly, D.1    Manasse, M.2    Najork, M.3
  • 9
    • 45949101532 scopus 로고    scopus 로고
    • Corleone - Core Linguistic Entity Extraction
    • Technical Report. JRC of the European Commission
    • Jakub Piskorski. Corleone - Core Linguistic Entity Extraction. Technical Report. JRC of the European Commission, 2008.
    • (2008)
    • Piskorski, J.1
  • 11
    • 34250653315 scopus 로고    scopus 로고
    • Detecting spam web pages through content analysis
    • Edinburgh, Scotland
    • A. Ntoulas, M. Najork, M. Manasse, and D. Fetterly. Detecting spam web pages through content analysis. In Proceedings of WWW 2006, Edinburgh, Scotland, pages 83-92, 2006.
    • (2006) Proceedings of WWW 2006 , pp. 83-92
    • Ntoulas, A.1    Najork, M.2    Manasse, M.3    Fetterly, D.4
  • 13
    • 63049134885 scopus 로고    scopus 로고
    • Tracking web spam with hidden style similarity
    • T. Urvoy, T. Lavergne, and P. Filoche. Tracking web spam with hidden style similarity. In AIRWeb 2006, pages 25-31, 2006.
    • (2006) AIRWeb 2006 , pp. 25-31
    • Urvoy, T.1    Lavergne, T.2    Filoche, P.3
  • 14
    • 63049096087 scopus 로고    scopus 로고
    • URL:, accessed February 21, 2008
    • Webspam corpora. URL: http://yr-bcn.es/webspam/datasets, accessed February 21, 2008.
    • Webspam corpora
  • 15
    • 3843099453 scopus 로고    scopus 로고
    • Automating Linguistics-Based Cues for Detecting Deception of Text-based Asynchronous Computer-Mediated Communication
    • A. Zhou, J. Burgoon, J. Nunamaker, and D. Twitchell. Automating Linguistics-Based Cues for Detecting Deception of Text-based Asynchronous Computer-Mediated Communication. Group Decision and Negotiations, 12:81-106, 2004.
    • (2004) Group Decision and Negotiations , vol.12 , pp. 81-106
    • Zhou, A.1    Burgoon, J.2    Nunamaker, J.3    Twitchell, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.