메뉴 건너뛰기




Volumn , Issue , 2008, Pages 274-282

Entity categorization over large document collections

Author keywords

Algorithms; Experimentation; Performance

Indexed keywords

COMPUTATIONAL CHALLENGES; DATA ANALYSIS; DOCUMENT COLLECTIONS; DOCUMENT CONTEXTS; EXPERIMENTAL STUDIES; EXPERIMENTATION; MULTIPLE DOCUMENTS; PERFORMANCE; REAL DATA SETS; UNSTRUCTURED DOCUMENTS;

EID: 65449152689     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1401890.1401927     Document Type: Conference Paper
Times cited : (19)

References (25)
  • 1
    • 36949004011 scopus 로고    scopus 로고
    • Scaling Information Extraction to Large Document Collections
    • E. Agichtein. Scaling Information Extraction to Large Document Collections. IEEE Data Eng. Bull., 28(4):3-10, 2005.
    • (2005) IEEE Data Eng. Bull , vol.28 , Issue.4 , pp. 3-10
    • Agichtein, E.1
  • 2
    • 0344065593 scopus 로고    scopus 로고
    • Querying Text Databases for efficient Information Extraction
    • E. Agichtein and L. Gravano. Querying Text Databases for efficient Information Extraction. In ICDE, 2003.
    • (2003) ICDE
    • Agichtein, E.1    Gravano, L.2
  • 3
    • 84857524978 scopus 로고    scopus 로고
    • Scalable Information Extraction and integration
    • E. Agichtein and S. Sarawagi. Scalable Information Extraction and integration. In ACM SICKDD, 2006.
    • (2006) ACM SICKDD
    • Agichtein, E.1    Sarawagi, S.2
  • 4
    • 40249094693 scopus 로고    scopus 로고
    • Introduction to Information Extraction Technology
    • D. E. Appelt and D. Israel. Introduction to Information Extraction Technology. IJCAI-99 Tutorial, 1999.
    • (1999) IJCAI-99 Tutorial
    • Appelt, D.E.1    Israel, D.2
  • 5
    • 36849011074 scopus 로고    scopus 로고
    • Show me the Money!: Deriving the Pricing Power of Product Features by Mining Consumer Reviews
    • N. Archak, A. Ghose, and P. G. Ipeirotis. Show me the Money!: Deriving the Pricing Power of Product Features by Mining Consumer Reviews. In ACM SIGKDD, pages 56-65, 2007.
    • (2007) ACM SIGKDD , pp. 56-65
    • Archak, N.1    Ghose, A.2    Ipeirotis, P.G.3
  • 7
    • 0014814325 scopus 로고
    • Space/Time Tradeoffs in Hash Coding with Allowable Errors
    • B. Bloom. Space/Time Tradeoffs in Hash Coding with Allowable Errors. In Communications of the ACM 13(7), pages 422-426, 1970.
    • (1970) Communications of the ACM , vol.13 , Issue.7 , pp. 422-426
    • Bloom, B.1
  • 9
    • 33750818017 scopus 로고    scopus 로고
    • A Search Engine for Natural Language Applications
    • M. J. Cafarella and O. Etzioni. A Search Engine for Natural Language Applications. In WWW Conference, 2005.
    • (2005) WWW Conference
    • Cafarella, M.J.1    Etzioni, O.2
  • 10
    • 33749624541 scopus 로고    scopus 로고
    • Efficient Batch top-k Search for Dictionary-based Entity Recognition
    • A. Chandel, P. C. Nagesh, and S. Sarawagi. Efficient Batch top-k Search for Dictionary-based Entity Recognition. IEEE ICDE Conf., 2006.
    • (2006) IEEE ICDE Conf
    • Chandel, A.1    Nagesh, P.C.2    Sarawagi, S.3
  • 11
    • 79952557550 scopus 로고    scopus 로고
    • Information Extraction and Integration: An Overview
    • W. Cohen and A. McCallum.. Information Extraction and Integration: an Overview. In SIGKDD, 2004.
    • (2004) SIGKDD
    • Cohen, W.1    McCallum, A.2
  • 12
    • 14844367057 scopus 로고    scopus 로고
    • An Improved Data Stream Summary: The Count-Min Sketch and its Applications
    • G. Cormode and S. Muthukrishnan. An Improved Data Stream Summary: the Count-Min Sketch and its Applications. In Journal of Algorithms, 55(1), pages 58-75, 2005.
    • (2005) Journal of Algorithms , vol.55 , Issue.1 , pp. 58-75
    • Cormode, G.1    Muthukrishnan, S.2
  • 13
    • 23944436942 scopus 로고    scopus 로고
    • What's Hot and What's Not: Tracking Most Frequent Items Dynamically
    • G. Cormonde and S. Muthukrishnan. What's Hot and What's Not: Tracking Most Frequent Items Dynamically. ACM TODS, 30(1):249-278, 2005.
    • (2005) ACM TODS , vol.30 , Issue.1 , pp. 249-278
    • Cormonde, G.1    Muthukrishnan, S.2
  • 14
    • 84880742675 scopus 로고    scopus 로고
    • A Probabilistic Model of Redundancy in Information Extraction
    • D. Downey, O. Etzioni, and S. Soderland. A Probabilistic Model of Redundancy in Information Extraction. In IJCAI, 2005.
    • (2005) IJCAI
    • Downey, D.1    Etzioni, O.2    Soderland, S.3
  • 16
    • 0027608375 scopus 로고
    • Query Evaluation Techniques for Large Databases
    • G. Graefe. Query Evaluation Techniques for Large Databases. ACM Comput. Surv., 25(2), 1993.
    • (1993) ACM Comput. Surv , vol.25 , Issue.2
    • Graefe, G.1
  • 17
    • 34250654176 scopus 로고    scopus 로고
    • To Search or to Crawl?: Towards a Query Optimizer for Text-Centric Tasks
    • P. G. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano. To Search or to Crawl?: towards a Query Optimizer for Text-Centric Tasks. In SIGMOD, pages 265-276, 2006.
    • (2006) SIGMOD , pp. 265-276
    • Ipeirotis, P.G.1    Agichtein, E.2    Jain, P.3    Gravano, L.4
  • 18
    • 33749555540 scopus 로고    scopus 로고
    • Reducing the Human Overhead in Text Categorization
    • A. C. König and E. Brill. Reducing the Human Overhead in Text Categorization. In SIGKDD, 2006.
    • (2006) SIGKDD
    • König, A.C.1    Brill, E.2
  • 19
    • 65449173097 scopus 로고    scopus 로고
    • A. McCallum and W. Li. Early Results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons. In CoNLL, 2003.
    • A. McCallum and W. Li. Early Results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons. In CoNLL, 2003.
  • 20
    • 33749551461 scopus 로고    scopus 로고
    • A Mixture Model for Contextual Text Mining
    • Q. Mei and C. Zhai. A Mixture Model for Contextual Text Mining. In ACM SIGKDD, pages 649-655, 2006.
    • (2006) ACM SIGKDD , pp. 649-655
    • Mei, Q.1    Zhai, C.2
  • 24
    • 65449153129 scopus 로고    scopus 로고
    • W. Winkler. The State of Record Linkage and Current Research Problems. Technical report, U.S. Bureau of the Census, 1999.
    • W. Winkler. The State of Record Linkage and Current Research Problems. Technical report, U.S. Bureau of the Census, 1999.
  • 25
    • 65449143828 scopus 로고    scopus 로고
    • G. Zhou and J. Su. Named Entity Recognition using an HMM-based Chunk Tagger. In ACL, 2002.
    • G. Zhou and J. Su. Named Entity Recognition using an HMM-based Chunk Tagger. In ACL, 2002.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.