SCOPUS 정보 검색 플랫폼

WSDM 2012 - Proceedings of the 5th ACM International Conference on Web Search and Data Mining

Volumn , Issue , 2012, Pages 243-252

WebSets: Extracting sets of entities from the Web using unsupervised information extraction

(3) Dalvi, Bhavana a Cohen, William W a Callan, Jamie a

a Carnegie Mellon University (United States)

Author keywords

Clustering; Hyponymy relation acquisition; Web mining

Indexed keywords

CLUSTERING; DATA SETS; HTML TABLES; HYPONYMY RELATION; INFORMATION EXTRACTION; INFORMATION EXTRACTION METHODS; WEB MINING;

DATA MINING; HTML; INFORMATION RETRIEVAL;

WEBSITES;

EID: 84858032933 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2124295.2124327 Document Type: Conference Paper

Times cited : (79)

References (30)

1
- 84858049690
- Html TIDY project. http://tidy.sourceforge.net/.

2
- 84859197607
- Webtables: Exploring the power of tables on the web
- M. J. Cafarella, E. Wu, A. Halevy, Y. Zhang, and D. Z. Wang. Webtables: Exploring the power of tables on the web. PVLDB, 2008.
- (2008) PVLDB
- Cafarella, M.J.¹ Wu, E.² Halevy, A.³ Zhang, Y.⁴ Wang, D.Z.⁵

3
- 77958609314
- J. Callan. The clueweb09 dataset.http://boston.lti.cs.cmu.edu/Data/ clueweb09/.
- The clueweb09 Dataset
- Callan, J.¹

4
- 77950891804
- Coupled semi-supervised learning for information extraction
- A. Carlson, J. Betteridge, R. C. Wang, E. R. Hruschka, Jr., and T. M. Mitchell. Coupled semi-supervised learning for information extraction. In WSDM, 2010.
- (2010) WSDM
- Carlson, A.¹ Betteridge, J.² Wang, R.C.³ Hruschka Jr., E.R.⁴ Mitchell, T.M.⁵

5
- 0002546287
- Efficient algorithms for agglomerative hierarchical clustering methods
- W. H. E. Day and H. Edelsbrunner. Efficient algorithms for agglomerative hierarchical clustering methods. In Journal of Classification, 1984.
- (1984) Journal of Classification
- Day, W.H.E.¹ Edelsbrunner, H.²

6
- 33746085343
- Web-scale information extraction in knowitall: (Preliminary results)
- O. Etzioni, M. Cafarella, D. Downey, S. Kok, A.-M. Popescu, T. Shaked, S. Soderland, D. S. Weld, and A. Yates. Web-scale information extraction in knowitall: (preliminary results). In WWW, 2004.
- (2004) WWW
- Etzioni, O.¹ Cafarella, M.² Downey, D.³ Kok, S.⁴ Popescu, A.-M.⁵ Shaked, T.⁶ Soderland, S.⁷ Weld, D.S.⁸ Yates, A.⁹

7
- 17644423946
- Unsupervised named-entity extraction from the web: An experimental study
- O. Etzioni, M. Cafarella, D. Downey, A.-M. Popescu, T. Shaked, S. Soderland, D. S. Weld, and A. Yate. Unsupervised named-entity extraction from the web: An experimental study. In AI, 2005.
- (2005) AI
- Etzioni, O.¹ Cafarella, M.² Downey, D.³ Popescu, A.-M.⁴ Shaked, T.⁵ Soderland, S.⁶ Weld, D.S.⁷ Yate, A.⁸

8
- 35348900845
- Towards domain-independent information extraction from web tables
- W. Gatterbauer, P. Bohunsky, M. Herzog, B. Krüpl, and B. Pollak. Towards domain-independent information extraction from web tables. In WWW, 2007.
- (2007) WWW
- Gatterbauer, W.¹ Bohunsky, P.² Herzog, M.³ Krüpl, B.⁴ Pollak, B.⁵

9
- 79952384867
- Answering table augmentation queries from unstructured lists on the web
- R. Gupta and S. Sarawagi. Answering table augmentation queries from unstructured lists on the web. In VLDB, 2009.
- (2009) VLDB
- Gupta, R.¹ Sarawagi, S.²

10
- 79952397061
- Joint training for open-domain extraction on the web: Exploiting overlap when supervision is limited
- R. Gupta and S. Sarawagi. Joint training for open-domain extraction on the web: exploiting overlap when supervision is limited. In WSDM, 2011.
- (2011) WSDM
- Gupta, R.¹ Sarawagi, S.²

11
- 0012990385
- Automatic acquisition of hyponyms from large text corpora
- M. A. Hearst. Automatic acquisition of hyponyms from large text corpora. In ACL, 1992.
- (1992) ACL
- Hearst, M.A.¹

12
- 84858049685
- Using anchor text, spam filtering and wikipedia for web search and entity ranking
- J. Kamps, R. Kaptein, and M. Koolen. Using anchor text, spam filtering and wikipedia for web search and entity ranking. TREC, 2010.
- (2010) TREC
- Kamps, J.¹ Kaptein, R.² Koolen, M.³

13
- 80053269546
- A semi-supervised method to learn and construct taxonomies using the web
- Z. Kozareva and E. Hovy. A semi-supervised method to learn and construct taxonomies using the web. In EMNLP, 2010.
- (2010) EMNLP
- Kozareva, Z.¹ Hovy, E.²

14
- 79960022996
- Annotating and searching web tables using entities, types and relationships
- G. Limaye, S. Sarawagi, and S. Chakrabarti. Annotating and searching web tables using entities, types and relationships. PVLDB, 2010.
- (2010) PVLDB
- Limaye, G.¹ Sarawagi, S.² Chakrabarti, S.³

15
- 0242496755
- Concept discovery from text
- D. Lin and P. Pantel. Concept discovery from text. In COLING, 2002.
- (2002) COLING
- Lin, D.¹ Pantel, P.²

16
- 34548080780
- Cambridge University Press
- C. D. Manning, P. Raghavan, and H. Schtze. Introduction to information retrieval. In Cambridge University Press, 2008.
- (2008) Introduction to Information Retrieval
- Manning, C.D.¹ Raghavan, P.² Schtze, H.³

17
- 85117710380
- Automatically labeling semantic classes
- P. Pantel and D. Ravichandran. Automatically labeling semantic classes. In HLT-NAACL, 2004.
- (2004) HLT-NAACL
- Pantel, P.¹ Ravichandran, D.²

18
- 82055172940
- Towards the web of concepts: Extracting concepts from large datasets
- A. Parameswaran, H. Garcia-Molina, and A. Rajaraman. Towards the web of concepts: Extracting concepts from large datasets. In VLDB, 2010.
- (2010) VLDB
- Parameswaran, A.¹ Garcia-Molina, H.² Rajaraman, A.³

19
- 78649855644
- Probabilistic metrics for soft-clustering and topic model validation
- E. Ramirez, R. Brena, D. Magatti, and F. Stella. Probabilistic metrics for soft-clustering and topic model validation. In Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010.
- (2010) Web Intelligence and Intelligent Agent Technology (WI-IAT)
- Ramirez, E.¹ Brena, R.² Magatti, D.³ Stella, F.⁴

20
- 70350543748
- What is this, anyway: Automatic hypernym discovery
- A. Ritter, S. Soderland, and O. Etzioni. What is this, anyway: Automatic hypernym discovery. In AAAI, 2009.
- (2009) AAAI
- Ritter, A.¹ Soderland, S.² Etzioni, O.³

21
- 84912540476
- Acquiring hyponymy relations from web documents
- K. Shinzato and K. Torisawa. Acquiring hyponymy relations from web documents. In HLT-NAACL, 2004.
- (2004) HLT-NAACL
- Shinzato, K.¹ Torisawa, K.²

22
- 34247123828
- Learning syntactic patterns for automatic hypernym discovery
- R. Snow, D. Jurafsky, and A. Y. Ng. Learning syntactic patterns for automatic hypernym discovery. In NIPS, 2004.
- (2004) NIPS
- Snow, R.¹ Jurafsky, D.² Ng, A.Y.³

23
- 80053360508
- Cheap and fast - But is it good? evaluating non-expert annotations for natural language tasks
- R. Snow, B. O'Connor, D. Jurafsky, and A. Y. Ng. Cheap and fast - but is it good? evaluating non-expert annotations for natural language tasks. In EMNLP, 2008.
- (2008) EMNLP
- Snow, R.¹ O'Connor, B.² Jurafsky, D.³ Ng, A.Y.⁴

24
- 80053380980
- Weakly-supervised acquisition of labeled class instances using graph random walks
- P. P. Talukdar, J. Reisinger, M. Paşca, D. Ravichandran, R. Bhagat, and F. Pereira. Weakly-supervised acquisition of labeled class instances using graph random walks. In EMNLP, 2008.
- (2008) EMNLP
- Talukdar, P.P.¹ Reisinger, J.² Paşca, M.³ Ravichandran, D.⁴ Bhagat, R.⁵ Pereira, F.⁶

25
- 84888147893
- M. Tom. Nell: Never-ending language learning. http://rtw.ml.cmu.edu/rtw/.
- Nell: Never-ending Language Learning
- Tom, M.¹

26
- 84858058168
- Finding cars, goddesses and enzymes: Parametrizable acquisition of labeled instances for open-domain information extraction
- B. Van Durme and M. Pasca. Finding cars, goddesses and enzymes: parametrizable acquisition of labeled instances for open-domain information extraction. In AAAI, 2008.
- (2008) AAAI
- Van Durme, B.¹ Pasca, M.²

27
- 80053408112
- Automatic set instance extraction using the web
- R. C. Wang and W. W. Cohen. Automatic set instance extraction using the web. In ACL, 2009.
- (2009) ACL
- Wang, R.C.¹ Cohen, W.W.²

28
- 80053426759
- Character-level analysis of semi-structured documents for set expansion
- R. C. Wang and W. W. Cohen. Character-level analysis of semi-structured documents for set expansion. In EMNLP, 2009.
- (2009) EMNLP
- Wang, R.C.¹ Cohen, W.W.²

29
- 67650651939
- Analyzing social bookmarking systems: A del.icio.us cookbook
- R. Wetzker, C. Zimmermann, and C. Bauckhage. Analyzing social bookmarking systems: A del.icio.us cookbook. Mining Social Data (MSoDa) Workshop Proceedings, ECAI, 2008. http://www.dai-labor.de/en/competence-centers/irml/ datasets/.
- (2008) Mining Social Data (MSoDa) Workshop Proceedings ECAI
- Wetzker, R.¹ Zimmermann, C.² Bauckhage, C.³

30
- 84873560385
- Textrunner: Open information extraction on the web
- A. Yates, M. Cafarella, M. Banko, O. Etzioni, M. Broadhead, and S. Soderland. Textrunner: Open information extraction on the web. In NAACL, 2007.
- (2007) NAACL
- Yates, A.¹ Cafarella, M.² Banko, M.³ Etzioni, O.⁴ Broadhead, M.⁵ Soderland, S.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.