-
1
-
-
0032092761
-
NoDoSE - A tool for semi-automatically extracting structured and semistructured data from text documents
-
B. Adelberg, "NoDoSE - A Tool for Semi-Automatically Extracting Structured and Semistructured Data from Text Documents," Proc. 1998 ACM SIGMOD Int'l Conf. Management of Data (SIGMOD), 1998.
-
(1998)
Proc. 1998 ACM SIGMOD Int'l Conf. Management of Data (SIGMOD)
-
-
Adelberg, B.1
-
2
-
-
4544334171
-
Efficient substructure discovery from large semistructured data
-
T. Asai, K. Abe, S. Kawasoe, H. Arimura, H. Sakamoto, and S. Arikawa, "Efficient Substructure Discovery from Large Semistructured Data," Proc. SIAM Int'l Conf. Data Mining (SDM), 2002.
-
(2002)
Proc. SIAM Int'l Conf. Data Mining (SDM)
-
-
Asai, T.1
Abe, K.2
Kawasoe, S.3
Arimura, H.4
Sakamoto, H.5
Arikawa, S.6
-
5
-
-
0002142781
-
Syntactic clustering of the web
-
A. Broder, S. Glassman, M. Manasse, and G. Zweig, "Syntactic Clustering of the Web," Proc. Sixth World Wide Web Conf. (WWW), 1997.
-
(1997)
Proc. Sixth World Wide Web Conf. (WWW)
-
-
Broder, A.1
Glassman, S.2
Manasse, M.3
Zweig, G.4
-
6
-
-
0034172483
-
Learning to construct knowledge bases from the World Wide Web
-
M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam, and S. Slattery, "Learning to Construct Knowledge Bases from the World Wide Web," Artificial Intelligence, vol. 118, nos. 1-2, pp. 69-113, 2000.
-
(2000)
Artificial Intelligence
, vol.118
, Issue.1-2
, pp. 69-113
-
-
Craven, M.1
DiPasquo, D.2
Freitag, D.3
McCallum, A.4
Mitchell, T.5
Nigam, K.6
Slattery, S.7
-
7
-
-
84944134642
-
Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction
-
S. Chakrabarti, "Integrating the Document Object Model with Hyperlinks for Enhanced Topic Distillation and Information Extraction," Proc. 10th World Wide Web Conf. (WWW), 2001.
-
(2001)
Proc. 10th World Wide Web Conf. (WWW)
-
-
Chakrabarti, S.1
-
10
-
-
34548800414
-
Discovering frequent substructures from hierarchical semi-structured data
-
G. Cong, L. Yi, B. Liu, and K. Wang, "Discovering Frequent Substructures from Hierarchical Semi-Structured Data," Proc. SIAM Int'l Conf. Data Mining (SIAM SDM), 2002.
-
(2002)
Proc. SIAM Int'l Conf. Data Mining (SIAM SDM)
-
-
Cong, G.1
Yi, L.2
Liu, B.3
Wang, K.4
-
13
-
-
19944409473
-
Extracting characteristic structures among words in semistructured documents
-
K. Furukawa, T. Uchida, K. Yamada, T. Miyahara, T. Shoudai, and Y. Nakamura, "Extracting Characteristic Structures among Words in Semistructured Documents," Proc. Sixth Pacific-Asia Conf. Knowledge Discovery and Data Mining (PAKDD), 2002.
-
(2002)
Proc. Sixth Pacific-Asia Conf. Knowledge Discovery and Data Mining (PAKDD)
-
-
Furukawa, K.1
Uchida, T.2
Yamada, K.3
Miyahara, T.4
Shoudai, T.5
Nakamura, Y.6
-
14
-
-
19944401340
-
Clipping and analyzing news using machine learning techniques
-
H. Grundel, T. Naphtali, C. Wiech, J.-M. Gluba, M. Rohdenburg, and T. Scheffer, "Clipping and Analyzing News Using Machine Learning Techniques," Proc. Int'l Conf. Discovery Science, 2001.
-
(2001)
Proc. Int'l Conf. Discovery Science
-
-
Grundel, H.1
Naphtali, T.2
Wiech, C.3
Gluba, J.-M.4
Rohdenburg, M.5
Scheffer, T.6
-
15
-
-
0032309862
-
Generating finite-state transducers for semi-structured data extraction from the web
-
C.N. Hsu and M.T. Dung, "Generating Finite-State Transducers for Semi-Structured Data Extraction from the Web," Information Systems, vol. 23, no. 8, pp. 521-538, 1998.
-
(1998)
Information Systems
, vol.23
, Issue.8
, pp. 521-538
-
-
Hsu, C.N.1
Dung, M.T.2
-
16
-
-
0037480829
-
Entropy-based link analysis for mining web informative structures
-
H.-Y. Kao, S.H. Lin, J.M. Ho, and M.-S. Chen, "Entropy-Based Link Analysis for Mining Web Informative Structures," Proc. ACM 11th Int'l Conf. Information and Knowledge Management (CIKM), 2002.
-
(2002)
Proc. ACM 11th Int'l Conf. Information and Knowledge Management (CIKM)
-
-
Kao, H.-Y.1
Lin, S.H.2
Ho, J.M.3
Chen, M.-S.4
-
17
-
-
0742268832
-
Mining web information structures and contents based on entropy analysis
-
Jan.
-
H.-Y. Kao, S.-H. Lin, J.-M. Ho, and M.-S. Chen, "Mining Web Information Structures and Contents Based on Entropy Analysis," IEEE Trans. Knowledge and Data Eng., vol. 16, no. 1, Jan. 2004.
-
(2004)
IEEE Trans. Knowledge and Data Eng.
, vol.16
, Issue.1
-
-
Kao, H.-Y.1
Lin, S.-H.2
Ho, J.-M.3
Chen, M.-S.4
-
20
-
-
0037806547
-
A brief survey of web data extraction tools
-
June
-
A. Laender, B. Ribeiro-Neto, A. Silva, and J. Teixeira, "A Brief Survey of Web Data Extraction Tools," SIGMOD Record, vol. 31, no. 2, June 2002.
-
(2002)
SIGMOD Record
, vol.31
, Issue.2
-
-
Laender, A.1
Ribeiro-Neto, B.2
Silva, A.3
Teixeira, J.4
-
23
-
-
0038002037
-
Using micro information units for internet search
-
X. Li, B. Liu, T.-H. Phang, and M. Hu, "Using Micro Information Units for Internet Search," Proc. ACM 11th Int'l Conf. Information and Knowledge Management (CIKM), 2002.
-
(2002)
Proc. ACM 11th Int'l Conf. Information and Knowledge Management (CIKM)
-
-
Li, X.1
Liu, B.2
Phang, T.-H.3
Hu, M.4
-
24
-
-
9444247787
-
Discovery of frequent tag tree patterns in semistructured web documents
-
T. Miyahara, Y. Suzuki, T. Shoudai, T. Uchida, K. Takahashi, and H. Ueda, "Discovery of Frequent Tag Tree Patterns in Semistructured Web Documents," Proc. Sixth Pacific-Asia Conf. Knowledge Discovery and Data Mining (PAKDD), 2002.
-
(2002)
Proc. Sixth Pacific-Asia Conf. Knowledge Discovery and Data Mining (PAKDD)
-
-
Miyahara, T.1
Suzuki, Y.2
Shoudai, T.3
Uchida, T.4
Takahashi, K.5
Ueda, H.6
-
25
-
-
84856043672
-
A mathematical theory of communication
-
C.E. Shannon, "A Mathematical Theory of Communication," Bell System Technical J., vol. 27, pp. 398-403, 1948.
-
(1948)
Bell System Technical J.
, vol.27
, pp. 398-403
-
-
Shannon, C.E.1
-
28
-
-
0033699332
-
Discovering structural association of semistructured data
-
May/June
-
K. Wang and H. Liu, "Discovering Structural Association of Semistructured Data," IEEE Trans. Knowledge and Eng., vol. 12, no. 3, May/June 2000.
-
(2000)
IEEE Trans. Knowledge and Eng.
, vol.12
, Issue.3
-
-
Wang, K.1
Liu, H.2
-
29
-
-
0036204132
-
Reverse engineering for web data: From visual to semantic structures
-
C. Yip, C. Gertz, and N. Sundaresan, "Reverse Engineering for Web Data: From Visual to Semantic Structures," Proc. 19th IEEE, Int'l Conf. Data Eng. (ICDE), 2002.
-
(2002)
Proc. 19th IEEE, Int'l Conf. Data Eng. (ICDE)
-
-
Yip, C.1
Gertz, C.2
Sundaresan, N.3
|