-
1
-
-
21144444733
-
Extracting content structure for web pages based on visual representation
-
Xian, China
-
Deng Cai1, Shipeng Yu, Ji-Rong Wen and Wei-Ying Ma, "Extracting Content Structure for Web Pages based on Visual Representation", In Proceedings of the 5th Asia-Pacific Web Conference on Web Technologies and Applications, pp. 406-417, Xian, China, 2003.
-
(2003)
Proceedings of the 5th Asia-Pacific Web Conference on Web Technologies and Applications
, pp. 406-417
-
-
Cai, D.1
Yu, S.2
Wen, J.-R.3
Ma, W.-Y.4
-
2
-
-
79953832571
-
Signed approach for mining web content outliers
-
G. Poonkuzhali, K.Thiagarajan, K.Sarukesi and G.V.Uma, "Signed Approach for Mining Web Content Outliers", World Academy of Science, Engineering and Technology, Vol.56, pp. 820-824, 2009.
-
(2009)
World Academy of Science, Engineering and Technology
, vol.56
, pp. 820-824
-
-
Poonkuzhali, G.1
Thiagarajan, K.2
Sarukesi, K.3
Uma, G.V.4
-
3
-
-
33644531525
-
Mining Web Content Outliers using Structure Oriented Weighting Techniques and N-Grams
-
New Mexico, March
-
Malik Agyemang, Ken Barker and Rada S. Alhajj, "Mining Web Content Outliers using Structure Oriented Weighting Techniques and N-Grams", In Proceedings of the ACM Annual Symposium on Applied Computing, pp. 482-487, New Mexico, March 2005.
-
(2005)
Proceedings of the ACM Annual Symposium on Applied Computing
, pp. 482-487
-
-
Malik, A.1
Barker, K.2
Alhajj, R.S.3
-
4
-
-
78651389387
-
Elimination of redundant links in web pages-mathematical approach
-
April
-
G. Poonkuzhali, K.Thiagarajan and K.Sarukesi, "Elimination of Redundant Links in Web Pages-Mathematical Approach", World Academy of Science, Engineering and Technology, No.52, pp. 562, April 2009.
-
(2009)
World Academy of Science, Engineering and Technology
, Issue.52
, pp. 562
-
-
Poonkuzhali, G.1
Thiagarajan, K.2
Sarukesi, K.3
-
5
-
-
46249083758
-
A web page segmentation algorithm for extracting product information
-
Weihai, China, August
-
Changjun Wu, Guosun Zeng and Guorong Xu, "A Web Page Segmentation Algorithm for Extracting Product Information", In Proceedings of the IEEE International Conference on Information Acquisition, pp. 1374-1379, Weihai, China, August 2006.
-
(2006)
Proceedings of the IEEE International Conference on Information Acquisition
, pp. 1374-1379
-
-
Wu, C.1
Zeng, G.2
Xu, G.3
-
6
-
-
35148846743
-
Frequent pattern mining in web log data", Acta polytechnica hungarica
-
Renata Ivancsy and Istvan Vajk, "Frequent Pattern Mining in Web Log Data", Acta Polytechnica Hungarica, Journal of Applied Sciences at Budapest Tech Hungary, Special Issue on Computational Intelligence, Vol. 3, No. 1, pp.77-99, 2006.
-
(2006)
Journal of Applied Sciences at Budapest Tech Hungary, Special Issue on Computational Intelligence
, vol.3
, Issue.1
, pp. 77-99
-
-
Ivancsy, R.1
Vajk, I.2
-
7
-
-
0042850436
-
Web log mining for predictive web caching
-
Qiang Yang, Haining Henry Zhang, "Web Log Mining For Predictive Web Caching", IEEE Transactions on Knowledge and Data Engineering, Vol. 15, No. 4, pp. 1050-1053, 2003.
-
(2003)
IEEE Transactions on Knowledge and Data Engineering
, vol.15
, Issue.4
, pp. 1050-1053
-
-
Yang, Q.1
Zhang, H.H.2
-
8
-
-
0001781295
-
Web mining research: A survey
-
Raymond Kosala and Hendrik Blockeel, "Web Mining Research: A Survey", ACM SIGKDD Explorations, Vol. 2, No.1, pp. 1-15, 2000.
-
(2000)
ACM SIGKDD Explorations
, vol.2
, Issue.1
, pp. 1-15
-
-
Kosala, R.1
Blockeel, H.2
-
9
-
-
78449236991
-
Performance modeling of a distributed web crawler using stochastic activity networks
-
Springer, Verlag, ISSN: 1865-0929
-
Mitra Nasri, Saeed Shariati and Mohammad Abdollahi Azgomi, "Performance Modeling of a Distributed Web Crawler using Stochastic Activity Networks", Communications in Computer and Information Science (CCIS), Springer-Verlag, Vol. 9, pp.535-542, ISSN: 1865-0929, 2008.
-
(2008)
Communications in Computer and Information Science (CCIS)
, vol.9
, pp. 535-542
-
-
Nasri, M.1
Shariati, S.2
Abdollahi Azgomi, M.3
-
10
-
-
33644658089
-
Editorial: Special issue on web content mining
-
Bing Liu and Kevin Chen-Chuan-Chang, "Editorial: Special Issue on Web Content Mining", ACM SIGKDD Explorations Newsletter, Vol.6, No.2, pp. 1-4, 2004.
-
(2004)
ACM SIGKDD Explorations Newsletter
, vol.6
, Issue.2
, pp. 1-4
-
-
Liu, B.1
Chen-Chuan-Chang, K.2
-
11
-
-
50149098792
-
Using Xpath to discover informative content blocks of web pages
-
October 29-31, Shan Xi, China
-
Yan Fu, Dongqing Yang, Shiwei Tang, Tengjiao Wang and Jun Gao, "Using XPath to Discover Informative Content Blocks of Web Pages", In Proceedings of the Third International Conference on Semantics, Knowledge and Grid (SKG 2007), pp. 450-453, October 29-31, Shan Xi, China, 2007.
-
(2007)
Proceedings of the Third International Conference on Semantics, Knowledge and Grid (SKG 2007)
, pp. 450-453
-
-
Fu, Y.1
Yang, D.2
Tang, S.3
Wang, T.4
Gao, J.5
-
13
-
-
38049108412
-
The characteristic analysis of web user clusters based on frequent browsing patterns
-
Zhiwang Zhang and Yong Shi, "The Characteristic Analysis of Web User Clusters Based on Frequent Browsing Patterns", Lecture Notes in Computer Science, Vol. 4488, pp. 490-493, 2007.
-
(2007)
Lecture Notes in Computer Science
, vol.4488
, pp. 490-493
-
-
Zhang, Z.1
Shi, Y.2
-
14
-
-
9744247654
-
An efficient method of eliminating noisy information in web pages for data mining
-
September 14-16, Wuhan, China
-
A. K. Tripathy and A. K. Singh, "An Efficient Method of Eliminating Noisy Information in Web Pages for Data Mining", In Proceedings of the Fourth International Conference on Computer and Information Technology (CIT'04), pp. 978-985, September 14-16, Wuhan, China, 2004.
-
(2004)
Proceedings of the Fourth International Conference on Computer and Information Technology (CIT'04)
, pp. 978-985
-
-
Tripathy, A.K.1
Singh, A.K.2
-
15
-
-
8844245490
-
A method of eliminating noises in web pages by style tree model and its applications
-
Zhao Cheng-li and Yi Dong-yun, "A Method of Eliminating Noises in Web Pages by Style Tree Model and Its Applications", Wuhan University Journal of Natural Sciences, Vol.9, No.5, pp. 611-616, 2004.
-
(2004)
Wuhan University Journal of Natural Sciences
, vol.9
, Issue.5
, pp. 611-616
-
-
Cheng-Li, Z.1
Dong-Yun, Y.2
-
16
-
-
79953819136
-
Extracting content blocks from web pages
-
November
-
Manisha Marathe, S. H. Patil, G. V. Garje and M. S. Bewoor, "Extracting Content Blocks from Web Pages", International Journal of Recent Trends in Engineering (IJRTE), Vol.2, No.4, pp. 62-64, November 2009.
-
(2009)
International Journal of Recent Trends in Engineering (IJRTE)
, vol.2
, Issue.4
, pp. 62-64
-
-
Marathe, M.1
Patil, S.H.2
Garje, G.V.3
Bewoor, M.S.4
-
17
-
-
26844469211
-
Automatic extraction of informative blocks from web pages
-
Santa Fe, New Mexico
-
Sandip Debnath, Prasenjit Mitra and C. Lee Giles, "Automatic Extraction of Informative Blocks from Web Pages", In Proceedings of the ACM symposium on applied computing, pp. 1722-1726, Santa Fe, New Mexico, 2005.
-
(2005)
Proceedings of the ACM Symposium on Applied Computing
, pp. 1722-1726
-
-
Debnath, S.1
Mitra, P.2
Lee Giles, C.3
-
18
-
-
18744381159
-
Learning block importance models for Web pages
-
Thirteenth International World Wide Web Conference Proceedings, WWW2004
-
Ruihua Song, Haifeng Liu, Ji-Rong Wen and Wei-Ying Ma, "Learning Block Importance Models for Web Pages", In Proceedings of the 13th International Conference on World Wide Web, pp. 203-211, 2004. (Pubitemid 40752755)
-
(2004)
Thirteenth International World Wide Web Conference Proceedings, WWW2004
, pp. 203-211
-
-
Song, R.1
Liu, H.2
Wen, J.-R.3
Ma, W.-Y.4
-
19
-
-
26444532019
-
Learning important models for web page blocks based on layout and content analysis
-
Ruihua Song, Haifeng Liu, Ji-Rong Wen and Wei-Ying Ma, "Learning Important Models for Web Page Blocks based on Layout and Content Analysis", ACM SIGKDD Explorations Newsletter, Vol. 6, No. 2, pp. 14-23, 2004.
-
(2004)
ACM SIGKDD Explorations Newsletter
, vol.6
, Issue.2
, pp. 14-23
-
-
Song, R.1
Liu, H.2
Wen, J.-R.3
Ma, W.-Y.4
-
20
-
-
79953811061
-
Noise elimination from the web documents by using url paths and information redundancy
-
June 26-29, Las Vegas, Nevada, US
-
Byeong Ho Kang and Yang Sok Kim, "Noise Elimination from The Web Documents By Using URL Paths and Information Redundancy", In Proceedings of the International Conference on Information & Knowledge Engineering, pp. 135-141, June 26-29, Las Vegas, Nevada, US, 2006.
-
(2006)
Proceedings of the International Conference on Information & Knowledge Engineering
, pp. 135-141
-
-
Kang, B.H.1
Sok Kim, Y.2
-
22
-
-
84880811191
-
Web page cleaning for web mining through feature weighting
-
August 09-15, Acapulco, Mexico
-
Lan Yi and Bing Liu, "Web Page Cleaning for Web Mining Through Feature Weighting", In Proceedings of the 18th International Joint Conference on Artificial Intelligence,Vol.18, pp. 43-50, August 09-15, Acapulco, Mexico, 2003.
-
(2003)
Proceedings of the 18th International Joint Conference on Artificial Intelligence
, vol.18
, pp. 43-50
-
-
Yi, L.1
Liu, B.2
-
23
-
-
77952370025
-
Eliminating noisy information in web pages for data mining
-
August 24-27, Washington, DC, USA
-
Lan Yi, Bing Liu and Xiaoli Li, "Eliminating Noisy Information in Web Pages for Data Mining", In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 296-305, August 24-27, Washington, DC, USA, 2003.
-
(2003)
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 296-305
-
-
Yi, L.1
Liu, B.2
Li, X.3
-
24
-
-
33846539282
-
Mining key information of web pages: A method and its application
-
DOI 10.1016/j.eswa.2006.05.017, PII S0957417406001588
-
Chao Wang, Jie Lua, and Guangquan Zhanga, "Mining Key Information of Web Pages: A Method and Its Application", Expert Systems with Applications, Vol.33, No.2, pp.425-433, August 2007. (Pubitemid 46157268)
-
(2007)
Expert Systems with Applications
, vol.33
, Issue.2
, pp. 425-433
-
-
Wang, C.1
Lu, J.2
Zhang, G.3
-
25
-
-
77954300056
-
ECON: An approach to extract content from web news page
-
April 06-08, Buscan, Korea
-
Yan Guo, Huifeng Tang, Linhai Song, Yu Wang and Guodong Ding, "ECON: An Approach to Extract Content from Web News Page", In Proceedings of the 12th International Asia-Pacific Web Conference (APWEB), pp. 314-320, April 06-08, Buscan, Korea, 2010.
-
Proceedings of the 12th International Asia-Pacific Web Conference (APWEB)
, vol.2010
, pp. 314-320
-
-
Guo, Y.1
Tang, H.2
Song, L.3
Wang, Y.4
Ding, G.5
-
27
-
-
35348911985
-
Detecting near-duplicates for web crawling
-
DOI 10.1145/1242572.1242592, 16th International World Wide Web Conference, WWW2007
-
Gurmeet Singh Manku, Arvind Jain and Anish Das Sarma, "Detecting Near-Duplicates for Web Crawling", In Proceedings of the 16th International Conference on World Wide Web, pp. 141-150, May 8-12, Banff, Alberta, Canada, 2007. (Pubitemid 47582246)
-
(2007)
16th International World Wide Web Conference, WWW2007
, pp. 141-150
-
-
Manku, G.S.1
Jain, A.2
Das Sarma, A.3
|