메뉴 건너뛰기




Volumn , Issue , 2011, Pages 123-132

On identifying academic homepages for digital libraries

Author keywords

latent dirichlet allocation; mark recapture techniques; topic mixtures; webpage classification

Indexed keywords

ANIMAL POPULATIONS; AUTOMATIC METHOD; CONTENT-BASED FEATURES; DATA SETS; HOME PAGE; LATENT DIRICHLET ALLOCATION; MARK-RECAPTURE TECHNIQUES; MUTUAL INFORMATIONS; SCIENTIFIC RESEARCHES; SHORT SEGMENTS; TERM FREQUENCY; TOPIC MIXTURES; TOPIC MODEL; TRADITIONAL TECHNIQUES; WEB-PAGE;

EID: 79960522068     PISSN: 15525996     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1998076.1998099     Document Type: Conference Paper
Times cited : (15)

References (42)
  • 1
    • 84880892105 scopus 로고    scopus 로고
    • Determining expert profiles (with an application to expert finding)
    • K. Balog and M. De Rijke. Determining expert profiles (with an application to expert finding). In IJCAI, 2007.
    • (2007) IJCAI
    • Balog, K.1    De Rijke, M.2
  • 2
    • 34250665891 scopus 로고    scopus 로고
    • Random sampling from a search engines index
    • Z. Bar-yossef and M. Gurevich. Random sampling from a search engines index. In WWW, 2006.
    • (2006) WWW
    • Bar-yossef, Z.1    Gurevich, M.2
  • 3
    • 42149094804 scopus 로고    scopus 로고
    • A comparison of sampling techniques for web characterization
    • L. Becchetti, C. Castillo, D. Donato, and A. Fazzone. A comparison of sampling techniques for web characterization. In LinkKDD, 2006.
    • (2006) LinkKDD
    • Becchetti, L.1    Castillo, C.2    Donato, D.3    Fazzone, A.4
  • 4
    • 27544439829 scopus 로고    scopus 로고
    • A technique for measuring the relative size and overlap of public web search engines
    • K. Bharat and A. Broder. A technique for measuring the relative size and overlap of public web search engines. In WWW, 1998.
    • (1998) WWW
    • Bharat, K.1    Broder, A.2
  • 6
    • 84868100489 scopus 로고    scopus 로고
    • Text classification by augmenting the bag-of-words representation with redundancy-compensated bigrams
    • C. Boulis and M. Ostendorf. Text classification by augmenting the bag-of-words representation with redundancy-compensated bigrams. In FSDM, 2005.
    • (2005) FSDM
    • Boulis, C.1    Ostendorf, M.2
  • 10
    • 67049158089 scopus 로고    scopus 로고
    • Formal models for expert finding on dblp bibliography data
    • H. Deng, I. King, and M. R. Lyu. Formal models for expert finding on dblp bibliography data. In ICDM, 2008.
    • (2008) ICDM
    • Deng, H.1    King, I.2    Lyu, M.R.3
  • 13
    • 0000085642 scopus 로고
    • Capture-recapture estimation via gibbs sampling
    • E. I. George and C. P. Robert. Capture-recapture estimation via gibbs sampling. In Biometrika, 1992.
    • (1992) Biometrika
    • George, E.I.1    Robert, C.P.2
  • 16
    • 84941274546 scopus 로고    scopus 로고
    • Automatic document metadata extraction using support vector machines
    • H. Han, C. L. Giles, E. Manavoglu, H. Zha, Z. Zhang, and E. A. Fox. Automatic document metadata extraction using support vector machines. In JCDL, 2003.
    • (2003) JCDL
    • Han, H.1    Giles, C.L.2    Manavoglu, E.3    Zha, H.4    Zhang, Z.5    Fox, E.A.6
  • 18
    • 36348945030 scopus 로고    scopus 로고
    • Efficient name disambiguation for large-scale databases
    • J. Huang, S. Ertekin, and C. Giles. Efficient name disambiguation for large-scale databases. In PKDD. 2006.
    • (2006) PKDD
    • Huang, J.1    Ertekin, S.2    Giles, C.3
  • 21
    • 0032478628 scopus 로고    scopus 로고
    • Searching the world wide web
    • S. Lawrence and C. L. Giles. Searching the world wide web. In Science. 1998.
    • (1998) Science
    • Lawrence, S.1    Giles, C.L.2
  • 25
    • 36348992066 scopus 로고    scopus 로고
    • Mining a digital library for influential authors
    • D. M. Mimno and A. McCallum. Mining a digital library for influential authors. In JCDL, 2007.
    • (2007) JCDL
    • Mimno, D.M.1    McCallum, A.2
  • 26
  • 27
    • 8644231803 scopus 로고    scopus 로고
    • Discriminative models for information retrieval
    • R. Nallapati. Discriminative models for information retrieval. In SIGIR, 2004.
    • (2004) SIGIR
    • Nallapati, R.1
  • 30
    • 12244299438 scopus 로고    scopus 로고
    • Estimating the size of the telephone universe: A bayesian mark-recapture approach
    • D. Poole. Estimating the size of the telephone universe: a bayesian mark-recapture approach. In KDD, 2004.
    • (2004) KDD
    • Poole, D.1
  • 31
    • 61949425675 scopus 로고    scopus 로고
    • Web page classification: Features and algorithms
    • X. Qi and B. D. Davison. Web page classification: Features and algorithms. ACM Comput. Surv., 2009.
    • (2009) ACM Comput. Surv.
    • Qi, X.1    Davison, B.D.2
  • 33
    • 77951191376 scopus 로고    scopus 로고
    • Combining super-structuring and abstraction on sequence classification
    • A. Silvescu, C. Caragea, and V. Honavar. Combining super-structuring and abstraction on sequence classification. In ICDM, 2009.
    • (2009) ICDM
    • Silvescu, A.1    Caragea, C.2    Honavar, V.3
  • 34
    • 77954597480 scopus 로고    scopus 로고
    • Estimating the web robot population
    • Y. Sun and C. L. Giles. Estimating the web robot population. In WWW, 2010.
    • (2010) WWW
    • Sun, Y.1    Giles, C.L.2
  • 35
    • 44649113555 scopus 로고    scopus 로고
    • Social network extraction of academic researchers
    • J. Tang, D. Zhang, and L. Yao. Social network extraction of academic researchers. In ICDM, 2007.
    • (2007) ICDM
    • Tang, J.1    Zhang, D.2    Yao, L.3
  • 36
    • 65449166085 scopus 로고    scopus 로고
    • Arnetminer: Extraction and mining of academic social networks
    • J. Tang, J. Zhang, L. Yao, J. Li, L. Zhang, and Z. Su. Arnetminer: extraction and mining of academic social networks. In KDD, 2008.
    • (2008) KDD
    • Tang, J.1    Zhang, J.2    Yao, L.3    Li, J.4    Zhang, L.5    Su, Z.6
  • 38
    • 79960484191 scopus 로고    scopus 로고
    • Web page classification exploiting contents of surrounding pages for building a high-quality homepage collection
    • Y. Wang and K. Oyama. Web page classification exploiting contents of surrounding pages for building a high-quality homepage collection. In Digital Libraries: Achievements, Challenges and Opportunities. 2006.
    • (2006) Digital Libraries: Achievements, Challenges and Opportunities
    • Wang, Y.1    Oyama, K.2
  • 41
    • 36849062139 scopus 로고    scopus 로고
    • Joint optimization of wrapper generation and template detection
    • S. Zheng, R. Song, J.-R. Wen, and D. Wu. Joint optimization of wrapper generation and template detection. In KDD, 2007.
    • (2007) KDD
    • Zheng, S.1    Song, R.2    Wen, J.-R.3    Wu, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.