메뉴 건너뛰기




Volumn 42, Issue 5, 2006, Pages 1276-1293

Automatic extraction of titles from general documents using machine learning

Author keywords

Information extraction; Machine learning; Metadata extraction; Search

Indexed keywords

COMPUTER SOFTWARE; FORMAL LANGUAGES; INTRANETS; LEARNING SYSTEMS; METADATA; MODELS;

EID: 33646071859     PISSN: 03064573     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ipm.2005.12.001     Document Type: Article
Times cited : (33)

References (27)
  • 2
    • 33646064472 scopus 로고    scopus 로고
    • Crystal, A., & Land, P. (2003). Metadata and Search Global Corporate Circle DCMI 2003 Workshop. Available from http://dublincore.org/groups/corporate/Seattle/.
  • 3
    • 33646027927 scopus 로고    scopus 로고
    • Collins, M. (2002). Discriminative training methods for hidden markov models: theory and experiments with perceptron algorithms. In Proceedings of conference on empirical methods in natural language processing (pp. 1-8).
  • 4
    • 34249753618 scopus 로고
    • Support-vector networks
    • Cortes C., and Vapnik V. Support-vector networks. Machine Learning 20 (1995) 273-297
    • (1995) Machine Learning , vol.20 , pp. 273-297
    • Cortes, C.1    Vapnik, V.2
  • 5
    • 0036923221 scopus 로고    scopus 로고
    • Chieu, H. L., & Ng, H. T. (2002). A maximum entropy approach to information extraction from semi-structured and free text. In Proceedings of the eighteenth national conference on artificial intelligence (pp. 768-791).
  • 6
    • 33646024037 scopus 로고    scopus 로고
    • Evans, D. K., Klavans, J. L., & McKeown, K. R. (2004). Columbia newsblaster: multilingual news summarization on the Web. In Proceedings of human language technology conference/North American chapter of the association for computational linguistics annual meeting (pp. 1-4).
  • 7
  • 8
    • 27544443379 scopus 로고    scopus 로고
    • Gheel, J., & Anderson, T. (1999). Data and metadata for finding and reminding. In Proceedings of the 1999 international conference on information visualization (pp. 446-451).
  • 9
    • 1542377474 scopus 로고    scopus 로고
    • Giles, C. L., Petinot, Y., Teregowda, P. B., Han, H., Lawrence, S., & Rangaswamy, A., et al. (2003). eBizSearch: a niche search engine for e-Business. In Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval (pp. 413-414).
  • 10
    • 0033650832 scopus 로고    scopus 로고
    • Giuffrida, G., Shek, E. C., & Yang, J. (2000). Knowledge-based metadata extraction from PostScript files. In Proceedings of the fifth ACM conference on digital libraries (pp. 77-84).
  • 11
    • 84941274546 scopus 로고    scopus 로고
    • Han, H., Giles, C. L., Manavoglu, E., Zha, H., Zhang, Z., & Fox, E. A. (2003). Automatic document metadata extraction using support vector machines. In Proceedings of the third ACM/IEEE-CS joint conference on digital libraries (pp. 37-48).
  • 12
  • 13
    • 33646058930 scopus 로고    scopus 로고
    • Lafferty, J., McCallum, A., & Pereira, F. (2001). Conditional random fields: probabilistic models for segmenting and labeling sequence data. In Proceedings of the eighteenth international conference on machine learning (pp. 282-289).
  • 14
    • 33646054598 scopus 로고    scopus 로고
    • Li, Y., Zaragoza, H., Herbrich, R., Shawe-Taylor, J., & Kandola, J. S., (2002). The perceptron algorithm with uneven margins. In Proceedings of the nineteenth international conference on machine learning (pp. 379-386).
  • 15
    • 0036992518 scopus 로고    scopus 로고
    • Liddy, E. D., Sutton, S., Allen, E., Harwell, S., Corieri, S., & Yilmazel, O., et al. (2002). Automatic metadata generation & evaluation. In Proceedings of the 25th annual international ACM SIGIR conference on research and development in information retrieval (pp. 401-402).
  • 16
    • 33646041949 scopus 로고    scopus 로고
    • Littlefield, A. (2002). Effective enterprise information retrieval across new content formats. In Proceedings of the seventh search engine conference. Available from http://www.infonortics.com/searchengines/sh02/02prog.html.
  • 17
    • 1942516385 scopus 로고    scopus 로고
    • Mao, S., Kim, J. W., & Thoma, G. R. (2004). A dynamic feature generation system for automated metadata extraction in preservation of digital materials. In Proceedings of the first international workshop on document image analysis for libraries (pp. 225-232).
  • 18
    • 33646042918 scopus 로고    scopus 로고
    • McCallum, A., Freitag, D., & Pereira, F. (2000). Maximum entropy markov models for information extraction and segmentation. In Proceedings of the seventeenth international conference on machine learning (pp. 591-598).
  • 19
    • 0031599641 scopus 로고    scopus 로고
    • Murphy, L. D. (1998). Digital document metadata in organizations: roles, analytical approaches, and future research directions. In Proceedings of the thirty-first annual Hawaii international conference on system sciences (pp. 267-276).
  • 20
    • 33646046859 scopus 로고    scopus 로고
    • Peng, F., & McCallum, A. (2004). Accurate information extraction from research papers using conditional random fields. In Proceedings of the human language technology conference/North American chapter of the association for computational linguistics annual meeting (pp. 329-336).
  • 21
    • 1542287488 scopus 로고    scopus 로고
    • Pinto, D., McCallum, A., Wei, X., & Croft, W. B. (2003). Table extraction using conditional random fields. In Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval (pp. 235-242).
  • 22
    • 33646049093 scopus 로고    scopus 로고
    • Ratnaparkhi, A. (1998). Unsupervised statistical models for prepositional phrase attachment. In Proceedings of the seventeenth international conference on computational linguistics (pp. 1079-1085).
  • 23
    • 18744388867 scopus 로고    scopus 로고
    • Robertson, S., Zaragoza, H., & Taylor, M. (2004). Simple BM25 extension to multiple weighted fields. In Proceedings of ACM thirteenth conference on information and knowledge management (pp. 42-49).
  • 24
    • 0033716961 scopus 로고    scopus 로고
    • Yi, J., & Sundaresan, N. (2000). Metadata based Web mining for relevance. In Proceedings of the 2000 international symposium on database engineering & applications (pp. 113-121).
  • 25
    • 4944255256 scopus 로고    scopus 로고
    • Yilmazel, O., Finneran, C. M., & Liddy, E. D. (2004). MetaExtract: an NLP system to automatically assign metadata. In Proceedings of the 2004 joint ACM/IEEE conference on digital libraries (pp. 241-242).
  • 26
    • 4444299909 scopus 로고    scopus 로고
    • Internet search engines' response to metadata Dublin Core implementation
    • Zhang J., and Dimitroff A. Internet search engines' response to metadata Dublin Core implementation. Journal of Information Science 30 (2004) 310-320
    • (2004) Journal of Information Science , vol.30 , pp. 310-320
    • Zhang, J.1    Dimitroff, A.2
  • 27
    • 8644241114 scopus 로고    scopus 로고
    • Zhang, L., Pan, Y., & Zhang, T. (2004). Recognising and using named entities: focused named entity recognition using machine learning. In Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval (pp. 281-288).


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.