메뉴 건너뛰기




Volumn , Issue , 2009, Pages 105-112

Reducing class imbalance during active learning for named entity annotation

Author keywords

[No Author keywords available]

Indexed keywords

ACTIVE LEARNING; CLASS IMBALANCE; DATA SETS; F-SCORE; LOW FREQUENCY; NAMED ENTITIES; NAMED ENTITY RECOGNITION; NATURAL LANGUAGE PROCESSING; PERFORMANCE OF CLASSIFIER; POOR PERFORMANCE; SKEWED DATA; TRAINING MATERIAL;

EID: 70449647082     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1597735.1597754     Document Type: Conference Paper
Times cited : (61)

References (20)
  • 2
    • 85059361118 scopus 로고    scopus 로고
    • Taking into account the differences between actively and passively acquired data: The case of active learning with support vector machines for imbalanced datasets
    • M. Bloodgood and V. Shanker. Taking into account the differences between actively and passively acquired data: The case of active learning with support vector machines for imbalanced datasets. In Proc. of the NAACL-HLT '09, pages 137-140, 2009.
    • (2009) Proc. of the NAACL-HLT '09 , pp. 137-140
    • Bloodgood, M.1    Shanker, V.2
  • 5
    • 84867577175 scopus 로고    scopus 로고
    • The foundations of cost-sensitive learning
    • C. Elkan. The foundations of cost-sensitive learning. In Proc. of the IJCAI '01, pages 973-978, 2001.
    • (2001) Proc. of the IJCAI '01 , pp. 973-978
    • Elkan, C.1
  • 6
    • 85149117258 scopus 로고    scopus 로고
    • Minimizing manual annotation cost in supervised training from corpora
    • S. Engelson and I. Dagan. Minimizing manual annotation cost in supervised training from corpora. In Proc. of the ACL '96, pages 319-326, 1996.
    • (1996) Proc. of the ACL '96 , pp. 319-326
    • Engelson, S.1    Dagan, I.2
  • 7
    • 63449090301 scopus 로고    scopus 로고
    • Learning on the border: Active learning in imbalanced data classification
    • S. Ertekin, J. Huang, L. Bottou, and L. Giles. Learning on the border: Active learning in imbalanced data classification. In Proc. of the CIKM '07, pages 127-136, 2007.
    • (2007) Proc. of the CIKM '07 , pp. 127-136
    • Ertekin, S.1    Huang, J.2    Bottou, L.3    Giles, L.4
  • 8
    • 0031209604 scopus 로고    scopus 로고
    • Selective sampling using the query by committee algorithm
    • Y. Freund, H. Seung, E. Shamir, and N. Tishby. Selective sampling using the query by committee algorithm. Machine Learning, 28(2-3):133-168, 1997.
    • (1997) Machine Learning , vol.28 , Issue.2-3 , pp. 133-168
    • Freund, Y.1    Seung, H.2    Shamir, E.3    Tishby, N.4
  • 9
    • 84896456544 scopus 로고    scopus 로고
    • Semantic annotations for biology: A corpus development initiative at the Jena University Language & Information Engineering Lab
    • U. Hahn, E. Beisswanger, E. Buyko, M. Poprat, K. Tomanek, and J. Wermter. Semantic annotations for biology: A corpus development initiative at the Jena University Language & Information Engineering Lab. In Proc. of the LREC '08, 2008.
    • (2008) Proc. of the LREC '08
    • Hahn, U.1    Beisswanger, E.2    Buyko, E.3    Poprat, M.4    Tomanek, K.5    Wermter, J.6
  • 10
    • 25844460429 scopus 로고    scopus 로고
    • Sample selection for statistical parsing
    • R. Hwa. Sample selection for statistical parsing. Computational Linguistics, 30(3):253-276, 2004.
    • (2004) Computational Linguistics , vol.30 , Issue.3 , pp. 253-276
    • Hwa, R.1
  • 11
    • 33845536164 scopus 로고    scopus 로고
    • The class imbalance problem: A systematic study
    • N. Japkowicz and S. Stephen. The class imbalance problem: A systematic study. Intelligent Data Analysis, 6(5):429-449, 2002.
    • (2002) Intelligent Data Analysis , vol.6 , Issue.5 , pp. 429-449
    • Japkowicz, N.1    Stephen, S.2
  • 13
    • 0142192295 scopus 로고    scopus 로고
    • Conditional Random Fields: Probabilistic models for segmenting and labeling sequence data
    • J. Lafferty, A. McCallum, and F. Pereira. Conditional Random Fields: Probabilistic models for segmenting and labeling sequence data. In Proc. of the ICML '01, pages 282-289, 2001.
    • (2001) Proc. of the ICML '01 , pp. 282-289
    • Lafferty, J.1    McCallum, A.2    Pereira, F.3
  • 14
    • 85013879626 scopus 로고
    • A sequential algorithm for training text classifiers
    • D. D. Lewis and W. A. Gale. A sequential algorithm for training text classifiers. In Proc. of the SIGIR '94, pages 3-12, 1994.
    • (1994) Proc. of the SIGIR '94 , pp. 3-12
    • Lewis, D.D.1    Gale, W.A.2
  • 15
    • 85149124378 scopus 로고    scopus 로고
    • Rule writing or annotation: Cost-efficient resource usage for base noun phrase chunking
    • G. Ngai and D. Yarowsky. Rule writing or annotation: Cost-efficient resource usage for base noun phrase chunking. In Proc. of the ACL '00, pages 117-125, 2000.
    • (2000) Proc. of the ACL '00 , pp. 117-125
    • Ngai, G.1    Yarowsky, D.2
  • 19
    • 56649102769 scopus 로고    scopus 로고
    • An approach to text corpus construction which cuts annotation costs and maintains corpus reusability of annotated data
    • K. Tomanek, J. Wermter, and U. Hahn. An approach to text corpus construction which cuts annotation costs and maintains corpus reusability of annotated data. In Proc. of the EMNLP-CoNLL '07, pages 486-495, 2007.
    • (2007) Proc. of the EMNLP-CoNLL '07 , pp. 486-495
    • Tomanek, K.1    Wermter, J.2    Hahn, U.3
  • 20
    • 80053363150 scopus 로고    scopus 로고
    • Active learning for word sense disambiguation with methods for addressing the class imbalance problem
    • J. Zhu and E. Hovy. Active learning for word sense disambiguation with methods for addressing the class imbalance problem. In Proc. of the EMNLP-CoNLL '07, pages 783-790, 2007.
    • (2007) Proc. of the EMNLP-CoNLL '07 , pp. 783-790
    • Zhu, J.1    Hovy, E.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.