메뉴 건너뛰기




Volumn 33, Issue , 2015, Pages 203-238

Big Data Driven Natural Language Processing Research and Applications

Author keywords

Big data; Natural language processing; Statistical models

Indexed keywords


EID: 84944593291     PISSN: 01697161     EISSN: None     Source Type: Book Series    
DOI: 10.1016/B978-0-444-63492-4.00009-5     Document Type: Chapter
Times cited : (35)

References (96)
  • 2
    • 84928017675 scopus 로고    scopus 로고
    • Text classification using machine learning methods-a survey
    • Springer India, B.V. Babu, A. Nagar, K. Deep, M. Pant, J.C. Bansal, K. Ray, U. Gupta (Eds.)
    • Agarwal B., Mittal N. Text classification using machine learning methods-a survey. Advances in Intelligent Systems and Computing 2014, vol. 236:701-709. Springer India. B.V. Babu, A. Nagar, K. Deep, M. Pant, J.C. Bansal, K. Ray, U. Gupta (Eds.).
    • (2014) Advances in Intelligent Systems and Computing , vol.236 , pp. 701-709
    • Agarwal, B.1    Mittal, N.2
  • 7
    • 84861170800 scopus 로고    scopus 로고
    • Probabilistic topic models
    • Blei D.M. Probabilistic topic models. Commun. ACM 2012, 55(4):77-84.
    • (2012) Commun. ACM , vol.55 , Issue.4 , pp. 77-84
    • Blei, D.M.1
  • 9
    • 47749103248 scopus 로고    scopus 로고
    • Linguistic Data Consortium
    • Brants T., Franz A. Web 1T 5-gram version 1 2012, Linguistic Data Consortium. http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13.
    • (2012) Web 1T 5-gram version 1
    • Brants, T.1    Franz, A.2
  • 10
    • 84867919822 scopus 로고
    • Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging
    • Brill E. Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging. Comput. Linguist. 1995, 21(4):543-565.
    • (1995) Comput. Linguist. , vol.21 , Issue.4 , pp. 543-565
    • Brill, E.1
  • 11
    • 84944531206 scopus 로고    scopus 로고
    • British National Corpus 2014, http://www.natcorp.ox.ac.uk/.
    • (2014)
  • 14
    • 84943757899 scopus 로고    scopus 로고
    • One billion word benchmark for measuring progress in statistical language modeling
    • Chelba C., Mikolov T., Schuster M., Ge Q., Brants T., Koehn P. One billion word benchmark for measuring progress in statistical language modeling. CoRR 2013, abs/1312.3005. http://arxiv.org/abs/1312.3005.
    • (2013) CoRR
    • Chelba, C.1    Mikolov, T.2    Schuster, M.3    Ge, Q.4    Brants, T.5    Koehn, P.6
  • 15
    • 0004291783 scopus 로고    scopus 로고
    • De Gruyter Mouton, Boston, Massachusetts
    • Chomsky N. Syntactic Structures 2002, De Gruyter Mouton, Boston, Massachusetts. second ed.
    • (2002) Syntactic Structures
    • Chomsky, N.1
  • 19
    • 84888424872 scopus 로고    scopus 로고
    • Data science and prediction
    • Dhar V. Data science and prediction. Commun. ACM 2013, 56(12):64-73.
    • (2013) Commun. ACM , vol.56 , Issue.12 , pp. 64-73
    • Dhar, V.1
  • 22
    • 84923310628 scopus 로고    scopus 로고
    • Max Planck Institute for Evolutionary Anthropology, Leipzig
    • Dryer M.S., Haspelmath M. WALS Online 2013, Max Planck Institute for Evolutionary Anthropology, Leipzig.
    • (2013) WALS Online
    • Dryer, M.S.1    Haspelmath, M.2
  • 23
    • 46749110035 scopus 로고    scopus 로고
    • Text mining infrastructure in R
    • Feinerer I., Hornik K., Meyer D. Text mining infrastructure in R. J. Stat. Softw. 2008, 25(5):1-54. http://www.jstatsoft.org/v25/i05.
    • (2008) J. Stat. Softw. , vol.25 , Issue.5 , pp. 1-54
    • Feinerer, I.1    Hornik, K.2    Meyer, D.3
  • 25
    • 0039430078 scopus 로고
    • Francis N., Kucera H. The brown corpus 1964, http://www.essex.ac.uk/linguistics/external/clmt/w3c/corpus_ling/content/corpora/list/private/brown/brown.html.
    • (1964) The brown corpus
    • Francis, N.1    Kucera, H.2
  • 30
    • 84944623426 scopus 로고    scopus 로고
    • Google Syntactic N-grams 2014, http://storage.googleapis.com/books/syntactic-ngrams/index.html.
    • (2014) Syntactic N-grams
  • 31
    • 70849126253 scopus 로고    scopus 로고
    • The unreasonable effectiveness of data
    • Halevy A., Norvig P., Pereira F. The unreasonable effectiveness of data. IEEE Intell. Syst. 2009, 24(2):8-12.
    • (2009) IEEE Intell. Syst. , vol.24 , Issue.2 , pp. 8-12
    • Halevy, A.1    Norvig, P.2    Pereira, F.3
  • 36
    • 84897743792 scopus 로고    scopus 로고
    • Learning representations for weakly supervised natural language processing tasks
    • Huang F., Ahuja A., Downey D., Yang Y., Guo Y., Yates A. Learning representations for weakly supervised natural language processing tasks. Comput. Linguist. 2014, 40(1):85-120.
    • (2014) Comput. Linguist. , vol.40 , Issue.1 , pp. 85-120
    • Huang, F.1    Ahuja, A.2    Downey, D.3    Yang, Y.4    Guo, Y.5    Yates, A.6
  • 41
    • 1842712330 scopus 로고    scopus 로고
    • Text information extraction in images and video: a survey
    • Jung K., Kim K.I., Jain A.K. Text information extraction in images and video: a survey. Pattern Recogn. 2004, 37(5):977-997.
    • (2004) Pattern Recogn. , vol.37 , Issue.5 , pp. 977-997
    • Jung, K.1    Kim, K.I.2    Jain, A.K.3
  • 52
    • 84940448896 scopus 로고    scopus 로고
    • Linguistic Data Consortium Language resources 2015, https://www.ldc.upenn.edu/language-resources.
    • (2015) Language resources
  • 55
    • 34249852033 scopus 로고
    • Building a large annotated corpus of english: the penn treebank
    • Cambridge, MA, USA
    • Marcus M.P., Marcinkiewicz M.A., Santorini B. Building a large annotated corpus of english: the penn treebank. Comput. Linguist. 1993, 19(2):313-330. Cambridge, MA, USA.
    • (1993) Comput. Linguist. , vol.19 , Issue.2 , pp. 313-330
    • Marcus, M.P.1    Marcinkiewicz, M.A.2    Santorini, B.3
  • 57
    • 47749122510 scopus 로고    scopus 로고
    • A survey of named entity recognition and classification
    • Nadeau D., Sekine S. A survey of named entity recognition and classification. Lingvist. Investig. 2007, 30(1):3-26.
    • (2007) Lingvist. Investig. , vol.30 , Issue.1 , pp. 3-26
    • Nadeau, D.1    Sekine, S.2
  • 58
    • 61949087310 scopus 로고    scopus 로고
    • Word sense disambiguation: a survey
    • 10:1-10:69
    • Navigli R. Word sense disambiguation: a survey. ACM Comput. Surv. 2009, 41(2). 10:1-10:69.
    • (2009) ACM Comput. Surv. , vol.41 , Issue.2
    • Navigli, R.1
  • 61
    • 84860848638 scopus 로고    scopus 로고
    • A survey of text summarization techniques
    • Springer, Heidelberg, C. Aggarwal, C. Zhai (Eds.)
    • Nenkova A., McKeown K. A survey of text summarization techniques. Mining Text Data 2012, 43-76. Springer, Heidelberg. C. Aggarwal, C. Zhai (Eds.).
    • (2012) Mining Text Data , pp. 43-76
    • Nenkova, A.1    McKeown, K.2
  • 62
    • 84890530341 scopus 로고    scopus 로고
    • Natural language corpus data
    • O'Reilly Media, T. Segaran, J. Hammerbacher (Eds.)
    • Norvig P. Natural language corpus data. Beautiful Data: The Stories Behind Elegant Data Solutions 2009, 219-242. O'Reilly Media. T. Segaran, J. Hammerbacher (Eds.).
    • (2009) Beautiful Data: The Stories Behind Elegant Data Solutions , pp. 219-242
    • Norvig, P.1
  • 63
    • 84944522690 scopus 로고    scopus 로고
    • OASIS Technical Committee Apache UIMA project 2015, http://uima.apache.org/.
    • (2015) Apache UIMA project
  • 64
    • 85072855288 scopus 로고    scopus 로고
    • A trainable rule-based algorithm for word segmentation
    • Association for Computational Linguistics, Stroudsburg, PA
    • Palmer D.D. A trainable rule-based algorithm for word segmentation. Proceedings of the 35th Annual Meeting of the ACL, Madrid, Spain 1997, 321-328. Association for Computational Linguistics, Stroudsburg, PA.
    • (1997) Proceedings of the 35th Annual Meeting of the ACL, Madrid, Spain , pp. 321-328
    • Palmer, D.D.1
  • 65
    • 33645983416 scopus 로고    scopus 로고
    • The proposition bank: an annotated corpus of semantic roles
    • Palmer M., Gildea D., Kingsbury P. The proposition bank: an annotated corpus of semantic roles. Comput. Linguist. 2005, 31(1):71-106.
    • (2005) Comput. Linguist. , vol.31 , Issue.1 , pp. 71-106
    • Palmer, M.1    Gildea, D.2    Kingsbury, P.3
  • 70
    • 84944535482 scopus 로고    scopus 로고
    • Gutenberg Project 2015, https://www.gutenberg.org/.
    • (2015)
    • Gutenberg, P.1
  • 71
    • 85124016637 scopus 로고    scopus 로고
    • A maximum entropy model for part-of-speech tagging
    • Philadelphia, PA, USA, E. Brill, K. Church (Eds.)
    • Ratnaparkhi A. A maximum entropy model for part-of-speech tagging. Proceedings of the Empirical Methods in Natural Language Processing 1996, 133-142. Philadelphia, PA, USA. E. Brill, K. Church (Eds.).
    • (1996) Proceedings of the Empirical Methods in Natural Language Processing , pp. 133-142
    • Ratnaparkhi, A.1
  • 77
    • 84868288681 scopus 로고    scopus 로고
    • Information extraction
    • Sarawagi S. Information extraction. Found. Trends Databases 2008, 1(3):261-377.
    • (2008) Found. Trends Databases , vol.1 , Issue.3 , pp. 261-377
    • Sarawagi, S.1
  • 81
    • 84944524892 scopus 로고    scopus 로고
    • Stanford NLP Group Stanford NLP tools 2014, http://nlp.stanford.edu/software/index.shtml.
    • (2014) Stanford NLP tools
  • 83
    • 83255167044 scopus 로고    scopus 로고
    • An introduction to conditional random fields
    • Sutton C., McCallum A. An introduction to conditional random fields. Found. Trends Mach. Learn. 2012, 4(4):267-373. 1935-8237.
    • (2012) Found. Trends Mach. Learn. , vol.4 , Issue.4 , pp. 267-373
    • Sutton, C.1    McCallum, A.2
  • 85
    • 84944628285 scopus 로고    scopus 로고
    • The Apache Software Foundation openNLP 2012, http://opennlp.apache.org/.
    • (2012) openNLP
  • 86
    • 84944585120 scopus 로고    scopus 로고
    • The Apache Software Foundation Apache lucene core 2014, http://lucene.apache.org/core/.
    • (2014) Apache lucene core
  • 87
    • 84944586135 scopus 로고    scopus 로고
    • The University of Sheffield GATE 2015, http://gate.ac.uk/.
    • (2015) GATE
  • 92
    • 84980078034 scopus 로고
    • The unreasonable effectiveness of mathematics in the natural sciences
    • Wigner E. The unreasonable effectiveness of mathematics in the natural sciences. Commun. Pure Appl. Math. 1960, 13(1):1-14.
    • (1960) Commun. Pure Appl. Math. , vol.13 , Issue.1 , pp. 1-14
    • Wigner, E.1
  • 95
    • 84944567991 scopus 로고    scopus 로고
    • YouTube Viewership statistics 2015, http://www.youtube.com/yt/press/statistics.html.
    • (2015) Viewership statistics
  • 96
    • 79952778019 scopus 로고    scopus 로고
    • Syntactic processing using the generalized perceptron and beam search
    • Zhang Y., Clark S. Syntactic processing using the generalized perceptron and beam search. Comput. Linguist. 2011, 37(1):105-151.
    • (2011) Comput. Linguist. , vol.37 , Issue.1 , pp. 105-151
    • Zhang, Y.1    Clark, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.