메뉴 건너뛰기




Volumn 7, Issue 2, 2013, Pages 613-639

Model trees with topic model preprocessing: An approach for data journalism illustrated with the Wikileaks Afghanistan war logs

Author keywords

Afghanistan; Computational social science; Count data; Database data; Latent Dirichlet allocation; Model based recursive partitioning; Text mining; Tree stability; Tree validation; WikiLeaks

Indexed keywords


EID: 84879514447     PISSN: 19326157     EISSN: 19417330     Source Type: Journal    
DOI: 10.1214/12-AOAS618     Document Type: Article
Times cited : (21)

References (65)
  • 2
    • 84879540637 scopus 로고    scopus 로고
    • AMNESTY INTERNATIONAL, Available at
    • AMNESTY INTERNATIONAL (2009). Afghanistan: German government must investigate deadly Kunduz airstrikes. Available at http://www.amnesty.org/en/news-and-updates/news/ afghanistan-german-government-must-investigate-deadly-kunduz-airstrikes-20091030.
    • (2009) Afghanistan: German government must investigate deadly Kunduz airstrikes
  • 3
    • 0037045542 scopus 로고    scopus 로고
    • Children of war: The real casualties of the Afghan conflict
    • BHUTTA, Z. A. (2002). Children of war: The real casualties of the Afghan conflict. BMJ 324 349-352.
    • (2002) BMJ , vol.324 , pp. 349-352
    • Bhutta, Z.A.1
  • 4
    • 34548579305 scopus 로고    scopus 로고
    • Military fatality rates (by cause) in Afghanistan and Iraq: A measure of hostilities
    • BIRD, S. M. and FAIRWEATHER, C. B. (2007). Military fatality rates (by cause) in Afghanistan and Iraq: A measure of hostilities. Int. J. Epidemiol. 36 841-846.
    • (2007) Int. J. Epidemiol. , vol.36 , pp. 841-846
    • Bird, S.M.1    Fairweather, C.B.2
  • 5
    • 84861170800 scopus 로고    scopus 로고
    • Probabilistic topic models
    • BLEI, D. (2012). Probabilistic topic models. Communications of the ACM 55 77-84.
    • (2012) Communications of the ACM , vol.55 , pp. 77-84
    • Blei, D.1
  • 7
    • 52449116403 scopus 로고    scopus 로고
    • A correlated topic model of Science
    • MR2393839
    • BLEI, D. M. and LAFFERTY, J. D. (2007). A correlated topic model of Science. Ann. Appl. Stat. 1 17-35. MR2393839
    • (2007) Ann. Appl. Stat , vol.1 , pp. 17-35
    • Blei, D.M.1    Lafferty, J.D.2
  • 9
    • 79952499031 scopus 로고    scopus 로고
    • The war in Afghanistan. Counting the dead in Afghanistan
    • BOHANNON, J. (2011). The war in Afghanistan. Counting the dead in Afghanistan. Science 331 1256-1260.
    • (2011) Science , vol.331 , pp. 1256-1260
    • Bohannon, J.1
  • 10
  • 11
    • 33749999494 scopus 로고    scopus 로고
    • Mortality after the 2003 invasion of Iraq a cross-sectional cluster sample survey
    • BURNHAM, G., LAFTA, R., DOOCY, S. and ROBERTS, L. (2006). Mortality after the 2003 invasion of Iraq a cross-sectional cluster sample survey. Lancet 368 1421-1428.
    • (2006) Lancet , vol.368 , pp. 1421-1428
    • Burnham, G.1    Lafta, R.2    Doocy, S.3    Roberts, L.4
  • 14
    • 18744386102 scopus 로고    scopus 로고
    • Regression trees for analysis of count data with extra Poisson variation
    • MR2141425
    • CHOI, Y., AHN, H. and CHEN, J. J. (2005). Regression trees for analysis of count data with extra Poisson variation. Comput. Statist. Data Anal. 49 893-915. MR2141425
    • (2005) Comput. Statist. Data Anal , vol.49 , pp. 893-915
    • Choi, Y.1    Ahn, H.2    Chen, J.J.3
  • 16
    • 80053498197 scopus 로고    scopus 로고
    • Computational journalism: How computer scientists can empower journalists, democracy's watchdogs, in the production of news in the public interest
    • COHEN, S., HAMILTON, J. T. and TURNER, F. (2011). Computational journalism: How computer scientists can empower journalists, democracy's watchdogs, in the production of news in the public interest. Communciations of the ACM 54 66-71.
    • (2011) Communciations of the ACM , vol.54 , pp. 66-71
    • Cohen, S.1    Hamilton, J.T.2    Turner, F.3
  • 19
    • 74549179052 scopus 로고    scopus 로고
    • Patterns of mortality rates in Darfur conflict
    • DEGOMME, O. and GUHA-SAPIR, D. (2010). Patterns of mortality rates in Darfur conflict. Lancet 375 294-300.
    • (2010) Lancet , vol.375 , pp. 294-300
    • Degomme, O.1    Guha-Sapir, D.2
  • 20
    • 84879518553 scopus 로고    scopus 로고
    • Available at
    • FIRM (2011). Cluster@WU. Available at http://www.wu.ac.at/firm/cluster_folder.
    • (2011) Cluster@WU
    • Firm1
  • 22
    • 0025772337 scopus 로고
    • Epidemiologic analysis of warfare. A historical review
    • GARFIELD, R. M. and NEUGUT, A. I. (1991). Epidemiologic analysis of warfare. A historical review. J. Amer. Med. Assoc. 266 688-692.
    • (1991) J. Amer. Med. Assoc. , vol.266 , pp. 688-692
    • Garfield, R.M.1    Neugut, A.I.2
  • 25
    • 79961227421 scopus 로고    scopus 로고
    • Topicmodels: An R package for fitting topic models
    • GRÜN, B. and HORNIK, K. (2011). topicmodels: An R package for fitting topic models. Journal of Statistical Software 40 1-30.
    • (2011) Journal of Statistical Software , vol.40 , pp. 1-30
    • Grün, B.1    Hornik, K.2
  • 26
    • 84879533598 scopus 로고    scopus 로고
    • GUARDIAN. CO. UK, Available at
    • GUARDIAN. CO. UK (2010). Afghanistan war logs: 56 civilians killed in Nato bombing. Available at http://www.guardian.co.uk/world/afghanistan/warlogs/826B488C-EA6F-A132-511610DB68C2EDBD.
    • (2010) Afghanistan war logs: 56 civilians killed in Nato bombing
  • 27
    • 78149240063 scopus 로고    scopus 로고
    • Both sides retaliate in the Israeli-Palestinian conflict
    • USA
    • HAUSHOFER, J., BILETZKI, A. and KANWISHER, N. (2010). Both sides retaliate in the Israeli-Palestinian conflict. Proc. Natl. Acad. Sci. USA 107 17927-17932.
    • (2010) Proc. Natl. Acad. Sci. , vol.107 , pp. 17927-17932
    • Haushofer, J.1    Biletzki, A.2    Kanwisher, N.3
  • 28
    • 34548254914 scopus 로고    scopus 로고
    • Cluster-wise assessment of cluster stability
    • MR2409980
    • HENNIG, C. (2007). Cluster-wise assessment of cluster stability. Comput. Statist. Data Anal. 52 258-271. MR2409980
    • (2007) Comput. Statist. Data Anal , vol.52 , pp. 258-271
    • Hennig, C.1
  • 32
    • 33749677657 scopus 로고    scopus 로고
    • Unbiased recursive partitioning: A conditional inference framework
    • MR2291267
    • HOTHORN, T., HORNIK, K. and ZEILEIS, A. (2006). Unbiased recursive partitioning: A conditional inference framework. J. Comput. Graph. Statist. 15 651-674. MR2291267
    • (2006) J. Comput. Graph. Statist , vol.15 , pp. 651-674
    • Hothorn, T.1    Hornik, K.2    Zeileis, A.3
  • 33
    • 0001368374 scopus 로고
    • Distribution de la flore alpine dans le bassin des Dranses et dans quelques régions voisines [Distribution of alpine flora in the Dranse basin and several neighboring regions]
    • JACCARD, P. (1901). Distribution de la flore alpine dans le bassin des Dranses et dans quelques régions voisines [Distribution of alpine flora in the Dranse basin and several neighboring regions]. Bulletin de la Société Vaudoise des Sciences Naturelles 37 241-272.
    • (1901) Bulletin de la Société Vaudoise des Sciences Naturelles , vol.37 , pp. 241-272
    • Jaccard, P.1
  • 34
    • 70849135366 scopus 로고    scopus 로고
    • Beanplot: A boxplot alternative for visual comparison of distributions
    • KAMPSTRA, P. (2008). Beanplot: A boxplot alternative for visual comparison of distributions. Journal of Statistical Software, Code Snippets 28 1-9.
    • (2008) Journal of Statistical Software, Code Snippets , vol.28 , pp. 1-9
    • Kampstra, P.1
  • 35
    • 1542573450 scopus 로고    scopus 로고
    • Classification trees with unbiased multiway splits
    • MR1946427
    • KIM, H. and LOH, W.-Y. (2001). Classification trees with unbiased multiway splits. J. Amer. Statist. Assoc. 96 589-604. MR1946427
    • (2001) J. Amer. Statist. Assoc , vol.96 , pp. 589-604
    • Kim, H.1    Loh, W.-Y.2
  • 37
    • 18544383606 scopus 로고    scopus 로고
    • Israeli army casualties in the second Palestinian uprising
    • LAKSTEIN, D. and BLUMENFELD, A. (2005). Israeli army casualties in the second Palestinian uprising. Mil. Med. 170 427-430.
    • (2005) Mil. Med. , vol.170 , pp. 427-430
    • Lakstein, D.1    Blumenfeld, A.2
  • 38
    • 84988052086 scopus 로고
    • Negative binomial and mixed Poisson regression
    • MR0926553
    • LAWLESS, J. F. (1987). Negative binomial and mixed Poisson regression. Canad. J. Statist. 15 209-225. MR0926553
    • (1987) Canad. J. Statist , vol.15 , pp. 209-225
    • Lawless, J.F.1
  • 40
    • 0036556537 scopus 로고    scopus 로고
    • Regression trees with unbiased variable selection and interaction detection
    • MR1902715
    • LOH, W.-Y. (2002). Regression trees with unbiased variable selection and interaction detection. Statist. Sinica 12 361-386. MR1902715
    • (2002) Statist. Sinica , vol.12 , pp. 361-386
    • Loh, W.-Y.1
  • 41
    • 78651532074 scopus 로고    scopus 로고
    • Improving the precision of classification trees
    • MR2752155
    • LOH, W.-Y. (2009). Improving the precision of classification trees. Ann. Appl. Stat. 3 1710-1737. MR2752155
    • (2009) Ann. Appl. Stat , vol.3 , pp. 1710-1737
    • Loh, W.-Y.1
  • 42
    • 0031312210 scopus 로고    scopus 로고
    • Split selection methods for classification trees
    • MR1488644
    • LOH, W.-Y. and SHIH, Y.-S. (1997). Split selection methods for classification trees. Statist. Sinica 7 815-840. MR1488644
    • (1997) Statist. Sinica , vol.7 , pp. 815-840
    • Loh, W.-Y.1    Shih, Y.-S.2
  • 44
    • 0003779972 scopus 로고
    • 3rd ed. Longman, Green, Longman, Roberts, and Green, London
    • NIGHTINGALE, F. (1863). Notes on Hospitals, 3rd ed. Longman, Green, Longman, Roberts, and Green, London.
    • (1863) Notes on Hospitals
    • Nightingale, F.1
  • 45
    • 77955884352 scopus 로고    scopus 로고
    • Peering into the fog of war: The geography of the Wikileaks Afghanistan war logs, 2004-2009
    • O'LOUGHLIN, J., WITMER, F. D., LINKE, A. M. and THORWARDSON, N. (2010). Peering into the fog of war: The geography of the Wikileaks Afghanistan war logs, 2004-2009. Eurasian Geography and Economics 51 472-495.
    • (2010) Eurasian Geography and Economics , vol.51 , pp. 472-495
    • O'loughlin, J.1    Witmer, F.D.2    Linke, A.M.3    Thorwardson, N.4
  • 46
    • 84907095419 scopus 로고    scopus 로고
    • R: A language and environment for statistical computing
    • R DEVELOPMENT CORE TEAM Vienna, Austria
    • R DEVELOPMENT CORE TEAM (2012). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.
    • (2012) R Foundation for Statistical Computing
  • 49
    • 84879533886 scopus 로고    scopus 로고
    • Gaining insight with recursive partitioning of generalized linear models
    • To appear
    • RUSCH, T. and ZEILEIS, A. (2013). Gaining insight with recursive partitioning of generalized linear models. J. Stat. Comput. Simul. To appear.
    • (2013) J. Stat. Comput. Simul
    • Rusch, T.1    Zeileis, A.2
  • 54
    • 0034709721 scopus 로고    scopus 로고
    • War andmortality in Kosovo, 1998-99: An epidemiological testimony
    • SPIEGEL, P. B. and SALAMA, P. (2001). War andmortality in Kosovo, 1998-99: An epidemiological testimony. Lancet 355 2204-2209.
    • (2001) Lancet , vol.355 , pp. 2204-2209
    • Spiegel, P.B.1    Salama, P.2
  • 56
    • 0034965040 scopus 로고    scopus 로고
    • Accidents and injuries among US Navy crewmembers during extended submarine patrols, 1997 to 1999
    • THOMAS, T. L., PARKER, A. L., HORN, W. G., MOLE, D., SPIRO, T. R., HOOPER, T. I. and GARLAND, F. C. (2001). Accidents and injuries among US Navy crewmembers during extended submarine patrols, 1997 to 1999. Mil. Med. 166 534-540.
    • (2001) Mil. Med. , vol.166 , pp. 534-540
    • Thomas, T.L.1    Parker, A.L.2    Horn, W.G.3    Mole, D.4    Spiro, T.R.5    Hooper, T.I.6    Garland, F.C.7
  • 57
    • 26644472430 scopus 로고    scopus 로고
    • Cluster validation by prediction strength
    • MR2170199
    • TIBSHIRANI, R. and WALTHER, G. (2005). Cluster validation by prediction strength. J. Comput. Graph. Statist. 14 511-528. MR2170199
    • (2005) J. Comput. Graph. Statist , vol.14 , pp. 511-528
    • Tibshirani, R.1    Walther, G.2
  • 63
    • 35348978702 scopus 로고    scopus 로고
    • GeneralizedM-fluctuation tests for parameter instability
    • MR2351461
    • ZEILEIS, A. andHORNIK, K. (2007). GeneralizedM-fluctuation tests for parameter instability. Stat. Neerl. 61 488-508. MR2351461
    • (2007) Stat. Neerl , vol.61 , pp. 488-508
    • Zeileis, A.1    Hornik, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.