메뉴 건너뛰기




Volumn 6912 LNAI, Issue PART 2, 2011, Pages 341-357

Analyzing word frequencies in large text corpora using inter-arrival times and bootstrapping

Author keywords

burstiness; natural language modeling; sequence analysis

Indexed keywords

BAG OF WORDS; BERNOULLI PROCESS; BURSTINESS; FALSE POSITIVE; FIXED PARAMETERS; FREQUENCY COUNTS; GOLD STANDARDS; INTER-ARRIVAL TIME; METHOD MODEL; NATURAL LANGUAGES; NULL MODEL; OCCURRENCE PATTERN; SCIENTIFIC DISCIPLINE; SEQUENCE ANALYSIS; SIGNIFICANCE TESTING; SPATIAL PATTERNS; TEXT CORPORA; WORD FREQUENCIES;

EID: 80052405328     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-23783-6_22     Document Type: Conference Paper
Times cited : (13)

References (26)
  • 1
    • 70450169875 scopus 로고    scopus 로고
    • Beyond word frequency: Bursts, lulls, and scaling in the temporal distributions of words
    • Altmann, E.G., Pierrehumbert, J.B., Motter, A.E.: Beyond word frequency: Bursts, lulls, and scaling in the temporal distributions of words. PLoS ONE 4(11), e7678 (2009)
    • (2009) PLoS ONE , vol.4 , Issue.11
    • Altmann, E.G.1    Pierrehumbert, J.B.2    Motter, A.E.3
  • 3
    • 18744406314 scopus 로고    scopus 로고
    • The origin of bursts and heavy tails in human dynamics
    • Barabási, A.-L.: The origin of bursts and heavy tails in human dynamics. Nature 435, 207-211 (2005)
    • (2005) Nature , vol.435 , pp. 207-211
    • Barabási, A.-L.1
  • 5
    • 85055298348 scopus 로고
    • Accurate methods for the statistics of surprise and coincidence
    • Dunning, T.: Accurate methods for the statistics of surprise and coincidence. Computational Linguistics 19, 61-74 (1993)
    • (1993) Computational Linguistics , vol.19 , pp. 61-74
    • Dunning, T.1
  • 7
    • 0033204106 scopus 로고    scopus 로고
    • On power-law relationships of the internet topology
    • Faloutsos, M., Faloutsos, P., Faloutsos, C.: On power-law relationships of the internet topology. In: ACM SIGCOMM, pp. 251-262 (1999)
    • (1999) ACM SIGCOMM , pp. 251-262
    • Faloutsos, M.1    Faloutsos, P.2    Faloutsos, C.3
  • 8
    • 33745624002 scopus 로고    scopus 로고
    • Parameter free bursty events detection in text streams
    • Fung, G.P.C., Pui, G., Fung, C., Yu, J.X., Yu, P.S., Yu, S., Lu, H.: Parameter free bursty events detection in text streams. In: VLDB, pp. 181-192 (2005)
    • (2005) VLDB , pp. 181-192
    • Fung, G.P.C.1    Pui, G.2    Fung, C.3    Yu, J.X.4    Yu, P.S.5    Yu, S.6    Lu, H.7
  • 9
    • 34248692260 scopus 로고    scopus 로고
    • Null-hypothesis significance testing of word frequencies: A follow-up on Kilgarriff
    • Gries, S.T.: Null-hypothesis significance testing of word frequencies: a follow-up on Kilgarriff. Corpus Linguistics and Linguistic Theory 12, 277-294 (2005)
    • (2005) Corpus Linguistics and Linguistic Theory , vol.12 , pp. 277-294
    • Gries, S.T.1
  • 10
    • 24344466887 scopus 로고    scopus 로고
    • Syntactic priming: A corpus-based approach
    • Gries, S.T.: Syntactic priming: A corpus-based approach. Journal of Psycholinguistic Research 34(4), 365-399 (2005)
    • (2005) Journal of Psycholinguistic Research , vol.34 , Issue.4 , pp. 365-399
    • Gries, S.T.1
  • 11
    • 36448936021 scopus 로고    scopus 로고
    • Analyzing feature trajectories for event detection
    • He, Q., Chang, K., Lim, E.-P.: Analyzing feature trajectories for event detection. In: ACM SIGIR, pp. 207-214 (2007)
    • (2007) ACM SIGIR , pp. 207-214
    • He, Q.1    Chang, K.2    Lim, E.-P.3
  • 12
    • 49749133495 scopus 로고    scopus 로고
    • Using burstiness to improve clustering of topics in news streams
    • He, Q., Chang, K., Lim, E.-P.: Using burstiness to improve clustering of topics in news streams. In: IEEE ICDM, pp. 493-498 (2007)
    • (2007) IEEE ICDM , pp. 493-498
    • He, Q.1    Chang, K.2    Lim, E.-P.3
  • 13
    • 70350718061 scopus 로고    scopus 로고
    • Bursty Feature Representation for Clustering Text Streams
    • He, Q., Chang, K., Lim, E.-P., Zhang, J.: Bursty Feature Representation for Clustering Text Streams. In: SIAM SDM, pp. 491-496 (2007)
    • (2007) SIAM SDM , pp. 491-496
    • He, Q.1    Chang, K.2    Lim, E.-P.3    Zhang, J.4
  • 15
    • 0042209915 scopus 로고    scopus 로고
    • Bursty and hierarchical structure in streams
    • Kleinberg, J.: Bursty and hierarchical structure in streams. DMKD 7, 373-397 (2003)
    • (2003) DMKD , vol.7 , pp. 373-397
    • Kleinberg, J.1
  • 18
    • 34248205060 scopus 로고    scopus 로고
    • Graph evolution: Densification and shrinking diameters
    • Leskovec, J., Kleinberg, J., Faloutsos, C.: Graph evolution: Densification and shrinking diameters. ACM TKDD 1(1) (2007)
    • (2007) ACM TKDD , vol.1 , Issue.1
    • Leskovec, J.1    Kleinberg, J.2    Faloutsos, C.3
  • 19
    • 0036071235 scopus 로고    scopus 로고
    • A note on the calculation of empirical p-values from Monte Carlo procedures
    • North, B.V., Curtis, D., Sham, P.C.: A note on the calculation of empirical p-values from Monte Carlo procedures. The American Journal of Human Genetics 71(2), 439-441 (2002)
    • (2002) The American Journal of Human Genetics , vol.71 , Issue.2 , pp. 439-441
    • North, B.V.1    Curtis, D.2    Sham, P.C.3
  • 21
    • 84989402655 scopus 로고    scopus 로고
    • Social differentiation in the use of English vocabulary: Some analyses of the conversational component of the British National Corpus
    • Rayson, P., Leech, G., Hodges, M.: Social differentiation in the use of English vocabulary: some analyses of the conversational component of the British National Corpus. International Journal of Corpus Linguistics 2(1), 133-152 (1997)
    • (1997) International Journal of Corpus Linguistics , vol.2 , Issue.1 , pp. 133-152
    • Rayson, P.1    Leech, G.2    Hodges, M.3
  • 22
    • 84966534942 scopus 로고
    • Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval
    • Robertson, S.E., Walker, S.: Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In: ACM SIGIR, pp. 232-241 (1994)
    • (1994) ACM SIGIR , pp. 232-241
    • Robertson, S.E.1    Walker, S.2
  • 23
    • 33748791707 scopus 로고    scopus 로고
    • Language users as creatures of habit: A corpus-based analysis of persistence in spoken English
    • Szmrecsanyi, B.: Language users as creatures of habit: A corpus-based analysis of persistence in spoken English. Corpus Linguistics and Linguistic Theory 1(1), 113-149 (2005)
    • (2005) Corpus Linguistics and Linguistic Theory , vol.1 , Issue.1 , pp. 113-149
    • Szmrecsanyi, B.1
  • 25
    • 3142717571 scopus 로고    scopus 로고
    • Identifying similarities, periodicities and bursts for online search queries
    • Vlachos, M.: Identifying similarities, periodicities and bursts for online search queries. In: ACM SIGMOD, pp. 131-142 (2004)
    • (2004) ACM SIGMOD , pp. 131-142
    • Vlachos, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.