-
1
-
-
3242889975
-
Replacing suffix trees with enhanced suffix arrays
-
Mar
-
M. I. Abouelhoda, S. Kurtz, and E. Ohlebusch. Replacing Suffix Trees with Enhanced Suffix Arrays. Journal of Discrete Algorithms, 2(1):53-86, Mar. 2004.
-
(2004)
Journal of Discrete Algorithms
, vol.2
, Issue.1
, pp. 53-86
-
-
Abouelhoda, M.I.1
Kurtz, S.2
Ohlebusch, E.3
-
2
-
-
0037213089
-
An information-Theoretic perspective of tf-idf measures
-
Jan
-
A. Aizawa. An Information-Theoretic Perspective of TF-IDF Measures. Information Processing and Management, 39(1):45-65, Jan. 2003.
-
(2003)
Information Processing and Management
, vol.39
, Issue.1
, pp. 45-65
-
-
Aizawa, A.1
-
3
-
-
0032124110
-
Information distance
-
July
-
C. H. Bennett, P. Gs, M. Li, P. M. Vityi, and W. H. Zurek. Information Distance. IEEE Transactions on Information Theory, 44(4):1407-1423, July 1998.
-
(1998)
IEEE Transactions on Information Theory
, vol.44
, Issue.4
, pp. 1407-1423
-
-
Bennett, C.H.1
Gs, P.2
Li, M.3
Vityi, P.M.4
Zurek, W.H.5
-
4
-
-
0023383102
-
Complete inverted files for efficient text retrieval and analysis
-
July
-
A. Blumer, J. Blumer, D. Haussler, R. M. McConnell, and A. Ehrenfeucht. Complete Inverted Files for Efficient Text Retrieval and Analysis. Journal of the ACM, 34(3):578-595, July 1987.
-
(1987)
Journal of the ACM
, vol.34
, Issue.3
, pp. 578-595
-
-
Blumer, A.1
Blumer, J.2
Haussler, D.3
McConnell, R.M.4
Ehrenfeucht, A.5
-
9
-
-
84936824188
-
Word association norms, mutual information, and lexicography
-
22-29 Mar
-
K. W. Church and P. Hanks. Word Association Norms, Mutual Information, and Lexicography. Computational Linguistics, 16(1):22-29, Mar. 1990.
-
(1990)
Computational Linguistics
, vol.16
, pp. 1
-
-
Church, K.W.1
Hanks, P.2
-
12
-
-
4944243732
-
A local maxima method and a fair dispersion normalization for extracting multi-word units from corpora
-
July
-
J. F. da Silva and J. G. P. Lopes. A Local Maxima Method and a Fair Dispersion Normalization for Extracting Multi-word Units from Corpora. In Proceedings of Meeting on Mathematics of Language (MOL), pages 369-381, July 1999.
-
(1999)
Proceedings of Meeting on Mathematics of Language (MOL
, pp. 369-381
-
-
Da Silva, J.F.1
Lopes, J.G.P.2
-
15
-
-
84857913430
-
New algorithms on wavelet trees and applications to information retrieval
-
Apr
-
T. Gagie, G. Navarro, and S. J. Puglisi. New Algorithms on Wavelet Trees and Applications to Information Retrieval. Theoretical Computer Science, 426-427:25-41, Apr. 2012.
-
(2012)
Theoretical Computer Science
, vol.426-427
, pp. 25-41
-
-
Gagie, T.1
Navarro, G.2
Puglisi, S.J.3
-
18
-
-
0016526522
-
A probabilistic approach to automatic keyword indexing part i on the distribution of specialty words in a technical literature
-
July/Aug
-
S. P. Harter. A Probabilistic Approach to Automatic Keyword Indexing. Part I. On the Distribution of Specialty Words in a Technical Literature. Journal of the American Society for Information Science, 26(4):197-206, July/Aug. 1975.
-
(1975)
Journal of the American Society for Information Science
, vol.26
, Issue.4
, pp. 197-206
-
-
Harter, S.P.1
-
20
-
-
0000600049
-
A probabilistic justification for using tf df term weighting in information retrieval
-
Aug
-
D. Hiemstra. A Probabilistic Justification for Using TF DF Term Weighting in Information Retrieval. International Journal on Digital Libraries, 3(2):131-139, Aug. 2000.
-
(2000)
International Journal on Digital Libraries
, vol.3
, Issue.2
, pp. 131-139
-
-
Hiemstra, D.1
-
21
-
-
84953744816
-
A statistical interpretation of term specificity and its application in retrieval
-
K. S. Jones. A Statistical Interpretation of Term Specificity and its Application in Retrieval. Journal of Documentation, 28:11-21, 1972.
-
(1972)
Journal of Documentation
, vol.28
, pp. 11-21
-
-
Jones, K.S.1
-
22
-
-
0007424433
-
On tables of random numbers
-
A. Kolmogorov. On Tables of Random Numbers. Sankhya Ser. A, 25:369-376, 1963.
-
(1963)
Sankhy A Ser. A
, vol.25
, pp. 369-376
-
-
Kolmogorov, A.1
-
23
-
-
62249199807
-
Supervised and traditional term weighting methods for automatic text categorization
-
Apr
-
M. Lan, C. L. Tan, J. Su, and Y. Lu. Supervised and Traditional Term Weighting Methods for Automatic Text Categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(4):721-735, Apr. 2009.
-
(2009)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.31
, Issue.4
, pp. 721-735
-
-
Lan, M.1
Tan, C.L.2
Su, J.3
Lu, Y.4
-
24
-
-
84966611720
-
Irreversibility and heat generation in the computing process
-
July
-
R. Landauer. Irreversibility and Heat Generation in the Computing Process. IBM Journal of Research and Development, 5(3):183-191, July 1961.
-
(1961)
IBM Journal of Research and Development
, vol.5
, Issue.3
, pp. 183-191
-
-
Landauer, R.1
-
28
-
-
79955139538
-
Unsupervised query segmentation using only query logs
-
Mar./Apr
-
N. Mishra, R. S. Roy, N. Ganguly, S. Laxman, and M. Choudhury. Unsupervised Query Segmentation Using only Query Logs. In Proceedings of International World Wide Web Conference (WWW), pages 91-92, Mar./Apr. 2011.
-
(2011)
Proceedings of International World Wide Web Conference
, pp. 91-92
-
-
Mishra, N.1
Roy, R.S.2
Ganguly, N.3
Laxman, S.4
Choudhury, M.5
-
29
-
-
37849015556
-
Efficient computation of substring equivalence classes with suffix arrays
-
July
-
K. Narisawa, S. Inenaga, H. Bannai, and M. Takeda. Efficient Computation of Substring Equivalence Classes with Suffix Arrays. In Proceedings of Symposium on Combinatorial Pattern Matching (CPM), pages 340-351, July 2007.
-
(2007)
Proceedings of Symposium on Combinatorial Pattern Matching (CPM)
, pp. 340-351
-
-
Narisawa, K.1
Inenaga, S.2
Bannai, H.3
Takeda, M.4
-
33
-
-
84859887839
-
An extensive empirical study of collocation extraction methods
-
June
-
P. Pecina. An Extensive Empirical Study of Collocation Extraction Methods. In Proceedings of ACL Student Research Workshop, pages 13-18, June 2005.
-
(2005)
Proceedings of ACL Student Research Workshop
, pp. 13-18
-
-
Pecina, P.1
-
34
-
-
34548062522
-
TF-ICF: A new term weighting scheme for clustering dynamic data streams
-
Dec
-
J. W. Reed, Y. Jiao, T. E. Potok, B. A. Klump, M. T. Elmore, and A. R. Hurson. TF-ICF: A New Term Weighting Scheme for Clustering Dynamic Data Streams. In Proceedings of International Conference on Machine Learning and Applications (ICMLA), pages 258-263, Dec. 2006.
-
(2006)
Proceedings of International Conference on Machine Learning and Applications (ICMLA)
, pp. 258-263
-
-
Reed, J.W.1
Jiao, Y.2
Potok, T.E.3
Klump, B.A.4
Elmore, M.T.5
Hurson, A.R.6
-
36
-
-
8844253324
-
Understanding inverse document frequency: On theoretical arguments for IDF
-
S. Robertson. Understanding Inverse Document Frequency: On theoretical arguments for IDF. Journal of Documentation, 60(5):503-520, 2004.
-
(2004)
Journal of Documentation
, vol.60
, Issue.5
, pp. 503-520
-
-
Robertson, S.1
-
37
-
-
0001319911
-
Okapi at TREC-3
-
S. Robertson, S. Walker, S. Jones, M. Hancock-Beaulieu, and M. Gatford. Okapi at TREC-3. In Proceedings of Text Retrieval Conference (TREC), pages 109-126, 1994.
-
(1994)
Proceedings of Text Retrieval Conference (TREC
, pp. 109-126
-
-
Robertson, S.1
Walker, S.2
Jones, S.3
Hancock-Beaulieu, M.4
Gatford, M.5
-
40
-
-
84866594862
-
An ir-based evaluation framework for web search query segmentation
-
Aug
-
R. S. Roy, N. Ganguly, M. Choudhury, and S. Laxman. An IR-based Evaluation Framework for Web Search Query Segmentation. In Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pages 881-890, Aug. 2012.
-
(2012)
Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR
, pp. 881-890
-
-
Roy, R.S.1
Ganguly, N.2
Choudhury, M.3
Laxman, S.4
-
42
-
-
0016572913
-
A vector space model for automatic indexing
-
G. Salton, A. Wong, and C.-S. Yang. A Vector Space Model for Automatic Indexing. Communications of the ACM, 18(11):613-620, 1975.
-
(1975)
Communications of the ACM
, vol.18
, Issue.11
, pp. 613-620
-
-
Salton, G.1
Wong, A.2
Yang, C.-S.3
-
46
-
-
46249110298
-
Interpreting tf-idf term weights as making relevance decisions
-
June
-
H. C. Wu, R. W. P. Luk, K. F. Wong, and K. L. Kwok. Interpreting TF-IDF Term Weights as Making Relevance Decisions. ACM Transactions on Information Systems, 26(3):13:1-13:37, June 2008.
-
(2008)
ACM Transactions on Information Systems
, vol.26
, Issue.3
, pp. 131-1337
-
-
Wu, H.C.1
Luk, R.W.P.2
Wong, K.F.3
Kwok, K.L.4
-
47
-
-
0038632285
-
Using suffix arrays to compute term frequency and document frequency for all substrings in a corpus
-
1-30, Mar
-
M. Yamamoto and K. W. Church. Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus. Computational Linguistics, 27(1):1-30, Mar. 2001.
-
(2001)
Computational Linguistics
, vol.27
, pp. 1
-
-
Yamamoto, M.1
Church, K.W.2
-
48
-
-
67349246433
-
Improving effectiveness of mutual information for substantival multiword expression extraction
-
Oct
-
W. Zhang, T. Yoshida, X. Tang, and T.-B. Ho. Improving Effectiveness of Mutual Information for Substantival Multiword Expression Extraction. Expert Systems with Applications, 36(8):10919-10930, Oct. 2009.
-
(2009)
Expert Systems with Applications
, vol.36
, Issue.8
, pp. 10919-10930
-
-
Zhang, W.1
Yoshida, T.2
Tang, X.3
Ho, T.-B.4
|