-
1
-
-
0004015324
-
-
Prentice Hall, Englewood Cliffs, NJ
-
T.C. Bell, J.G. Cleary, I.H. Witten, Text Compression, Prentice Hall, Englewood Cliffs, NJ, 1990.
-
(1990)
Text Compression
-
-
Bell, T.C.1
Cleary, J.G.2
Witten, I.H.3
-
2
-
-
0032686423
-
Data compression using long common strings
-
IEEE Press, Los Alamitos, CA
-
J. Bentley, D. McIlroy, Data compression using long common strings, in: Proc. Data Compression Conference, IEEE Press, Los Alamitos, CA, 1999, pp. 287-295.
-
(1999)
Proc. Data Compression Conference
, pp. 287-295
-
-
Bentley, J.1
McIlroy, D.2
-
3
-
-
0029370635
-
Automatic condensation of electronic publications by sentence selection
-
R. Brandow, K. Mitze, L.F. Rau, Automatic condensation of electronic publications by sentence selection, Information Processing and Management 31 (5) (1995) 675-685.
-
(1995)
Information Processing and Management
, vol.31
, Issue.5
, pp. 675-685
-
-
Brandow, R.1
Mitze, K.2
Rau, L.F.3
-
4
-
-
0030211964
-
Bagging predictors
-
L. Breiman, Bagging predictors, Machine Learning 24 (2) (1996) 123-140.
-
(1996)
Machine Learning
, vol.24
, Issue.2
, pp. 123-140
-
-
Breiman, L.1
-
6
-
-
0021405335
-
Data compression using adaptive coding and partial string matching
-
J.G. Cleary, I.H. Witten, Data compression using adaptive coding and partial string matching, IEEE Trans. Comm. 32 (4) (1984) 396-402.
-
(1984)
IEEE Trans. Comm.
, vol.32
, Issue.4
, pp. 396-402
-
-
Cleary, J.G.1
Witten, I.H.2
-
7
-
-
85105809948
-
Inductive learning algorithms and representations for text categorization
-
S.T. Dumais, J. Platt, D. Heckerman, M. Sahami, Inductive learning algorithms and representations for text categorization, in: Proceedings of the 7th International Conference on Information and Knowledge Management, 1998.
-
(1998)
Proceedings of the 7th International Conference on Information and Knowledge Management
-
-
Dumais, S.T.1
Platt, J.2
Heckerman, D.3
Sahami, M.4
-
9
-
-
0002936192
-
Domain-specific keyphrase extraction
-
Stockholm, Sweden
-
E. Frank, G.W. Paynter, I.H. Witten, C. Gutwin, C. Nevill-Manning, Domain-specific keyphrase extraction, in: Proc. Internat. Joint Conference on Artificial Intelligence, Stockholm, Sweden, 1999, pp. 668-673.
-
(1999)
Proc. Internat. Joint Conference on Artificial Intelligence
, pp. 668-673
-
-
Frank, E.1
Paynter, G.W.2
Witten, I.H.3
Gutwin, C.4
Nevill-Manning, C.5
-
10
-
-
0033894701
-
Text categorization using compression models
-
(Poster Paper), IEEE Press, Los Alamitos, CA
-
E. Frank, C. Chiu, I.H. Witten, Text categorization using compression models, in: Proc. Data Compression Conference (Poster paper), IEEE Press, Los Alamitos, CA, 2000. Full version available as Working Paper 00/2, Department of Computer Science, University of Waikato.
-
(2000)
Proc. Data Compression Conference
-
-
Frank, E.1
Chiu, C.2
Witten, I.H.3
-
11
-
-
0033894701
-
-
Department of Computer Science, University of Waikato
-
E. Frank, C. Chiu, I.H. Witten, Text categorization using compression models, in: Proc. Data Compression Conference (Poster paper), IEEE Press, Los Alamitos, CA, 2000. Full version available as Working Paper 00/2, Department of Computer Science, University of Waikato.
-
Working Paper
, vol.2
-
-
-
13
-
-
0004137004
-
-
Cambridge University Press, Cambridge, UK
-
D. Gusfield, Algorithms on Strings, Trees, and Sequences, Cambridge University Press, Cambridge, UK, 1997.
-
(1997)
Algorithms on Strings, Trees, and Sequences
-
-
Gusfield, D.1
-
15
-
-
0039141039
-
The application of linguistic processing to automatic abstract generation
-
F.C. Johnson, C.D. Paice, W. Black, A. Neal, The application of linguistic processing to automatic abstract generation, J. Document and Text Management 1 (1993) 215-241.
-
(1993)
J. Document and Text Management
, vol.1
, pp. 215-241
-
-
Johnson, F.C.1
Paice, C.D.2
Black, W.3
Neal, A.4
-
16
-
-
0029193387
-
A trainable document summarizer
-
ACM Press
-
J.M. Kupiec, J. Pedersen, F. Chen, A trainable document summarizer, in: Proc. ACM SIGIR Conference on Research and Development in Information Retrieval, ACM Press, 1995, pp. 68-73.
-
(1995)
Proc. ACM SIGIR Conference on Research and Development in Information Retrieval
, pp. 68-73
-
-
Kupiec, J.M.1
Pedersen, J.2
Chen, F.3
-
17
-
-
0032647886
-
Offline dictionary-based compression
-
IEEE Press, Los Alamitos, CA
-
N.J. Larsson, A. Moffat, Offline dictionary-based compression, in: Proc. Data Compression Conference, IEEE Press, Los Alamitos, CA, 1999, pp. 296-305.
-
(1999)
Proc. Data Compression Conference
, pp. 296-305
-
-
Larsson, N.J.1
Moffat, A.2
-
21
-
-
0032010306
-
Collaborative, programmable intelligent agents
-
B.A. Nardi, J.R. Miller, D.J. Wright, Collaborative, programmable intelligent agents, Comm. ACM 41 (3) (1998) 96-104.
-
(1998)
Comm. ACM
, vol.41
, Issue.3
, pp. 96-104
-
-
Nardi, B.A.1
Miller, J.R.2
Wright, D.J.3
-
22
-
-
0002044093
-
Identifying hierarchical structure in sequences: A linear-time algorithm
-
C.G. Nevill-Manning, I.H. Witten, Identifying hierarchical structure in sequences: a linear-time algorithm, J. Artificial Intelligence Res. 7 (1997) 67-82.
-
(1997)
J. Artificial Intelligence Res.
, vol.7
, pp. 67-82
-
-
Nevill-Manning, C.G.1
Witten, I.H.2
-
23
-
-
0031702890
-
Phrase hierarchy inference and compression in bounded space
-
J.A. Storer, M. Cohn (Eds.), IEEE Press, Los Alamitos, CA
-
C.G. Nevill-Manning, I.H. Witten, Phrase hierarchy inference and compression in bounded space, in: J.A. Storer, M. Cohn (Eds.), Proc. Data Compression Conference, IEEE Press, Los Alamitos, CA, 1998, pp. 179-188.
-
(1998)
Proc. Data Compression Conference
, pp. 179-188
-
-
Nevill-Manning, C.G.1
Witten, I.H.2
-
24
-
-
0000033413
-
Lexically-generated subject hierarchies for browsing large collections
-
C.G. Nevill-Manning, I.H. Witten, G.W. Paynter, Lexically-generated subject hierarchies for browsing large collections, Internat. J. Digital Libraries 2 (2-3) (1999) 111-123.
-
(1999)
Internat. J. Digital Libraries
, vol.2
, Issue.2-3
, pp. 111-123
-
-
Nevill-Manning, C.G.1
Witten, I.H.2
Paynter, G.W.3
-
25
-
-
10644257127
-
Online and offline heuristics for inferring hierarchies of repetitions in sequences
-
C.G. Nevill-Manning, I.H. Witten. Online and offline heuristics for inferring hierarchies of repetitions in sequences, Proc. IEEE 88 (11) (2000) 1745-1755.
-
(2000)
Proc. IEEE
, vol.88
, Issue.11
, pp. 1745-1755
-
-
Nevill-Manning, C.G.1
Witten, I.H.2
-
28
-
-
0001277731
-
A compression-based algorithm for Chinese word segmentation
-
W.J. Teahan, Y. Wen, R. McNab, I.H. Witten, A compression-based algorithm for Chinese word segmentation, Comput. Linguistics 26 (3) (2000) 375-393.
-
(2000)
Comput. Linguistics
, vol.26
, Issue.3
, pp. 375-393
-
-
Teahan, W.J.1
Wen, Y.2
McNab, R.3
Witten, I.H.4
-
29
-
-
0009151655
-
Text mining technology: Turning information into knowledge
-
D. Tkach, Text mining technology: Turning information into knowledge, IBM White paper, 1997.
-
(1997)
IBM White Paper
-
-
Tkach, D.1
-
30
-
-
21844478478
-
Learning algorithms for keyphrase extraction
-
P. Turney, Learning algorithms for keyphrase extraction, Information Retrieval 2 (4) (2000) 303-336.
-
(2000)
Information Retrieval
, vol.2
, Issue.4
, pp. 303-336
-
-
Turney, P.1
-
31
-
-
84935113569
-
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
-
A.J. Viterbi, Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, IEEE Trans. Inform. Theory (1967) 260-269.
-
(1967)
IEEE Trans. Inform. Theory
, pp. 260-269
-
-
Viterbi, A.J.1
-
32
-
-
0032650194
-
Text mining: A new frontier for lossless compression
-
IEEE Press, Los Alamitos, CA
-
I.H. Witten, Z. Bray, M. Mahoui, W.J. Teahan, Text mining: a new frontier for lossless compression, in: Proc. Data Compression Conference, IEEE Press, Los Alamitos, CA, 1999, pp. 198-207.
-
(1999)
Proc. Data Compression Conference
, pp. 198-207
-
-
Witten, I.H.1
Bray, Z.2
Mahoui, M.3
Teahan, W.J.4
-
33
-
-
0003756969
-
-
Morgan Kaufmann, San Francisco, CA
-
I.H. Witten, A. Moffat, T.C. Bell, Managing Gigabytes: Compressing and Indexing Documents and Images, second ed., Morgan Kaufmann, San Francisco, CA, 1999.
-
(1999)
Managing Gigabytes: Compressing and Indexing Documents and Images, Second Ed.
-
-
Witten, I.H.1
Moffat, A.2
Bell, T.C.3
-
34
-
-
84982036565
-
An algorithm for the segmentation of an artificial language analogue
-
J.G. Wolff, An algorithm for the segmentation of an artificial language analogue, British J. Psychol. 66 (1975) 79-90.
-
(1975)
British J. Psychol.
, vol.66
, pp. 79-90
-
-
Wolff, J.G.1
-
35
-
-
0033891710
-
Using compression to identify acronyms in text
-
(Poster Paper), IEEE Press, Los Alamitos, CA, 2000
-
S. Yeates, D. Bainbridge, I.H. Witten, Using compression to identify acronyms in text, in: Proc. Data Compression Conference (Poster paper), IEEE Press, Los Alamitos, CA, 2000. Full version available as Working Paper 00/1, Department of Computer Science, University of Waikato.
-
Proc. Data Compression Conference
-
-
Yeates, S.1
Bainbridge, D.2
Witten, I.H.3
-
36
-
-
0033891710
-
-
Department of Computer Science, University of Waikato
-
S. Yeates, D. Bainbridge, I.H. Witten, Using compression to identify acronyms in text, in: Proc. Data Compression Conference (Poster paper), IEEE Press, Los Alamitos, CA, 2000. Full version available as Working Paper 00/1, Department of Computer Science, University of Waikato.
-
Working Paper
, vol.1
-
-
-
37
-
-
0017493286
-
A universal algorithm for sequential data compression
-
J. Ziv, A. Lempel, A universal algorithm for sequential data compression, IEEE Trans. Inform. Theory IT-23 (3) (1977) 337-343.
-
(1977)
IEEE Trans. Inform. Theory
, vol.IT-23
, Issue.3
, pp. 337-343
-
-
Ziv, J.1
Lempel, A.2
-
38
-
-
0018019231
-
Compression of individual sequences via variable-rate coding
-
J. Ziv, A. Lempel, Compression of individual sequences via variable-rate coding, IEEE Trans. Inform. Theory IT-24 (5) (1978) 530-536.
-
(1978)
IEEE Trans. Inform. Theory
, vol.IT-24
, Issue.5
, pp. 530-536
-
-
Ziv, J.1
Lempel, A.2
|