메뉴 건너뛰기




Volumn 43, Issue 3, 2007, Pages 769-790

Using structural contexts to compress semistructured text collections

Author keywords

Compressed text databases; Semistructured documents; Text compression

Indexed keywords

DATA STRUCTURES; DATABASE SYSTEMS; HEURISTIC METHODS; INFORMATION RETRIEVAL; NATURAL LANGUAGE PROCESSING SYSTEMS; RANDOM ACCESS STORAGE;

EID: 33846470515     PISSN: 03064573     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ipm.2006.07.001     Document Type: Article
Times cited : (11)

References (35)
  • 1
    • 2642583476 scopus 로고    scopus 로고
    • Adiego, J., Navarro, G., & de la Fuente, P. (2004). Lempel-Ziv compression of structured text. In Proceedings 14th IEEE data compression conference (DCC'04) (pp. 112-121).
  • 2
    • 33846497677 scopus 로고    scopus 로고
    • Adiego, J., Navarro, G., & de la Fuente, P. (in press). Lempel-ziv compression of highly structured documents. Journal of the American Society for Information Science and Technology (JASIST).
  • 5
    • 0142249976 scopus 로고    scopus 로고
    • (s,c)-Dense coding: an optimized compression code for natural language text databases
    • Proceedings of 10th international symposium on string processing and information retrieval (SPIRE 2003), Springer
    • Brisaboa N., Fariña A., Navarro G., and Esteller M. (s,c)-Dense coding: an optimized compression code for natural language text databases. Proceedings of 10th international symposium on string processing and information retrieval (SPIRE 2003). LNCS 2857 (2003), Springer 122-136
    • (2003) LNCS 2857 , pp. 122-136
    • Brisaboa, N.1    Fariña, A.2    Navarro, G.3    Esteller, M.4
  • 6
    • 33846538288 scopus 로고    scopus 로고
    • Burrows, M., & Wheeler, D. (1994). A block sorting lossless data compression algorithm. Technical Report 124, Digital Equipment Corporation.
  • 7
    • 0035008559 scopus 로고    scopus 로고
    • Cheney, J. (2001). Compressing XML with multiplexed hierarchical PPM models. In Proceedings of 11th IEEE data compression conference (DCC'01) (pp. 163-172).
  • 8
    • 0021405335 scopus 로고
    • Data compression using adaptive coding and partial string matching
    • Cleary J., and Witten I. Data compression using adaptive coding and partial string matching. IEEE Transactions on Communication 32 (1984) 396-402
    • (1984) IEEE Transactions on Communication , vol.32 , pp. 396-402
    • Cleary, J.1    Witten, I.2
  • 9
    • 84958980272 scopus 로고    scopus 로고
    • Word-based compression methods and indexing for text retrieval systems
    • Proceedings of 2nd East European symposium on advances in databases and information systems (ADBIS'99), Springer
    • Dvorský J., Pokorný J., and Snásel V. Word-based compression methods and indexing for text retrieval systems. Proceedings of 2nd East European symposium on advances in databases and information systems (ADBIS'99). LNCS 1691 (1999), Springer 75-84
    • (1999) LNCS 1691 , pp. 75-84
    • Dvorský, J.1    Pokorný, J.2    Snásel, V.3
  • 10
    • 0033683372 scopus 로고    scopus 로고
    • Girardot, M., & Sundaresan, N. (2000). Millau: an encoding format for efficient representation and exchange of XML documents over the WWW. In Proceedings of 9th international World wide Web conference on computer networks (pp. 747-765).
  • 11
    • 33846470837 scopus 로고    scopus 로고
    • Harman, D. (1995). Overview of the Third Text REtrieval Conference. In Proceedings of third Text REtrieval Conference (TREC-3) (pp. 1-19). NIST Special Publication 500-207.
  • 13
    • 84959015234 scopus 로고    scopus 로고
    • Horspool, R., & Cormack, G. (1992). Constructing word-based text compression algorithms. In Proceedings of 2nd IEEE data compression conference (DCC'92) (pp. 62-71).
  • 14
    • 84938015047 scopus 로고    scopus 로고
    • Huffman, D. (1952). A method for the construction of minimum-redundancy codes. In Proceedings of the Institute of Radio Engineers, 40(9), 1098-1101.
  • 15
    • 33846507731 scopus 로고    scopus 로고
    • Lam, W., Wood, P., & Levene, M. (2003). XCQ: XML compression and querying system. In Proceedings of 12th international conference on the World wide Web (WWW'03). Poster.
  • 16
    • 33846544122 scopus 로고    scopus 로고
    • Levene, M., & Wood, P. (2002). XML structure compression. In Proceedings of 2nd international workshop on Web dynamics.
  • 17
    • 0039785296 scopus 로고    scopus 로고
    • Liefke, H., & Suciu, D. (2000). XMill: an efficient compressor for XML data. In Proceedings of international ACM conference on management of data (SIGMOD'00) (pp. 153-164).
  • 19
    • 2642527811 scopus 로고    scopus 로고
    • Moffat, A., & Wan, R. (2001). RE-store: a system for compressing, browsing and searching large documents. In Proceedings of 8th international symposium on string processing and information retrieval (SPIRE'01) (pp. 162-174). IEEE CS Press.
  • 23
    • 84856043672 scopus 로고
    • A mathematical theory of communication
    • Shannon C. A mathematical theory of communication. Bell System Technical Journal 27 (1948) 398-403
    • (1948) Bell System Technical Journal , vol.27 , pp. 398-403
    • Shannon, C.1
  • 24
    • 84948416199 scopus 로고    scopus 로고
    • Shkarin, D. (2002). PPM: one step to practicality. In Proceedings of 12th IEEE data compression conference (DCC 2002) (pp. 202-211).
  • 26
    • 0036203985 scopus 로고    scopus 로고
    • Tolani, P., & Haritsa, J. (2002). XGRIND: a query-friendly XML compressor. In Proceedings of 18th international conference of data engineering (ICDE'02) (pp. 225-234).
  • 27
    • 33846534491 scopus 로고    scopus 로고
    • Toman, V. (2004). Syntactical compression of XML data. Presented at 16th International conference on advanced information systems engineering (CAiSE'04), Riga, Latvia, June 7-11.
  • 28
    • 33846503016 scopus 로고    scopus 로고
    • Turpin, A., & Moffat, A. (1997). Fast file search using text compression. In Proceedings of the 20th Australian computer science conference (pp. 1-8).
  • 29
    • 0021439618 scopus 로고
    • A technique for high-performance data compression
    • Welch T. A technique for high-performance data compression. IEEE Computer 17 6 (1984) 8-19
    • (1984) IEEE Computer , vol.17 , Issue.6 , pp. 8-19
    • Welch, T.1
  • 33
    • 0342521304 scopus 로고    scopus 로고
    • Compression: a key for next-generation text retrieval systems
    • Ziviani N., Moura E., Navarro G., and Baeza-Yates R. Compression: a key for next-generation text retrieval systems. IEEE Computer 33 11 (2000) 37-44
    • (2000) IEEE Computer , vol.33 , Issue.11 , pp. 37-44
    • Ziviani, N.1    Moura, E.2    Navarro, G.3    Baeza-Yates, R.4
  • 34
    • 0017493286 scopus 로고
    • An universal algorithm for sequential data compression
    • Ziv J., and Lempel A. An universal algorithm for sequential data compression. IEEE Transactions on Information Theory 23 3 (1977) 337-343
    • (1977) IEEE Transactions on Information Theory , vol.23 , Issue.3 , pp. 337-343
    • Ziv, J.1    Lempel, A.2
  • 35
    • 0018019231 scopus 로고
    • Compression of individual sequences via variable-rate coding
    • Ziv J., and Lempel A. Compression of individual sequences via variable-rate coding. IEEE Transactions on Information Theory 24 5 (1978) 530-536
    • (1978) IEEE Transactions on Information Theory , vol.24 , Issue.5 , pp. 530-536
    • Ziv, J.1    Lempel, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.