메뉴 건너뛰기




Volumn 9, Issue , 2008, Pages 23-46

Linear-time computation of similarity measures for sequential data

Author keywords

Learning with sequential data; String distances; String kernels

Indexed keywords

COMPUTATION THEORY; EMBEDDED SYSTEMS; FORMAL LANGUAGES; LEARNING SYSTEMS; LINEAR SYSTEMS; WORD PROCESSING;

EID: 38949156579     PISSN: 15324435     EISSN: 15337928     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (88)

References (79)
  • 3
    • 0026966646 scopus 로고
    • A training algorithm for optimal margin classifiers
    • D. Haussler, editor, ACM Press
    • B. Boser, I. Guyon, and V. Vapnik. A training algorithm for optimal margin classifiers. In D. Haussler, editor, Proceedings of COLT, pages 144-152. ACM Press, 1992.
    • (1992) Proceedings of COLT , pp. 144-152
    • Boser, B.1    Guyon, I.2    Vapnik, V.3
  • 4
    • 0002636321 scopus 로고
    • N-gram-based text categorization
    • Las Vegas, NV, USA, Apr
    • W. B. Cavnar and J. M. Trenkle. N-gram-based text categorization. In Proceedings of SDAIR, pages 161-175, Las Vegas, NV, USA., Apr. 1994.
    • (1994) Proceedings of SDAIR , pp. 161-175
    • Cavnar, W.B.1    Trenkle, J.M.2
  • 8
    • 0028911698 scopus 로고
    • Gauging similarity with n-grams: Language-independent categorization of text
    • M. Damashek. Gauging similarity with n-grams: Language-independent categorization of text. Science, 267(5199):843-848, 1995.
    • (1995) Science , vol.267 , Issue.5199 , pp. 843-848
    • Damashek, M.1
  • 11
  • 12
    • 4444288656 scopus 로고    scopus 로고
    • Kernels and distances for structured data
    • T. Gärtner, J. Lloyd, and P. Flach. Kernels and distances for structured data. Machine Learning, 57 (3):205-232, 2004.
    • (2004) Machine Learning , vol.57 , Issue.3 , pp. 205-232
    • Gärtner, T.1    Lloyd, J.2    Flach, P.3
  • 16
    • 84943817322 scopus 로고
    • Error-detecting and error-correcting codes
    • R. W. Hamming. Error-detecting and error-correcting codes. Bell System Technical Journal, 29(2):147-160, 1950.
    • (1950) Bell System Technical Journal , vol.29 , Issue.2 , pp. 147-160
    • Hamming, R.W.1
  • 19
    • 38949179322 scopus 로고    scopus 로고
    • J. Hopcroft and J. Motwani, R. Ullmann. Introduction to Automata Theory, Languages, and Computation. Addison-Wesley, 2 edition, 2001.
    • J. Hopcroft and J. Motwani, R. Ullmann. Introduction to Automata Theory, Languages, and Computation. Addison-Wesley, 2 edition, 2001.
  • 23
    • 84957069814 scopus 로고    scopus 로고
    • Text categorization with support vector machines: Learning with many relevant features
    • Springer
    • T. Joachims. Text categorization with support vector machines: Learning with many relevant features. In Proceedings of ECML, pages 137 - 142. Springer, 1998.
    • (1998) Proceedings of ECML , pp. 137-142
    • Joachims, T.1
  • 24
    • 14744279495 scopus 로고    scopus 로고
    • Efficient substring traversal with suffix arrays
    • Technical report, 185, Department of Informatics, Kyushu University
    • T. Kasai, H. Ariumar, and A. Setsuo. Efficient substring traversal with suffix arrays. Technical report, 185, Department of Informatics, Kyushu University, 2001a.
    • (2001)
    • Kasai, T.1    Ariumar, H.2    Setsuo, A.3
  • 27
    • 33745428531 scopus 로고    scopus 로고
    • Feature discovery in non-metric pairwise data
    • 5(Jul):801-818, July
    • J. Laub and K.-R. Müller. Feature discovery in non-metric pairwise data. Journal of Machine Learning, 5(Jul):801-818, July 2004.
    • (2004) Journal of Machine Learning
    • Laub, J.1    Müller, K.-R.2
  • 28
    • 33745428009 scopus 로고    scopus 로고
    • On the information and representation of noneuclidean pairwise data
    • Oct
    • J. Laub, V. Roth, J. Buhmann, and K.-R. Müller. On the information and representation of noneuclidean pairwise data. Pattern Recognition, 39(10):1815-1826, Oct. 2006.
    • (2006) Pattern Recognition , vol.39 , Issue.10 , pp. 1815-1826
    • Laub, J.1    Roth, V.2    Buhmann, J.3    Müller, K.-R.4
  • 29
    • 84883575579 scopus 로고    scopus 로고
    • Fast string kernels using inexact matching for protein sequences
    • C. Leslie and R. Kuang. Fast string kernels using inexact matching for protein sequences. Journal of Machine Learning Research, 5:1435-1455, 2004.
    • (2004) Journal of Machine Learning Research , vol.5 , pp. 1435-1455
    • Leslie, C.1    Kuang, R.2
  • 30
    • 0036358995 scopus 로고    scopus 로고
    • The spectrum kernel: A string kernel for SVM protein classification
    • C. Leslie, E. Eskin, and W. Noble. The spectrum kernel: A string kernel for SVM protein classification. In Proc. Pacific Symp. Biocomputing, pages 564-575, 2002.
    • (2002) Proc. Pacific Symp. Biocomputing , pp. 564-575
    • Leslie, C.1    Eskin, E.2    Noble, W.3
  • 31
    • 25144516978 scopus 로고    scopus 로고
    • Mismatch string kernel for discriminative protein classification
    • C. Leslie, E. Eskin, A. Cohen, J. Weston, and W. Noble. Mismatch string kernel for discriminative protein classification. Bioinformatics, 1(1):1-10, 2003.
    • (2003) Bioinformatics , vol.1 , Issue.1 , pp. 1-10
    • Leslie, C.1    Eskin, E.2    Cohen, A.3    Weston, J.4    Noble, W.5
  • 32
    • 0000390142 scopus 로고
    • Binary codes capable of correcting deletions, insertions, and reversals
    • V. I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. Doklady Akademii Nauk SSSR, 163(4):845-848, 1966.
    • (1966) Doklady Akademii Nauk SSSR , vol.163 , Issue.4 , pp. 845-848
    • Levenshtein, V.I.1
  • 34
    • 0742287001 scopus 로고    scopus 로고
    • Combining pairwise sequence similiarity and support vector machines for detecting remote protein evolutionary and structural relationships
    • L. Liao and W. Noble. Combining pairwise sequence similiarity and support vector machines for detecting remote protein evolutionary and structural relationships. Journal of Computational Biology, 10:857-868, 2003.
    • (2003) Journal of Computational Biology , vol.10 , pp. 857-868
    • Liao, L.1    Noble, W.2
  • 35
    • 0034301517 scopus 로고    scopus 로고
    • The 1999 DARPA off-line intrusion detection evaluation
    • R. Lippmann, J. Haines, D. Fried, J. Korba, and K. Das. The 1999 DARPA off-line intrusion detection evaluation. Computer Networks, 34(4):579-595, 2000.
    • (2000) Computer Networks , vol.34 , Issue.4 , pp. 579-595
    • Lippmann, R.1    Haines, J.2    Fried, D.3    Korba, J.4    Das, K.5
  • 37
    • 0027681165 scopus 로고
    • Suffix arrays: A new method for on-line string searches
    • U. Manber and G. Myers. Suffix arrays: a new method for on-line string searches. SIAM Journal on Computing, 22(5):935-948, 1993.
    • (1993) SIAM Journal on Computing , vol.22 , Issue.5 , pp. 935-948
    • Manber, U.1    Myers, G.2
  • 38
    • 51149095385 scopus 로고    scopus 로고
    • An efficient, versatile approach to suffix sorting
    • Article
    • M. Maniscalco and S. Puglisi. An efficient, versatile approach to suffix sorting. Journal of Experimental Algorithmics, 12, Article No. 1.2, 2007.
    • (2007) Journal of Experimental Algorithmics , Issue.1 .2 , pp. 12
    • Maniscalco, M.1    Puglisi, S.2
  • 39
    • 21044442250 scopus 로고    scopus 로고
    • Engineering a lightweight suffix array construction algorithm
    • G. Manzini and P. Ferragina. Engineering a lightweight suffix array construction algorithm. Algorithmica, 40:33-50, 2004.
    • (2004) Algorithmica , vol.40 , pp. 33-50
    • Manzini, G.1    Ferragina, P.2
  • 40
  • 41
    • 0016942292 scopus 로고
    • A space-economical suffix tree construction algorithm
    • E. M. McCreight. A space-economical suffix tree construction algorithm. Journal of the ACM, 23 (2):262-272, 1976.
    • (1976) Journal of the ACM , vol.23 , Issue.2 , pp. 262-272
    • McCreight, E.M.1
  • 42
    • 85019691440 scopus 로고    scopus 로고
    • Testing intrusion detection systems: A critique of the 1998 and 1999 DARPA intrusion detection system evaluations as performed by Lincoln Laboratory
    • J. McHugh. Testing intrusion detection systems: a critique of the 1998 and 1999 DARPA intrusion detection system evaluations as performed by Lincoln Laboratory. ACM Transactions on Information Systems Security, 3(4):262-294, 2000.
    • (2000) ACM Transactions on Information Systems Security , vol.3 , Issue.4 , pp. 262-294
    • McHugh, J.1
  • 45
    • 0014757386 scopus 로고
    • A general method applicable to the search for similarties in the amino acid sequence of two proteins
    • S. B. Needleman and C. D. Wunsch. A general method applicable to the search for similarties in the amino acid sequence of two proteins. Journal of Molecular Biology, 48:443-453, 1970.
    • (1970) Journal of Molecular Biology , vol.48 , pp. 443-453
    • Needleman, S.B.1    Wunsch, C.D.2
  • 46
    • 13244273801 scopus 로고    scopus 로고
    • Building kernels from binary strings for image matching
    • F. Odone, A. Baria, and A. Verri. Building kernels from binary strings for image matching. IEEE Transactions on Image Processing, 14(2):169-180, 2005.
    • (2005) IEEE Transactions on Image Processing , vol.14 , Issue.2 , pp. 169-180
    • Odone, F.1    Baria, A.2    Verri, A.3
  • 48
    • 14344254996 scopus 로고    scopus 로고
    • Learning with non-positive kernels
    • R. Greiner and D. Schuurmans, editors, ACM Press
    • C. Ong, X. Mary, S. Canu, and A. Smola. Learning with non-positive kernels. In R. Greiner and D. Schuurmans, editors, Proceedings of ICML, pages 639-646. ACM Press, 2004.
    • (2004) Proceedings of ICML , pp. 639-646
    • Ong, C.1    Mary, X.2    Canu, S.3    Smola, A.4
  • 49
    • 29144515642 scopus 로고    scopus 로고
    • Accurate splice site prediction for caenorhabditis elegans
    • MIT Press
    • G. Rätsch and S. Sonnenburg. Accurate splice site prediction for caenorhabditis elegans. In Kernel Methods in Computational Biology, pages 277-298. MIT Press, 2004.
    • (2004) Kernel Methods in Computational Biology , pp. 277-298
    • Rätsch, G.1    Sonnenburg, S.2
  • 51
    • 33846910249 scopus 로고    scopus 로고
    • Language models for detection of unknown attacks in network traffic
    • K. Rieck and P. Laskov. Language models for detection of unknown attacks in network traffic. Journal in Computer Virology, 2(4):243-256, 2007.
    • (2007) Journal in Computer Virology , vol.2 , Issue.4 , pp. 243-256
    • Rieck, K.1    Laskov, P.2
  • 52
    • 33750258120 scopus 로고    scopus 로고
    • Efficient algorithms for similarity measures over sequential data: A look beyond kernels
    • Pattern Recognition, Proc. of 28th DAGM Symposium, Sept
    • K. Rieck, P. Laskov, and K.-R. Müller. Efficient algorithms for similarity measures over sequential data: A look beyond kernels. In Pattern Recognition, Proc. of 28th DAGM Symposium, LNCS, pages 374-383, Sept. 2006.
    • (2006) LNCS , pp. 374-383
    • Rieck, K.1    Laskov, P.2    Müller, K.-R.3
  • 53
    • 38949213119 scopus 로고    scopus 로고
    • Computation of similarity measures for sequential data using generalized suffix trees
    • Cambridge, MA, MIT Press
    • K. Rieck, P. Laskov, and S. Sonnenburg. Computation of similarity measures for sequential data using generalized suffix trees. In Advances in Neural Information Processing Systems 19, pages 1177-1184, Cambridge, MA, 2007. MIT Press.
    • (2007) Advances in Neural Information Processing Systems 19 , pp. 1177-1184
    • Rieck, K.1    Laskov, P.2    Sonnenburg, S.3
  • 54
    • 0347380866 scopus 로고    scopus 로고
    • Optimal cluster preserving embedding of nonmetric proximity data
    • Dec
    • V. Roth, J. Laub, M. Kawanabe, and J. Buhmann. Optimal cluster preserving embedding of nonmetric proximity data. IEEE Trans. PAMI, 25:1540-1551, Dec. 2003.
    • (2003) IEEE Trans. PAMI , vol.25 , pp. 1540-1551
    • Roth, V.1    Laub, J.2    Kawanabe, M.3    Buhmann, J.4
  • 55
    • 24944584599 scopus 로고    scopus 로고
    • Efficient computation of gapped substring kernels for large alphabets
    • J. Rousu and J. Shawe-Taylor. Efficient computation of gapped substring kernels for large alphabets. Journal of Machine Leaning Research, 6:1323-1344, 2005.
    • (2005) Journal of Machine Leaning Research , vol.6 , pp. 1323-1344
    • Rousu, J.1    Shawe-Taylor, J.2
  • 56
    • 0018446498 scopus 로고
    • Mathematics and information retrieval
    • G. Salton. Mathematics and information retrieval. Journal of Documentation, 35(1):1-29, 1979.
    • (1979) Journal of Documentation , vol.35 , Issue.1 , pp. 1-29
    • Salton, G.1
  • 57
    • 0016572913 scopus 로고
    • A vector space model for automatic indexing
    • G. Salton, A. Wong, and C. Yang. A vector space model for automatic indexing. Communications of the ACM, 18(11):613-620, 1975.
    • (1975) Communications of the ACM , vol.18 , Issue.11 , pp. 613-620
    • Salton, G.1    Wong, A.2    Yang, C.3
  • 61
    • 0347243182 scopus 로고    scopus 로고
    • Nonlinear component analysis as a kernel eigenvalue problem
    • B. Schölkopf, A. Smola, and K.-R. Müller. Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation, 10:1299-1319, 1998b.
    • (1998) Neural Computation , vol.10 , pp. 1299-1319
    • Schölkopf, B.1    Smola, A.2    Müller, K.-R.3
  • 63
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • T. F. Smith and M. S. Waterman. Identification of common molecular subsequences. Journal of Molecular Biology, 147:195-197, 1981.
    • (1981) Journal of Molecular Biology , vol.147 , pp. 195-197
    • Smith, T.F.1    Waterman, M.S.2
  • 65
    • 33747871991 scopus 로고    scopus 로고
    • ARTS: Accurate Recognition of Transcription Starts in Human
    • S. Sonnenburg, A. Zien, and G. Rätsch. ARTS: Accurate Recognition of Transcription Starts in Human. Bioinformatics, 22(14):e472-e480, 2006.
    • (2006) Bioinformatics , vol.22 , Issue.14
    • Sonnenburg, S.1    Zien, A.2    Rätsch, G.3
  • 67
    • 0017952955 scopus 로고
    • N-gram statistics for natural language understanding and text processing
    • Apr
    • C. Y. Suen. N-gram statistics for natural language understanding and text processing. IEEE Trans. Pattern Analysis and Machine Intelligence, 1(2):164-172, Apr. 1979.
    • (1979) IEEE Trans. Pattern Analysis and Machine Intelligence , vol.1 , Issue.2 , pp. 164-172
    • Suen, C.Y.1
  • 69
    • 34250766728 scopus 로고    scopus 로고
    • Fast and space efficient string kernels using suffix arrays
    • ACM Press
    • C. Teo and S. Vishwanathan. Fast and space efficient string kernels using suffix arrays. In Proceedings of ICML, pages 939-936. ACM Press, 2006.
    • (2006) Proceedings of ICML , pp. 939-936
    • Teo, C.1    Vishwanathan, S.2
  • 71
    • 0001704377 scopus 로고
    • Online construction of suffix trees
    • E. Ukkonen. Online construction of suffix trees. Algorithmica, 14(3):249-260, 1995.
    • (1995) Algorithmica , vol.14 , Issue.3 , pp. 249-260
    • Ukkonen, E.1
  • 74
    • 33749236901 scopus 로고    scopus 로고
    • Fast kernels for string and tree matching
    • K. Tsuda, B. Schölkopf, and J. Vert, editors, MIT Press
    • S. Vishwanathan and A. Smola. Fast kernels for string and tree matching. In K. Tsuda, B. Schölkopf, and J. Vert, editors, Kernels and Bioinformatics, pages 113-130. MIT Press, 2004.
    • (2004) Kernels and Bioinformatics , pp. 113-130
    • Vishwanathan, S.1    Smola, A.2
  • 76
    • 0002531715 scopus 로고    scopus 로고
    • Dynamic alignment kernels
    • A. Smola, P. Bartlett, B. Schölkopf, and D. Schuurmans, editors, Cambridge, MA, MIT Press
    • C. Watkins. Dynamic alignment kernels. In A. Smola, P. Bartlett, B. Schölkopf, and D. Schuurmans, editors, Advances in Large Margin Classifiers, pages 39-50, Cambridge, MA, 2000. MIT Press.
    • (2000) Advances in Large Margin Classifiers , pp. 39-50
    • Watkins, C.1
  • 79
    • 0033670134 scopus 로고    scopus 로고
    • Engineering Support Vector Machine Kernels That Recognize Translation Initiation Sites
    • Sept
    • A. Zien, G. Rätsch, S. Mika, B. Schölkopf, T. Lengauer, and K.-R. Müller. Engineering Support Vector Machine Kernels That Recognize Translation Initiation Sites. Bioinformatics, 16(9):799807, Sept. 2000.
    • (2000) Bioinformatics , vol.16 , Issue.9 , pp. 799807
    • Zien, A.1    Rätsch, G.2    Mika, S.3    Schölkopf, B.4    Lengauer, T.5    Müller, K.-R.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.