메뉴 건너뛰기




Volumn 20, Issue 7, 2013, Pages 471-485

A geometric interpretation for local alignment-free sequence comparison

Author keywords

algorithms; alignment; dynamic programming; metagenomics

Indexed keywords

ALGORITHM; ARTICLE; COMPUTER PROGRAM; COMPUTER SIMULATION; DNA SEQUENCE; HUMAN; METHODOLOGY; SEQUENCE ALIGNMENT; SEQUENCE ANALYSIS; STATISTICAL ANALYSIS;

EID: 84880126573     PISSN: 10665277     EISSN: None     Source Type: Journal    
DOI: 10.1089/cmb.2012.0280     Document Type: Article
Times cited : (10)

References (46)
  • 2
    • 33751429178 scopus 로고    scopus 로고
    • The evolution of two-component systems in bacteria reveals different strategies for niche adaptation
    • Alm, E., Huang, K., and Arkin, A. 2006. The evolution of two-component systems in bacteria reveals different strategies for niche adaptation. PLoS Comput. Biol. 2, e143.
    • (2006) PLoS Comput. Biol. , vol.2
    • Alm, E.1    Huang, K.2    Arkin, A.3
  • 3
    • 0025183708 scopus 로고
    • Basic local alignment search tool
    • Altschul, S., Gish, W., Miller, W., et al. 1990. Basic local alignment search tool. J. Mol. Biol. 215, 403-410.
    • (1990) J. Mol. Biol. , vol.215 , pp. 403-410
    • Altschul, S.1    Gish, W.2    Miller, W.3
  • 4
    • 0030801002 scopus 로고    scopus 로고
    • Gapped BLAST and PSI-BLAST: A new generation of protein database search programs
    • Altschul, S., Madden, T., Schaffer, A., et al. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389-3402.
    • (1997) Nucleic Acids Res. , vol.25 , pp. 3389-3402
    • Altschul, S.1    Madden, T.2    Schaffer, A.3
  • 5
    • 0037154273 scopus 로고    scopus 로고
    • Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome
    • Berman, B., Nibu, Y., Pfeiffer, B., et al. 2002. Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome. Proc. Natl. Acad. Sci. 99, 757-762.
    • (2002) Proc. Natl. Acad. Sci. , vol.99 , pp. 757-762
    • Berman, B.1    Nibu, Y.2    Pfeiffer, B.3
  • 6
    • 0035024494 scopus 로고    scopus 로고
    • Efficient large-scale sequence comparison by locality-sensitive hashing
    • Buhler, J. 2001. Efficient large-scale sequence comparison by locality-sensitive hashing. Bioinformatics 17, 419-428.
    • (2001) Bioinformatics , vol.17 , pp. 419-428
    • Buhler, J.1
  • 9
    • 79957873906 scopus 로고    scopus 로고
    • Alignment-free detection of local similarity among viral and bacterial genomes
    • Domazet-Lošo, M., and Haubold, B. 2011. Alignment-free detection of local similarity among viral and bacterial genomes. Bioinformatics 27, 1466-1472.
    • (2011) Bioinformatics , vol.27 , pp. 1466-1472
    • Domazet-Lošo, M.1    Haubold, B.2
  • 10
    • 33244462202 scopus 로고    scopus 로고
    • Scalable partitioning and exploration of chemical spaces using geometric hashing
    • Dutta, D., Guha, R., Jurs, P., et al. 2006. Scalable partitioning and exploration of chemical spaces using geometric hashing. J. Chem. Inf. Model. 46, 321-333.
    • (2006) J. Chem. Inf. Model. , vol.46 , pp. 321-333
    • Dutta, D.1    Guha, R.2    Jurs, P.3
  • 11
    • 57149115764 scopus 로고    scopus 로고
    • Empirical distribution of k-word matches in biological sequences
    • Forêt, S., Wilson, S., and Burden, C. 2009. Empirical distribution of k-word matches in biological sequences. Pattern Recognition 42, 539-548.
    • (2009) Pattern Recognition , vol.42 , pp. 539-548
    • Forêt, S.1    Wilson, S.2    Burden, C.3
  • 12
    • 84893574327 scopus 로고
    • Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming
    • Goemans, M, and Williamson, D. 1995. Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming. Journal of the ACM 42, 1115-1145.
    • (1995) Journal of the ACM , vol.42 , pp. 1115-1145
    • Goemans, M.1    Williamson, D.2
  • 13
    • 84857867828 scopus 로고    scopus 로고
    • Estimation of pairwise sequence similarity of mammalian enhancers with word neighbourhood counts
    • Göke, J., Schulz, M., Lasserre, J., et al. 2012. Estimation of pairwise sequence similarity of mammalian enhancers with word neighbourhood counts. Bioinformatics 28, 656-663.
    • (2012) Bioinformatics , vol.28 , pp. 656-663
    • Göke, J.1    Schulz, M.2    Lasserre, J.3
  • 14
    • 79951527464 scopus 로고    scopus 로고
    • Alignment-free estimation of nucleotide diversity
    • Haubold, B., Reed, F., and Pfaffelhuber, P. 2011. Alignment-free estimation of nucleotide diversity. Bioinformatics 27, 449-455.
    • (2011) Bioinformatics , vol.27 , pp. 449-455
    • Haubold, B.1    Reed, F.2    Pfaffelhuber, P.3
  • 16
    • 0013404624 scopus 로고    scopus 로고
    • [Ph.D. thesis], Department of Computer Science, Stanford University, Stanford, CA
    • Indyk, P. 2001. High-dimensional computational geometry. [Ph.D. thesis], Department of Computer Science, Stanford University, Stanford, CA.
    • (2001) High-dimensional Computational Geometry
    • Indyk, P.1
  • 19
    • 34547844142 scopus 로고    scopus 로고
    • A statistical method for alignment-free comparison of regulatory sequences
    • Kantorovitz, M., Robinson, G., and Sinha, S. 2007. A statistical method for alignment-free comparison of regulatory sequences. Bioinformatics 23, 1249-1255.
    • (2007) Bioinformatics , vol.23 , pp. 1249-1255
    • Kantorovitz, M.1    Robinson, G.2    Sinha, S.3
  • 20
    • 0025259313 scopus 로고
    • Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes
    • Karlin, S., and Altschul, S. 1990. Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc. Natl. Acad. Sci. 87, 2264-2268.
    • (1990) Proc. Natl. Acad. Sci. , vol.87 , pp. 2264-2268
    • Karlin, S.1    Altschul, S.2
  • 21
    • 80053321076 scopus 로고    scopus 로고
    • Improved accuracy of supervised CRM discovery with interpolated markov models and cross-species comparison
    • Kazemian, M., Zhu, Q., Halfon, M., et al. 2011. Improved accuracy of supervised CRM discovery with interpolated markov models and cross-species comparison. Nucleic Acids Res. 39, 9463-9472.
    • (2011) Nucleic Acids Res , vol.39 , pp. 9463-9472
    • Kazemian, M.1    Zhu, Q.2    Halfon, M.3
  • 22
    • 0040304282 scopus 로고
    • A simple randomized sieve algorithm for the closest-pair problem
    • Khuller, S., and Matias, Y. 1995. A simple randomized sieve algorithm for the closest-pair problem. Information and Computation 118, 34-37.
    • (1995) Information and Computation , vol.118 , pp. 34-37
    • Khuller, S.1    Matias, Y.2
  • 23
    • 0037195172 scopus 로고    scopus 로고
    • Distributional regimes for the number of k-word matches between two random sequences
    • Lippert, R., Huang, H.Y., and Waterman, M. 2002. Distributional regimes for the number of k-word matches between two random sequences. Proc. Natl. Acad. Sci. 100, 13980-13989.
    • (2002) Proc. Natl. Acad. Sci. , vol.100 , pp. 13980-13989
    • Lippert, R.1    Huang, H.Y.2    Waterman, M.3
  • 24
    • 79959929795 scopus 로고    scopus 로고
    • New powerful statistics for alignment-free sequence comparison under a pattern transfer model
    • Liua, X., Wan, L., Li, J., et al. 2011. New powerful statistics for alignment-free sequence comparison under a pattern transfer model. J. Theor. Biol. 284, 106-116.
    • (2011) J. Theor. Biol. , vol.284 , pp. 106-116
    • Liua, X.1    Wan, L.2    Li, J.3
  • 25
    • 84859313678 scopus 로고    scopus 로고
    • Efficient large-scale protein sequence comparison and gene matching to identify orthologs and co-orthologs
    • Mahmood, K., Webb, G., Song, J., et al. 2012. Efficient large-scale protein sequence comparison and gene matching to identify orthologs and co-orthologs. Nucleic Acid Res. 40, e44.
    • (2012) Nucleic Acid Res , vol.40
    • Mahmood, K.1    Webb, G.2    Song, J.3
  • 26
    • 77957355684 scopus 로고    scopus 로고
    • High frequency of horizontal gene transfer in the oceans
    • McDaniel, L., Young, E., Delaney, J., et al. 2010. High frequency of horizontal gene transfer in the oceans. Science 330, 50.
    • (2010) Science , vol.330 , pp. 50
    • McDaniel, L.1    Young, E.2    Delaney, J.3
  • 27
    • 78149408021 scopus 로고    scopus 로고
    • Massive turnover of functional sequence in human and other mammalian genomes
    • Meader, S., Ponting, C., and Lunter, G. 2010. Massive turnover of functional sequence in human and other mammalian genomes. Genome Res. 20, 1335-1343.
    • (2010) Genome Res , vol.20 , pp. 1335-1343
    • Meader, S.1    Ponting, C.2    Lunter, G.3
  • 28
    • 0004168557 scopus 로고
    • Cambridge University Press, Cambridge, United Kingdom
    • Motwani, R., and Raghavan, P. 1995. Randomized algorithms. Cambridge University Press, Cambridge, United Kingdom.
    • (1995) Randomized Algorithms
    • Motwani, R.1    Raghavan, P.2
  • 29
    • 0004918181 scopus 로고
    • A note on a method for generating points uniformly on n-dimensional spheres
    • Muller, M. 1959. A note on a method for generating points uniformly on n-dimensional spheres. Communications of the Association for Computing Machinery 2, 19-20.
    • (1959) Communications of the Association for Computing Machinery , vol.2 , pp. 19-20
    • Muller, M.1
  • 30
    • 0023989064 scopus 로고
    • Improved tools for biological sequence comparison
    • Pearson, W., and Lipman, D. 1988. Improved tools for biological sequence comparison. Proc. Natl. Acad. Sci. 85, 2444-2448.
    • (1988) Proc. Natl. Acad. Sci. , vol.85 , pp. 2444-2448
    • Pearson, W.1    Lipman, D.2
  • 33
    • 75149164526 scopus 로고    scopus 로고
    • Alignment-free sequence comparison (I): Statistics and power
    • Reinert, G., Chew, D., Sun, F., et al. 2009. Alignment-free sequence comparison (I): statistics and power. J. Comp. Biol. 16, 1615-1634.
    • (2009) J. Comp. Biol. , vol.16 , pp. 1615-1634
    • Reinert, G.1    Chew, D.2    Sun, F.3
  • 34
    • 79957787637 scopus 로고    scopus 로고
    • Whole-genome phylogeny of Escherichia coli/Shigella group by feature frequency profiles (FFPs)
    • Sims, G., and Kim, S. 2011. Whole-genome phylogeny of Escherichia coli/Shigella group by feature frequency profiles (FFPs). Proc. Natl. Acad. Sci. 108, 8329-8334.
    • (2011) Proc. Natl. Acad. Sci. , vol.108 , pp. 8329-8334
    • Sims, G.1    Kim, S.2
  • 35
    • 16344388566 scopus 로고    scopus 로고
    • Sequence turnover and tandem repeats in cis-regulatory modules in Drosophila
    • Sinha, S., and Siggia, E. 2005. Sequence turnover and tandem repeats in cis-regulatory modules in Drosophila. Mol. Biol. Evol. 22, 874-885.
    • (2005) Mol. Biol. Evol. , vol.22 , pp. 874-885
    • Sinha, S.1    Siggia, E.2
  • 36
    • 0019887799 scopus 로고
    • Identification of common molecular subsequences
    • Smith, T., and Waterman, M. 1981. Identification of common molecular subsequences. J. Mol. Biol. 147, 195-197.
    • (1981) J. Mol. Biol. , vol.147 , pp. 195-197
    • Smith, T.1    Waterman, M.2
  • 38
    • 79959887501 scopus 로고    scopus 로고
    • Genome-wide identification of conserved regulatory function in diverged sequences
    • Taher, L., McGaughey, D., Maragh, S., et al. 2011. Genome-wide identification of conserved regulatory function in diverged sequences. Genome Res. 21, 1139-1149.
    • (2011) Genome Res , vol.21 , pp. 1139-1149
    • Taher, L.1    McGaughey, D.2    Maragh, S.3
  • 40
    • 85170194974 scopus 로고
    • Computation of D2: A measure of sequence dissimilarity
    • Torney, D., Burks, C., Davison, D., et al. 1990. Computation of D2: a measure of sequence dissimilarity. Computers and DNA, 109-125.
    • (1990) Computers and DNA , pp. 109-125
    • Torney, D.1    Burks, C.2    Davison, D.3
  • 42
    • 79952276786 scopus 로고    scopus 로고
    • Is transcription factor binding site turnover a sufficient explanation for cis-regulatory sequence divergence?
    • Venkataram, S., and Fay, J. 2010. Is transcription factor binding site turnover a sufficient explanation for cis-regulatory sequence divergence? Genome Biol Evol. 2, 851-858.
    • (2010) Genome Biol Evol. , vol.2 , pp. 851-858
    • Venkataram, S.1    Fay, J.2
  • 43
    • 78349292948 scopus 로고    scopus 로고
    • Alignment-free sequence comparison (II): Theoretical power of comparison statistics
    • Wan, L., Reinert, G., Sun, F., et al. 2010. Alignment-free sequence comparison (II): theoretical power of comparison statistics. J. Comput. Biol. 17, 1467-1490.
    • (2010) J. Comput. Biol. , vol.17 , pp. 1467-1490
    • Wan, L.1    Reinert, G.2    Sun, F.3
  • 44
    • 0028234758 scopus 로고
    • Rapid and accurate estimates of statistical significance for sequence data base searches
    • Waterman, M., and Vingron, M., 1994. Rapid and accurate estimates of statistical significance for sequence data base searches. Proc. Natl. Acad. Sci. 91, 4625-4628.
    • (1994) Proc. Natl. Acad. Sci. , vol.91 , pp. 4625-4628
    • Waterman, M.1    Vingron, M.2
  • 45
    • 0001154535 scopus 로고
    • On constructing minimum spanning trees in k-dimensional spaces and related problems
    • Yao, A. 1982. On constructing minimum spanning trees in k-dimensional spaces and related problems. SIAM J. Comput. 11, 721-736.
    • (1982) SIAM J. Comput. , vol.11 , pp. 721-736
    • Yao, A.1
  • 46
    • 0032169849 scopus 로고    scopus 로고
    • Protein sequence similarity searches using patterns as seeds
    • Zhang, Z., Schaffer, A., Miller, W., et al. 1998. Protein sequence similarity searches using patterns as seeds. Nucleic Acids Res. 26, 3986-3990.
    • (1998) Nucleic Acids Res , vol.26 , pp. 3986-3990
    • Zhang, Z.1    Schaffer, A.2    Miller, W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.