메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1223-1233

Parallel metagenomic sequence clustering via sketching and maximal quasi-clique enumeration on map-reduce clouds

Author keywords

cloud computing; MapReduce; metagenomics; next generation sequencing; quasi clique enumeration; sequence clustering; sketching

Indexed keywords

MAP-REDUCE; METAGENOMICS; NEXT GENERATION SEQUENCING; QUASI CLIQUE ENUMERATION; SEQUENCE CLUSTERING; SKETCHING;

EID: 80053243671     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2011.116     Document Type: Conference Paper
Times cited : (23)

References (48)
  • 4
    • 67349209853 scopus 로고    scopus 로고
    • Next-generation DNA sequencing techniques
    • W. Ansorge, "Next-generation DNA sequencing techniques."Nat. Biotechnol., vol. 25, no. 4, pp. 195-203, 2009.
    • (2009) Nat. Biotechnol. , vol.25 , Issue.4 , pp. 195-203
    • Ansorge, W.1
  • 5
    • 64849100345 scopus 로고    scopus 로고
    • Sanger who? sequencing the next generation
    • J. M. Perkel, "Sanger who? sequencing the next generation. "Science, vol. 10, pp. 275-279, 2009.
    • (2009) Science , vol.10 , pp. 275-279
    • Perkel, J.M.1
  • 8
    • 72949116371 scopus 로고    scopus 로고
    • Phylogenetic diversity and metabolic potential revealed in a glacier ice metagenome
    • C. Simon et al., "Phylogenetic diversity and metabolic potential revealed in a glacier ice metagenome." Appl. Environ. Microbiol., vol. 75, no. 23, pp. 7519-7526, 2009.
    • (2009) Appl. Environ. Microbiol. , vol.75 , Issue.23 , pp. 7519-7526
    • Simon, C.1
  • 9
    • 58749112734 scopus 로고    scopus 로고
    • A core gut microbiome in obese and lean twins
    • P. Turnbaugh et al., "A core gut microbiome in obese and lean twins." Nature, vol. 457, no. 7228, pp. 480-484, 2009.
    • (2009) Nature , vol.457 , Issue.7228 , pp. 480-484
    • Turnbaugh, P.1
  • 11
    • 54549106898 scopus 로고    scopus 로고
    • Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers
    • Z. Liu et al., "Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers."Nucleic Acids Res., vol. 36, no. 18, p. e120, 2008.
    • (2008) Nucleic Acids Res. , vol.36 , Issue.18
    • Liu, Z.1
  • 12
    • 58549089276 scopus 로고    scopus 로고
    • Shotgun metaproteomics of the human distal gut microbiota
    • N. Verberkmoes et al., "Shotgun metaproteomics of the human distal gut microbiota." J. ISME, vol. 3, no. 2, pp. 179-189, 2009.
    • (2009) J. ISME , vol.3 , Issue.2 , pp. 179-189
    • Verberkmoes, N.1
  • 13
    • 58149200948 scopus 로고    scopus 로고
    • The ribosomal database project: Improved alignments and new tools for rRNA analysis
    • J. Cole et al., "The ribosomal database project: Improved alignments and new tools for rRNA analysis." Nucleic Acids Res., vol. 37, no. Database issue, pp. D141-D145, 2009.
    • (2009) Nucleic Acids Res. , vol.37 , Issue.DATABASE ISSUE
    • Cole, J.1
  • 14
    • 33747827586 scopus 로고    scopus 로고
    • NAST: A multiple sequence alignment server for comparative analysis of 16S rRNA genes
    • T. DeSantis et al., "NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes." Nucleic Acids Res., vol. 34, no. Web Server issue, pp. W394-W399, 2006.
    • (2006) Nucleic Acids Res. , vol.34 , Issue.WEB SERVER ISSUE
    • DeSantis, T.1
  • 15
    • 33746061683 scopus 로고    scopus 로고
    • Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB
    • -, "Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB." Appl. Environ. Microbiol., vol. 72, no. 7, pp. 5069-5072, 2006.
    • (2006) Appl. Environ. Microbiol. , vol.72 , Issue.7 , pp. 5069-5072
    • DeSantis, T.1
  • 16
    • 62549109116 scopus 로고    scopus 로고
    • TACOA: Taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach
    • N. Diaz et al., "TACOA: Taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach." BMC Bioinf., vol. 10, p. 56, 2009.
    • (2009) BMC Bioinf. , vol.10 , pp. 56
    • Diaz, N.1
  • 17
    • 33847702910 scopus 로고    scopus 로고
    • MEGAN analysis of metagenomic data
    • DOI 10.1101/gr.5969107
    • D. Huson et al., "MEGAN analysis of metagenomic data."Genome Res., vol. 17, no. 3, pp. 377-386, 2007. (Pubitemid 46376747)
    • (2007) Genome Research , vol.17 , Issue.3 , pp. 377-386
    • Huson, D.H.1    Auch, A.F.2    Qi, J.3    Schuster, S.C.4
  • 18
    • 33845957530 scopus 로고    scopus 로고
    • Accurate phylogenetic classification of variable-length DNA fragments
    • DOI 10.1038/nmeth976, PII NMETH976
    • A. McHardy et al., "Accurate phylogenetic classification of variable-length DNA fragments." Nat. Methods, vol. 4, no. 1, pp. 63-72, 2007. (Pubitemid 46029478)
    • (2007) Nature Methods , vol.4 , Issue.1 , pp. 63-72
    • McHardy, A.C.1    Martin, H.G.2    Tsirigos, A.3    Hugenholtz, P.4    Rigoutsos, I.5
  • 19
    • 53549118607 scopus 로고    scopus 로고
    • The metagenomics RAST server - A public resource for the automatic phylogenetic and functional analysis of metagenomes
    • F. Meyer et al., "The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes." BMC Bioinf., vol. 9, no. 1, p. 386, 2008.
    • (2008) BMC Bioinf. , vol.9 , Issue.1 , pp. 386
    • Meyer, F.1
  • 20
    • 34548293679 scopus 로고    scopus 로고
    • Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy
    • DOI 10.1128/AEM.00062-07
    • Q. Wang et al., "Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy."Appl. Environ. Microbiol., vol. 73, no. 16, pp. 5261-5267, 2007. (Pubitemid 47326658)
    • (2007) Applied and Environmental Microbiology , vol.73 , Issue.16 , pp. 5261-5267
    • Wang, Q.1    Garrity, G.M.2    Tiedje, J.M.3    Cole, J.R.4
  • 21
    • 44449119442 scopus 로고    scopus 로고
    • Metagenomics: Exploring unseen communities
    • N. Blow, "Metagenomics: Exploring unseen communities."Nature, vol. 453, no. 7195, pp. 687-690, 2008.
    • (2008) Nature , vol.453 , Issue.7195 , pp. 687-690
    • Blow, N.1
  • 22
    • 54449096251 scopus 로고    scopus 로고
    • Probing metagenomics by rapid cluster analysis of very large datasets
    • W. Li, J. C. Wooley, and A. Godzik, "Probing metagenomics by rapid cluster analysis of very large datasets." PLoS One, vol. 3, no. 10, p. e3375, 2008.
    • (2008) PLoS One , vol.3 , Issue.10
    • Li, W.1    Wooley, J.C.2    Godzik, A.3
  • 24
    • 34447248788 scopus 로고    scopus 로고
    • Clustered sequence representation for fast homology search
    • DOI 10.1089/cmb.2007.R005
    • M. Cameron, Y. Bernstein, and H. Williams, "Clustered sequence representation for fast homology search." J. Comput. Biol., vol. 14, no. 5, pp. 594-614, 2007. (Pubitemid 47047816)
    • (2007) Journal of Computational Biology , vol.14 , Issue.5 , pp. 594-614
    • Cameron, M.1    Bernstein, Y.2    Williams, H.E.3
  • 25
    • 0037790561 scopus 로고    scopus 로고
    • Efficient clustering of large EST data sets on parallel computers
    • DOI 10.1093/nar/gkg379
    • A. Kalyanaraman et al., "Efficient clustering of large EST data sets on parallel computers." Nucleic Acids Res., vol. 31, no. 11, pp. 2963-2974, 2003. (Pubitemid 37442139)
    • (2003) Nucleic Acids Research , vol.31 , Issue.11 , pp. 2963-2974
    • Kalyanaraman, A.1    Aluru, S.2    Kothari, S.3    Brendel, V.4
  • 28
    • 0010362121 scopus 로고    scopus 로고
    • Syntactic clustering of the web
    • A. Broder et al., "Syntactic clustering of the web." Comput. Networks ISDN Systems, vol. 29, no. 8-13, pp. 1157-1166, 1997.
    • (1997) Comput. Networks ISDN Systems , vol.29 , Issue.8-13 , pp. 1157-1166
    • Broder, A.1
  • 30
    • 77956295988 scopus 로고    scopus 로고
    • The genome analysis toolkit: A mapreduce framework for analyzing next-generation DNA sequencing data
    • A. McKenna et al., "The genome analysis toolkit: a mapreduce framework for analyzing next-generation DNA sequencing data." Genome Res., vol. 20, no. 9, pp. 1297-1303, 2010.
    • (2010) Genome Res. , vol.20 , Issue.9 , pp. 1297-1303
    • McKenna, A.1
  • 31
    • 77954492012 scopus 로고    scopus 로고
    • Cloud computing and the DNA data race
    • M. Schatz, B. Langmead, and S. Salzberg, "Cloud computing and the DNA data race." Nat. Biotechnol., vol. 28, no. 7, pp. 691-693, 2010.
    • (2010) Nat. Biotechnol. , vol.28 , Issue.7 , pp. 691-693
    • Schatz, M.1    Langmead, B.2    Salzberg, S.3
  • 32
    • 77954055600 scopus 로고    scopus 로고
    • Cloud computing for comparative genomics
    • D. Wall et al., "Cloud computing for comparative genomics."BMC Bioinf., vol. 11, p. 259, 2010.
    • (2010) BMC Bioinf. , vol.11 , pp. 259
    • Wall, D.1
  • 33
    • 58149234737 scopus 로고    scopus 로고
    • Real-time dna sequencing from single polymerase molecules
    • J. Eid, A. Fehr, J. Gray, and K. L. et al., "Real-time dna sequencing from single polymerase molecules." Science, vol. 323, no. 5910, pp. 133-138, 2009.
    • (2009) Science , vol.323 , Issue.5910 , pp. 133-138
    • Eid, J.1    Fehr, A.2    Gray, J.3    L, K.4
  • 34
    • 16444383160 scopus 로고    scopus 로고
    • Survey of clustering algorithms
    • DOI 10.1109/TNN.2005.845141
    • R. Xu and D. Wunsch, "Survey of clustering algorithms."IEEE Trans. Neural Netw., vol. 16, no. 3, pp. 645-678, 2005. (Pubitemid 40718010)
    • (2005) IEEE Transactions on Neural Networks , vol.16 , Issue.3 , pp. 645-678
    • Xu, R.1    Wunsch II, D.2
  • 35
    • 0012834533 scopus 로고    scopus 로고
    • Why so many clustering algorithms: A position paper
    • V. Estivill-Castro, "Why so many clustering algorithms: a position paper." ACM SIGKDD Explor. Newsl., vol. 4, no. 1, pp. 65-75, 2002.
    • (2002) ACM SIGKDD Explor. Newsl. , vol.4 , Issue.1 , pp. 65-75
    • Estivill-Castro, V.1
  • 36
    • 0036040277 scopus 로고    scopus 로고
    • Similarity estimation techniques from rounding algorithms
    • M. Charikar, "Similarity estimation techniques from rounding algorithms." in Proc. of ACM STOC, 2002, pp. 380-388.
    • Proc. of ACM STOC, 2002 , pp. 380-388
    • Charikar, M.1
  • 37
    • 35348911985 scopus 로고    scopus 로고
    • Detecting near-duplicates for web crawling
    • G. Manku, A. Jain, and A. D. Sarma, "Detecting near-duplicates for web crawling." in Proc. of WWW, 2007, pp. 141-150.
    • Proc. of WWW, 2007 , pp. 141-150
    • Manku, G.1    Jain, A.2    Sarma, A.D.3
  • 38
    • 0034207121 scopus 로고    scopus 로고
    • Min-wise independent permutations
    • A. Broder et al., "Min-wise independent permutations." J. Comput. System Sci., vol. 60, no. 3, pp. 630-659, 2000.
    • (2000) J. Comput. System Sci. , vol.60 , Issue.3 , pp. 630-659
    • Broder, A.1
  • 39
    • 33750296887 scopus 로고    scopus 로고
    • Finding near-duplicate web pages: A large-scale evaluation of algorithms
    • M. Henzinger, "Finding near-duplicate web pages: A large-scale evaluation of algorithms." in Proc. of ACM SIGIR, 2006, pp. 284-291.
    • Proc. of ACM SIGIR, 2006 , pp. 284-291
    • Henzinger, M.1
  • 42
    • 80053246211 scopus 로고    scopus 로고
    • A parallel algorithm for enumerating all the maximal k-plexes
    • Emerging technologies in knowledge discovery and data mining, ser.
    • B. Wu and X. Pei, "A parallel algorithm for enumerating all the maximal k-plexes," in Emerging technologies in knowledge discovery and data mining, ser. Lecture notes in computer science, vol. 4819, 2009, p. 476483.
    • (2009) Lecture Notes in Computer Science , vol.4819 , pp. 476483
    • Wu, B.1    Pei, X.2
  • 44
    • 84885573531 scopus 로고    scopus 로고
    • "Amazon EC2," http://aws.amazon.com/ec2/.
    • Amazon EC2
  • 46
    • 0038483826 scopus 로고    scopus 로고
    • Emergence of scaling in random networks
    • A.-L. Barabasi and R. Albert, "Emergence of scaling in random networks," Science, vol. 286, no. 5439, pp. 509-512, 1999.
    • (1999) Science , vol.286 , Issue.5439 , pp. 509-512
    • Barabasi, A.-L.1    Albert, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.