메뉴 건너뛰기




Volumn , Issue , 2010, Pages 351-362

Fast sort on CPUs and GPUs: A case for bandwidth oblivious SIMD sort

Author keywords

buffer; databases; many core; merge; merge network; performance; radix; simd; sorting; tlp

Indexed keywords

ANALYTICAL MODEL; ARCHITECTURAL FEATURES; BUFFER; CARDINALITIES; COMPETITIVE ANALYSIS; CONSTANT BANDWIDTH; DATABASES; KEY SIZES; LARGE DATA; LARGE SIZES; LOW-BANDWIDTH; MAIN MEMORY; MANY-CORE; MODERN ARCHITECTURES; ON CURRENTS; RADIX SORT; SORTING ALGORITHM; SORTING METHOD;

EID: 77954743119     PISSN: 07308078     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1807167.1807207     Document Type: Conference Paper
Times cited : (192)

References (27)
  • 4
    • 85154002090 scopus 로고
    • Sorting networks and their applications
    • K. E. Batcher. Sorting networks and their applications. In Spring Joint Computer Conference, pages 307-314, 1968.
    • (1968) Spring Joint Computer Conference , pp. 307-314
    • Batcher, K.E.1
  • 5
    • 70849107602 scopus 로고    scopus 로고
    • Dictionary-based order-preserving string compression for column stores
    • C. Binnig, S. Hildenbrand, and F. Färber. Dictionary-based order-preserving string compression for column stores. In SIGMOD, pages 283-296, 2009.
    • (2009) SIGMOD , pp. 283-296
    • Binnig, C.1    Hildenbrand, S.2    Färber, F.3
  • 7
    • 0040748316 scopus 로고    scopus 로고
    • Main-memory index structures with fixed-size partial keys
    • P. Bohannon, P. Mcllroy, and R. Rastogi. Main-memory index structures with fixed-size partial keys. In SIGMOD, pages 163-174, 2001.
    • (2001) SIGMOD , pp. 163-174
    • Bohannon, P.1    Mcllroy, P.2    Rastogi, R.3
  • 8
    • 65549167440 scopus 로고    scopus 로고
    • Rock: A High-Performance Sparc CMT Processor
    • S. Chaudhry, R. Cypher, M. Ekman, M. Karlsson, et al. Rock: A High-Performance Sparc CMT Processor. IEEE Micro, 29(2):6-16, 2009.
    • (2009) IEEE Micro , vol.29 , Issue.2 , pp. 6-16
    • Chaudhry, S.1    Cypher, R.2    Ekman, M.3    Karlsson, M.4
  • 9
    • 84865096511 scopus 로고    scopus 로고
    • Efficient implementation of sorting on multi-core SIMD CPU architectures
    • J. Chhugani, A. D. Nguyen, V.W. Lee, et al. Efficient implementation of sorting on multi-core SIMD CPU architectures. VLDB, 1(2):1313-1324, 2008.
    • (2008) VLDB , vol.1 , Issue.2 , pp. 1313-1324
    • Chhugani, J.1    Nguyen, A.D.2    Lee, V.W.3
  • 12
    • 33947607609 scopus 로고    scopus 로고
    • GPUTeraSort: High Performance Graphics Co-processor Sorting
    • N. Govindaraju, J. Gray, R. Kumar, et al. GPUTeraSort: High Performance Graphics Co-processor Sorting. In SIGMOD, pages 325-336, 2006.
    • (2006) SIGMOD , pp. 325-336
    • Govindaraju, N.1    Gray, J.2    Kumar, R.3
  • 13
    • 47849101190 scopus 로고    scopus 로고
    • AA-Sort: A New Parallel Sorting Algorithm for Multi-Core SIMD Processors
    • H. Inoue, T. Moriyama, H. Komatsu, et al. AA-Sort: A New Parallel Sorting Algorithm for Multi-Core SIMD Processors. In PACT, pages 189-198, 2007.
    • (2007) PACT , pp. 189-198
    • Inoue, H.1    Moriyama, T.2    Komatsu, H.3
  • 16
    • 77954705147 scopus 로고    scopus 로고
    • Sort vs. hash revisited: Fast join implementation on multi-core cpus
    • C. Kim, E. Sedlar, J. Chhugani, T. Kaldewey, et al. Sort vs. hash revisited: Fast join implementation on multi-core cpus. PVLDB, 2(2):1378-1389, 2009.
    • (2009) PVLDB , vol.2 , Issue.2 , pp. 1378-1389
    • Kim, C.1    Sedlar, E.2    Chhugani, J.3    Kaldewey, T.4
  • 17
    • 0030733703 scopus 로고    scopus 로고
    • The Influence of Caches on the Performance of Sorting
    • A. Lamarca and R. E. Ladner. The Influence of Caches on the Performance of Sorting. In Journal of Algorithms, pages 370-379, 1997.
    • (1997) Journal of Algorithms , pp. 370-379
    • Lamarca, A.1    Ladner, R.E.2
  • 21
    • 80052538275 scopus 로고    scopus 로고
    • When multicore isn't enough: Trends and the future for multi-multicore systems
    • M. Reilly. When multicore isn't enough: Trends and the future for multi-multicore systems. In HPEC, 2008.
    • (2008) HPEC
    • Reilly, M.1
  • 22
    • 70450077484 scopus 로고    scopus 로고
    • Designing efficient sorting algorithms for manycore GPUs
    • N. Satish, , M. Harris, and M. Garland. Designing efficient sorting algorithms for manycore GPUs. In IPDPS, pages 1-10, 2009.
    • (2009) IPDPS , pp. 1-10
    • Satish, N.1    Harris, M.2    Garland, M.3
  • 23
    • 49249086142 scopus 로고    scopus 로고
    • Larrabee: A Many-Core x86 Architecture for Visual Computing
    • L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, et al. Larrabee: A Many-Core x86 Architecture for Visual Computing. SIGGRAPH, 27(3), 2008.
    • (2008) SIGGRAPH , vol.27 , Issue.3
    • Seiler, L.1    Carmean, D.2    Sprangle, E.3    Forsyth, T.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.