메뉴 건너뛰기




Volumn , Issue , 2012, Pages 247-258

Nested data-parallelism on the gpu

Author keywords

gpgpu; gpu; nesl; nested data parallelism

Indexed keywords

ARITHMETIC PERFORMANCE; DATA PARALLELISM; DATA-LEVEL PARALLELISM; DIVIDE-AND-CONQUER ALGORITHM; EMPIRICAL EVIDENCE; FIRST-ORDER FUNCTIONAL LANGUAGES; GPGPU; GPU; GRAPHICS PROCESSING UNITS; LANGUAGE IMPLEMENTATIONS; MEMORY BANDWIDTHS; NESL; NESTED DATA; PARALLEL COMPUTER;

EID: 84867546922     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2364527.2364563     Document Type: Conference Paper
Times cited : (38)

References (38)
  • 1
    • 0025545476 scopus 로고
    • VCODE: A data-parallel intermediate language
    • Blelloch, G. and S. Chatterjee. VCODE: A data-parallel intermediate language. In FOMPC3, 1990, pp. 471-480.
    • (1990) FOMPC3 , pp. 471-480
    • Blelloch, G.1    Chatterjee, S.2
  • 3
    • 43949161602 scopus 로고
    • Implementation of a portable nested data-parallel language
    • Blelloch, G. E., S. Chatterjee, J. C. Hardwick, J. Sipelstein, and M. Zagha. Implementation of a portable nested data-parallel language. JPDC, 21(1), 1994, pp. 4-14.
    • (1994) JPDC , vol.21 , Issue.1 , pp. 4-14
    • Blelloch, G.E.1    Chatterjee, S.2    Hardwick, J.C.3    Sipelstein, J.4    Zagha, M.5
  • 4
    • 0030381077 scopus 로고    scopus 로고
    • The quickhull algorithm for convex hulls
    • Barber, C. B., D. P. Dobkin, and H. Huhdanpaa. The quickhull algorithm for convex hulls. ACM TOMS, 22(4), 1996, pp. 469-483.
    • (1996) ACM TOMS , vol.22 , Issue.4 , pp. 469-483
    • Barber, C.B.1    Dobkin, D.P.2    Huhdanpaa, H.3
  • 6
    • 33846349887 scopus 로고
    • A hierarchical O(N logN) force calculation algorithm
    • December
    • Barnes, J. and P. Hut. A hierarchical O(N logN) force calculation algorithm. Nature, 324, December 1986, pp. 446-449.
    • (1986) Nature , vol.324 , pp. 446-449
    • Barnes, J.1    Hut, P.2
  • 7
    • 0030105185 scopus 로고    scopus 로고
    • Programming parallel algorithms
    • March
    • Blelloch, G. E. Programming parallel algorithms. CACM, 39(3), March 1996, pp. 85-97.
    • (1996) CACM , vol.39 , Issue.3 , pp. 85-97
    • Blelloch, G.E.1
  • 8
    • 84858427151 scopus 로고    scopus 로고
    • An efficient CUDA implementation of the tree-based Barnes Hut n-body algorithm
    • chapter 6, Elsevier Science Publishers, New York, NY
    • Burtscher, M. and K. Pingali. An efficient CUDA implementation of the tree-based Barnes Hut n-body algorithm. In GPU Computing Gems Emerald Edition, chapter 6, pp. 75-92. Elsevier Science Publishers, New York, NY, 2011.
    • (2011) GPU Computing Gems Emerald Edition , pp. 75-92
    • Burtscher, M.1    Pingali, K.2
  • 9
    • 85015692260 scopus 로고
    • The pricing of options and corporate liabilities
    • Black, F. and M. Scholes. The pricing of options and corporate liabilities. JPE, 81(3), 1973, pp. 637-654.
    • (1973) JPE , vol.81 , Issue.3 , pp. 637-654
    • Black, F.1    Scholes, M.2
  • 10
    • 0025380943 scopus 로고
    • Compiling collection-oriented languages onto massively parallel computers
    • Blelloch, G. E. and G.W. Sabot. Compiling collection-oriented languages onto massively parallel computers. JPDC, 8(2), 1990, pp. 119-134.
    • (1990) JPDC , vol.8 , Issue.2 , pp. 119-134
    • Blelloch, G.E.1    Sabot, G.W.2
  • 13
    • 0027632582 scopus 로고
    • Compiling nested data-parallel programs for shared-memory multiprocessors
    • July
    • Chatterjee, S. Compiling nested data-parallel programs for shared-memory multiprocessors. ACM TOPLAS, 15(3), July 1993, pp. 400-462.
    • (1993) ACM TOPLAS , vol.15 , Issue.3 , pp. 400-462
    • Chatterjee, S.1
  • 15
    • 84937389888 scopus 로고    scopus 로고
    • Nepal - Nested data parallelism in Haskell
    • Euro-Par '01, Springer- Verlag, August
    • Chakravarty, M. M. T., G. Keller, R. Leshchinskiy, and W. Pfannenstiel. Nepal - nested data parallelism in Haskell. In Euro-Par '01, vol. 2150 of LNCS. Springer-Verlag, August 2001, pp. 524-534.
    • (2001) LNCS , vol.2150 , pp. 524-534
    • Chakravarty, M.M.T.1    Keller, G.2    Leshchinskiy, R.3    Pfannenstiel, W.4
  • 16
    • 79551658111 scopus 로고    scopus 로고
    • Partial vectorisation of Haskell programs
    • ACM, January Available from
    • Chakravarty, M. M. T., R. Leshchinskiy, S. Peyton Jones, and G. Keller. Partial vectorisation of Haskell programs. In DAMP '08. ACM, January 2008, pp. 2-16. Available from http://clip.dia.fi.upm.es/Conferences/DAMP08/.
    • (2008) DAMP '08 , pp. 2-16
    • Chakravarty, M.M.T.1    Leshchinskiy, R.2    Jones, S.P.3    Keller, G.4
  • 17
  • 18
  • 20
  • 21
    • 33747508171 scopus 로고    scopus 로고
    • SAC - A Functional Array Language for Efficient Multi-threaded Execution
    • August
    • Grelck, C. and S.-B. Scholz. SAC - A Functional Array Language for Efficient Multi-threaded Execution. IJPP, 34(4), August 2006, pp. 383-427.
    • (2006) IJPP , vol.34 , Issue.4 , pp. 383-427
    • Grelck, C.1    Scholz, S.-B.2
  • 22
    • 79952162843 scopus 로고    scopus 로고
    • Breaking the GPU programming barrier with the auto-parallelising SAC compiler
    • ACM
    • Guo, J., J. Thiyagalingam, and S.-B. Scholz. Breaking the GPU programming barrier with the auto-parallelising SAC compiler. In DAMP '11, Austin, January 2011. ACM, pp. 15-24.
    • DAMP '11, Austin, January 2011 , pp. 15-24
    • Guo, J.1    Thiyagalingam, J.2    Scholz, S.-B.3
  • 23
    • 84882564541 scopus 로고    scopus 로고
    • Thrust: A productivity-oriented library for CUDA
    • W.W. Hwu (ed.), chapter 26, Morgan Kaufmann Publishers, October
    • Hoberock, J. and N. Bell. Thrust: A productivity-oriented library for CUDA. InW.W. Hwu (ed.), GPU Computing Gems, Jade Edition, chapter 26, pp. 359-372. Morgan Kaufmann Publishers, October 2011.
    • (2011) GPU Computing Gems, Jade Edition , pp. 359-372
    • Hoberock, J.1    Bell, N.2
  • 25
    • 70349100958 scopus 로고    scopus 로고
    • November Available from
    • Khronos OpenCL Working Group. OpenCL 1.2 Specification, November 2011. Available from http://www.khronos.org/registry/cl/specs/opencl-1.2.pdf.
    • (2011) OpenCL 1.2 Specification
  • 26
    • 79952182078 scopus 로고    scopus 로고
    • Simple optimizations for an applicative array language for graphics processors
    • ACM
    • Larsen, B. Simple optimizations for an applicative array language for graphics processors. In DAMP '11, Austin, January 2011. ACM, pp. 25-34.
    • DAMP '11, Austin, January 2011 , pp. 25-34
    • Larsen, B.1
  • 27
    • 33746637093 scopus 로고    scopus 로고
    • Higher order flattening
    • V. Alexandrov, D. van Albada, P. Sloot, and J. Dongarra (eds.), ICCS '06, Springer-Verlag, May
    • Leshchinskiy, R., M. M. T. Chakravarty, and G. Keller. Higher order flattening. In V. Alexandrov, D. van Albada, P. Sloot, and J. Dongarra (eds.), ICCS '06, number 3992 in LNCS. Springer-Verlag, May 2006, pp. 920-928.
    • (2006) LNCS , vol.3992 , pp. 920-928
    • Leshchinskiy, R.1    Chakravarty, M.M.T.2    Keller, G.3
  • 33
    • 79551704836 scopus 로고    scopus 로고
    • Available from
    • NVIDIA. NVIDIA CUDA C Programming Guide, 2011. Available from http://developer.nvidia.com/category/zone/cuda-zone.
    • (2011) NVIDIA CUDA C Programming Guide
  • 35
    • 0029196596 scopus 로고
    • Work-efficient nested data-parallelism
    • IEEE Computer Society Press
    • Palmer, D. W., J. F. Prins, and S. Westfold. Work-efficient nested data-parallelism. In FoMPP5. IEEE Computer Society Press, 1995, pp. 186-193.
    • (1995) FoMPP5 , pp. 186-193
    • Palmer, D.W.1    Prins, J.F.2    Westfold, S.3
  • 36


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.