메뉴 건너뛰기




Volumn , Issue , 2009, Pages 43-49

A memory optimization technique for software managed scratchpad memory in GPUs

Author keywords

CUDA; GPU computing; Memory optimization

Indexed keywords

APPLICATION PERFORMANCE; BENCHMARK SUITES; CUDA; EXECUTION TIME; GPU COMPUTING; GPU IMPLEMENTATION; GRAPH COLORINGS; MEMORY OPTIMIZATION; MEMORY SPACE; ON CHIPS; REAL-LIFE APPLICATIONS; SCRATCH PAD MEMORY; TRANSFORMATION METHODS;

EID: 70350786536     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SASP.2009.5226334     Document Type: Conference Paper
Times cited : (23)

References (30)
  • 1
    • 70350779165 scopus 로고    scopus 로고
    • GPGPU to many-core processing: Higher performance for mass market applications
    • D. Manocha, M. C. Lin, N. Govindaraju. GPGPU to Many-Core Processing: Higher Performance for Mass Market Applications. Manycore Computing Workshop, 2007.
    • (2007) Manycore Computing Workshop
    • Manocha, D.1    Lin, M.C.2    Govindaraju, N.3
  • 2
    • 56749110402 scopus 로고    scopus 로고
    • AMD Stream Processor. http://ati.amd.com/products/streamprocessor/index. html.
    • AMD Stream Processor
  • 3
    • 35948991669 scopus 로고    scopus 로고
    • NVIDIA Corporation, version 1.1
    • NVIDIA Corporation. NVIDIA CUDA Programming Guide, version 1.1, 2007.
    • (2007) NVIDIA CUDA Programming Guide
  • 5
    • 43449128019 scopus 로고    scopus 로고
    • NVIDIA CUDA software and GPU parallel computing architecture
    • May
    • J. Nickolls and I. Buck. NVIDIA CUDA software and GPU parallel computing architecture. Microprocessor Forum, May 2007.
    • (2007) Microprocessor Forum
    • Nickolls, J.1    Buck, I.2
  • 6
    • 0036053351 scopus 로고    scopus 로고
    • Compiler-directed scratch pad memory hierarchy design and management
    • New Orleans, Louisiana, USA, June 10-14, DAC'02. ACM, New York, NY, 2002
    • Kandemir, M. and Choudhary, A. 2002. Compiler-directed scratch pad memory hierarchy design and management. In Proceedings of the 39th Conference on Design Automation (New Orleans, Louisiana, USA, June 10 - 14, 2002). DAC '02. ACM, New York, NY, 628-633.
    • (2002) Proceedings of the 39th Conference on Design Automation , pp. 628-633
    • Kandemir, M.1    Choudhary, A.2
  • 7
    • 84893649314 scopus 로고    scopus 로고
    • Static memory allocation by pointer analysis and coloring
    • Munich, Germany., W. Nebel and A. Jerraya, Eds. Design, Automation, and Test in Europe. IEEE Press, Piscataway, NJ
    • Zhu, J. 2001. Static memory allocation by pointer analysis and coloring. In Proceedings of the Conference on Design, Automation and Test in Europe (Munich, Germany). W. Nebel and A. Jerraya, Eds. Design, Automation, and Test in Europe. IEEE Press, Piscataway, NJ, 785-790.
    • (2001) Proceedings of the Conference on Design, Automation and Test in Europe , pp. 785-790
    • Zhu, J.1
  • 17
    • 51449118065 scopus 로고    scopus 로고
    • A performance study of general-purpose applications on graphics processors using CUDA
    • Oct.
    • Che, S., Boyer, M., Meng, J., Tarjan, D., Sheaffer, J. W., and Skadron, K. A performance study of general-purpose applications on graphics processors using CUDA. J. Parallel Distrib. Comput. 68, 10 (Oct. 2008), 1370-1380.
    • (2008) J. Parallel Distrib. Comput. , vol.68 , Issue.10 , pp. 1370-1380
    • Che, S.1    Boyer, M.2    Meng, J.3    Tarjan, D.4    Sheaffer, J.W.5    Skadron, K.6
  • 20
    • 0028429472 scopus 로고
    • Improvements to graph coloring register allocation
    • P. Briggs, K. D. Cooper, and L. Torczon, "Improvements to graph coloring register allocation, " ACM Trans. Program. Lang. Syst., vol.16, no.3, pp. 428-455, 1994.
    • (1994) ACM Trans. Program. Lang. Syst. , vol.16 , Issue.3 , pp. 428-455
    • Briggs, P.1    Cooper, K.D.2    Torczon, L.3
  • 21
    • 0003422462 scopus 로고    scopus 로고
    • New York, NY, USA: Springer-Verlag New York, Inc.
    • V. V. Vazirani, Approximation algorithmms. New York, NY, USA: Springer-Verlag New York, Inc., 2001.
    • (2001) Approximation Algorithmms
    • Vazirani, V.V.1
  • 24
    • 70350768152 scopus 로고    scopus 로고
    • Geometric methods in bio-medical image processing
    • Active Contour and Segmentation Models Using Geometric PDE's for Medical Imaging Tony F. Chan and Luminita A. Vese, in Malladi, R. (Ed.), Springer
    • Active Contour and Segmentation Models Using Geometric PDE's for Medical Imaging Tony F. Chan and Luminita A. Vese, in Malladi, R. (Ed.), "Geometric Methods in Bio-Medical Image Processing", Series: Mathematics and Visualization, Springer, 2002, pp. 63-75.
    • (2002) Series: Mathematics and Visualization , pp. 63-75
  • 25
    • 0037272745 scopus 로고    scopus 로고
    • Curvature based image registration
    • B. Fischer and J. Modersitzki, "Curvature based image registration, " J.Math. Imaging Vis., vol.18, no.1, pp. 81-85, 2003.
    • (2003) J.Math. Imaging Vis. , vol.18 , Issue.1 , pp. 81-85
    • Fischer, B.1    Modersitzki, J.2
  • 26
    • 0242592206 scopus 로고    scopus 로고
    • Curvature-based transfer functions for direct volume rendering: Methods and applications
    • VIS 2003. IEEE, 24-24 Oct.
    • Kindlmann, G.; Whitaker, R.; Tasdizen, T.; Moller, T., "Curvature-based transfer functions for direct volume rendering: methods and applications, " Visualization, 2003. VIS 2003. IEEE, vol., no., pp.513- 520, 24-24 Oct. 2003.
    • (2003) Visualization, 2003 , pp. 513-520
    • Kindlmann, G.1    Whitaker, R.2    Tasdizen, T.3    Moller, T.4
  • 27
    • 33745771700 scopus 로고    scopus 로고
    • Fitting B-spline curves to point clouds by curvature-based squared distance minimization
    • Apr.
    • Wang, W., Pottmann, H., and Liu, Y. 2006. Fitting B-spline curves to point clouds by curvature-based squared distance minimization. ACM Trans. Graph. 25, 2 (Apr. 2006), 214-238.
    • (2006) ACM Trans. Graph. , vol.25 , Issue.2 , pp. 214-238
    • Wang, W.1    Pottmann, H.2    Liu, Y.3
  • 28
    • 35648995967 scopus 로고    scopus 로고
    • Introduction to the cell broadband engine architecture
    • C. R. Johns and D. A. Brokenshire, "Introduction to the Cell Broadband Engine Architecture", IBM Journal of Research and Development, Vol 51, Number 5, 2007, pp 503-520.
    • (2007) IBM Journal of Research and Development , vol.51 , Issue.5 , pp. 503-520
    • Johns, C.R.1    Brokenshire, D.A.2
  • 30
    • 33747437310 scopus 로고    scopus 로고
    • Overlay techniques for scratchpad memories in low-power embedded processors
    • Aug.
    • M. Verma and P. Marwedel. Overlay techniques for scratchpad memories in low-power embedded processors. IEEE Transactions on Very Large Scale Integration Systems, 4(8):802815, Aug. 2006.
    • (2006) IEEE Transactions on Very Large Scale Integration Systems , vol.4 , Issue.8 , pp. 802815
    • Verma, M.1    Marwedel, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.