메뉴 건너뛰기




Volumn 5952 LNCS, Issue , 2010, Pages 322-336

Analysis of task offloading for accelerators

Author keywords

[No Author keywords available]

Indexed keywords

CELL ARCHITECTURES; CELL PROCESSOR; COMMUNICATION OVERLAP; HETEROGENEOUS MULTICORE; NAS BENCHMARKS; PRAGMAS; RUNTIME SYSTEMS; WHOLE SYSTEMS;

EID: 77949621946     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-11515-8_24     Document Type: Conference Paper
Times cited : (8)

References (32)
  • 2
    • 77949639334 scopus 로고    scopus 로고
    • NVIDIA corporation: NVIDIA CUDA Compute Unified Device Architecture Version 2.0 2008
    • NVIDIA corporation: NVIDIA CUDA Compute Unified Device Architecture Version 2.0 (2008)
  • 3
    • 74549192511 scopus 로고    scopus 로고
    • NVIDIA corporation: Technical Brief
    • NVIDIA corporation: NVIDIA Tesla GPU Computing Technical Brief (2008)
    • (2008) NVIDIA Tesla GPU Computing
  • 4
    • 77949604224 scopus 로고    scopus 로고
    • OpenMP Architecture Review Board: OpenMP Application Program Interface. Version 3.0 May 2008
    • OpenMP Architecture Review Board: OpenMP Application Program Interface. Version 3.0 (May 2008), http://www.openmp.org
  • 7
    • 0003648799 scopus 로고    scopus 로고
    • The OpenMP Implementation of NAS Parallel Benchmarks and Its Performance
    • Technical Report NAS-99-011, NASA Ames Research Center
    • Jin, H., Frumkin, M., Yan, J.: The OpenMP Implementation of NAS Parallel Benchmarks and Its Performance. Technical Report NAS-99-011, NASA Ames Research Center (1999)
    • (1999)
    • Jin, H.1    Frumkin, M.2    Yan, J.3
  • 11
    • 77949601288 scopus 로고    scopus 로고
    • AMD Corporation: AMD 2007 Technology Analyst Day, http://www2.amd.com/us- en/assets/content-type/DownloadableAssets/ FinancialA-DayNewsSummary121307FINAL. pdf
    • AMD Corporation: AMD 2007 Technology Analyst Day, http://www2.amd.com/us- en/assets/content-type/DownloadableAssets/ FinancialA-DayNewsSummary121307FINAL. pdf
  • 12
    • 77949642925 scopus 로고    scopus 로고
    • Stanford University: BrookGPU, http://graphics.stanford.edu/projects/ brookgpu/
    • BrookGPU
  • 13
    • 84871286731 scopus 로고    scopus 로고
    • Stanford University: Brook Language, http://merrimac.stanford.edu/brook/
    • Brook Language
  • 15
    • 48949090561 scopus 로고    scopus 로고
    • A Proposal for Task Parallelism in OpenMP
    • Chapman, B, Zheng, W, Gao, G.R, Sato, M, Ayguadé, E, Wang, D, eds, IWOMP 2007, Springer, Heidelberg
    • Ayguadé, E., Copty, N., Duran, A., Hoeflinger, J., Lin, Y., Massaioli, F., Su, E., Unnikrishnan, P., Zhang, G.: A Proposal for Task Parallelism in OpenMP. In: Chapman, B., Zheng, W., Gao, G.R., Sato, M., Ayguadé, E., Wang, D. (eds.) IWOMP 2007. LNCS, vol. 4935, pp. 1-12. Springer, Heidelberg (2008)
    • (2008) LNCS , vol.4935 , pp. 1-12
    • Ayguadé, E.1    Copty, N.2    Duran, A.3    Hoeflinger, J.4    Lin, Y.5    Massaioli, F.6    Su, E.7    Unnikrishnan, P.8    Zhang, G.9
  • 17
    • 67650056929 scopus 로고    scopus 로고
    • Extending the OpenMP Tasking Model to Allow Dependent Tasks
    • Eigenmann, R, de Supinski, B.R, eds, IWOMP 2008, Springer, Heidelberg
    • Duran, A., Pérez, J.M., Ayguadé, E., Badia, R.M., Labarta, J.: Extending the OpenMP Tasking Model to Allow Dependent Tasks. In: Eigenmann, R., de Supinski, B.R. (eds.) IWOMP 2008. LNCS, vol. 5004, pp. 111-122. Springer, Heidelberg (2008)
    • (2008) LNCS , vol.5004 , pp. 111-122
    • Duran, A.1    Pérez, J.M.2    Ayguadé, E.3    Badia, R.M.4    Labarta, J.5
  • 18
    • 77949650408 scopus 로고    scopus 로고
    • Dolbeau, R., Bihan, S., Bodin, F.: HMPP: A Hybrid Multi-core Parallel Programming Environment. In: Workshop on General Processing Using GPUs (2006)
    • Dolbeau, R., Bihan, S., Bodin, F.: HMPP: A Hybrid Multi-core Parallel Programming Environment. In: Workshop on General Processing Using GPUs (2006)
  • 19
    • 77949644831 scopus 로고    scopus 로고
    • January 2009
    • IBM Corporation: XL C/C++ for Multicore Acceleration (January 2009), http://www-01.ibm.com/software/awdtools/xlcpp/multicore/
    • C++ for Multicore Acceleration
    • XL, C.1
  • 21
    • 54249087677 scopus 로고    scopus 로고
    • Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Sura, Z., Chen, T., Zhang, T., O'Brien, K., O'Brien, K.: A Novel Asynchronous Software Cache Implementation for the CELL/BE Processor. In: Adve, V., Garzarán, M.J., Petersen, P. (eds.) LCPC 2007. LNCS, 5234, pp. 125-140. Springer, Heidelberg (2008)
    • Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Sura, Z., Chen, T., Zhang, T., O'Brien, K., O'Brien, K.: A Novel Asynchronous Software Cache Implementation for the CELL/BE Processor. In: Adve, V., Garzarán, M.J., Petersen, P. (eds.) LCPC 2007. LNCS, vol. 5234, pp. 125-140. Springer, Heidelberg (2008)
  • 26
    • 77952225553 scopus 로고    scopus 로고
    • Beltran, V., Carrera, D., Torres, J., Ayguadé, E.: CellMT: A Cooperative Multi-threading Library for the Cell/B.E. In: HiPC 2009: Proceedings of the 16th Annual IEEE International Conference on High Performance Computing. IEEE Computer Society, Los Alamitos (2009)
    • Beltran, V., Carrera, D., Torres, J., Ayguadé, E.: CellMT: A Cooperative Multi-threading Library for the Cell/B.E. In: HiPC 2009: Proceedings of the 16th Annual IEEE International Conference on High Performance Computing. IEEE Computer Society, Los Alamitos (2009)
  • 27
    • 77949625831 scopus 로고    scopus 로고
    • Weltzer, J., Silha, E., May, C., Frey, B., Furukawa, J., Frazier, G.: PowerPC Architecture Book V. 2.02. IBM Corporation (2005)
    • Weltzer, J., Silha, E., May, C., Frey, B., Furukawa, J., Frazier, G.: PowerPC Architecture Book V. 2.02. IBM Corporation (2005)
  • 30
    • 77949601698 scopus 로고    scopus 로고
    • Corporation
    • Corporation, I.: Intel Xeon Processor 5000 Sequence (2009), http://www.intel. com/p/en-US/products/server/processor/xeon5000
    • (2009) I.: Intel Xeon Processor 5000 Sequence
  • 31
    • 38149061132 scopus 로고    scopus 로고
    • Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Labarta, J.: Runtime Address Space Computation for SDSM Systems. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, 4382, pp. 330-344. Springer, Heidelberg (2007)
    • Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Labarta, J.: Runtime Address Space Computation for SDSM Systems. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, vol. 4382, pp. 330-344. Springer, Heidelberg (2007)
  • 32
    • 38149004865 scopus 로고    scopus 로고
    • Chen, T., Sura, Z., O'Brien, K., O'Brien, J.K.: Optimizing the Use of Static Buffers for DMA on a CELL Chip. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, 4382, pp. 314-329. Springer, Heidelberg (2007)
    • Chen, T., Sura, Z., O'Brien, K., O'Brien, J.K.: Optimizing the Use of Static Buffers for DMA on a CELL Chip. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, vol. 4382, pp. 314-329. Springer, Heidelberg (2007)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.