SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 5952 LNCS, Issue , 2010, Pages 322-336

Analysis of task offloading for accelerators

(5) Ferrer, Roger a Beltran, Vicenç a Gonzàlez, Marc a,b Martorell, Xavier a,b Ayguadé, Eduard a,b

a BARCELONA SUPERCOMPUTING CENTER (Spain)

b UNIVERSITAT POLITÈCNICA DE CATALUNYA (Spain)

Author keywords

[No Author keywords available]

Indexed keywords

CELL ARCHITECTURES; CELL PROCESSOR; COMMUNICATION OVERLAP; HETEROGENEOUS MULTICORE; NAS BENCHMARKS; PRAGMAS; RUNTIME SYSTEMS; WHOLE SYSTEMS;

COMPUTER ARCHITECTURE; PROFITABILITY;

PROGRAM COMPILERS;

EID: 77949621946 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-11515-8_24 Document Type: Conference Paper

Times cited : (8)

References (32)

1
- 34548753903
- IBM Developer Works November
- Chen, T., Raghavan, R., Dale, J., Iwata, E.: Cell Broadband Engine Architecture and its first implementation. IBM Developer Works (November 2005)
- (2005) Cell Broadband Engine Architecture and its first implementation
- Chen, T.¹ Raghavan, R.² Dale, J.³ Iwata, E.⁴

2
- 77949639334
- NVIDIA corporation: NVIDIA CUDA Compute Unified Device Architecture Version 2.0 2008
- NVIDIA corporation: NVIDIA CUDA Compute Unified Device Architecture Version 2.0 (2008)

3
- 74549192511
- NVIDIA corporation: Technical Brief
- NVIDIA corporation: NVIDIA Tesla GPU Computing Technical Brief (2008)
- (2008) NVIDIA Tesla GPU Computing

4
- 77949604224
- OpenMP Architecture Review Board: OpenMP Application Program Interface. Version 3.0 May 2008
- OpenMP Architecture Review Board: OpenMP Application Program Interface. Version 3.0 (May 2008), http://www.openmp.org

5
- 60449097203
- The Design of OpenMP Tasks
- Ayguadé, E., Copty, N., Duran, A., Hoeflinger, J., Lin, Y., Massaioli, F., Teruel, X., Unnikrishnan, P., Zhang, G.: The Design of OpenMP Tasks. IEEE Transactions on Parallel and Distributed Systems 20(3), 404-418 (2009)
- (2009) IEEE Transactions on Parallel and Distributed Systems , vol.20 , Issue.3 , pp. 404-418
- Ayguadé, E.¹ Copty, N.² Duran, A.³ Hoeflinger, J.⁴ Lin, Y.⁵ Massaioli, F.⁶ Teruel, X.⁷ Unnikrishnan, P.⁸ Zhang, G.⁹

6
- 77949625437
- A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures
- Ayguadé, E., Badia, R.M., Cabrera, D., Duran, A., Gonzalez, M., Igual, F., Jimenez, D., Labarta, J., Martorell, X., Mayo, R., Perez, J.M., Quintana-Orti, E.: A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures. In: Fifth International Workshop on OpenMP, IWOMP (2009)
- (2009) Fifth International Workshop on OpenMP, IWOMP
- Ayguadé, E.¹ Badia, R.M.² Cabrera, D.³ Duran, A.⁴ Gonzalez, M.⁵ Igual, F.⁶ Jimenez, D.⁷ Labarta, J.⁸ Martorell, X.⁹ Mayo, R.¹⁰ Perez, J.M.¹¹ Quintana-Orti, E.¹²

7
- 0003648799
- The OpenMP Implementation of NAS Parallel Benchmarks and Its Performance
- Technical Report NAS-99-011, NASA Ames Research Center
- Jin, H., Frumkin, M., Yan, J.: The OpenMP Implementation of NAS Parallel Benchmarks and Its Performance. Technical Report NAS-99-011, NASA Ames Research Center (1999)
- (1999)
- Jin, H.¹ Frumkin, M.² Yan, J.³

8
- 84944046879
- Performance evaluation of the Omni OpenMP compiler
- Kusano, K., Satoh, S., Sato, M.: Performance evaluation of the Omni OpenMP compiler. In: Third International Symposium on High Performance Computing, pp. 403-414 (2000)
- (2000) Third International Symposium on High Performance Computing , pp. 403-414
- Kusano, K.¹ Satoh, S.² Sato, M.³

9
- 77954443618
- Evaluation of Memory Performance on the Cell BE with the SARC Programming Model
- October
- Ferrer, R., Gonzalez, M., Silla, F., Martorell, X., Ayguadé, E.: Evaluation of Memory Performance on the Cell BE with the SARC Programming Model. In: Proceedings of the 9th Workshop on Memory Performance: Dealing with Applications, systems, and architecture (MEDEA 2008) (October 2008)
- (2008) Proceedings of the 9th Workshop on Memory Performance: Dealing with Applications, systems, and architecture (MEDEA
- Ferrer, R.¹ Gonzalez, M.² Silla, F.³ Martorell, X.⁴ Ayguadé, E.⁵

10
- 74549201478
- March
- Intel Corporation: Intel Corporation's Multicore Architecture Briefing (March 2008), http://www.intel.com/pressroom/archive/releases/20080317fact.htm
- (2008) Intel Corporation's Multicore Architecture Briefing

11
- 77949601288
- AMD Corporation: AMD 2007 Technology Analyst Day, http://www2.amd.com/us- en/assets/content-type/DownloadableAssets/ FinancialA-DayNewsSummary121307FINAL. pdf
- AMD Corporation: AMD 2007 Technology Analyst Day, http://www2.amd.com/us- en/assets/content-type/DownloadableAssets/ FinancialA-DayNewsSummary121307FINAL. pdf

12
- 77949642925
- Stanford University: BrookGPU, http://graphics.stanford.edu/projects/ brookgpu/
- BrookGPU

13
- 84871286731
- Stanford University: Brook Language, http://merrimac.stanford.edu/brook/
- Brook Language

14
- 77949578044
- Group, February 2009
- Group, K.O.W.: The OpenCL Specification (February 2009), http://www.khronos.org/registry/cl/
- K.O.W.: The OpenCL Specification

15
- 48949090561
- A Proposal for Task Parallelism in OpenMP
- Chapman, B, Zheng, W, Gao, G.R, Sato, M, Ayguadé, E, Wang, D, eds, IWOMP 2007, Springer, Heidelberg
- Ayguadé, E., Copty, N., Duran, A., Hoeflinger, J., Lin, Y., Massaioli, F., Su, E., Unnikrishnan, P., Zhang, G.: A Proposal for Task Parallelism in OpenMP. In: Chapman, B., Zheng, W., Gao, G.R., Sato, M., Ayguadé, E., Wang, D. (eds.) IWOMP 2007. LNCS, vol. 4935, pp. 1-12. Springer, Heidelberg (2008)
- (2008) LNCS , vol.4935 , pp. 1-12
- Ayguadé, E.¹ Copty, N.² Duran, A.³ Hoeflinger, J.⁴ Lin, Y.⁵ Massaioli, F.⁶ Su, E.⁷ Unnikrishnan, P.⁸ Zhang, G.⁹

16
- 35649006026
- CellSs: Making it easier to program the Cell Broadband Engine processor
- Perez, J.M., Bellens, P., Badia, R.M., Labarta, J.: CellSs: Making it easier to program the Cell Broadband Engine processor. IBM Journal of Research and Development 51(5), 593-604 (2007)
- (2007) IBM Journal of Research and Development , vol.51 , Issue.5 , pp. 593-604
- Perez, J.M.¹ Bellens, P.² Badia, R.M.³ Labarta, J.⁴

17
- 67650056929
- Extending the OpenMP Tasking Model to Allow Dependent Tasks
- Eigenmann, R, de Supinski, B.R, eds, IWOMP 2008, Springer, Heidelberg
- Duran, A., Pérez, J.M., Ayguadé, E., Badia, R.M., Labarta, J.: Extending the OpenMP Tasking Model to Allow Dependent Tasks. In: Eigenmann, R., de Supinski, B.R. (eds.) IWOMP 2008. LNCS, vol. 5004, pp. 111-122. Springer, Heidelberg (2008)
- (2008) LNCS , vol.5004 , pp. 111-122
- Duran, A.¹ Pérez, J.M.² Ayguadé, E.³ Badia, R.M.⁴ Labarta, J.⁵

18
- 77949650408
- Dolbeau, R., Bihan, S., Bodin, F.: HMPP: A Hybrid Multi-core Parallel Programming Environment. In: Workshop on General Processing Using GPUs (2006)
- Dolbeau, R., Bihan, S., Bodin, F.: HMPP: A Hybrid Multi-core Parallel Programming Environment. In: Workshop on General Processing Using GPUs (2006)

19
- 77949644831
- January 2009
- IBM Corporation: XL C/C++ for Multicore Acceleration (January 2009), http://www-01.ibm.com/software/awdtools/xlcpp/multicore/
- C++ for Multicore Acceleration
- XL, C.¹

20
- 85121084005
- International Journal of Parallel Programming
- O'Brien, K., O'Brien, K., Sura, Z., Chen, T., Zhang, T.: Supporting OpenMP on Cell. International Journal of Parallel Programming (2008)
- (2008) Supporting OpenMP on Cell
- O'Brien, K.¹ O'Brien, K.² Sura, Z.³ Chen, T.⁴ Zhang, T.⁵

21
- 54249087677
- Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Sura, Z., Chen, T., Zhang, T., O'Brien, K., O'Brien, K.: A Novel Asynchronous Software Cache Implementation for the CELL/BE Processor. In: Adve, V., Garzarán, M.J., Petersen, P. (eds.) LCPC 2007. LNCS, 5234, pp. 125-140. Springer, Heidelberg (2008)
- Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Sura, Z., Chen, T., Zhang, T., O'Brien, K., O'Brien, K.: A Novel Asynchronous Software Cache Implementation for the CELL/BE Processor. In: Adve, V., Garzarán, M.J., Petersen, P. (eds.) LCPC 2007. LNCS, vol. 5234, pp. 125-140. Springer, Heidelberg (2008)

22
- 77949598285
- Group, December 2008
- Group, T.P.: PGI Fortran & C Accelerator Programming Model (December 2008), http://www.pgroup.com/lit/whitepapers/pgi-whitepaper-accpre.pdf
- T.P.: PGI Fortran & C Accelerator Programming Model

23
- 56749157122
- Dma-based prefetching for i/o-intensive workloads on the cell architecture
- ACM, New York
- Rafique, M.M., Butt, A.R., Nikolopoulos, D.S.: Dma-based prefetching for i/o-intensive workloads on the cell architecture. In: CF 2008: Proceedings of the 2008 conference on Computing frontiers, pp. 23-32. ACM, New York (2008)
- (2008) CF 2008: Proceedings of the 2008 conference on Computing frontiers , pp. 23-32
- Rafique, M.M.¹ Butt, A.R.² Nikolopoulos, D.S.³

24
- 43449138842
- Prefetching irregular references for software cache on cell
- ACM, New York
- Chen, T., Zhang, T., Sura, Z., Gonzalez, M.: Prefetching irregular references for software cache on cell. In: CGO 2008: Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization, pp. 155-164. ACM, New York (2008)
- (2008) CGO 2008: Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization , pp. 155-164
- Chen, T.¹ Zhang, T.² Sura, Z.³ Gonzalez, M.⁴

25
- 77949601287
- SPENK: Adding Another Level of Parallelism on the Cell Broadband Engine
- ACM, New York
- Ahmed, M.F., Ammar, R.A., Rajasekaran, S.: SPENK: Adding Another Level of Parallelism on the Cell Broadband Engine. In: IFMT 2008: Proceedings of the 1st international forum on Next-generation multicore/manycore technologies, pp. 1-10. ACM, New York (2008)
- (2008) IFMT 2008: Proceedings of the 1st international forum on Next-generation multicore/manycore technologies , pp. 1-10
- Ahmed, M.F.¹ Ammar, R.A.² Rajasekaran, S.³

26
- 77952225553
- Beltran, V., Carrera, D., Torres, J., Ayguadé, E.: CellMT: A Cooperative Multi-threading Library for the Cell/B.E. In: HiPC 2009: Proceedings of the 16th Annual IEEE International Conference on High Performance Computing. IEEE Computer Society, Los Alamitos (2009)
- Beltran, V., Carrera, D., Torres, J., Ayguadé, E.: CellMT: A Cooperative Multi-threading Library for the Cell/B.E. In: HiPC 2009: Proceedings of the 16th Annual IEEE International Conference on High Performance Computing. IEEE Computer Society, Los Alamitos (2009)

27
- 77949625831
- Weltzer, J., Silha, E., May, C., Frey, B., Furukawa, J., Frazier, G.: PowerPC Architecture Book V. 2.02. IBM Corporation (2005)
- Weltzer, J., Silha, E., May, C., Frey, B., Furukawa, J., Frazier, G.: PowerPC Architecture Book V. 2.02. IBM Corporation (2005)

28
- 0345025793
- McCalpin, J.D.: STREAM: Sustainable Memory Bandwidth in High Performance Computers (2008), http://www.cs.virginia.edu/stream
- (2008) STREAM: Sustainable Memory Bandwidth in High Performance Computers
- McCalpin, J.D.¹

29
- 77949634083
- Istanbul
- Corder, S., Sheumaker, K.: STREAM Benchmarking: Intel Xeon 5500 Nehalem vs AMD Opteron 2400 Istanbul (2009), http://www.advancedclustering.com/company- blog/stream-benchmarking.html
- (2009) STREAM Benchmarking: Intel Xeon 5500 Nehalem vs AMD Opteron 2400
- Corder, S.¹ Sheumaker, K.²

30
- 77949601698
- Corporation
- Corporation, I.: Intel Xeon Processor 5000 Sequence (2009), http://www.intel. com/p/en-US/products/server/processor/xeon5000
- (2009) I.: Intel Xeon Processor 5000 Sequence

31
- 38149061132
- Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Labarta, J.: Runtime Address Space Computation for SDSM Systems. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, 4382, pp. 330-344. Springer, Heidelberg (2007)
- Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Labarta, J.: Runtime Address Space Computation for SDSM Systems. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, vol. 4382, pp. 330-344. Springer, Heidelberg (2007)

32
- 38149004865
- Chen, T., Sura, Z., O'Brien, K., O'Brien, J.K.: Optimizing the Use of Static Buffers for DMA on a CELL Chip. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, 4382, pp. 314-329. Springer, Heidelberg (2007)
- Chen, T., Sura, Z., O'Brien, K., O'Brien, J.K.: Optimizing the Use of Static Buffers for DMA on a CELL Chip. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, vol. 4382, pp. 314-329. Springer, Heidelberg (2007)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.