SCOPUS 정보 검색 플랫폼

Volumn 53, Issue 6, 2011, Pages 266-273

Algorithm Engineering Challenges in Multicore and Manycore Systems;Algorithm Engineering Herausforderungen bei Mehrkern- und Manycore-Systemen

(3) Kang, Seunghwa a Ediger, David a Bader, David A a

a GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Algorithmen; GPU; parallele Programmierung; performance tuning

Indexed keywords

MEMORY ARCHITECTURE;

ALGORITHM ENGINEERING; ALGORITHMEN; ENGINEERING CHALLENGES; HIGH POWER EFFICIENCIES; MANY-CORE; MANYCORE SYSTEMS; MULTI-CORE SYSTEMS; PARALLELE PROGRAMMIERUNG; PERFORMANCE; PERFORMANCE TUNING;

GRAPHICS PROCESSING UNIT;

EID: 84884306681 PISSN: 16112776 EISSN: 21967032 Source Type: Journal
DOI: 10.1524/itit.2011.0652 Document Type: Article

Times cited : (2)

References (13)

1
- 35648995516
- The landscape of parallel computing research: A view from berkeley
- Dec
- K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S.W. Williams, and K. A. Yelick. The landscape of parallel computing research: A view from berkeley. Technical Report UCB/EECS-2006-183, Dec 2006.
- (2006) Technical Report UCB/EECS- , pp. 2006-2183
- Asanovic, K.¹ Bodik, R.² Catanzaro, B.C.³ Gebis, J.J.⁴ Husbands, P.⁵ Keutzer, K.⁶ Patterson, D.A.⁷ Plishker, W.L.⁸ Shalf, J.⁹ Williams, S.W.¹⁰ Yelick, K.A.¹¹

2
- 33745125067
- On the architectural requirements for efficient execution of graph algorithms
- Oslo, Norway Jun
- D. A. Bader and G. Cong. On the architectural requirements for efficient execution of graph algorithms. In: Proc. of Int?l Conf. on Parallel Processing, pages 547-556, Oslo, Norway, Jun 2005.
- (2005) Proc. Of Int?l Conf. Of Parallel Processing , pp. 547-556
- Bader, D.A.¹ Cong, G.²

3
- 24944539760
- High-performance algorithm engineering for parallel computation
- D. A. Bader, B.M. E. Moret, and P. Sanders. High-Performance Algorithm Engineering for Parallel Computation. In: Experimental Algorithmics, LNCS 2547, pages 1-23, 2002.
- (2002) Experimental Algorithmics LNCS 2547 , pp. 1-23
- Bader, D.A.¹ Moret, B.M.E.² Sanders, P.³

4
- 0027541302
- Automatic program parallelization
- U. Banerjee, R. Eigenmann, A. Nicolau, and D. A. Padua. Automatic program parallelization. In: Proc. of the IEEE, 81(2):211-243, 1993.
- (1993) Proc. Of the IEEE , vol.81 , Issue.2 , pp. 211-243
- Banerjee, U.¹ Eigenmann, R.² Nicolau, A.³ Padua, D.A.⁴

5
- 78650822594
- Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method. In: Proc. Of
- A. Chandramowlishwaran, K. Madduri, and R. Vuduc. Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method. In: Proc. of 2010 ACM/IEEE Int?l Conf. on High Performance Computing, Networking, Storage ansd Analysis, New Orleans, LA, USA, Nov 2010.
- (2010) ACM/IEEE Int?l Conf. On High Performance Computing, Networking, Storage Ansd Analysis, New Orleans, LA, USA, Nov 2010
- Chandramowlishwaran, A.¹ Madduri, K.² Vuduc, R.³

6
- 84966560559
- Parallel wavelet transform for large scale image processing. In
- Ft. Lauderdale, FL, USA Apr
- D. Chaver, M. Prieto, L. Pinuel, and F. Tirado. Parallel wavelet transform for large scale image processing. In: Proc. of the Int?l Parallel and Distributed Processing Symposium (IPDPS), pages 4-9, Ft. Lauderdale, FL, USA, Apr 2002.
- (2002) Proc. Of the Int?l Parallel and Distributed Processing Symposium (IPDPS , pp. 4-9
- Chaver, D.¹ Prieto, M.² Pinuel, L.³ Tirado, F.⁴

7
- 84945318819
- 2-d wavelet transform enhancement on general-purpose microprocessors: Memory hierarchy and simd parallelism exploitation
- Bangalore, India Dec
- D. Chaver, C. Tenllado, L. Piñuel, M. Prieto, and F. Tirado. 2-D Wavelet Transform Enhancement on General-Purpose Microprocessors: Memory Hierarchy and SIMD Parallelism Exploitation. In: Proc. of Int?l Conf. on High Performance Computing, LNCS 2552, pages 9-21, Bangalore, India, Dec 2002.
- (2002) Proc. Of Int?l Conf. Of High Performance Computing LNCS 2552 , pp. 9-21
- Chaver, D.¹ Tenllado, C.² Piñuel, L.³ Prieto, M.⁴ Tirado, F.⁵

8
- 70449975572
- Understanding the design trade-offs among current multicore systems for numerical computations. In
- May
- S. Kang, D. A. Bader, and R. Vuduc. Understanding the design trade-offs among current multicore systems for numerical computations. In: Proc. of Int?l Symp. on Parallel and Distributed Processing, Rome, Italy, May 2009.
- (2009) Proc. Of Int?l Symp. Of Parallel and Distributed Processing, Rome, Italy
- Kang, S.¹ Bader, D.A.² Vuduc, R.³

9
- 73649141632
- A NUMA API for Linux
- Apr
- A. Kleen. A NUMA API for Linux. Technical Report, Apr 2005.
- (2005) Technical Report
- Kleen, A.¹

10
- 60649099576
- Optimizing matrix multiplication for a short-vector SIMD architecture-CELL processor
- J. Kurzak, W. Alvaro, and J. Dongarra. Optimizing matrix multiplication for a short-vector SIMD architecture-CELL processor. In: Parallel Computing, 35(3):138-150, 2009.
- (2009) Parallel Computing , vol.35 , Issue.3 , pp. 138-150
- Kurzak, J.¹ Alvaro, W.² Dongarra, J.³

11
- 33947328378
- Performance, power efficiency and scalability of asymmetric cluster chip multiprocessors
- T. Y. Morad, U. C. Weiser, A. Kolodnyt, M. Valero, and E. Ayguade. Performance, power efficiency and scalability of asymmetric cluster chip multiprocessors. In: Computer Architecture Letters, 5(1):14-17, 2006.
- (2006) Computer Architecture Letters , vol.5 , Issue.1 , pp. 14-17
- Morad, T.Y.¹ Weiser, U.C.² Kolodnyt, A.³ Valero, M.⁴ Ayguade, E.⁵

12
- 56749158843
- Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In
- Nov
- S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In: Proc. of Int?l Conf. on Supercomputing, Reno, NV, Nov 2007.
- (2007) Proc. Of Int?l Conf. Of Supercomputing, Reno, NV
- Williams, S.¹ Oliker, L.² Vuduc, R.³ Shalf, J.⁴ Yelick, K.⁵ Demmel, J.⁶

13
- 79551702326
- Advanced MRI reconstruction toolbox with accelerating on GPU
- Jan
- X.-L. Wu, Y. Zhuo, J. Gai, F. Lam, M. Fu, J. P. Haldar, W.-M. Hwu, Z.-P. Liang, and B. P. Sutton. Advanced MRI reconstruction toolbox with accelerating on GPU. In: Proc. of Conf. on Parallel Processing for Imaging Applications, San Francisco, CA, Jan 2011.
- (2011) Proc. Of Conf. Of Parallel Processing for Imaging Applications, San Francisco, CA
- Wu, X.-L.¹ Zhuo, Y.² Gai, J.³ Lam, F.⁴ Fu, M.⁵ Haldar, J.P.⁶ Hwu, W.-M.⁷ Liang, Z.-P.⁸ Sutton, B.P.⁹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.