-
1
-
-
85060036181
-
Validity of the single processor approach to achieving large scale computing capabilities
-
Apr.
-
Gene M. Amdahl. Validity of the single processor approach to achieving large scale computing capabilities. In Proc. AFIPS 1967 Spring Joint Computer Conf., pages 483-485, Apr. 1967.
-
(1967)
Proc. AFIPS 1967 Spring Joint Computer Conf.
, pp. 483-485
-
-
Amdahl, G.M.1
-
2
-
-
28044454088
-
A performance prediction framework for scientific applications
-
February
-
Laura Carrington, Allan Snavely, and Nicole Wolter. A performance prediction framework for scientific applications. Future Generation Computer Systems, 22:3:336-346, February 2006.
-
(2006)
Future Generation Computer Systems
, vol.22
, Issue.3
, pp. 336-346
-
-
Carrington, L.1
Snavely, A.2
Wolter, N.3
-
3
-
-
77952568782
-
The performance effect of multi-core on scientific applications
-
Helen
-
Jonathan Carter, Yun (Helen) He, John Shalf, Hongzhang Shan, and Harvey Wasserman. The performance effect of multi-core on scientific applications. In Cray User Group 2007 conference (CUG 2007), May 2007.
-
Cray User Group 2007 Conference (CUG 2007), May 2007
-
-
Carter, J.1
He, Y.2
Shalf, J.3
Shan, H.4
Wasserman, H.5
-
4
-
-
31844441256
-
An evaluation of global address space languages: Co-array fortran and unified parallel c
-
Keshav Pingali, Katherine A. Yelick, and Andrew S. Grimshaw, editors, ACM
-
Cristian Coarfa, Yuri Dotsenko, John M. Mellor-Crummey, François Cantonnet, Tarek A. El-Ghazawi, Ashrujit Mohanti, Yiyi Yao, and Daniel G. Chavarría-Miranda. An evaluation of global address space languages: co-array fortran and unified parallel c. In Keshav Pingali, Katherine A. Yelick, and Andrew S. Grimshaw, editors, PPOPP, pages 36-47. ACM, 2005.
-
(2005)
PPOPP
, pp. 36-47
-
-
Coarfa, C.1
Dotsenko, Y.2
Mellor-Crummey, J.M.3
Cantonnet, F.4
El-Ghazawi, T.A.5
Mohanti, A.6
Yao, Y.7
Chavarría-Miranda, D.G.8
-
5
-
-
51049115984
-
Data access optimizations for highly threaded multi-core CPUs with multiple memory controllers
-
April
-
G. Hager, T. Zeiser, and G. Wellein. Data access optimizations for highly threaded multi-core CPUs with multiple memory controllers. In Proceedings of IEEE International Symposium on Parallel and Distributed Processing, 2008 (IPDPS 2008), pages 1-7, April 2008.
-
(2008)
Proceedings of IEEE International Symposium on Parallel and Distributed Processing, 2008 (IPDPS 2008)
, pp. 1-7
-
-
Hager, G.1
Zeiser, T.2
Wellein, G.3
-
7
-
-
0003834102
-
-
Prentice-Hall, Inc., Upper Saddle River, NJ, USA
-
Edward D. Lazowska, John Zahorjan, G. Scott Graham, and Kenneth C. Sevcik. Quantitative system performance: computer system analysis using queueing network models. Prentice-Hall, Inc., Upper Saddle River, NJ, USA, 1984.
-
(1984)
Quantitative System Performance: Computer System Analysis Using Queueing Network Models
-
-
Lazowska, E.D.1
Zahorjan, J.2
Graham, G.S.3
Sevcik, K.C.4
-
8
-
-
0000861722
-
A proof of the queuing formula lwλw
-
J.D.C. Little. A proof of the queuing formula lwλw. Operations Research, 9, 1961.
-
(1961)
Operations Research
, vol.9
-
-
Little, J.D.C.1
-
16
-
-
48149094931
-
Memory hierarchy performance measurement of commercial dual-core desktop processors
-
August
-
Lu Peng, Jih-Kwon Peir, Tribuvan K. Prakash, Carl Staelin, Yen- Kuang Chen, and David Koppelman. Memory hierarchy performance measurement of commercial dual-core desktop processors. Journal of Systems Architecture, 54:8:816-828, August 2008.
-
(2008)
Journal of Systems Architecture
, vol.54
, Issue.8
, pp. 816-828
-
-
Peng, L.1
Peir, J.-K.2
Prakash, T.K.3
Staelin, C.4
Chen, Y.-.K.5
Koppelman, D.6
-
17
-
-
77952569462
-
-
Technical Report RENCI Technical Report TR-08-107, Renaissance Computing Institute
-
Allan Porterfield, Rob Fowler, Anirban Mandal, and Min Yeol Lim. Performance consistency on multi-socket AMD Opteron systems. Technical Report RENCI Technical Report TR-08-107, Renaissance Computing Institute, 2008.
-
(2008)
Performance Consistency on Multi-socket AMD Opteron Systems
-
-
Porterfield, A.1
Fowler, R.2
Mandal, A.3
Lim, M.Y.4
-
18
-
-
80053252314
-
A framework for performance modeling and prediction
-
Allan Snavely, Laura Carrington, Nicole Wolter, Jesus Labarta, Rosa Badia, and Avi Purkayastha. A framework for performance modeling and prediction. In Proceedings of the 2002 ACM/IEEE conference on Supercomputing, pages 1-17, 2002.
-
(2002)
Proceedings of the 2002 ACM/IEEE Conference on Supercomputing
, pp. 1-17
-
-
Snavely, A.1
Carrington, L.2
Wolter, N.3
Labarta, J.4
Badia, R.5
Purkayastha, A.6
-
19
-
-
56749185811
-
A genetic algorithms approach to modeling the performance of memory-bound computations
-
Becky Verastegui, editor, ACM Press
-
Mustafa M. Tikir, Laura Carrington, Erich Strohmaier, and Allan Snavely. A genetic algorithms approach to modeling the performance of memory-bound computations. In Becky Verastegui, editor, SC, page 47. ACM Press, 2007.
-
(2007)
SC
, pp. 47
-
-
Tikir, M.M.1
Carrington, L.2
Strohmaier, E.3
Snavely, A.4
-
20
-
-
65649092566
-
PERI: Auto-tuning memory intensive kernels for multicore
-
S. Williams, K. Datta, J. Carter, L. Oliker, J. Shalf, K. Yelick, and D. Bailey. PERI: auto-tuning memory intensive kernels for multicore. Journal of Physics: Conference Series, 125 012001, 2008.
-
(2008)
Journal of Physics: Conference Series
, vol.125
, pp. 012001
-
-
Williams, S.1
Datta, K.2
Carter, J.3
Oliker, L.4
Shalf, J.5
Yelick, K.6
Bailey, D.7
-
21
-
-
56749158843
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In Proceedings of the 2007 ACM/IEEE conference on Supercomputing, November 2007.
-
Proceedings of the 2007 ACM/IEEE Conference on Supercomputing, November 2007
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
23
-
-
77952562358
-
-
Technical Report UCB/EECS- 2008-2134, EECS Department, University of California, Berkeley, Oct
-
Samuel Webb Williams, Andrew Waterman, and David A. Patterson. Roofline: An insightful visual performance model for floating-point programs and multicore architectures. Technical Report UCB/EECS- 2008-2134, EECS Department, University of California, Berkeley, Oct 2008.
-
(2008)
Roofline: An Insightful Visual Performance Model for Floating-point Programs and Multicore Architectures.
-
-
Williams, S.W.1
Waterman, A.2
Patterson, D.A.3
-
24
-
-
77952557877
-
The roofline model: A pedagogical tool for auto-tuning kernels on multicore architectures
-
S.W Williams, D.A Patterson, L Oliker, J Shalf, and K Yelick. The roofline model: A pedagogical tool for auto-tuning kernels on multicore architectures. In HOT Chips, A Symposium on High Performance Chips, 2008. Stanford, CA.
-
HOT Chips, a Symposium on High Performance Chips, 2008. Stanford, CA
-
-
Williams, S.W.1
Patterson, D.A.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
|