-
1
-
-
34548012397
-
Scheduling FFT computation on SMP and multicore systems
-
New York, NY, USA, ACM. ISBN 978-1-59593-768-1
-
Ayaz Ali, Lennart Johnsson, and Jaspal Subhlok. Scheduling FFT computation on SMP and multicore systems. In Proceedings of the ACM/IEEE Conference on Supercomputing, pages 293-301, New York, NY, USA, 2007. ACM. ISBN 978-1-59593-768-1.
-
(2007)
Proceedings of the ACM/IEEE Conference on Supercomputing
, pp. 293-301
-
-
Ali, A.1
Johnsson, L.2
Subhlok, J.3
-
2
-
-
67650825996
-
-
Ed Anderson, Zhaojun Bai, Christian Bischof, Susan Blackford, James Demmel, Jack Dongarra, Jeremy Du Croz, Anne Greenbaum, Sven Hammarling, A. McKenney, and Danny Sorensen. LAPACK Users' Guide. Society for Industrial and Applied Mathematics, Philadelphia, PA, third edition, 1999. ISBN 0-89871-447-8.
-
Ed Anderson, Zhaojun Bai, Christian Bischof, Susan Blackford, James Demmel, Jack Dongarra, Jeremy Du Croz, Anne Greenbaum, Sven Hammarling, A. McKenney, and Danny Sorensen. LAPACK Users' Guide. Society for Industrial and Applied Mathematics, Philadelphia, PA, third edition, 1999. ISBN 0-89871-447-8.
-
-
-
-
3
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
New York, NY, USA, ACM. ISBN 0-89791-902-5
-
Jeff Bilmes, Krste Asanovic, Chee-Whye Chin, and Jim Demmel. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology. In Proceedings of the ACM/IEEE Conference on Supercomputing, pages 340-347, New York, NY, USA, 1997. ACM. ISBN 0-89791-902-5.
-
(1997)
Proceedings of the ACM/IEEE Conference on Supercomputing
, pp. 340-347
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.-W.3
Demmel, J.4
-
4
-
-
0029193257
-
High-level optimization via automated statistical modeling
-
New York, NY, USA, ACM. ISBN 0-89791-701-6
-
Eric A. Brewer. High-level optimization via automated statistical modeling. In Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 80-91, New York, NY, USA, 1995. ACM. ISBN 0-89791-701-6.
-
(1995)
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 80-91
-
-
Brewer, E.A.1
-
5
-
-
0003252789
-
Applied Numerical Linear Algebra
-
August
-
James W. Demmel. Applied Numerical Linear Algebra. SIAM, August 1997.
-
(1997)
SIAM
-
-
Demmel, J.W.1
-
6
-
-
20744449792
-
-
Matteo Frigo and Steven G. Johnson. The design and implementation of FFTW3. Proceedings of the IEEE, 93(2):216-231, February 2005. Invited paper, special issue on Program Generation, Optimization, and Platform Adaptation.
-
Matteo Frigo and Steven G. Johnson. The design and implementation of FFTW3. Proceedings of the IEEE, 93(2):216-231, February 2005. Invited paper, special issue on "Program Generation, Optimization, and Platform Adaptation".
-
-
-
-
8
-
-
0031622953
-
The implementation of the Cilk-5 multithreaded language
-
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, Montreal, Quebec, Canada, Jun, May
-
Matteo Frigo, Charles E. Leiserson, and Keith H. Randall. The implementation of the Cilk-5 multithreaded language. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 212-223, Montreal, Quebec, Canada, Jun 1998. Proceedings published ACM SIGPLAN Notices, Vol. 33, No. 5, May, 1998.
-
(1998)
Proceedings published ACM SIGPLAN Notices
, vol.33
, Issue.5
, pp. 212-223
-
-
Frigo, M.1
Leiserson, C.E.2
Randall, K.H.3
-
9
-
-
0039435412
-
-
John A. Gunnels, Fred G. Gustavson, Greg M. Henry, and Robert A. van de Geijn. FLAME: Formal Linear Algebra Methods Environment. ACM Transactions on Mathematical Software, 27(4):422-455, December 2001. ISSN 0098-3500.
-
John A. Gunnels, Fred G. Gustavson, Greg M. Henry, and Robert A. van de Geijn. FLAME: Formal Linear Algebra Methods Environment. ACM Transactions on Mathematical Software, 27(4):422-455, December 2001. ISSN 0098-3500.
-
-
-
-
15
-
-
49149088569
-
-
Justin Mazzola Paluska, Hubert Pham, Umar Saif, Grace Chau, Chris Terman, and Steve Ward. Structured decomposition of adaptive applications. In Proceedings of the Annual IEEE International Conference on Pervasive Computing and Communications, pages 1-10, Washington, DC, USA, 2008. IEEE Computer Society. ISBN 978-0-7695-3113-7.
-
Justin Mazzola Paluska, Hubert Pham, Umar Saif, Grace Chau, Chris Terman, and Steve Ward. Structured decomposition of adaptive applications. In Proceedings of the Annual IEEE International Conference on Pervasive Computing and Communications, pages 1-10, Washington, DC, USA, 2008. IEEE Computer Society. ISBN 978-0-7695-3113-7.
-
-
-
-
16
-
-
19344368072
-
-
Markus Puschel, Jose M. F. Moura, Jeremy R. Johnson, David Padua, Manuela M. Veloso, Bryan W. Singer, Jianxin Xiong, Aca Gacic Franz Franchetti, Robbert W. Johnson Yevgen Voronenko, Kang Chen, and Nicholas Rizzolo. SPIRAL: Code generation for DSP transforms. In Proceedings of the IEEE, 93, pages 232-275. IEEE, Feb 2005.
-
Markus Puschel, Jose M. F. Moura, Jeremy R. Johnson, David Padua, Manuela M. Veloso, Bryan W. Singer, Jianxin Xiong, Aca Gacic Franz Franchetti, Robbert W. Johnson Yevgen Voronenko, Kang Chen, and Nicholas Rizzolo. SPIRAL: Code generation for DSP transforms. In Proceedings of the IEEE, volume 93, pages 232-275. IEEE, Feb 2005.
-
-
-
-
18
-
-
31844454218
-
A framework for adaptive algorithm selection in STAPL
-
New York, NY, USA, ACM. ISBN 1-59593-080-9
-
Nathan Thomas, Gabriel Tanase, Olga Tkachyshyn, Jack Perdue, Nancy M. Amato, and Lawrence Rauchwerger. A framework for adaptive algorithm selection in STAPL. In Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 277-288, New York, NY, USA, 2005. ACM. ISBN 1-59593-080-9.
-
(2005)
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 277-288
-
-
Thomas, N.1
Tanase, G.2
Tkachyshyn, O.3
Perdue, J.4
Amato, N.M.5
Rauchwerger, L.6
-
20
-
-
0034819518
-
High-level adaptive program optimization with adapt
-
ISSN 0362-1340
-
Michael Voss and Rudolf Eigenmann. High-level adaptive program optimization with adapt. ACM SIGPLAN Notices, 36(7):93-102, 2001. ISSN 0362-1340.
-
(2001)
ACM SIGPLAN Notices
, vol.36
, Issue.7
, pp. 93-102
-
-
Voss, M.1
Eigenmann, R.2
-
21
-
-
1542710758
-
Statistical models for empirical search-based performance tuning
-
ISSN 1094-3420
-
Richard Vuduc, James W. Demmel, and Jeff A. Bilmes. Statistical models for empirical search-based performance tuning. International Journal of High Performance Computing Applications, 18(1):65-94, 2004. ISSN 1094-3420.
-
(2004)
International Journal of High Performance Computing Applications
, vol.18
, Issue.1
, pp. 65-94
-
-
Vuduc, R.1
Demmel, J.W.2
Bilmes, J.A.3
-
22
-
-
24344485098
-
-
Richard Vuduc, James W. Demmel, and Katherine A. Yelick. OSKI: A library of automatically tuned sparse matrix kernels. In Proceedings of the Scientific Discovery through Advanced Computing Conference, Journal of Physics: Conference Series, San Francisco, CA, USA, June 2005. Institute of Physics Publishing.
-
Richard Vuduc, James W. Demmel, and Katherine A. Yelick. OSKI: A library of automatically tuned sparse matrix kernels. In Proceedings of the Scientific Discovery through Advanced Computing Conference, Journal of Physics: Conference Series, San Francisco, CA, USA, June 2005. Institute of Physics Publishing.
-
-
-
-
23
-
-
84943297310
-
Automatically tuned linear algebra software
-
Washington, DC, USA, IEEE Computer Society. ISBN 0-89791-984-X
-
Richard Clint Whaley and Jack J. Dongarra. Automatically tuned linear algebra software. In Proceedings of the ACM/IEEE Conference on Supercomputing, pages 1-27, Washington, DC, USA, 1998. IEEE Computer Society. ISBN 0-89791-984-X.
-
(1998)
Proceedings of the ACM/IEEE Conference on Supercomputing
, pp. 1-27
-
-
Clint Whaley, R.1
Dongarra, J.J.2
-
24
-
-
13244279577
-
Minimizing development and maintenance costs in supporting persistently optimized BLAS
-
February
-
Richard Clint Whaley and Antoine Petitet. Minimizing development and maintenance costs in supporting persistently optimized BLAS. Software: Practice and Experience, 35(2):101-121, February 2005.
-
(2005)
Software: Practice and Experience
, vol.35
, Issue.2
, pp. 101-121
-
-
Clint Whaley, R.1
Petitet, A.2
-
25
-
-
67650832222
-
-
Samuel Webb Williams, Andrew Waterman, and David A. Patterson. Roofline: An insightful visual performance model for floating-point programs and multicore architectures. Technical Report UCB/EECS-2008-134, EECS Department, University of California, Berkeley, Oct 2008.
-
Samuel Webb Williams, Andrew Waterman, and David A. Patterson. Roofline: An insightful visual performance model for floating-point programs and multicore architectures. Technical Report UCB/EECS-2008-134, EECS Department, University of California, Berkeley, Oct 2008.
-
-
-
-
26
-
-
0034826555
-
SPL: A language and compiler for DSP algorithms
-
New York, NY, USA, ACM. ISBN 1-58113-414-2
-
Jianxin Xiong, Jeremy Johnson, Robert Johnson, and David Padua. SPL: a language and compiler for DSP algorithms. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 298-308, New York, NY, USA, 2001. ACM. ISBN 1-58113-414-2.
-
(2001)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 298-308
-
-
Xiong, J.1
Johnson, J.2
Johnson, R.3
Padua, D.4
-
28
-
-
0038378242
-
A comparison of empirical and model-driven optimization
-
New York, NY, USA, ACM. ISBN 1-58113-662-5
-
Kamen Yotov, Xiaoming Li, Gang Ren, Michael Cibulskis, Gerald DeJong, Maria Garzaran, David Padua, Keshav Pingali, Paul Stodghill, and Peng Wu. A comparison of empirical and model-driven optimization. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 63-76, New York, NY, USA, 2003. ACM. ISBN 1-58113-662-5.
-
(2003)
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 63-76
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Cibulskis, M.4
DeJong, G.5
Garzaran, M.6
Padua, D.7
Pingali, K.8
Stodghill, P.9
Wu, P.10
-
29
-
-
10444220869
-
An adaptive algorithm selection framework
-
Washington, DC, USA, IEEE Computer Society. ISBN 0-7695-2229-7
-
Hao Yu, Dongmin Zhang, and Lawrence Rauchwerger. An adaptive algorithm selection framework. In Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, pages 278-289, Washington, DC, USA, 2004. IEEE Computer Society. ISBN 0-7695-2229-7.
-
(2004)
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques
, pp. 278-289
-
-
Yu, H.1
Zhang, D.2
Rauchwerger, L.3
|