-
2
-
-
0027756433
-
URSA: A unified resource allocator for registers and functional units in vliw architectures
-
Orlando, FL, Jan
-
D. A. Berson, R. Gupta, and M. L. Soffa. URSA: A unified resource allocator for registers and functional units in vliw architectures. In Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism, Orlando, FL, Jan. 1993.
-
(1993)
Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism
-
-
Berson, D.A.1
Gupta, R.2
Soffa, M.L.3
-
3
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
J. Bilmes, K. Asanović, C.-W. Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology. In International Conference on Supercomputing, 1997.
-
(1997)
International Conference on Supercomputing
-
-
Bilmes, J.1
Asanović, K.2
Chin, C.-W.3
Demmel, J.4
-
6
-
-
84949671069
-
Tempo: Specializing systems applications and beyond
-
C. Consel, L. Hornof, J. Lawall, R. Marlet, G. Muller, J. Noyé, S. Thibault, and N. Volanschi. Tempo: Specializing systems applications and beyond. In Symposium on Partial Evaluation, 1998.
-
(1998)
Symposium on Partial Evaluation
-
-
Consel, C.1
Hornof, L.2
Lawall, J.3
Marlet, R.4
Muller, G.5
Noyé, J.6
Thibault, S.7
Volanschi, N.8
-
8
-
-
0029712698
-
C: A language for high-level, efficient, and machine-independent dynamic code generation
-
Saint Petersburg, FL, Jan
-
D. R. Engler, W. C. Hsieh, and M. F. Kaashoek. ’C: A language for high-level, efficient, and machine-independent dynamic code generation. In Principles of Programming Languages, Saint Petersburg, FL, Jan. 1996.
-
(1996)
Principles of Programming Languages
-
-
Engler, D.R.1
Hsieh, W.C.2
Kaashoek, M.F.3
-
9
-
-
0001878903
-
On the partial computation principle
-
A. P. Ershov. On the partial computation principle. Inf. Process. Lett., 1977.
-
(1977)
Inf. Process. Lett
-
-
Ershov, A.P.1
-
12
-
-
0003533835
-
-
Technical Report MIT-LCS-TR-728, Massachusetts Institute of Technology, Laboratory for Computer Science, Sept
-
M. Frigo and S. G. Johnson. The fastest fourier transform in the west. Technical Report MIT-LCS-TR-728, Massachusetts Institute of Technology, Laboratory for Computer Science, Sept. 1997.
-
(1997)
The Fastest Fourier Transform in the West
-
-
Frigo, M.1
Johnson, S.G.2
-
13
-
-
77949563167
-
Architecture-cognizant divide and conquer algorithms
-
Nov
-
K. S. Gatlin and L. Carter. Architecture-cognizant divide and conquer algorithms. In Supercomputing, Nov. 1999.
-
(1999)
Supercomputing
-
-
Gatlin, K.S.1
Carter, L.2
-
15
-
-
0003526480
-
-
Technical Report UWCSE-97-03-03, University of Washington, Department of Computer Science and Engineering, June
-
B. Grant, M. Mock, M. Philipose, C. Chambers, and S. J. Eggers. DyC: An expressive annotation-directed dynamic compiler for c. Technical Report UWCSE-97-03-03, University of Washington, Department of Computer Science and Engineering, June 1998.
-
(1998)
Dyc: An Expressive Annotation-Directed Dynamic Compiler for C
-
-
Grant, B.1
Mock, M.2
Philipose, M.3
Chambers, C.4
Eggers, S.J.5
-
16
-
-
85008989421
-
HINT-a new way to measure computer performance
-
Wailela, Maui, Hawaii, Jan
-
J. L. Gustafson and Q. O. Snell. HINT-a new way to measure computer performance. In HICSS-28, Wailela, Maui, Hawaii, Jan. 1995.
-
(1995)
HICSS-28
-
-
Gustafson, J.L.1
Snell, Q.O.2
-
17
-
-
0029724964
-
Generating machine specific optimizing compilers
-
St. Petersburg, FL
-
R. Hoover and K. Zadeck. Generating machine specific optimizing compilers. In Principles of Programming Languages, St. Petersburg, FL, 1996.
-
(1996)
Principles of Programming Languages
-
-
Hoover, R.1
Zadeck, K.2
-
21
-
-
84949668652
-
-
Technical Report LA-UR-97-3462, Los Alamos National Laboratory
-
O. M. Lubeck, Y. Luo, H. J. Wasserman, and F. Bassetti. Development and validation of a hierarhical memory model incorporating cpu-and memory-operation overlap. Technical Report LA-UR-97-3462, Los Alamos National Laboratory, 1998.
-
(1998)
Development and Validation of a Hierarhical Memory Model Incorporating Cpu-And Memory-Operation Overlap
-
-
Lubeck, O.M.1
Luo, Y.2
Wasserman, H.J.3
Bassetti, F.4
-
23
-
-
0002619341
-
Lmbench: Portable tools for performance analysis
-
Jan
-
L. McVoy and C. Staelin. lmbench: Portable tools for performance analysis. In Usenix Proceedings, Jan. 1995.
-
(1995)
Usenix Proceedings
-
-
McVoy, L.1
Staelin, C.2
-
26
-
-
0029727692
-
Improving the effectiveness of software prefetching with adaptive execution
-
Boston, MA, Oct
-
R. H. Saavedra and D. Park. Improving the effectiveness of software prefetching with adaptive execution. In Parallel Architectures and Compilation Techniques, Boston, MA, Oct. 1996.
-
(1996)
Parallel Architectures and Compilation Techniques
-
-
Saavedra, R.H.1
Park, D.2
-
27
-
-
0000718681
-
Measuring cache and TLB performance and their effect on benchmark run times
-
Oct
-
R. H. Saavedra and A. J. Smith. Measuring cache and TLB performance and their effect on benchmark run times. IEEE Trans. Comput., 44(10):1223-1235, Oct. 1995.
-
(1995)
IEEE Trans. Comput
, vol.44
, Issue.10
, pp. 1223-1235
-
-
Saavedra, R.H.1
Smith, A.J.2
-
28
-
-
0031140581
-
Automatic selection of high-order transformations in the IBM XL FORTRAN compilers
-
May
-
V. Sarkar. Automatic selection of high-order transformations in the IBM XL FORTRAN compilers. IBM J. Res. Dev., 41(3), May 1997.
-
(1997)
IBM J. Res. Dev
, vol.41
, Issue.3
-
-
Sarkar, V.1
-
30
-
-
0027764718
-
To copy or not to copy: A compiletime technique for assessing when data copying should be used to eliminate cache conflicts
-
Portland, Oregon, Nov
-
O. Temam, E. D. Granston, and W. Jalby. To copy or not to copy: A compiletime technique for assessing when data copying should be used to eliminate cache conflicts. In Supercomputing’93, pages 410-419, Portland, Oregon, Nov. 1993.
-
(1993)
Supercomputing’93
, pp. 410-419
-
-
Temam, O.1
Granston, E.D.2
Jalby, W.3
-
31
-
-
0026987137
-
Sharlit—a tool for building optimizers
-
San Francisco, California, June 1992. SIGPLAN Notices, July
-
S. W. K. Tjiang and J. L. Hennessy. Sharlit—a tool for building optimizers. In Programming Language Design and Implementation, pages 82-93, San Francisco, California, June 1992. SIGPLAN Notices 27(7), July 1992.
-
(1992)
Programming Language Design and Implementation
, vol.27
, Issue.7
, pp. 82-93
-
-
Tjiang, S.1
Hennessy, J.L.2
-
34
-
-
84943297310
-
Automatically tuned linear algebra software
-
Nov
-
R. C. Whaley and J. Dongarra. Automatically tuned linear algebra software. In Supercomputing, Nov. 1998.
-
(1998)
Supercomputing
-
-
Whaley, R.C.1
Dongarra, J.2
|