-
1
-
-
23944518246
-
Near-optimal loop tiling by means of cache miss equations and genetic algorithms
-
Abella, J., González, A., Llosa, J., and Vera, X. 2002. Near-optimal loop tiling by means of cache miss equations and genetic algorithms. In Proceedings of 31st International Conference on Parallel Processing (ICPP'02).
-
(2002)
Proceedings of 31st International Conference on Parallel Processing (ICPP'02)
-
-
Abella, J.1
González, A.2
Llosa, J.3
Vera, X.4
-
2
-
-
0000695168
-
DBMSs on a modern processor: Where does time go?
-
(Edinburgh, Scotland)
-
Ailamaki, A., DeWitt, D. J., Hill, M. D., and Wood, D. A. 1999. DBMSs on a modern processor: where does time go? In Proceedings of the 25th VLDB Conference (Edinburgh, Scotland).
-
(1999)
Proceedings of the 25th VLDB Conference
-
-
Ailamaki, A.1
DeWitt, D.J.2
Hill, M.D.3
Wood, D.A.4
-
4
-
-
1842822179
-
A uniform internal representation for high-level and instruction level transformations
-
Tech rep. UPC-DAC-95-02. Universitat Politècnica de Catalunya, Barcelona, Spain
-
Ayguadé, E. et al. 1995. A uniform internal representation for high-level and instruction level transformations. Tech rep. UPC-DAC-95-02. Universitat Politècnica de Catalunya, Barcelona, Spain.
-
(1995)
-
-
Ayguadé, E.1
-
10
-
-
0032652980
-
Nonlinear array layout for hierarchical memory systems
-
Chatterjee, S., Jain, V. V., Lebeck, A. R., Mundhra, S., and Thottethodi, M. 1999. Nonlinear array layout for hierarchical memory systems. In Proceedings of the ACM International Conference on Supercomputing (Rhodes, Greece) (ICS'99). 444-453.
-
(1999)
Proceedings of the ACM International Conference on Supercomputing (Rhodes, Greece) (ICS'99)
, pp. 444-453
-
-
Chatterjee, S.1
Jain, V.V.2
Lebeck, A.R.3
Mundhra, S.4
Thottethodi, M.5
-
11
-
-
0034832018
-
Exact analysis of the cache behavior of nested loops
-
Chatterjee, S., Parker, E., Hanlon, P. J., and Lebeck, A. R. 2001. Exact analysis of the cache behavior of nested loops. In ACM SIGPLAN '01 Conference on Programming Language Design and Implementation (PLDI'01). 286-297.
-
(2001)
ACM SIGPLAN '01 Conference on Programming Language Design and Implementation (PLDI'01)
, pp. 286-297
-
-
Chatterjee, S.1
Parker, E.2
Hanlon, P.J.3
Lebeck, A.R.4
-
15
-
-
84957027384
-
Automatic parallelization in the polytope model
-
G. R. Perrin and A. Darte, Eds. Lecture Notes in Computer Science; Springer-Verlag, Berlin, Germany
-
Feautrier, P. 1996. Automatic parallelization in the polytope model. In The Data Parallel Programming Model, G. R. Perrin and A. Darte, Eds. Lecture Notes in Computer Science, vol. 1132. Springer-Verlag, Berlin, Germany, 79-103.
-
(1996)
The Data Parallel Programming Model
, vol.1132
, pp. 79-103
-
-
Feautrier, P.1
-
17
-
-
0001366267
-
Strategies for cache and local memory management by global program transformations
-
Gannon, D., Jalby, W., and Gallivan, K. 1988. Strategies for cache and local memory management by global program transformations. J. Parallel. Distrib. Comput. 5, 587-616.
-
(1988)
J. Parallel. Distrib. Comput.
, vol.5
, pp. 587-616
-
-
Gannon, D.1
Jalby, W.2
Gallivan, K.3
-
18
-
-
0027640963
-
Cache performance of the spec92 benchmark suite
-
(Aug.)
-
Gee, J., Hill, M., Pnevmatikatos, D., and Smith, A. 1993. Cache performance of the spec92 benchmark suite. IEEE Micro 13, 4 (Aug.), 17-27.
-
(1993)
IEEE Micro
, vol.13
, Issue.4
, pp. 17-27
-
-
Gee, J.1
Hill, M.2
Pnevmatikatos, D.3
Smith, A.4
-
19
-
-
0345857248
-
Compiler analysis framework for tuning memory behavior
-
Ph.D. dissertation. Princeton University, Princeton, NJ
-
Ghosh, S. 1999. Compiler analysis framework for tuning memory behavior. Ph.D. dissertation. Princeton University, Princeton, NJ.
-
(1999)
-
-
Ghosh, S.1
-
21
-
-
0001714824
-
Cache miss equations: A compiler framework for analyzing and tuning memory behavior
-
Ghosh, S., Martonosi, M., and Malik, S. 1999. Cache miss equations: A compiler framework for analyzing and tuning memory behavior. ACM Trans. Programm. Lang. Syst. 21, 4, 703-746.
-
(1999)
ACM Trans. Programm. Lang. Syst.
, vol.21
, Issue.4
, pp. 703-746
-
-
Ghosh, S.1
Martonosi, M.2
Malik, S.3
-
23
-
-
0026284572
-
Performance debugging shared memory multiprocessor programs with mtool
-
Goldberg, A. and Hennessy, J. 1991. Performance debugging shared memory multiprocessor programs with mtool. In Proceedings of Supercomputing (SC'91). 481-490.
-
(1991)
Proceedings of Supercomputing (SC'91)
, pp. 481-490
-
-
Goldberg, A.1
Hennessy, J.2
-
25
-
-
0348126353
-
Symbolic analysis: A basis for parallelization, optimization and scheduling of programs
-
Springer Verlag, Portland, Ore.
-
Haghighat, M. R. and Polychronopoulos, C. D. 1993. Symbolic analysis: A basis for parallelization, optimization and scheduling of programs. In 1993 Workshop on Languages and Compilers for Parallel Computing (LCPC'93). Springer Verlag, Portland, Ore., 567-585.
-
(1993)
1993 Workshop on Languages and Compilers for Parallel Computing (LCPC'93)
, pp. 567-585
-
-
Haghighat, M.R.1
Polychronopoulos, C.D.2
-
27
-
-
0033077834
-
A linear algebra framework for automatic determination of optimal data layouts
-
(Feb.)
-
Kandemir, M., Choudhary, A., Banerjee, P., and Ramanujam, J. 1999. A linear algebra framework for automatic determination of optimal data layouts. IEEE Trans. Parallel Distrib. Syst. 10, 2 (Feb.), 115-135.
-
(1999)
IEEE Trans. Parallel Distrib. Syst.
, vol.10
, Issue.2
, pp. 115-135
-
-
Kandemir, M.1
Choudhary, A.2
Banerjee, P.3
Ramanujam, J.4
-
28
-
-
1442337836
-
Analyzing and visualizing performance of memory hierarchy
-
ACM Press, New York, NY
-
Kennedy, K., Callahan, D., and Porterfield, A. 1990. Analyzing and visualizing performance of memory hierarchy. In Instrumentation for Visualization. ACM Press, New York, NY.
-
(1990)
Instrumentation for Visualization
-
-
Kennedy, K.1
Callahan, D.2
Porterfield, A.3
-
31
-
-
0028517833
-
Cache profiling and the spec benchmarks: A case study
-
(Oct.)
-
Lebeck, A. and Wood, D. 1994. Cache profiling and the spec benchmarks: A case study. IEEE Comput. 27, 10 (Oct.) 15-26.
-
(1994)
IEEE Comput.
, vol.27
, Issue.10
, pp. 15-26
-
-
Lebeck, A.1
Wood, D.2
-
39
-
-
1842721363
-
-
Padua, D. et al. 1994. Polaris Developer's Document. Available online at http://polaris.uiuc.edu/polaris/polaris-developer/polaris-developer.html
-
(1994)
-
-
Padua, D.1
-
46
-
-
0013009642
-
Multi-configuration simulation algorithms for the evaluation of computer designs
-
Ph.D. thesis, University of Michigan
-
Sugumar, R. 1993. Multi-configuration simulation algorithms for the evaluation of computer designs. Ph.D. thesis, University of Michigan.
-
(1993)
-
-
Sugumar, R.1
-
48
-
-
0027764718
-
To copy or not to copy: A compile-time technique for accessing when data copying should be used to eliminate cache conflicts
-
Temam, O., Granston, E., and Jalby, W. 1993. To copy or not to copy: A compile-time technique for accessing when data copying should be used to eliminate cache conflicts. In Proceedings of Supercomputing (SC'93). 410-419.
-
(1993)
Proceedings of Supercomputing (SC'93)
, pp. 410-419
-
-
Temam, O.1
Granston, E.2
Jalby, W.3
-
49
-
-
0031153459
-
Trace-driven memory simulation: A survey
-
(Sept.)
-
Uhlig, R. A. and Mudge, T. N. 1997. Trace-driven memory simulation: a survey. ACM Comput. Surv. 29, 3 (Sept.), 128-170.
-
(1997)
ACM Comput. Surv.
, vol.29
, Issue.3
, pp. 128-170
-
-
Uhlig, R.A.1
Mudge, T.N.2
-
50
-
-
0031186412
-
A cache visualization tool
-
(July)
-
van der Deijl, E., Kanbier, G., Temam, O., and Granston, E. 1997. A cache visualization tool. IEEE Comput. 30, 7 (July), 71-78.
-
(1997)
IEEE Comput.
, vol.30
, Issue.7
, pp. 71-78
-
-
Van Der Deijl, E.1
Kanbier, G.2
Temam, O.3
Granston, E.4
-
53
-
-
0004005802
-
A library for doing polyhedral operations
-
Tech. rep. 785, Oregon State University
-
Wilde, D. 1993. A library for doing polyhedral operations. Tech. rep. 785, Oregon State University.
-
(1993)
-
-
Wilde, D.1
|