-
1
-
-
0003913538
-
-
Norwell, MA, USA: Kluwer Academic Publishers
-
F. Catthoor, E. d. Greef, and S. Suytack, Custom Memory Management Methodology: Exploration of Memory Organisation for Embedded Multimedia System Design. Norwell, MA, USA: Kluwer Academic Publishers, 1998.
-
(1998)
Custom Memory Management Methodology: Exploration of Memory Organisation for Embedded Multimedia System Design
-
-
Catthoor, F.1
Greef, E.D.2
Suytack, S.3
-
3
-
-
0030190854
-
Improving Data Locality with Loop Transformations
-
K. S. McKinley, S. Carr, and C.-W. Tseng, "Improving data locality with loop transformations," ACM Trans. Program. Lang. Syst., vol. 18, pp. 424-453, July 1996. (Pubitemid 126422522)
-
(1996)
ACM Transactions on Programming Languages and Systems
, vol.18
, Issue.4
, pp. 424-453
-
-
Mckinley, K.S.1
Carr, S.2
Tseng, C.-W.3
-
4
-
-
3142688389
-
A data locality optimizing algorithm
-
New York, NY, USA
-
M. E. Wolf and M. S. Lam, "A data locality optimizing algorithm," in PLDI '91. New York, NY, USA.
-
PLDI '91
-
-
Wolf, M.E.1
Lam, M.S.2
-
5
-
-
0026232450
-
A loop transformation theory and an algorithm to maximize parallelism
-
Oct.
-
M. E. Wolf and M. S. Lam, "A loop transformation theory and an algorithm to maximize parallelism," IEEE Trans. Parallel Distrib. Syst., vol. 2, pp. 452-471, Oct. 1991.
-
(1991)
IEEE Trans. Parallel Distrib. Syst.
, vol.2
, pp. 452-471
-
-
Wolf, M.E.1
Lam, M.S.2
-
6
-
-
0001448065
-
Some efficient solutions to the affine scheduling problem: Part II. multidimensional time
-
P. Feautrier, "Some efficient solutions to the affine scheduling problem: Part II. multidimensional time," International Journal of Parallel Programming, vol. 21, pp. 389-420, 1992.
-
(1992)
International Journal of Parallel Programming
, vol.21
, pp. 389-420
-
-
Feautrier, P.1
-
7
-
-
0032662841
-
An affine partitioning algorithm to maximize parallelism and minimize communication
-
New York, Aug.
-
A. W. Lim, G. I. Cheong, and M. S. Lam, "An affine partitioning algorithm to maximize parallelism and minimize communication," in ICS '99. New York, Aug. 1999.
-
(1999)
ICS '99
-
-
Lim, A.W.1
Cheong, G.I.2
Lam, M.S.3
-
8
-
-
10844259103
-
Synthesizing transformations for locality enhancement of imperfectly-nested loop nests
-
N. Ahmed, N. Mateev, and K. Pingali, "Synthesizing transformations for locality enhancement of imperfectly-nested loop nests," IJPP, vol. 29, pp. 493-544, Oct. 2001. (Pubitemid 33818443)
-
(2001)
International Journal of Parallel Programming
, vol.29
, Issue.5
, pp. 493-544
-
-
Ahmed, N.1
Mateev, N.2
Pingali, K.3
-
9
-
-
57349139452
-
A practical automatic polyhedral parallelizer and locality optimizer
-
New York, NY, USA
-
U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan, "A practical automatic polyhedral parallelizer and locality optimizer," in PLDI '08, pp. 101-113, New York, NY, USA,
-
PLDI '08
, pp. 101-113
-
-
Bondhugula, U.1
Hartono, A.2
Ramanujam, J.3
Sadayappan, P.4
-
10
-
-
78149255973
-
A model for fusion and code motion in an automatic parallelizing compiler
-
Sept.
-
U. Bondhugula, O. Gunluk, S. Dash, and L. Renganarayanan, "A model for fusion and code motion in an automatic parallelizing compiler," in PACT '10, pp. 343-352, Sept. 2010,
-
(2010)
PACT '10
, pp. 343-352
-
-
Bondhugula, U.1
Gunluk, O.2
Dash, S.3
Renganarayanan, L.4
-
11
-
-
79251560668
-
Loop transformations: Convexity, pruning and optimization
-
Jan.
-
L.-N. Pouchet, U. Bondhugula, C. Bastoul, A. Cohen, J. Ramanujam, P. Sadayappan, and N. Vasilache. "Loop transformations: convexity, pruning and optimization," in POPL '11, pp. 549-562, Jan. 2011.
-
(2011)
POPL '11
, pp. 549-562
-
-
Pouchet, L.-N.1
Bondhugula, U.2
Bastoul, C.3
Cohen, A.4
Ramanujam, J.5
Sadayappan, P.6
Vasilache, N.7
-
12
-
-
0003502725
-
-
Norwell, MA, USA: Kluwer Academic Publishers
-
F. Catthoor, K. Danckaert, K. Kulkarni, E. Brockmeyer, P. Kjeldsberg, T. v. Achteren, and T. Omnes, Data access and storage management for embedded programmable processors. Norwell, MA, USA: Kluwer Academic Publishers, 2002.
-
(2002)
Data Access and Storage Management for Embedded Programmable Processors
-
-
Catthoor, F.1
Danckaert, K.2
Kulkarni, K.3
Brockmeyer, E.4
Kjeldsberg, P.5
Achteren, T.V.6
Omnes, T.7
-
13
-
-
84863557512
-
Data reuse exploration techniques for loop-dominated applications
-
Mar.
-
T. Van Achteren, G. Deconinck, F. Catthoor, and R. Lauwereins, "Data reuse exploration techniques for loop-dominated applications," in DATE'02, pp. 428-435, Mar. 2002.
-
(2002)
DATE'02
, pp. 428-435
-
-
Van Achteren, T.1
Deconinck, G.2
Catthoor, F.3
Lauwereins, R.4
-
14
-
-
34248334635
-
DRDU: A data reuse analysis technique for efficient scratch-pad memory management
-
April
-
I. Issenin, E. Brockmeyer, M. Miranda, and N. Dutt, "DRDU: A data reuse analysis technique for efficient scratch-pad memory management," ACM Trans. Des. Autom. Electron. Syst., April 2007.
-
(2007)
ACM Trans. Des. Autom. Electron. Syst.
-
-
Issenin, I.1
Brockmeyer, E.2
Miranda, M.3
Dutt, N.4
-
15
-
-
0032303141
-
Formalized methodology for data reuse: Exploration for low-power hierarchical memory mappings
-
S. Wuytack, J.-P. Diguet, F. V. M. Catthoor, and H. J. De Man, "Formalized methodology for data reuse: exploration for low-power hierarchical memory mappings," IEEE Trans. on VLSI, 1998.
-
(1998)
IEEE Trans. on VLSI
-
-
Wuytack, S.1
Diguet, J.-P.2
Catthoor, F.V.M.3
De Man, H.J.4
-
16
-
-
0036053351
-
Compiler-directed scratch pad memory hierarchy design and management
-
June
-
M. Kandemir and A. Choudhary, "Compiler-directed scratch pad memory hierarchy design and management," in DAC'02, pp. 628-633, June 2002.
-
(2002)
DAC'02
, pp. 628-633
-
-
Kandemir, M.1
Choudhary, A.2
-
17
-
-
84893726637
-
Layer assignment techniques for low energy in multi-layered memory organisations
-
Mar.
-
E. Brockmeyer, M. Miranda, and F. Catthoor, "Layer assignment techniques for low energy in multi-layered memory organisations," in DATE'03, pp. 1070-1075, Mar. 2003.
-
(2003)
DATE'03
, pp. 1070-1075
-
-
Brockmeyer, E.1
Miranda, M.2
Catthoor, F.3
-
18
-
-
80052674346
-
A reuse-aware prefetching scheme for scratchpad memory
-
J. Cong, H. Huang, C. Liu, and Y. Zou, "A reuse-aware prefetching scheme for scratchpad memory," in DAC'11, pp. 960-965, 2011.
-
(2011)
DAC'11
, pp. 960-965
-
-
Cong, J.1
Huang, H.2
Liu, C.3
Zou, Y.4
-
19
-
-
47849112050
-
Data-reuse-driven energy-aware cosynthesis of scratch pad memory and hierarchical bus-based communication architecture for multiprocessor streaming applications
-
I. Issenin, E. Brockmeyer, B. Durinck, and N. D. Dutt, "Data-reuse-driven energy-aware cosynthesis of scratch pad memory and hierarchical bus-based communication architecture for multiprocessor streaming applications," IEEE trans. on CAD, vol. 27, no. 8, pp. 1439-1452, 2008.
-
(2008)
IEEE Trans. on CAD
, vol.27
, Issue.8
, pp. 1439-1452
-
-
Issenin, I.1
Brockmeyer, E.2
Durinck, B.3
Dutt, N.D.4
-
20
-
-
65849148709
-
Trade-offs in loop transformations
-
April
-
M. Palkovic, F. Catthoor, and H. Corporaal, "Trade-offs in loop transformations," ACM Trans. Des. Autom. Electron. Syst., vol. 14, pp. 22:1-22:30, April 2009.
-
(2009)
ACM Trans. Des. Autom. Electron. Syst.
, vol.14
-
-
Palkovic, M.1
Catthoor, F.2
Corporaal, H.3
-
21
-
-
0032738222
-
Local memory exploration and optimization in embedded systems
-
P. R. Panda, N. D. Dutt, and A. Nicolau, "Local memory exploration and optimization in embedded systems," IEEE Trans. on CAD, vol. 18, no. 1, pp. 3-13, 1999.
-
(1999)
IEEE Trans. on CAD
, vol.18
, Issue.1
, pp. 3-13
-
-
Panda, P.R.1
Dutt, N.D.2
Nicolau, A.3
-
22
-
-
35148879778
-
Incremental hierarchical memory size estimation for steering of loop transformations
-
September
-
Q. Hu, P. G. Kjeldsberg, A. Vandecappelle, M. Palkovic, and F. Catthoor, "Incremental hierarchical memory size estimation for steering of loop transformations," ACM Trans. Des. Autom. Electron. Syst., vol. 12, September 2007.
-
(2007)
ACM Trans. Des. Autom. Electron. Syst.
, vol.12
-
-
Hu, Q.1
Kjeldsberg, P.G.2
Vandecappelle, A.3
Palkovic, M.4
Catthoor, F.5
-
23
-
-
63549135938
-
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories
-
Feb.
-
M. M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan, "Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories," in PPoPP'08, pp. 1-10, Feb. 2008.
-
(2008)
PPoPP'08
, pp. 1-10
-
-
Baskaran, M.M.1
Bondhugula, U.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
24
-
-
77955224068
-
Combining data reuse with data-level parallelization for FPGA-targeted hardware compilation: A geometric programming framework
-
Q. Liu, G. A. Constantinides, K. Masselos, and P. Cheung, "Combining data reuse with data-level parallelization for FPGA-targeted hardware compilation: A geometric programming framework," IEEE Trans. on CAD, vol. 28, no. 3, 2009.
-
(2009)
IEEE Trans. on CAD
, vol.28
, Issue.3
-
-
Liu, Q.1
Constantinides, G.A.2
Masselos, K.3
Cheung, P.4
-
25
-
-
84855778017
-
Combined loop transformation and hierarchy allocation in data reuse optimization
-
Nov.
-
J. Cong, P. Zhang, and Y. Zou, "Combined loop transformation and hierarchy allocation in data reuse optimization," in ICCAD'11, pp. 185-192, Nov. 2011.
-
(2011)
ICCAD'11
, pp. 185-192
-
-
Cong, J.1
Zhang, P.2
Zou, Y.3
-
26
-
-
84863537938
-
-
Technical Report, Computer Science Department, UCLA, TR200019
-
J. Cong, P. Zhang, and Y. Zou, "Optimizing Memory Hierarchy Allocation with Loop Transformations for High-Level Synthesis," Technical Report, Computer Science Department, UCLA, TR200019, 2012.
-
(2012)
Optimizing Memory Hierarchy Allocation with Loop Transformations for High-Level Synthesis
-
-
Cong, J.1
Zhang, P.2
Zou, Y.3
-
28
-
-
33746593747
-
Semi-Automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies
-
June
-
S. Girbal, N. Vasilache, C. Bastoul, A. Cohen, D. Parello, M. Sigler, and O. Temam, "Semi-Automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies," IJPP, vol. 34, no. 3, June 2006.
-
(2006)
IJPP
, vol.34
, Issue.3
-
-
Girbal, S.1
Vasilache, N.2
Bastoul, C.3
Cohen, A.4
Parello, D.5
Sigler, M.6
Temam, O.7
-
29
-
-
84863537937
-
-
Barvinok library
-
Barvinok library. http://freshmeat.net/projects/barvinok
-
-
-
-
32
-
-
79953076698
-
High-level synthesis for FPGA: From prototyping to deployment
-
J. Cong, B. Liu, S. Neuendorffer, J. Noguera, K. Vissers, and Z. Zhang, "High-level synthesis for FPGA: From prototyping to deployment," IEEE Trans. on CAD, vol. 30, no. 4, 2011.
-
(2011)
IEEE Trans. on CAD
, vol.30
, Issue.4
-
-
Cong, J.1
Liu, B.2
Neuendorffer, S.3
Noguera, J.4
Vissers, K.5
Zhang, Z.6
-
33
-
-
84892282773
-
AutoPilot: A Platform-Based ESL Synthesis System
-
ed. P. Coussy and A. Morawiec, Springer Publishers
-
Z. Zhang, Y. Fan, W. Jiang, G. Han, C. Yang, and J. Cong, "AutoPilot: A Platform-Based ESL Synthesis System," High-Level Synthesis: From Algorithm to Digital Circuit, ed. P. Coussy and A. Morawiec, Springer Publishers, 2008.
-
(2008)
High-Level Synthesis: from Algorithm to Digital Circuit
-
-
Zhang, Z.1
Fan, Y.2
Jiang, W.3
Han, G.4
Yang, C.5
Cong, J.6
-
34
-
-
84863552043
-
-
Xilinx ISE Design Suite
-
Xilinx ISE Design Suite. http://www.xilinx.com/products/design-tools/ise- design-suite/.
-
-
-
-
35
-
-
84863540565
-
-
Polyhedral benchmark suite V3.1
-
Polyhedral benchmark suite v3.1. http://www.cse.ohio-state.edu/~pouchet/ software/polybench/ .
-
-
-
-
36
-
-
79953656477
-
Customizable Domain-Specific Computing
-
March/April
-
J. Cong, V. Sarkar, G. Reinman and A. Bui, "Customizable Domain-Specific Computing," IEEE Design and Test of Computers, vol. 28, no. 2, pp. 5-15, March/April 2011.
-
(2011)
IEEE Design and Test of Computers
, vol.28
, Issue.2
, pp. 5-15
-
-
Cong, J.1
Sarkar, V.2
Reinman, G.3
Bui, A.4
|