-
4
-
-
0012531957
-
Controlling code size of software-pipelined loops on the TMS320C6000 VLIW DSP architecture
-
Dec.
-
E. Granston, R. Scales, E. Stotzer, A. Ward, and J. Zbiciak. Controlling code size of software-pipelined loops on the TMS320C6000 VLIW DSP architecture. In Proceedings of the 3rd IEEE/ACM Workshop on Media and Streaming Processors, pages 29-38, Dec. 2001.
-
(2001)
Proceedings of the 3rd IEEE/ACM Workshop on Media and Streaming Processors
, pp. 29-38
-
-
Granston, E.1
Scales, R.2
Stotzer, E.3
Ward, A.4
Zbiciak, J.5
-
6
-
-
0001465739
-
Maximizing loop parallelism and improving data locality via loop fusion and distribution
-
K. Kennedy and K. S. Mckinley. Maximizing loop parallelism and improving data locality via loop fusion and distribution. In Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, Number 768, Springer-Verlag, Berlin, pages 301-320, 1993.
-
(1993)
Languages and Compilers for Parallel Computing, Lecture Notes in Computer Science, Number 768, Springer-Verlag, Berlin
, pp. 301-320
-
-
Kennedy, K.1
Mckinley, K.S.2
-
7
-
-
1242268977
-
Typed fusion with applications to parallel and sequential code generation
-
Center for Research on Parallel Computation, Rice University, Jan.
-
K. Kennedy and K. S. Mckinley. Typed fusion with applications to parallel and sequential code generation. Technical Report CRPC-TR94646, Center for Research on Parallel Computation, Rice University, Jan. 1994.
-
(1994)
Technical Report
, vol.CRPC-TR94646
-
-
Kennedy, K.1
Mckinley, K.S.2
-
8
-
-
0026005478
-
Retiming synchronous circuitry
-
June
-
C. E. Leiserson and J. B. Saxe. Retiming synchronous circuitry. Algortithmica, 6(1):5-35, June 1991.
-
(1991)
Algortithmica
, vol.6
, Issue.1
, pp. 5-35
-
-
Leiserson, C.E.1
Saxe, J.B.2
-
10
-
-
0030190854
-
Improving data locality with loop transformations
-
Jul.
-
K. S. McKinley, S. Carr, and C.-W. Tseng. Improving data locality with loop transformations. ACM Transactions on Programming Languages and Systems (TOPLAS), 18(4):424-453, Jul. 1996.
-
(1996)
ACM Transactions on Programming Languages and Systems (TOPLAS)
, vol.18
, Issue.4
, pp. 424-453
-
-
McKinley, K.S.1
Carr, S.2
Tseng, C.-W.3
-
16
-
-
0036979933
-
Loop fusion for clustered VLIW architecture
-
Y. Qian, S. Carr, and P. Sweany. Loop fusion for clustered VLIW architecture. In Proceedings of the joint conference on Languages, compilers and tools for embedded systems: software and compilers for embedded systems, pages 112-119, 2002.
-
(2002)
Proceedings of the Joint Conference on Languages, Compilers and Tools for Embedded Systems: Software and Compilers for Embedded Systems
, pp. 112-119
-
-
Qian, Y.1
Carr, S.2
Sweany, P.3
-
17
-
-
29144495955
-
Efficient polynomial-time nested loop fusion with full parallelism
-
Mar.
-
E. H.-M. Sha, T. W. O'Neil, and N. L. Passos. Efficient polynomial-time nested loop fusion with full parallelism, International Journal of Computers and Their Applications, 10(1):9-24, Mar. 2003.
-
(2003)
International Journal of Computers and Their Applications
, vol.10
, Issue.1
, pp. 9-24
-
-
Sha, E.H.-M.1
O'Neil, T.W.2
Passos, N.L.3
-
18
-
-
0001505938
-
A parameterized loop fusion algorithm for improving parallelism and cache locality
-
June
-
S. K. Singhai and K. S. Mckinley. A parameterized loop fusion algorithm for improving parallelism and cache locality. The Computer Journal, 40(6):340-355, June 1997.
-
(1997)
The Computer Journal
, vol.40
, Issue.6
, pp. 340-355
-
-
Singhai, S.K.1
Mckinley, K.S.2
-
19
-
-
47849126837
-
Multi-dimensional incremental loop fusion for data locality
-
S. Verdoolaege, M. Bruynooghe, and F. Catthoor, Multi-dimensional incremental loop fusion for data locality. In Proceedings of the Application-Specific Systems, Architectures, and Processors, pages 14-24, 2003.
-
(2003)
Proceedings of the Application-specific Systems, Architectures, and Processors
, pp. 14-24
-
-
Verdoolaege, S.1
Bruynooghe, M.2
Catthoor, F.3
-
21
-
-
85008028251
-
Code size reduction technique and implementation for software-pipelined DSP applications
-
Nov.
-
Q. Zhuge, B. Xiao, and E.-M. Sha. Code size reduction technique and implementation for software-pipelined DSP applications. ACM Transactions on Embedded Computing Systems(TECS), 2(4):590-613, Nov. 2003.
-
(2003)
ACM Transactions on Embedded Computing Systems(TECS)
, vol.2
, Issue.4
, pp. 590-613
-
-
Zhuge, Q.1
Xiao, B.2
Sha, E.-M.3
|