-
3
-
-
84859677886
-
-
Trimaran Research Infrastructure, http://www.trimaran.org.
-
-
-
-
4
-
-
84859677890
-
-
VAST-C/AltiVec. http://www.crescentbaysoftware.com.
-
-
-
-
5
-
-
0035696746
-
Graph-partitioning based instruction scheduling for clustered processors
-
Austin, TX, December
-
A. Aletà, J. M. Codina, J. Sánchez, and A. Gonzalez. Graph-Partitioning Based Instruction Scheduling for Clustered Processors. In Proceedings of the 34th Annual International Symposium on Microarchitecture, pages 150-159, Austin, TX, December 2001.
-
(2001)
Proceedings of the 34th Annual International Symposium on Microarchitecture
, pp. 150-159
-
-
Aletà, A.1
Codina, J.M.2
Sánchez, J.3
Gonzalez, A.4
-
8
-
-
0035176849
-
A unified modulo scheduling and register allocation technique for clustered processors
-
Barcelona, Spain, September
-
J. M. Codina, J. Sánchez, and A. Gonzalez. A Unified Modulo Scheduling and Register Allocation Technique for Clustered Processors. In Proceedings of the 10th International Conference on Parallel Architectures and Compilation Techniques, pages 175-184, Barcelona, Spain, September 2001.
-
(2001)
Proceedings of the 10th International Conference on Parallel Architectures and Compilation Techniques
, pp. 175-184
-
-
Codina, J.M.1
Sánchez, J.2
Gonzalez, A.3
-
11
-
-
0033872689
-
AltiVec extension to powerPC accelerates media processing
-
March
-
K. Diefendorff, P. K. Dubey, R. Hochsprung, and H. Scales. AltiVec Extension to PowerPC Accelerates Media Processing. IEEE Micro, 20(2):85-95, March 2000.
-
(2000)
IEEE Micro
, vol.20
, Issue.2
, pp. 85-95
-
-
Diefendorff, K.1
Dubey, P.K.2
Hochsprung, R.3
Scales, H.4
-
13
-
-
8344245462
-
Vectorization for SIMD architectures with alignment constraints
-
Washington, DC, June
-
A. E. Eichenberger, P. Wu, and K. O'Brien. Vectorization for SIMD Architectures with Alignment Constraints. In Proceedings of the SIGPLAN '04 Conference on Programming Language Design and Implementation, pages 82-93, Washington, DC, June 2004.
-
(2004)
Proceedings of the SIGPLAN '04 Conference on Programming Language Design and Implementation
, pp. 82-93
-
-
Eichenberger, A.E.1
Wu, P.2
O'Brien, K.3
-
14
-
-
0033888003
-
The TigerSHARC DSP architecture
-
January
-
J. Fridman and Z. Greenfield. The TigerSHARC DSP Architecture. IEEE Micro, 20(1):66-76, January 2000.
-
(2000)
IEEE Micro
, vol.20
, Issue.1
, pp. 66-76
-
-
Fridman, J.1
Greenfield, Z.2
-
16
-
-
84990479742
-
An efficient heuristic procedure for partitioning graphs
-
February
-
B. Kernighan and S. Lin. An Efficient Heuristic Procedure for Partitioning Graphs. Bell System Technical Journal, 49:291-307, February 1970.
-
(1970)
Bell System Technical Journal
, vol.49
, pp. 291-307
-
-
Kernighan, B.1
Lin, S.2
-
18
-
-
33745189827
-
Generation of permutations for SIMD processors
-
Chicago, IL, June
-
A. Kudriavtsev and P. Kogge. Generation of Permutations for SIMD Processors. In Proceedings of the 2005 ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools for Embedded Systems, pages 147-156, Chicago, IL, June 2005.
-
(2005)
Proceedings of the 2005 ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools for Embedded Systems
, pp. 147-156
-
-
Kudriavtsev, A.1
Kogge, P.2
-
21
-
-
84948766393
-
Increasing and detecting memory address congruence
-
Charlottesville, VA, September
-
S. Larsen, E. Witchel, and S. Amarasinghe. Increasing and Detecting Memory Address Congruence. In Proceedings of the 11th International Conference on Parallel Architectures and Compilation Techniques, pages 18-29, Charlottesville, VA, September 2002.
-
(2002)
Proceedings of the 11th International Conference on Parallel Architectures and Compilation Techniques
, pp. 18-29
-
-
Larsen, S.1
Witchel, E.2
Amarasinghe, S.3
-
22
-
-
0030384118
-
Modulo scheduling of loops in control-intensive non-numeric programs
-
Paris, France, December
-
D. M. Lavery and W. mei W. Hwu. Modulo Scheduling of Loops in Control-Intensive Non-Numeric Programs. In Proceedings of the 29th Annual International Symposium on Microarchitecture, pages 126-137, Paris, France, December 1996.
-
(1996)
Proceedings of the 29th Annual International Symposium on Microarchitecture
, pp. 126-137
-
-
Lavery, D.M.1
Mei, W.2
Hwu, W.3
-
23
-
-
0002449750
-
Subword parallelism with MAX-2
-
August
-
R. Lee. Subword Parallelism with MAX-2. IEEE Micro, 16(4):51-59, August 1996.
-
(1996)
IEEE Micro
, vol.16
, Issue.4
, pp. 51-59
-
-
Lee, R.1
-
24
-
-
0038633609
-
Itanium 2 processor microarchitecture
-
March
-
C. McNairy and D. Soltis. Itanium 2 Processor Microarchitecture. IEEE Micro, 23(2):44-55, March 2003.
-
(2003)
IEEE Micro
, vol.23
, Issue.2
, pp. 44-55
-
-
McNairy, C.1
Soltis, D.2
-
26
-
-
4544372264
-
Vectorizing for a SIMdD DSP architecture
-
San Jose, CA, October
-
D. Naishlos, M. Biberstein, S. Ben-David, and A. Zaks. Vectorizing for a SIMdD DSP Architecture. In Proceedings of the 2003 International Conference on Compilers, Architecture and Synthesis for Embedded Systems, pages 2-11, San Jose, CA, October 2003.
-
(2003)
Proceedings of the 2003 International Conference on Compilers, Architecture and Synthesis for Embedded Systems
, pp. 2-11
-
-
Naishlos, D.1
Biberstein, M.2
Ben-David, S.3
Zaks, A.4
-
28
-
-
33745223764
-
Pointer alignment analysis for processors with SIMD instructions
-
San Diego, CA, December
-
I. Pryanishnikov, A. Krall, and N. Horspool. Pointer Alignment Analysis for Processors with SIMD Instructions. In Proceedings of the 5th Workshop on Media and Streaming Processors, pages 50-57, San Diego, CA, December 2003.
-
(2003)
Proceedings of the 5th Workshop on Media and Streaming Processors
, pp. 50-57
-
-
Pryanishnikov, I.1
Krall, A.2
Horspool, N.3
-
29
-
-
0034224812
-
Implementing streaming SIMD extensions on the pentium III processor
-
July
-
S. K. Raman, V. Pentkovski, and J. Keshava. Implementing Streaming SIMD Extensions on the Pentium III Processor. IEEE Micro, 20(4):47-57, July 2000.
-
(2000)
IEEE Micro
, vol.20
, Issue.4
, pp. 47-57
-
-
Raman, S.K.1
Pentkovski, V.2
Keshava, J.3
-
30
-
-
0026966702
-
Register allocation for software pipelined loops
-
San Francisco, CA, June
-
B. Rau, M. Lee, P. Tirumalai, and M. Schlansker. Register Allocation for Software Pipelined Loops. In Proceedings of the SIGPLAN '92 Conference on Programming Language Design and Implementation, pages 283-299, San Francisco, CA, June 1992.
-
(1992)
Proceedings of the SIGPLAN '92 Conference on Programming Language Design and Implementation
, pp. 283-299
-
-
Rau, B.1
Lee, M.2
Tirumalai, P.3
Schlansker, M.4
-
31
-
-
0009755242
-
Iterative modulo scheduling
-
Hewlett Packard Company, November
-
B. R. Rau. Iterative Modulo Scheduling. Technical Report HPL-94-115, Hewlett Packard Company, November 1995.
-
(1995)
Technical Report
, vol.HPL-94-115
-
-
Rau, B.R.1
-
32
-
-
0026976353
-
Code generation schema for modulo scheduled loops
-
Portland, OR, December
-
B. R. Rau, M. S. Schlansker, and P. Tirumalai. Code Generation Schema for Modulo Scheduled Loops. In Proceedings of the 25th Annual International Symposium on Microarchitecture, pages 158-169, Portland, OR, December 1992.
-
(1992)
Proceedings of the 25th Annual International Symposium on Microarchitecture
, pp. 158-169
-
-
Rau, B.R.1
Schlansker, M.S.2
Tirumalai, P.3
-
33
-
-
33646554301
-
Superword-level parallelism in the presence of control flow
-
San Jose, CA, March
-
J. Shin, M. Hall, and J. Chame. Superword-Level Parallelism in the Presence of Control Flow. In Proceedings of the International Symposium on Code Generation and Optimization, pages 165-175, San Jose, CA, March 2005.
-
(2005)
Proceedings of the International Symposium on Code Generation and Optimization
, pp. 165-175
-
-
Shin, J.1
Hall, M.2
Chame, J.3
-
34
-
-
84948740064
-
Compiler-controlled caching in superword register files for multimedia extension architecture
-
Charlottesville, VA, September
-
J. Shin, J. Chame, and M. Hall. Compiler-Controlled Caching in Superword Register Files for Multimedia Extension Architecture. In Proceedings of the 11th International Conference on Parallel Architectures and Compilation Techniques, pages 45-55, Charlottesville, VA, September 2002.
-
(2002)
Proceedings of the 11th International Conference on Parallel Architectures and Compilation Techniques
, pp. 45-55
-
-
Shin, J.1
Chame, J.2
Hall, M.3
-
36
-
-
0001790593
-
Depth first search and linear graph algorithms
-
June
-
R. E. Tarjan. Depth First Search and Linear Graph Algorithms. SIAM Journal of Computing, 1(2):146-160, June 1972.
-
(1972)
SIAM Journal of Computing
, vol.1
, Issue.2
, pp. 146-160
-
-
Tarjan, R.E.1
-
37
-
-
0041606016
-
VIS speeds new media processing
-
August
-
M. Tremblay, M. O'Connor, V. Narayanan, and L. He. VIS Speeds New Media Processing. IEEE Micro, 16(4): 10-20, August 1996.
-
(1996)
IEEE Micro
, vol.16
, Issue.4
, pp. 10-20
-
-
Tremblay, M.1
O'Connor, M.2
Narayanan, V.3
He, L.4
-
38
-
-
84976692695
-
SUIF: An infrastructure for research on parallelizing and optimizing compilers
-
December
-
R. P. Wilson, R. S. French, C. S. Wilson, S. P. Amarasinghe, J. M. Anderson, S. W. K. Tjiang, S.-W. Liao, C.-W. Tseng, M. W. Hall, M. S. Lam, and J. L. Hennessy. SUIF: An Infrastructure for Research on Parallelizing and Optimizing Compilers. ACM SIGPLAN Notices, 29(12):31-37, December 1994.
-
(1994)
ACM SIGPLAN Notices
, vol.29
, Issue.12
, pp. 31-37
-
-
Wilson, R.P.1
French, R.S.2
Wilson, C.S.3
Amarasinghe, S.P.4
Anderson, J.M.5
Tjiang, S.W.K.6
Liao, S.-W.7
Tseng, C.-W.8
Hall, M.W.9
Lam, M.S.10
Hennessy, J.L.11
-
40
-
-
33646833599
-
Efficient SIMD code generation for runtime alignment and length conversion
-
San Jose, CA, March
-
P. Wu, A. E. Eichenberger, and A. Wang. Efficient SIMD Code Generation for Runtime Alignment and Length Conversion. In Proceedings of the International Symposium on Code Generation and Optimization, pages 153-164, San Jose, CA, March 2005.
-
(2005)
Proceedings of the International Symposium on Code Generation and Optimization
, pp. 153-164
-
-
Wu, P.1
Eichenberger, A.E.2
Wang, A.3
-
41
-
-
32844466554
-
An integrated simdization framework using virtual vectors
-
Cambridge, MA, June
-
P. Wu, A. E. Eichenberger, A. Wang, and P. Zhao. An Integrated Simdization Framework Using Virtual Vectors. In Proceedings of the 19th ACM International Conference on Supercomputing, pages 169-178, Cambridge, MA, June 2005.
-
(2005)
Proceedings of the 19th ACM International Conference on Supercomputing
, pp. 169-178
-
-
Wu, P.1
Eichenberger, A.E.2
Wang, A.3
Zhao, P.4
-
42
-
-
0035691538
-
Modulo scheduling with integrated register spilling for clustered VLIW architectures
-
Austin, TX, December
-
J. Zalamea, J. Llosa, E. Ayguadé, and M. Valero. Modulo Scheduling with Integrated Register Spilling for Clustered VLIW Architectures. In Proceedings of the 34th Annual International Symposium on Microarchitecture, pages 160-169, Austin, TX, December 2001.
-
(2001)
Proceedings of the 34th Annual International Symposium on Microarchitecture
, pp. 160-169
-
-
Zalamea, J.1
Llosa, J.2
Ayguadé, E.3
Valero, M.4
|