-
2
-
-
0002921197
-
Efficient exploitation of parallelism on Pentium III and Pentium 4 processor-based systems
-
February
-
A. J. C. Bik, M. Girkar, P. M. Grey, and X. Tian. Efficient exploitation of parallelism on Pentium III and Pentium 4 processor-based systems. Intel Technology J., February 2001.
-
(2001)
Intel Technology J.
-
-
Bik, A.J.C.1
Girkar, M.2
Grey, P.M.3
Tian, X.4
-
3
-
-
0025447908
-
Improving register allocation for subscripted variables
-
June
-
David Callahan, S. Carr, and K. Kennedy. Improving register allocation for subscripted variables. In PLDI, pages 53-65, June 1990.
-
(1990)
PLDI
, pp. 53-65
-
-
Callahan, D.1
Carr, S.2
Kennedy, K.3
-
4
-
-
0026980850
-
An efficient architecture for loop based data preloading
-
William Y. Chen, R. Bringmann, S. A. Mahlke, R. E. Hank, and J. E. Sicolo. An efficient architecture for loop based data preloading. In Micro, 1992.
-
(1992)
Micro
-
-
Chen, W.Y.1
Bringmann, R.2
Mahlke, S.A.3
Hank, R.E.4
Sicolo, J.E.5
-
7
-
-
18844405317
-
StarCore SC140: A new DSP architecture for portable devices
-
Motorola, September
-
Paul D'Arcy and Scott Beach. StarCore SC140: A new DSP architecture for portable devices. In Wireless Symposium. Motorola, September 1999.
-
(1999)
Wireless Symposium
-
-
D'Arcy, P.1
Beach, S.2
-
8
-
-
0033872689
-
Altivec extension to PowerPC accelerates media processing
-
March-April
-
K. Diefendorff and P. K. Dubey et al. Altivec extension to PowerPC accelerates media processing. IEEE Micro, March-April 2000.
-
(2000)
IEEE Micro
-
-
Diefendorff, K.1
Dubey, P.K.2
-
9
-
-
0035182922
-
Optimizing software data prefetches with rotating registers
-
Gautam Dohsi, Rakesh Krishnaiyer, and Kalyan Muthukumar. Optimizing software data prefetches with rotating registers. In PACT, pages 257-267, 2001.
-
(2001)
PACT
, pp. 257-267
-
-
Dohsi, G.1
Krishnaiyer, R.2
Muthukumar, K.3
-
10
-
-
18844453872
-
-
Texas Instruments, www.ti.com/sc/c6x, 2000.
-
(2000)
-
-
-
11
-
-
18844390869
-
Optimizing inter-nest data locality
-
M. Kandemir, I. Kadayif, A. Choudhary, and J. A. Zambreno. Optimizing inter-nest data locality. In PACT, pages 127-135, 2002.
-
(2002)
PACT
, pp. 127-135
-
-
Kandemir, M.1
Kadayif, I.2
Choudhary, A.3
Zambreno, J.A.4
-
12
-
-
0034250996
-
Compilation techniques for multimedia processors
-
Andreas Krall and Sylvain Lelait. Compilation techniques for multimedia processors. Intl. J. of Parallel Programming, 28(4):347-361, 2000.
-
(2000)
Intl. J. of Parallel Programming
, vol.28
, Issue.4
, pp. 347-361
-
-
Krall, A.1
Lelait, S.2
-
13
-
-
18844382518
-
Techniques for increasing and detecting memory alignment
-
MIT LCS, November
-
Samuel Larsen, Emmett Witchel, and Saman Amarasinghe. Techniques for increasing and detecting memory alignment. Technical Memo 621, MIT LCS, November 2001.
-
(2001)
Technical Memo
, vol.621
-
-
Larsen, S.1
Witchel, E.2
Amarasinghe, S.3
-
14
-
-
0031141704
-
Simulation/evaluation environment for a VLIW processor architecture
-
May
-
J. H. Moreno, M. Moudgill, K. Ebcioglu, E. Altman, B. Hall, R. Miranda, S. K. Chen, and A. Polyak. Simulation/evaluation environment for a VLIW processor architecture. IBM Journal of Research and Development, 41(3):287-302, May 1997.
-
(1997)
IBM Journal of Research and Development
, vol.41
, Issue.3
, pp. 287-302
-
-
Moreno, J.H.1
Moudgill, M.2
Ebcioglu, K.3
Altman, E.4
Hall, B.5
Miranda, R.6
Chen, S.K.7
Polyak, A.8
-
15
-
-
0037809797
-
An innovative low-power high-performance programmable signal processor for digital communications
-
March
-
Jaime H. Moreno, V. Zyuban, U. Shvadron, F. Neeser, J. Derby, M. Ware, K. Kailas, A. Zaks, A. Geva, S. Ben-David, S. Asaad, T. Fox, M. Biberstein, D. Naishlos, and H. Hunter. An innovative low-power high-performance programmable signal processor for digital communications. IBM Journal of Research and Development, March 2003.
-
(2003)
IBM Journal of Research and Development
-
-
Moreno, J.H.1
Zyuban, V.2
Shvadron, U.3
Neeser, F.4
Derby, J.5
Ware, M.6
Kailas, K.7
Zaks, A.8
Geva, A.9
Ben-David, S.10
Asaad, S.11
Fox, T.12
Biberstein, M.13
Naishlos, D.14
Hunter, H.15
-
17
-
-
0032684984
-
Exploiting SIMD parallelism in DSP and multimedia algorithms using the AltiVec technology
-
Huy Nguyen and Lizy Kurian John. Exploiting SIMD parallelism in DSP and multimedia algorithms using the AltiVec technology. In Intl. Conf., on Supercomputing, pages 11-20, 1999.
-
(1999)
Intl. Conf., on Supercomputing
, pp. 11-20
-
-
Nguyen, H.1
John, L.K.2
-
18
-
-
0030686025
-
Efficient utilization of scratch-pad memory in embedded processor applications
-
March
-
Preeti Ranjan Panda, Nikil D. Dutt, and Alexandru Nicolau. Efficient utilization of scratch-pad memory in embedded processor applications. In European Design and Test Conf., March 1997.
-
(1997)
European Design and Test Conf.
-
-
Panda, P.R.1
Dutt, N.D.2
Nicolau, A.3
-
19
-
-
0002517538
-
MMX technology extension to the Intel architecture
-
August
-
A. Peleg and U. Weiser. MMX technology extension to the Intel architecture. IEEE Micro, pages 43-45, August 1996.
-
(1996)
IEEE Micro
, pp. 43-45
-
-
Peleg, A.1
Weiser, U.2
-
20
-
-
84984058313
-
Dependence flow graphs: An algebraic approach to program dependencies
-
K. Pingali, M. Beck, R. Johnson, M. Moudgill, and P. Stodghill. Dependence flow graphs: an algebraic approach to program dependencies. In POPL, pages 67-78, 1991.
-
(1991)
POPL
, pp. 67-78
-
-
Pingali, K.1
Beck, M.2
Johnson, R.3
Moudgill, M.4
Stodghill, P.5
-
22
-
-
84948740064
-
Compiler-controlled caching in superword register files for multimedia extension architectures
-
Jaewook Shin, Jacqueline Chame, and Mary W. Hall. Compiler-controlled caching in superword register files for multimedia extension architectures. In PACT, 2002.
-
(2002)
PACT
-
-
Shin, J.1
Chame, J.2
Hall, M.W.3
|