SCOPUS 정보 검색 플랫폼

CASES 2003: International Conference on Compilers, Architecture, and Synthesis for Embedded Systems

Volumn , Issue , 2003, Pages 2-11

Vectorizing for a SIMdD DSP architecture

(4) Naishlos, Dorit a Biberstein, Marina a Ben David, Shay a Zaks, Ayal a

a UNIVERSITY OF HAIFA (Israel)

Author keywords

Compiler controlled cache; Data reuse; Rotating register file; SIMD; Subword parallelism; Vectorization; Viterbi

Indexed keywords

CACHE MEMORY; CODES (SYMBOLS); COMPUTER ARCHITECTURE; DECODING; DIGITAL SIGNAL PROCESSING; MATHEMATICAL MODELS; OPTIMIZATION; PROGRAM COMPILERS; VECTORS;

COMPILER CONTROLLED CACHE; DATA ELEMENTS; SINGLE INSTRUCTION MULTIPLE DATA (SIMD); VECTORIZATION;

DATA STRUCTURES;

EID: 4544372264 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/951710.951714 Document Type: Conference Paper

Times cited : (56)

References (23)

1
- 0343535748
- Torrent architecture manual
- ICSI
- K. Asanovic and D. Johnson. Torrent architecture manual. Technical report, ICSI, 1996.
- (1996) Technical Report
- Asanovic, K.¹ Johnson, D.²

2
- 0002921197
- Efficient exploitation of parallelism on Pentium III and Pentium 4 processor-based systems
- February
- A. J. C. Bik, M. Girkar, P. M. Grey, and X. Tian. Efficient exploitation of parallelism on Pentium III and Pentium 4 processor-based systems. Intel Technology J., February 2001.
- (2001) Intel Technology J.
- Bik, A.J.C.¹ Girkar, M.² Grey, P.M.³ Tian, X.⁴

3
- 0025447908
- Improving register allocation for subscripted variables
- June
- David Callahan, S. Carr, and K. Kennedy. Improving register allocation for subscripted variables. In PLDI, pages 53-65, June 1990.
- (1990) PLDI , pp. 53-65
- Callahan, D.¹ Carr, S.² Kennedy, K.³

4
- 0026980850
- An efficient architecture for loop based data preloading
- William Y. Chen, R. Bringmann, S. A. Mahlke, R. E. Hank, and J. E. Sicolo. An efficient architecture for loop based data preloading. In Micro, 1992.
- (1992) Micro
- Chen, W.Y.¹ Bringmann, R.² Mahlke, S.A.³ Hank, R.E.⁴ Sicolo, J.E.⁵

5
- 0347151974
- Cross-loop reuse analysis and its application to cache optimizations
- August
- Keith Cooper, Ken Kennedy, and Nathaniel McIntosh. Cross-loop reuse analysis and its application to cache optimizations. In Ninth Workshop on Languages and Compilers for Parallel Computing, August 1996.
- (1996) Ninth Workshop on Languages and Compilers for Parallel Computing
- Cooper, K.¹ Kennedy, K.² McIntosh, N.³

6
- 0033321339
- Exploiting a new level of DLP in multimedia applications
- Jesus Corbal, Roger Espasa, and Mateo Valero. Exploiting a new level of DLP in multimedia applications. In Intl. Symposium on Microarchitecture, pages 72-, 1999.
- (1999) Intl. Symposium on Microarchitecture , pp. 72
- Corbal, J.¹ Espasa, R.² Valero, M.³

7
- 18844405317
- StarCore SC140: A new DSP architecture for portable devices
- Motorola, September
- Paul D'Arcy and Scott Beach. StarCore SC140: A new DSP architecture for portable devices. In Wireless Symposium. Motorola, September 1999.
- (1999) Wireless Symposium
- D'Arcy, P.¹ Beach, S.²

8
- 0033872689
- Altivec extension to PowerPC accelerates media processing
- March-April
- K. Diefendorff and P. K. Dubey et al. Altivec extension to PowerPC accelerates media processing. IEEE Micro, March-April 2000.
- (2000) IEEE Micro
- Diefendorff, K.¹ Dubey, P.K.²

9
- 0035182922
- Optimizing software data prefetches with rotating registers
- Gautam Dohsi, Rakesh Krishnaiyer, and Kalyan Muthukumar. Optimizing software data prefetches with rotating registers. In PACT, pages 257-267, 2001.
- (2001) PACT , pp. 257-267
- Dohsi, G.¹ Krishnaiyer, R.² Muthukumar, K.³

10
- 18844453872
- Texas Instruments, www.ti.com/sc/c6x, 2000.
- (2000)

11
- 18844390869
- Optimizing inter-nest data locality
- M. Kandemir, I. Kadayif, A. Choudhary, and J. A. Zambreno. Optimizing inter-nest data locality. In PACT, pages 127-135, 2002.
- (2002) PACT , pp. 127-135
- Kandemir, M.¹ Kadayif, I.² Choudhary, A.³ Zambreno, J.A.⁴

12
- 0034250996
- Compilation techniques for multimedia processors
- Andreas Krall and Sylvain Lelait. Compilation techniques for multimedia processors. Intl. J. of Parallel Programming, 28(4):347-361, 2000.
- (2000) Intl. J. of Parallel Programming , vol.28 , Issue.4 , pp. 347-361
- Krall, A.¹ Lelait, S.²

13
- 18844382518
- Techniques for increasing and detecting memory alignment
- MIT LCS, November
- Samuel Larsen, Emmett Witchel, and Saman Amarasinghe. Techniques for increasing and detecting memory alignment. Technical Memo 621, MIT LCS, November 2001.
- (2001) Technical Memo , vol.621
- Larsen, S.¹ Witchel, E.² Amarasinghe, S.³

14
- 0031141704
- Simulation/evaluation environment for a VLIW processor architecture
- May
- J. H. Moreno, M. Moudgill, K. Ebcioglu, E. Altman, B. Hall, R. Miranda, S. K. Chen, and A. Polyak. Simulation/evaluation environment for a VLIW processor architecture. IBM Journal of Research and Development, 41(3):287-302, May 1997.
- (1997) IBM Journal of Research and Development , vol.41 , Issue.3 , pp. 287-302
- Moreno, J.H.¹ Moudgill, M.² Ebcioglu, K.³ Altman, E.⁴ Hall, B.⁵ Miranda, R.⁶ Chen, S.K.⁷ Polyak, A.⁸

15
- 0037809797
- An innovative low-power high-performance programmable signal processor for digital communications
- March
- Jaime H. Moreno, V. Zyuban, U. Shvadron, F. Neeser, J. Derby, M. Ware, K. Kailas, A. Zaks, A. Geva, S. Ben-David, S. Asaad, T. Fox, M. Biberstein, D. Naishlos, and H. Hunter. An innovative low-power high-performance programmable signal processor for digital communications. IBM Journal of Research and Development, March 2003.
- (2003) IBM Journal of Research and Development
- Moreno, J.H.¹ Zyuban, V.² Shvadron, U.³ Neeser, F.⁴ Derby, J.⁵ Ware, M.⁶ Kailas, K.⁷ Zaks, A.⁸ Geva, A.⁹ Ben-David, S.¹⁰ Asaad, S.¹¹ Fox, T.¹² Biberstein, M.¹³ Naishlos, D.¹⁴ Hunter, H.¹⁵

16
- 0003502903
- Morgan Kaufmann
- Steven S. Muchnick. Advanced Compiler Design and Implementation. Morgan Kaufmann, 1997.
- (1997) Advanced Compiler Design and Implementation
- Muchnick, S.S.¹

17
- 0032684984
- Exploiting SIMD parallelism in DSP and multimedia algorithms using the AltiVec technology
- Huy Nguyen and Lizy Kurian John. Exploiting SIMD parallelism in DSP and multimedia algorithms using the AltiVec technology. In Intl. Conf., on Supercomputing, pages 11-20, 1999.
- (1999) Intl. Conf., on Supercomputing , pp. 11-20
- Nguyen, H.¹ John, L.K.²

18
- 0030686025
- Efficient utilization of scratch-pad memory in embedded processor applications
- March
- Preeti Ranjan Panda, Nikil D. Dutt, and Alexandru Nicolau. Efficient utilization of scratch-pad memory in embedded processor applications. In European Design and Test Conf., March 1997.
- (1997) European Design and Test Conf.
- Panda, P.R.¹ Dutt, N.D.² Nicolau, A.³

19
- 0002517538
- MMX technology extension to the Intel architecture
- August
- A. Peleg and U. Weiser. MMX technology extension to the Intel architecture. IEEE Micro, pages 43-45, August 1996.
- (1996) IEEE Micro , pp. 43-45
- Peleg, A.¹ Weiser, U.²

20
- 84984058313
- Dependence flow graphs: An algebraic approach to program dependencies
- K. Pingali, M. Beck, R. Johnson, M. Moudgill, and P. Stodghill. Dependence flow graphs: an algebraic approach to program dependencies. In POPL, pages 67-78, 1991.
- (1991) POPL , pp. 67-78
- Pingali, K.¹ Beck, M.² Johnson, R.³ Moudgill, M.⁴ Stodghill, P.⁵

21
- 0003582437
- PhD thesis, U. of Michigan
- Matthew Postiff. Compiler and Microarchitecture Mechanisms for Exploiting Registers to Improve Memory Performance. PhD thesis, U. of Michigan, 2001.
- (2001) Compiler and Microarchitecture Mechanisms for Exploiting Registers to Improve Memory Performance
- Postiff, M.¹

22
- 84948740064
- Compiler-controlled caching in superword register files for multimedia extension architectures
- Jaewook Shin, Jacqueline Chame, and Mary W. Hall. Compiler-controlled caching in superword register files for multimedia extension architectures. In PACT, 2002.
- (2002) PACT
- Shin, J.¹ Chame, J.² Hall, M.W.³

23
- 0003927035
- Addison Wesley
- Michael Wolfe. High Performance Compilers for Parallel Computing. Addison Wesley, 1996.
- (1996) High Performance Compilers for Parallel Computing
- Wolfe, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.