SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 4420 LNCS, Issue , 2007, Pages 1-15

New algorithms for SIMD alignment

(3) Fireman, Liza a Petrank, Erez b Zaks, Ayal c

a TECHNION ISRAEL INSTITUTE OF TECHNOLOGY (Israel)

b MICROSOFT RESEARCH (United States)

c IBM HAIFA RESEARCH LAB (Israel)

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION ALGORITHMS; DYNAMIC PROGRAMMING; MULTIPROCESSING SYSTEMS; OPTIMIZATION; PROBLEM SOLVING; VECTORS;

MULTIPROCESSORS; VECTOR PLATFORMS;

COMPUTER SOFTWARE;

EID: 37149053855 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-540-71229-9_1 Document Type: Conference Paper

Times cited : (11)

References (30)

1
- 0004115996
- Prentice Hall
- R. K. Ahuja, T. L. Magnanti, and J. B. Orlin. Network flows. Prentice Hall, 1993.
- (1993) Network flows
- Ahuja, R.K.¹ Magnanti, T.L.² Orlin, J.B.³

2
- 0037952146
- Morgan Kaufmann
- R. Allen and K. Kennedy. Optimizing Compilers for Modern Architectures. Morgan Kaufmann, 2001.
- (2001) Optimizing Compilers for Modern Architectures
- Allen, R.¹ Kennedy, K.²

3
- 24144474794
- Intel Press, June
- A. Bik. The Software Vectorization Handbook: Applying Multimedia Extensions for Maximum Performance. Intel Press, June 2004.
- (2004) The Software Vectorization Handbook: Applying Multimedia Extensions for Maximum Performance
- Bik, A.¹

4
- 0344908850
- Automatic intra-register vectorization for the intel architecture
- April
- A. Bik, M. Girkar, P. M. Grey, and X. Tian. Automatic intra-register vectorization for the intel architecture. International J. of Parallel Programming, 2:65-98, April 2002.
- (2002) International J. of Parallel Programming , vol.2 , pp. 65-98
- Bik, A.¹ Girkar, M.² Grey, P.M.³ Tian, X.⁴

5
- 37149024957
- V. Bouchitt'e, P. Boulet, A. Darte, and Y. Robert. Evaluating array expressions on massively parallel machines with communication/computation overlap, 1995.
- (1995) Evaluating array expressions on massively parallel machines with communication/computation overlap
- Bouchitt'e, V.¹ Boulet, P.² Darte, A.³ Robert, Y.⁴

6
- 0027311338
- Automatic array alignment in data-parallel programs
- ACM Press
- S. Chatterjee, J. R. Gilbert, R. Schreiber, and S.-H. Teng. Automatic array alignment in data-parallel programs. In Proceedings of POPL, pages 16-28. ACM Press, 1993.
- (1993) Proceedings of POPL , pp. 16-28
- Chatterjee, S.¹ Gilbert, J.R.² Schreiber, R.³ Teng, S.-H.⁴

7
- 0029238937
- Optimal evaluation of array expressions on massively parallel machines
- S. Chatterjee, J. R. Gilbert, R. Schreiber, and S.-H. Teng. Optimal evaluation of array expressions on massively parallel machines. ACM Trans. Program. Lang. Syst., 17(1):123-156, 1995.
- (1995) ACM Trans. Program. Lang. Syst , vol.17 , Issue.1 , pp. 123-156
- Chatterjee, S.¹ Gilbert, J.R.² Schreiber, R.³ Teng, S.-H.⁴

8
- 37149036736
- Second SUIF Compiler Workshop, August
- G. Cheong and M. S. Lam. An optimizer formultimedia instruction sets. In In Second SUIF Compiler Workshop, August 1997.
- (1997) An optimizer formultimedia instruction sets
- Cheong, G.¹ Lam, M.S.²

9
- 0004116989
- McGraw-Hill Higher Education
- T. H. Cormen, C. Stein, R. L. Rivest, and C. E. Leiserson. Introduction to Algorithms. McGraw-Hill Higher Education, 2001.
- (2001) Introduction to Algorithms
- Cormen, T.H.¹ Stein, C.² Rivest, R.L.³ Leiserson, C.E.⁴

10
- 37149048617
- M. Corporation. Altivec technology programming interface manual. June 1999.
- M. Corporation. Altivec technology programming interface manual. June 1999.

11
- 0026966832
- The complexity of multiway cuts (extended abstract)
- New York, NY, USA, ACM Press
- E. Dahlhaus, D. S. Johnson, C. H. Papadimitriou, P. D. Seymour, and M. Yannakakis. The complexity of multiway cuts (extended abstract). In Proceedings of the 24th ACM symposium on Theory of computing, pages 241-251, New York, NY, USA, 1992. ACM Press.
- (1992) Proceedings of the 24th ACM symposium on Theory of computing , pp. 241-251
- Dahlhaus, E.¹ Johnson, D.S.² Papadimitriou, C.H.³ Seymour, P.D.⁴ Yannakakis, M.⁵

12
- 0028512446
- On the alignment problem
- A. Darte and Y. Robert. On the alignment problem. Parallel Processing Letters, 4(3):259-270, 1994.
- (1994) Parallel Processing Letters , vol.4 , Issue.3 , pp. 259-270
- Darte, A.¹ Robert, Y.²

13
- 1642502420
- Improving effective bandwidth through compiler enhancement of global cache reuse
- C. Ding and K. Kennedy. Improving effective bandwidth through compiler enhancement of global cache reuse. J. Parallel Distrib. Comput., 64:108-134, 2004.
- (2004) J. Parallel Distrib. Comput , vol.64 , pp. 108-134
- Ding, C.¹ Kennedy, K.²

14
- 8344245462
- Vectorization for SIMD architectures with alignment constraints
- June
- A. E. Eichenberger, P. Wu, and K. O'Brien. Vectorization for SIMD architectures with alignment constraints. In Proceeding of PLDI, June 2004.
- (2004) Proceeding of PLDI
- Eichenberger, A.E.¹ Wu, P.² O'Brien, K.³

15
- 37149024346
- M.Sc. thesis, Technion, Israel Institute of Technology, Department of Computer Science, June
- L. Fireman. The complexity of SIMD alignment. M.Sc. thesis, Technion - Israel Institute of Technology, Department of Computer Science, June 2006. http://www.cs.technion.ac.il/users/wwwb/cgi-bin/tr-info.cgi/2006/MSC/ MSC-2006-17.
- (2006) The complexity of SIMD alignment
- Fireman, L.¹

16
- 84876653309
- Collective loop fusion for array contraction
- G. R. Gao, R. Olsen, V. Sarkar, and R. Thekkath. Collective loop fusion for array contraction. In Workshop on Languages and Compilers for Parallel Computing, pages 281-295, 1992.
- (1992) Workshop on Languages and Compilers for Parallel Computing , pp. 281-295
- Gao, G.R.¹ Olsen, R.² Sarkar, V.³ Thekkath, R.⁴

17
- 0026219468
- Optimal expression evaluation for data parallel architectures
- J. R. Gilbert and R. Schreiber. Optimal expression evaluation for data parallel architectures. J. Parallel Distrib. Comput., 13(1):58-64, 1991.
- (1991) J. Parallel Distrib. Comput , vol.13 , Issue.1 , pp. 58-64
- Gilbert, J.R.¹ Schreiber, R.²

18
- 0001465739
- Maximizing loop parallelism and improving data locality via loop fusion and distribution
- K. Kennedy and K. S. McKinley. Maximizing loop parallelism and improving data locality via loop fusion and distribution. In Workshop on Languages and Compilers for Parallel Computing, pages 301-320, 1993.
- (1993) Workshop on Languages and Compilers for Parallel Computing , pp. 301-320
- Kennedy, K.¹ McKinley, K.S.²

19
- 0034446825
- Exploiting superword level parallelism with multimedia instruction sets
- S. Larsen and S. Amarasinghe. Exploiting superword level parallelism with multimedia instruction sets. In Proceedings of PLDI, pages 145-156, 2000.
- (2000) Proceedings of PLDI , pp. 145-156
- Larsen, S.¹ Amarasinghe, S.²

20
- 84948766393
- Increasing and detecting memory address congruence
- S. Larsen, E. Witchel, and S. Amarasinghe. Increasing and detecting memory address congruence. In Proceedings of PACT, 2002.
- (2002) Proceedings of PACT
- Larsen, S.¹ Witchel, E.² Amarasinghe, S.³

21
- 37149021552
- D. Naishlos. Autovectorization in gcc. In Proceeding of GCC Developers Summit, pages 105-118, 2004.
- D. Naishlos. Autovectorization in gcc. In Proceeding of GCC Developers Summit, pages 105-118, 2004.

22
- 4544372264
- Vectorizing for a SIMdD DSP Architecture
- D. Naishlos, M. Biberstein, S. Ben-David, and A. Zaks. Vectorizing for a SIMdD DSP Architecture. In Proceedings of CASES, pages 2-11, 2003.
- (2003) Proceedings of CASES , pp. 2-11
- Naishlos, D.¹ Biberstein, M.² Ben-David, S.³ Zaks, A.⁴

23
- 79953275887
- Multi-platform auto-vectorization
- D. Nuzman and R. Henderson. Multi-platform auto-vectorization. In Proceedings of CGO, pages 281-294, 2006.
- (2006) Proceedings of CGO , pp. 281-294
- Nuzman, D.¹ Henderson, R.²

24
- 37149019455
- Autovectorization in gcc - two years later
- D. Nuzman and A. Zaks. Autovectorization in gcc - two years later. In Proceedings of GCC Developers Summit, pages 145-158, 2006.
- (2006) Proceedings of GCC Developers Summit , pp. 145-158
- Nuzman, D.¹ Zaks, A.²

25
- 8344268421
- A preliminary study on the vectorization of multimedia applications for multimedia extensions
- October
- G. Ren, P. Wu, and D. Padua. A preliminary study on the vectorization of multimedia applications for multimedia extensions. In 16th International Workshop of Languages and Compilers for Parallel Computing, October 2003.
- (2003) 16th International Workshop of Languages and Compilers for Parallel Computing
- Ren, G.¹ Wu, P.² Padua, D.³

26
- 33745222449
- Optimizing data permutations for simd devices
- G. Ren, P. Wu, and D. A. Padua. Optimizing data permutations for simd devices. In Proceedings of PLDI, pages 118-131, 2006.
- (2006) Proceedings of PLDI , pp. 118-131
- Ren, G.¹ Wu, P.² Padua, D.A.³

27
- 33646554301
- Superword-level parallelism in the presence of control flow
- Washington, DC, USA, IEEE Computer Society
- J. Shin, M. Hall, and J. Chame. Superword-level parallelism in the presence of control flow. In Proceedings of CGO, pages 165-175, Washington, DC, USA, 2005. IEEE Computer Society.
- (2005) Proceedings of CGO , pp. 165-175
- Shin, J.¹ Hall, M.² Chame, J.³

28
- 37149001737
- C. B. Software. VAST-F/AltiVec: Automatic Fortran Vectorizer for PowerPC Vector Unit. http://www.psrv.com/vastaltivec.html, 2004.
- C. B. Software. VAST-F/AltiVec: Automatic Fortran Vectorizer for PowerPC Vector Unit. http://www.psrv.com/vastaltivec.html, 2004.

29
- 0003422462
- Springer-Verlag, 1st edition
- V. V. Vazirani. Approximation Algorithms, pages 38-40,155-160. Springer-Verlag, 1st edition, 2001.
- (2001) Approximation Algorithms
- Vazirani, V.V.¹

30
- 33646833599
- Efficient simd code generation for runtime alignment and length conversion
- Washington, DC, USA, IEEE Computer Society
- P. Wu, A. E. Eichenberger, and A. Wang. Efficient simd code generation for runtime alignment and length conversion. In Proceedings of CGO, pages 153-164, Washington, DC, USA, 2005. IEEE Computer Society.
- (2005) Proceedings of CGO , pp. 153-164
- Wu, P.¹ Eichenberger, A.E.² Wang, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.