메뉴 건너뛰기




Volumn , Issue , 2012, Pages 55-64

Efficient SIMD code generation for irregular kernels

Author keywords

DFG based vectorization; Irregular kernels; Processors; SIMD

Indexed keywords

CODE GENERATION; DATA REORGANIZATION; IRREGULAR KERNELS; MEMORY REFERENCES; PERFORMANCE GAIN; SCIENTIFIC APPLICATIONS; SIMD; SINGLE INSTRUCTION , MULTIPLE DATUM; VECTORIZATION;

EID: 84863347581     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2145816.2145824     Document Type: Conference Paper
Times cited : (27)

References (26)
  • 4
    • 0001483604 scopus 로고
    • Communication optimizations for irregular scientific computations on distributed memory architectures
    • Sep.
    • R. Das, M. Uysal, J. Saltz, and Y.-S. Hwang. Communication optimizations for irregular scientific computations on distributed memory architectures. J. Parallel Distrib. Comput., 22:462-478, Sep. 1994.
    • (1994) J. Parallel Distrib. Comput. , vol.22 , pp. 462-478
    • Das, R.1    Uysal, M.2    Saltz, J.3    Hwang, Y.-S.4
  • 5
    • 0033872689 scopus 로고    scopus 로고
    • AltiVec extension to PowerPC accelerates media processing
    • DOI 10.1109/40.848475
    • K. Diefendorff, P. K. Dubey, R. Hochsprung, and H. Scales. AltiVec extension to PowerPC accelerates media processing. IEEE Micro, 20: 85-95, Mar./Apr. 2000. (Pubitemid 30585387)
    • (2000) IEEE Micro , vol.20 , Issue.2 , pp. 85-95
    • Diefendorff, K.1    Dubey, P.K.2    Hochsprung, R.3    Scales, H.4
  • 9
    • 36849034066 scopus 로고    scopus 로고
    • SPEC CPU2006 benchmark descriptions
    • Sep.
    • J. L. Henning. SPEC CPU2006 benchmark descriptions. SIGARCH Comput. Archit. News, 34:1-17, Sep. 2006.
    • (2006) SIGARCH Comput. Archit. News , vol.34 , pp. 1-17
    • Henning, J.L.1
  • 10
    • 0034250996 scopus 로고    scopus 로고
    • Compilation techniques for multimedia processors
    • Aug.
    • A. Krall and S. Lelait. Compilation techniques for multimedia processors. Int. J. Parallel Program., 28:347-361, Aug. 2000.
    • (2000) Int. J. Parallel Program. , vol.28 , pp. 347-361
    • Krall, A.1    Lelait, S.2
  • 13
    • 31844445061 scopus 로고    scopus 로고
    • PhD thesis, Computer Science Dept. University of Illinois at Urbana-Champaign, Urbana, IL, May, [online
    • C. Lattner. Macroscopic Data Structure Analysis and Optimization. PhD thesis, Computer Science Dept., University of Illinois at Urbana-Champaign, Urbana, IL, May 2005. [online] http://llvm.cs.uiuc.edu.
    • (2005) Macroscopic Data Structure Analysis and Optimization
    • Lattner, C.1
  • 23
    • 0034249157 scopus 로고    scopus 로고
    • A vectorizing compiler for multimedia extensions
    • Aug.
    • N. Sreraman and R. Govindarajan. A vectorizing compiler for multimedia extensions. Int. J. Parallel Program., 28:363-400, Aug. 2000.
    • (2000) Int. J. Parallel Program , vol.28 , pp. 363-400
    • Sreraman, N.1    Govindarajan, R.2
  • 24
    • 0001790593 scopus 로고
    • Depth-first search and linear graph algorithms
    • R. Tarjan. Depth-first search and linear graph algorithms. SIAM Journal on Computing, 1(2):146-160, 1972.
    • (1972) SIAM Journal on Computing , vol.1 , Issue.2 , pp. 146-160
    • Tarjan, R.1
  • 26
    • 32844466554 scopus 로고    scopus 로고
    • An integrated simdization framework using virtual vectors
    • ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
    • P. Wu, A. E. Eichenberger, A. Wang, and P. Zhao. An integrated simdization framework using virtual vectors. In Proceedings of the 19th annual International Conference on Supercomputing, ICS'05, pages 169-178, 2005. (Pubitemid 43251321)
    • (2005) Proceedings of the International Conference on Supercomputing , pp. 169-178
    • Wu, P.1    Eichenberger, A.E.2    Wang, A.3    Zhao, P.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.