메뉴 건너뛰기




Volumn , Issue , 2013, Pages 341-351

Breaking SIMD shackles with an exposed flexible microarchitecture and the access execute PDG

Author keywords

Accelerators; Access Execute Program Dependence Graph; DySER; SIMD; Vectorization

Indexed keywords

DATA-LEVEL PARALLELISM; DYSER; MICRO ARCHITECTURES; MODERN MICROPROCESSOR; PROGRAM DEPENDENCE GRAPH; SHORT VECTORS; SIMD; VECTORIZATION;

EID: 84887502088     PISSN: 1089795X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/PACT.2013.6618830     Document Type: Conference Paper
Times cited : (25)

References (38)
  • 6
    • 34547185000 scopus 로고    scopus 로고
    • Scalable subgraph mapping for acyclic computation accelerators
    • N. Clark, A. Hormati, S. Mahlke, and S. Yehia, "Scalable subgraph mapping for acyclic computation accelerators," in CASES'06.
    • CASES'06
    • Clark, N.1    Hormati, A.2    Mahlke, S.3    Yehia, S.4
  • 8
    • 77954724842 scopus 로고    scopus 로고
    • Sams multi-layout memory: Providing multiple views of data to boost simd performance
    • C. Gou, G. Kuzmanov, and G. Gaydadjiev, "Sams multi-layout memory: providing multiple views of data to boost simd performance," in ICS'10.
    • ICS'10
    • Gou, C.1    Kuzmanov, G.2    Gaydadjiev, G.3
  • 10
    • 79955890625 scopus 로고    scopus 로고
    • Dynamically specialized datapaths for energy efficient computing
    • V. Govindaraju, C.-H. Ho, and K. Sankaralingam, "Dynamically specialized datapaths for energy efficient computing," in HPCA 2011.
    • (2011) HPCA
    • Govindaraju, V.1    Ho, C.-H.2    Sankaralingam, K.3
  • 11
    • 84871291822 scopus 로고    scopus 로고
    • Bundled execution of recurring traces for energy-efficient general purpose processing
    • S. Gupta, S. Feng, A. Ansari, S. Mahlke, and D. August, "Bundled execution of recurring traces for energy-efficient general purpose processing," in MICRO-44.
    • MICRO-44
    • Gupta, S.1    Feng, S.2    Ansari, A.3    Mahlke, S.4    August, D.5
  • 12
    • 0031360911 scopus 로고    scopus 로고
    • Garp: A MIPS processor with a reconfigurable coprocessor
    • J. R. Hauser and J. Wawrzynek, "Garp: A MIPS Processor with a Reconfigurable Coprocessor," in FCCM'97.
    • FCCM'97
    • Hauser, J.R.1    Wawrzynek, J.2
  • 15
    • 0034446825 scopus 로고    scopus 로고
    • Exploiting superword level parallelism with multimedia instruction sets
    • S. Larsen and S. Amarasinghe, "Exploiting superword level parallelism with multimedia instruction sets," in PLDI'00.
    • PLDI'00
    • Larsen, S.1    Amarasinghe, S.2
  • 19
    • 77951154340 scopus 로고    scopus 로고
    • The gpu computing era
    • Mar.
    • J. Nickolls and W. J. Dally, "The gpu computing era," IEEE Micro, vol. 30, no. 2, Mar. 2010.
    • (2010) IEEE Micro , vol.30 , Issue.2
    • Nickolls, J.1    Dally, W.J.2
  • 21
    • 79953275887 scopus 로고    scopus 로고
    • Multi-platform auto-vectorization
    • D. Nuzman and R. Henderson, "Multi-platform auto-vectorization, " in CGO'06.
    • CGO'06
    • Nuzman, D.1    Henderson, R.2
  • 22
    • 33746034953 scopus 로고    scopus 로고
    • Auto-vectorization of interleaved data for simd
    • D. Nuzman, I. Rosen, and A. Zaks, "Auto-vectorization of interleaved data for simd," in PLDI'06.
    • PLDI'06
    • Nuzman, D.1    Rosen, I.2    Zaks, A.3
  • 23
    • 0022874874 scopus 로고
    • Advanced compiler optimizations for supercomputers
    • D. A. Padua and M. J. Wolfe, "Advanced compiler optimizations for supercomputers," Commun. ACM, 1986.
    • (1986) Commun. ACM
    • Padua, D.A.1    Wolfe, M.J.2
  • 25
    • 84876586321 scopus 로고    scopus 로고
    • Libra: Tailoring simd execution using heterogeneous hardware and dynamic configurability
    • Y. Park, J. J. K. Park, H. Park, and S. Mahlke, "Libra: Tailoring simd execution using heterogeneous hardware and dynamic configurability," in MICRO'12.
    • MICRO'12
    • Park, Y.1    Park, J.J.K.2    Park, H.3    Mahlke, S.4
  • 26
    • 84863353689 scopus 로고    scopus 로고
    • Simd defragmenter: Efficient ilp realization on data-parallel architectures
    • Y. Park, S. Seo, H. Park, H. K. Cho, and S. Mahlke, "Simd defragmenter: efficient ilp realization on data-parallel architectures," in ASPLOS'12.
    • ASPLOS'12
    • Park, Y.1    Seo, S.2    Park, H.3    Cho, H.K.4    Mahlke, S.5
  • 27
    • 84870653904 scopus 로고    scopus 로고
    • Ispc: A spmd compiler for highperformance cpu programming
    • M. Pharr and W. R. Mark, ""ispc: A spmd compiler for highperformance cpu programming"," in InPar 2012.
    • (2012) InPar
    • Pharr, M.1    Mark, W.R.2
  • 28
    • 33745222449 scopus 로고    scopus 로고
    • Optimizing data permutations for simd devices
    • G. Ren, P. Wu, and D. Padua, "Optimizing data permutations for simd devices," in PLDI'06.
    • PLDI'06
    • Ren, G.1    Wu, P.2    Padua, D.3
  • 33
    • 47849103500 scopus 로고    scopus 로고
    • Introducing control flow into vectorized code
    • J. Shin, "Introducing control flow into vectorized code," in PACT'07.
    • PACT'07
    • Shin, J.1
  • 34
    • 85088882721 scopus 로고    scopus 로고
    • Vector instruction set support for conditional operations
    • J. E. Smith, G. Faanes, and R. Sugumar, "Vector instruction set support for conditional operations," in ISCA'00.
    • ISCA'00
    • Smith, J.E.1    Faanes, G.2    Sugumar, R.3
  • 35
    • 84857819522 scopus 로고    scopus 로고
    • Using machine learning to improve automatic vectorization
    • K. Stock, L.-N. Pouchet, and P. Sadayappan, "Using machine learning to improve automatic vectorization," TACO 2012.
    • (2012) TACO
    • Stock, K.1    Pouchet, L.-N.2    Sadayappan, P.3
  • 37
    • 84887444440 scopus 로고    scopus 로고
    • Relaxing simd control flow constraints using loop transformations
    • R. v. Hanxleden and K. Kennedy, "Relaxing simd control flow constraints using loop transformations," in PLDI'92.
    • PLDI'92
    • Hanxleden, R.V.1    Kennedy, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.