메뉴 건너뛰기




Volumn , Issue , 2012, Pages 1233-1238

Optimizing memory hierarchy allocation with loop transformations for high-level synthesis

Author keywords

data reuse; high level synthesis; loop transformation; memory hierarchy optimization

Indexed keywords

APPLICATION SYSTEMS; BANDWIDTH CONSUMPTION; BRANCH AND BOUNDS; COMPUTATION COMPLEXITY; DATA REUSE; HIGH LEVEL SYNTHESIS; LOOP FUSION; LOOP TRANSFORMATION; MEMORY HIERARCHY; NESTED LOOPS; OFF-CHIP; OFF-CHIP MEMORIES; ON CHIP MEMORY; ON-CHIP BUFFERS; OPTIMAL SOLUTIONS;

EID: 84863555595     PISSN: 0738100X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2228360.2228586     Document Type: Conference Paper
Times cited : (50)

References (36)
  • 4
    • 3142688389 scopus 로고    scopus 로고
    • A data locality optimizing algorithm
    • New York, NY, USA
    • M. E. Wolf and M. S. Lam, "A data locality optimizing algorithm," in PLDI '91. New York, NY, USA.
    • PLDI '91
    • Wolf, M.E.1    Lam, M.S.2
  • 5
    • 0026232450 scopus 로고
    • A loop transformation theory and an algorithm to maximize parallelism
    • Oct.
    • M. E. Wolf and M. S. Lam, "A loop transformation theory and an algorithm to maximize parallelism," IEEE Trans. Parallel Distrib. Syst., vol. 2, pp. 452-471, Oct. 1991.
    • (1991) IEEE Trans. Parallel Distrib. Syst. , vol.2 , pp. 452-471
    • Wolf, M.E.1    Lam, M.S.2
  • 6
    • 0001448065 scopus 로고
    • Some efficient solutions to the affine scheduling problem: Part II. multidimensional time
    • P. Feautrier, "Some efficient solutions to the affine scheduling problem: Part II. multidimensional time," International Journal of Parallel Programming, vol. 21, pp. 389-420, 1992.
    • (1992) International Journal of Parallel Programming , vol.21 , pp. 389-420
    • Feautrier, P.1
  • 7
    • 0032662841 scopus 로고    scopus 로고
    • An affine partitioning algorithm to maximize parallelism and minimize communication
    • New York, Aug.
    • A. W. Lim, G. I. Cheong, and M. S. Lam, "An affine partitioning algorithm to maximize parallelism and minimize communication," in ICS '99. New York, Aug. 1999.
    • (1999) ICS '99
    • Lim, A.W.1    Cheong, G.I.2    Lam, M.S.3
  • 8
    • 10844259103 scopus 로고    scopus 로고
    • Synthesizing transformations for locality enhancement of imperfectly-nested loop nests
    • N. Ahmed, N. Mateev, and K. Pingali, "Synthesizing transformations for locality enhancement of imperfectly-nested loop nests," IJPP, vol. 29, pp. 493-544, Oct. 2001. (Pubitemid 33818443)
    • (2001) International Journal of Parallel Programming , vol.29 , Issue.5 , pp. 493-544
    • Ahmed, N.1    Mateev, N.2    Pingali, K.3
  • 9
    • 57349139452 scopus 로고    scopus 로고
    • A practical automatic polyhedral parallelizer and locality optimizer
    • New York, NY, USA
    • U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan, "A practical automatic polyhedral parallelizer and locality optimizer," in PLDI '08, pp. 101-113, New York, NY, USA,
    • PLDI '08 , pp. 101-113
    • Bondhugula, U.1    Hartono, A.2    Ramanujam, J.3    Sadayappan, P.4
  • 10
    • 78149255973 scopus 로고    scopus 로고
    • A model for fusion and code motion in an automatic parallelizing compiler
    • Sept.
    • U. Bondhugula, O. Gunluk, S. Dash, and L. Renganarayanan, "A model for fusion and code motion in an automatic parallelizing compiler," in PACT '10, pp. 343-352, Sept. 2010,
    • (2010) PACT '10 , pp. 343-352
    • Bondhugula, U.1    Gunluk, O.2    Dash, S.3    Renganarayanan, L.4
  • 13
    • 84863557512 scopus 로고    scopus 로고
    • Data reuse exploration techniques for loop-dominated applications
    • Mar.
    • T. Van Achteren, G. Deconinck, F. Catthoor, and R. Lauwereins, "Data reuse exploration techniques for loop-dominated applications," in DATE'02, pp. 428-435, Mar. 2002.
    • (2002) DATE'02 , pp. 428-435
    • Van Achteren, T.1    Deconinck, G.2    Catthoor, F.3    Lauwereins, R.4
  • 15
    • 0032303141 scopus 로고    scopus 로고
    • Formalized methodology for data reuse: Exploration for low-power hierarchical memory mappings
    • S. Wuytack, J.-P. Diguet, F. V. M. Catthoor, and H. J. De Man, "Formalized methodology for data reuse: exploration for low-power hierarchical memory mappings," IEEE Trans. on VLSI, 1998.
    • (1998) IEEE Trans. on VLSI
    • Wuytack, S.1    Diguet, J.-P.2    Catthoor, F.V.M.3    De Man, H.J.4
  • 16
    • 0036053351 scopus 로고    scopus 로고
    • Compiler-directed scratch pad memory hierarchy design and management
    • June
    • M. Kandemir and A. Choudhary, "Compiler-directed scratch pad memory hierarchy design and management," in DAC'02, pp. 628-633, June 2002.
    • (2002) DAC'02 , pp. 628-633
    • Kandemir, M.1    Choudhary, A.2
  • 17
    • 84893726637 scopus 로고    scopus 로고
    • Layer assignment techniques for low energy in multi-layered memory organisations
    • Mar.
    • E. Brockmeyer, M. Miranda, and F. Catthoor, "Layer assignment techniques for low energy in multi-layered memory organisations," in DATE'03, pp. 1070-1075, Mar. 2003.
    • (2003) DATE'03 , pp. 1070-1075
    • Brockmeyer, E.1    Miranda, M.2    Catthoor, F.3
  • 18
    • 80052674346 scopus 로고    scopus 로고
    • A reuse-aware prefetching scheme for scratchpad memory
    • J. Cong, H. Huang, C. Liu, and Y. Zou, "A reuse-aware prefetching scheme for scratchpad memory," in DAC'11, pp. 960-965, 2011.
    • (2011) DAC'11 , pp. 960-965
    • Cong, J.1    Huang, H.2    Liu, C.3    Zou, Y.4
  • 19
    • 47849112050 scopus 로고    scopus 로고
    • Data-reuse-driven energy-aware cosynthesis of scratch pad memory and hierarchical bus-based communication architecture for multiprocessor streaming applications
    • I. Issenin, E. Brockmeyer, B. Durinck, and N. D. Dutt, "Data-reuse-driven energy-aware cosynthesis of scratch pad memory and hierarchical bus-based communication architecture for multiprocessor streaming applications," IEEE trans. on CAD, vol. 27, no. 8, pp. 1439-1452, 2008.
    • (2008) IEEE Trans. on CAD , vol.27 , Issue.8 , pp. 1439-1452
    • Issenin, I.1    Brockmeyer, E.2    Durinck, B.3    Dutt, N.D.4
  • 21
    • 0032738222 scopus 로고    scopus 로고
    • Local memory exploration and optimization in embedded systems
    • P. R. Panda, N. D. Dutt, and A. Nicolau, "Local memory exploration and optimization in embedded systems," IEEE Trans. on CAD, vol. 18, no. 1, pp. 3-13, 1999.
    • (1999) IEEE Trans. on CAD , vol.18 , Issue.1 , pp. 3-13
    • Panda, P.R.1    Dutt, N.D.2    Nicolau, A.3
  • 23
    • 63549135938 scopus 로고    scopus 로고
    • Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories
    • Feb.
    • M. M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan, "Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories," in PPoPP'08, pp. 1-10, Feb. 2008.
    • (2008) PPoPP'08 , pp. 1-10
    • Baskaran, M.M.1    Bondhugula, U.2    Krishnamoorthy, S.3    Ramanujam, J.4    Rountev, A.5    Sadayappan, P.6
  • 24
    • 77955224068 scopus 로고    scopus 로고
    • Combining data reuse with data-level parallelization for FPGA-targeted hardware compilation: A geometric programming framework
    • Q. Liu, G. A. Constantinides, K. Masselos, and P. Cheung, "Combining data reuse with data-level parallelization for FPGA-targeted hardware compilation: A geometric programming framework," IEEE Trans. on CAD, vol. 28, no. 3, 2009.
    • (2009) IEEE Trans. on CAD , vol.28 , Issue.3
    • Liu, Q.1    Constantinides, G.A.2    Masselos, K.3    Cheung, P.4
  • 25
    • 84855778017 scopus 로고    scopus 로고
    • Combined loop transformation and hierarchy allocation in data reuse optimization
    • Nov.
    • J. Cong, P. Zhang, and Y. Zou, "Combined loop transformation and hierarchy allocation in data reuse optimization," in ICCAD'11, pp. 185-192, Nov. 2011.
    • (2011) ICCAD'11 , pp. 185-192
    • Cong, J.1    Zhang, P.2    Zou, Y.3
  • 28
    • 33746593747 scopus 로고    scopus 로고
    • Semi-Automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies
    • June
    • S. Girbal, N. Vasilache, C. Bastoul, A. Cohen, D. Parello, M. Sigler, and O. Temam, "Semi-Automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies," IJPP, vol. 34, no. 3, June 2006.
    • (2006) IJPP , vol.34 , Issue.3
    • Girbal, S.1    Vasilache, N.2    Bastoul, C.3    Cohen, A.4    Parello, D.5    Sigler, M.6    Temam, O.7
  • 29
    • 84863537937 scopus 로고    scopus 로고
    • Barvinok library
    • Barvinok library. http://freshmeat.net/projects/barvinok
  • 34
    • 84863552043 scopus 로고    scopus 로고
    • Xilinx ISE Design Suite
    • Xilinx ISE Design Suite. http://www.xilinx.com/products/design-tools/ise- design-suite/.
  • 35
    • 84863540565 scopus 로고    scopus 로고
    • Polyhedral benchmark suite V3.1
    • Polyhedral benchmark suite v3.1. http://www.cse.ohio-state.edu/~pouchet/ software/polybench/ .


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.