SCOPUS 정보 검색 플랫폼

Proceedings - International Symposium on Code Generation and Optimization, CGO 2011

Volumn , Issue , 2011, Pages 161-170

On-chip cache hierarchy-aware tile scheduling for multicore machines

(4) Liu, Jun a Zhang, Yuanrui a Ding, Wei a Kandemir, Mahmut a

a The Pennsylvania State University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ALTERNATE METHOD; APPLICATION PROGRAMS; CACHE MISS; COMPUTATION KERNEL; DATA REUSE; EMBEDDED APPLICATION; EXECUTION TIME; INTEL MACHINES; ITERATION SPACES; KEY COMPONENT; MULTI-CORE MACHINES; MULTI-PROCESSOR PLATFORMS; MULTITHREADED CODE GENERATION; ON-CHIP CACHE; PARALLELIZATIONS; SCHEDULING STRATEGIES; SOURCE-TO-SOURCE TRANSLATIONS; UNIPROCESSORS;

AUTOMATIC PROGRAMMING; NETWORK COMPONENTS; OPTIMIZATION; SCHEDULING ALGORITHMS;

MULTITASKING;

EID: 79957454903 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CGO.2011.5764684 Document Type: Conference Paper

Times cited : (20)

References (47)

1
- 78049338046
- "Single-chip cloud computer," http://techresearch.intel.com/ articles/Tera-Scale/1826.htm.
- Single-chip Cloud Computer

2
- 38149022809
- "Teraflops research chip," http://techresearch.intel.com/ articles/Tera-Scale/1449.htm.
- Teraflops Research Chip

3
- 79957477113
- "IBM Power7 - smarter systems for a smarter planet," http://www.ibm.com/.
- IBM Power7 - Smarter Systems for A Smarter Planet

4
- 79957464835
- "Amd magny-cours processors," http://www.amd.com/.
- Amd Magny-cours Processors

5
- 0003863228
- Springer-Verlag
- G. Golub and C. Reinsch, "Handbook for automatic computation ii, linear algebra," Springer-Verlag, 1971.
- (1971) Handbook for Automatic Computation Ii, Linear Algebra
- Golub, G.¹ Reinsch, C.²

6
- 57349110181
- Technical Report, The Ohio State University
- U. Bondhugula et al., "Affine transformations for communication minimal parallelization and locality optimization of arbitrarily-nested loop sequences," Technical Report, The Ohio State University, 2007.
- (2007) Affine Transformations for Communication Minimal Parallelization and Locality Optimization of Arbitrarily-nested Loop Sequences
- Bondhugula, U.¹

7
- 0032662841
- An affine partitioning algorithm to maximize parallelism and minimize communication
- A. W. Lim et al., "An affine partitioning algorithm to maximize parallelism and minimize communication," Proc. of ICS, 1999.
- Proc. of ICS, 1999
- Lim, A.W.¹

8
- 84947997685
- Efficient code generation for automatic parallelization and optimization
- C. Bastoul, "Efficient code generation for automatic parallelization and optimization," International Symposium on Parallel and Distributed Computing, 2003.
- International Symposium on Parallel and Distributed Computing, 2003
- Bastoul, C.¹

9
- 57349145904
- Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model
- U. Bondhugula et al., "Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model," Proc. of the Joint European Conferences on Theory and Practice of Software 17th international conference on Compiler construction, 2008.
- Proc. of the Joint European Conferences on Theory and Practice of Software 17th International Conference on Compiler Construction, 2008
- Bondhugula, U.¹

10
- 79957518643
- Maximizing parallelism and minimizing synchronization with affine transforms
- A. W. Lim and M. S. Lam, "Maximizing parallelism and minimizing synchronization with affine transforms," Proc. of POPL, 1997.
- Proc. of POPL, 1997
- Lim, A.W.¹ Lam, M.S.²

11
- 0000563616
- Compiler algorithms for optimizing locality and parallelism on shared and distributedmemory machines
- M. Kandemir et al., "Compiler algorithms for optimizing locality and parallelism on shared and distributedmemory machines," J. Parallel Distrib. Comput., 2000.
- (2000) J. Parallel Distrib. Comput.
- Kandemir, M.¹

12
- 0032315190
- Reuse-driven tiling for improving data locality
- J. Xue and C.-H. Huang, "Reuse-driven tiling for improving data locality," Int. J. Parallel Program., 1998.
- (1998) Int. J. Parallel Program.
- Xue, J.¹ Huang, C.-H.²

13
- 0024935630
- More iteration space tiling
- M. Wolfe, "More iteration space tiling," Proc. of SC, 1989.
- Proc. of SC, 1989
- Wolfe, M.¹

14
- 84976827033
- A data locality optimizing algorithm
- M. E. Wolf and M. S. Lam, "A data locality optimizing algorithm," SIGPLAN Not., 1991.
- (1991) SIGPLAN Not.
- Wolf, M.E.¹ Lam, M.S.²

15
- 79957502820
- Selecting tile shape for minimal execution time
- K. Högstedt et al., "Selecting tile shape for minimal execution time," Proc. of SPAA, 1999.
- Proc. of SPAA, 1999
- Högstedt, K.¹

16
- 0032635362
- New tiling techniques to improve cache temporal locality
- Y. Song and Z. Li, "New tiling techniques to improve cache temporal locality," Proc. of PLDI, 1999.
- Proc. of PLDI, 1999
- Song, Y.¹ Li, Z.²

17
- 79957459413
- Iteration space tiling for distributed memory machines
- J. Ramanujam and P. Sadayappan, "Iteration space tiling for distributed memory machines," Languages, compilers and run-time environments for distributed memory machines, 1992.
- (1992) Languages, Compilers and Run-time Environments for Distributed Memory Machines
- Ramanujam, J.¹ Sadayappan, P.²

18
- 79957448379
- Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors
- M. M. Baskaran et al., "Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors," Proc. of PPoPP, 2009.
- Proc. of PPoPP, 2009
- Baskaran, M.M.¹

19
- 79957497933
- Adaptive loop tiling for a multi-cluster cmp
- J. Zhao et al., "Adaptive loop tiling for a multi-cluster cmp," Proc. of International Conference on Algorithms and Architectures for Parallel Processing, 2008.
- Proc. of International Conference on Algorithms and Architectures for Parallel Processing, 2008
- Zhao, J.¹

20
- 0001592349
- Supernode partitioning
- F. Irigoin and R. Triolet, "Supernode partitioning," Proc. of POPL, 1988.
- Proc. of POPL, 1988
- Irigoin, F.¹ Triolet, R.²

21
- 38249009019
- Tiling multidimensional iteration spaces for multicomputers
- J. Ramanujam and P. Sadayappan, "Tiling multidimensional iteration spaces for multicomputers," J. Parallel and Distributed Computing, 1992.
- (1992) J. Parallel and Distributed Computing
- Ramanujam, J.¹ Sadayappan, P.²

22
- 79957459923
- Cache-aware partitioning of multidimensional iteration spaces
- A. Kejariwal et al., "Cache-aware partitioning of multidimensional iteration spaces," Proc. of SYSTOR, 2009.
- Proc. of SYSTOR, 2009
- Kejariwal, A.¹

23
- 33847108581
- Hierarchically tiled arrays for parallelism and locality
- J. Guo et al., "Hierarchically tiled arrays for parallelism and locality," Proc. of IPDPS, 2006.
- Proc. of IPDPS, 2006
- Guo, J.¹

24
- 79957443193
- Design and use of htalib: A library for hierarchically tiled arrays
- G. Bikshandi et al., "Design and use of htalib: a library for hierarchically tiled arrays," Proc. of LCPC, 2007.
- Proc. of LCPC, 2007
- Bikshandi, G.¹

25
- 79957506046
- Data-centric multi-level blocking
- I. Kodukula et al., "Data-centric multi-level blocking," Proc. of PLDI, 1997.
- Proc. of PLDI, 1997
- Kodukula, I.¹

26
- 33847150556
- Selecting the tile shape to reduce the total communication volume
- N. Drosinos et al., "Selecting the tile shape to reduce the total communication volume," Proc. of IPDPS, 2006.
- Proc. of IPDPS, 2006
- Drosinos, N.¹

27
- 85009352487
- Tile size selection using cache organization and data layout
- S. Coleman and K. S. McKinley, "Tile size selection using cache organization and data layout," Proc. of PLDI, 1995.
- Proc. of PLDI, 1995
- Coleman, S.¹ McKinley, K.S.²

28
- 33748307622
- An analytical model for loop tiling and its solution
- V. Sarkar and N. Megiddo, "An analytical model for loop tiling and its solution," Proc. of ISPASS, 2000.
- (2000) Proc. of ISPASS
- Sarkar, V.¹ Megiddo, N.²

29
- 70449702074
- Parametric multi-level tiling of imperfectly nested loops
- A. Hartono et al., "Parametric multi-level tiling of imperfectly nested loops," Proc. of ICS, 2009.
- Proc. of ICS, 2009
- Hartono, A.¹

30
- 79957480502
- Iterative optimization in the polyhedral model: Part i, one-dimensional time
- L.-N. Pouchet et al., "Iterative optimization in the polyhedral model: Part i, one-dimensional time," Proc. of CGO, 2007.
- Proc. of CGO, 2007
- Pouchet, L.-N.¹

31
- 85088886364
- Blocking and array contraction across arbitrarily nested loops using affine partitioning
- A. W. Lim et al., "Blocking and array contraction across arbitrarily nested loops using affine partitioning," Proc. of PPoPP, 2001.
- Proc. of PPoPP, 2001
- Lim, A.W.¹

32
- 74049164978
- A practical automatic polyhedral parallelizer and locality optimizer
- U. Bondhugula et al., "A practical automatic polyhedral parallelizer and locality optimizer," Proc. of PLDI, 2008.
- Proc. of PLDI, 2008
- Bondhugula, U.¹

33
- 79957460491
- "A polyhedral automatic parallelizer and locality optimizer for multicores," http://pluto-compiler.sourceforge.net.
- A Polyhedral Automatic Parallelizer and Locality Optimizer for Multicores

34
- 79957489791
- Optimizing shared cache behavior of chip multiprocessors
- M. Kandemir et al., "Optimizing shared cache behavior of chip multiprocessors," Proc. of Micro, 2009.
- Proc. of Micro, 2009
- Kandemir, M.¹

35
- 79960161840
- Cache topology aware computation mapping for multicores
- M. Kandemir, T. Yemliha et al., "Cache topology aware computation mapping for multicores," Proc. of PLDI, 2010.
- Proc. of PLDI, 2010
- Kandemir, M.¹ Yemliha, T.²

36
- 79957456798
- Compilation for explicitly managed memory hierarchies
- T. J. Knight et al., "Compilation for explicitly managed memory hierarchies," Proc. of PPoPP, 2007.
- Proc. of PPoPP, 2007
- Knight, T.J.¹

37
- 84988767938
- Modeling parallel computers as memory hierarchies
- B. Alpern et al., "Modeling parallel computers as memory hierarchies," Proc. of Programming Models for Massively Parallel Computers, 1993.
- Proc. of Programming Models for Massively Parallel Computers, 1993
- Alpern, B.¹

38
- 85087537552
- Facilitating the search for compositions of program transformations
- A. Cohen et al., "Facilitating the search for compositions of program transformations," Proc. of ICS, 2005.
- Proc. of ICS, 2005
- Cohen, A.¹

39
- 0346233054
- Data dependence and data-flow analysis of arrays
- D. E. Maydan et al., "Data dependence and data-flow analysis of arrays," Proc. of International Workshop on Languages and Compilers for Parallel Computing, 1993.
- Proc. of International Workshop on Languages and Compilers for Parallel Computing, 1993
- Maydan, D.E.¹

40
- 0003690189
- John Wiley & Sons, Chichester
- A. Schrijver, "Theory of linear and integer programming," John Wiley & Sons, Chichester, 1986.
- (1986) Theory of Linear and Integer Programming
- Schrijver, A.¹

41
- 79957501879
- Counting integer points in parametric polytopes using barvinok's rational functions
- S. Verdoolaege et al., "Counting integer points in parametric polytopes using barvinok's rational functions," Journal Algorithmica, 2007.
- (2007) Journal Algorithmica
- Verdoolaege, S.¹

42
- 1242331527
- Approximation algorithms for minimum k-cut
- N. Guttmann-Beck and R. Hassin, "Approximation algorithms for minimum k-cut," Algorithmica, 2000.
- (2000) Algorithmica
- Guttmann-Beck, N.¹ Hassin, R.²

43
- 10444289646
- Code generation in the polyhedral model is easier than you think
- C. Bastoul, "Code generation in the polyhedral model is easier than you think," Proc. of PACT, 2004.
- Proc. of PACT, 2004
- Bastoul, C.¹

44
- 84910075371
- "The chunky loop generator," http://www.cloog.org.
- The Chunky Loop Generator

45
- 77954024228
- Technical Report, LRI, Paris-Sud University
- C. Bastoul, "Extracting polyhedral representation from high level languages," Technical Report, LRI, Paris-Sud University, 2008.
- (2008) Extracting Polyhedral Representation from High Level Languages
- Bastoul, C.¹

46
- 79957441795
- "Chunky analyzer for dependencies in loops," http://www.lri.fr/ bastoul/development/candl/.
- Chunky Analyzer for Dependencies in Loops

47
- 79957451369
- "Hardware-based performance monitoring interface for linux," http://perfmon2.sourceforge.net/.
- Hardware-based Performance Monitoring Interface for Linux

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.