메뉴 건너뛰기




Volumn , Issue , 2011, Pages 243-252

Enhancing data locality for dynamic simulations through asynchronous data transformations and adaptive control

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE CONTROL; ASYNCHRONOUS DATA; CRITICAL PATHS; DATA LOCALITY; DATA REORDERING; DATA TRANSFORMATION; DYNAMIC ADAPTATIONS; HETEROGENEOUS CHIP MULTIPROCESSOR; MEMORY REFERENCES; PERFORMANCE IMPROVEMENTS; PROGRAM STATE; RUNTIME OPTIMIZATION; TRADITIONAL TECHNIQUES;

EID: 84856544146     PISSN: 1089795X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/PACT.2011.56     Document Type: Conference Paper
Times cited : (16)

References (30)
  • 1
    • 84856515925 scopus 로고    scopus 로고
    • libpfm4
    • libpfm4. http://perfmon2.sourceforge.net/docs.html.
  • 2
    • 84856519788 scopus 로고    scopus 로고
    • NVIDIA CUDA. http://www.nvidia.com/cuda.
  • 4
    • 0023346636 scopus 로고
    • Partitioning strategy for nonuniform problems on multiprocessors
    • M. Berger and S. Bokhari. A partitioning strategy for non-uniform problems on multiprocessors. IEEE Trans. Computers, 37(12):570-580, 1987. (Pubitemid 17582501)
    • (1987) IEEE Transactions on Computers , vol.C-36 , Issue.5 , pp. 570-580
    • Berger Marsha, J.1    Bokhari Shahid, H.2
  • 7
    • 0001483604 scopus 로고
    • Communication optimizations for irregular scientific computations on distributioned memory architectures
    • R. Das, M. Uysal, J. Saltz, and Y.-S. Hwang. Communication optimizations for irregular scientific computations on distributioned memory architectures. Journal of Parallel and Distributed Computing, 22(3):462-479, 1994.
    • (1994) Journal of Parallel and Distributed Computing , vol.22 , Issue.3 , pp. 462-479
    • Das, R.1    Uysal, M.2    Saltz, J.3    Hwang, Y.-S.4
  • 8
    • 1642502420 scopus 로고    scopus 로고
    • Improving effective bandwidth through compiler enhancement of global cache reuse
    • DOI 10.1016/j.jpdc.2003.09.005
    • C. Ding and K. Kennedy. Improving effective bandwidth through compiler enhancement of global cache reuse. Journal of Parallel and Distributed Computing, 64(1):108-134, 2004. (Pubitemid 38117742)
    • (2004) Journal of Parallel and Distributed Computing , vol.64 , Issue.1 , pp. 108-134
    • Ding, C.1    Kennedy, K.2
  • 12
    • 33745715056 scopus 로고    scopus 로고
    • Exploiting locality for irregular scientific codes
    • DOI 10.1109/TPDS.2006.88
    • H. Han and C.-W. Tseng. Exploiting locality for irregular scientific codes. IEEE Transactions on Parallel Distributed Systems, 17(7):606-618, 2006. (Pubitemid 43997184)
    • (2006) IEEE Transactions on Parallel and Distributed Systems , vol.17 , Issue.7 , pp. 606-618
    • Han, H.1    Tseng, C.-W.2
  • 14
    • 0009406160 scopus 로고
    • A fast and high quality multilevel scheme for partitioning irregular graphs
    • G. Karypis and V. Kumar. A fast and high quality multilevel scheme for partitioning irregular graphs. In Proceedings of ICPP, 1995.
    • (1995) Proceedings of ICPP
    • Karypis, G.1    Kumar, V.2
  • 15
    • 79958785075 scopus 로고    scopus 로고
    • Region-based parallelization of irregular reductions onexplicitly managed memory hierarchies
    • S. Kim, H. Han, and K. Choe. Region-based parallelization of irregular reductions onexplicitly managed memory hierarchies. Journal of Supercomputing, 2009.
    • (2009) Journal of Supercomputing
    • Kim, S.1    Han, H.2    Choe, K.3
  • 17
    • 67650081010 scopus 로고    scopus 로고
    • OpenMP to GPGPU: A compiler framework for automatic translation and optimization
    • S. Lee, S. Min, and R. Eigenmann. OpenMP to GPGPU: A compiler framework for automatic translation and optimization. In Proceedings of PPoPP, 2009.
    • (2009) Proceedings of PPoPP
    • Lee, S.1    Min, S.2    Eigenmann, R.3
  • 18
    • 0016940739 scopus 로고
    • Comparative analysis of the cuthill-mckee and the reverse cuthill-mckee ordering algorithms for sparse matrices
    • April
    • W. Liu and A. Sherman. Comparative analysis of the cuthill-mckee and the reverse cuthill-mckee ordering algorithms for sparse matrices. SIAM J. Numerical Analysis, 13(2), April 1976.
    • (1976) SIAM J. Numerical Analysis , vol.13 , pp. 2
    • Liu, W.1    Sherman, A.2
  • 21
  • 24
    • 77954709868 scopus 로고    scopus 로고
    • Compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations
    • V. Ravi, W. Ma, D. Chiu, and G. Agrawal. compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations. In Proceedings of ICS, 2010.
    • (2010) Proceedings of ICS
    • Ravi, V.1    Ma, W.2    Chiu, D.3    Agrawal, G.4
  • 27
    • 77954691442 scopus 로고    scopus 로고
    • A gpgpu compiler for memory optimization and parallelism management
    • Y. Yang, P. Xiang, J. Kong, and H. Zhou. A gpgpu compiler for memory optimization and parallelism management. In PLDI, 2010.
    • (2010) PLDI
    • Yang, Y.1    Xiang, P.2    Kong, J.3    Zhou, H.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.