메뉴 건너뛰기




Volumn , Issue , 2009, Pages 309-316

Mapping the FDTD application to many-core chip architectures

Author keywords

Bandwidth Reduction; Code Optimization; Parallel Tiling; Stencil Computations

Indexed keywords

BANDWIDTH REDUCTIONS; CHIP ARCHITECTURE; CODE OPTIMIZATION; CODE OPTIMIZATION TECHNIQUE; COMPILER OPTIMIZATIONS; CURRENT TRENDS; DATA CACHES; FINE GRAINS; FINITE DIFFERENCE TIME DOMAINS; MANY-CORE; MANY-CORE ARCHITECTURE; MULTI-THREADING; ON CHIP MEMORY; ON CHIPS; ON-CHIP PARALLELISM; PERFORMANCE IMPROVEMENTS; STENCIL COMPUTATIONS; TIME SKEWING;

EID: 77951447129     PISSN: 01903918     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICPP.2009.44     Document Type: Conference Paper
Times cited : (41)

References (24)
  • 1
    • 35348812496 scopus 로고    scopus 로고
    • Synchronization state buffer: Supporting efficient fine-grain synchronization on many-core architectures
    • W. Zhu, "Synchronization state buffer: Supporting efficient fine-grain synchronization on many-core architectures," in In The 34th International Symposium on Computer Architecture, 2007.
    • (2007) The 34th International Symposium on Computer Architecture
    • Zhu, W.1
  • 2
    • 84894021661 scopus 로고
    • Numerical solution of inital boundary value problems involving maxwell's equations in isotropic media
    • May
    • K. Yee, "Numerical solution of inital boundary value problems involving maxwell's equations in isotropic media," Antennas and Propagation, IEEE Transactions on, vol. 14, no. 3, pp. 302-307, May 1966.
    • (1966) Antennas and Propagation, IEEE Transactions on , vol.14 , Issue.3 , pp. 302-307
    • Yee, K.1
  • 6
    • 0035301805 scopus 로고    scopus 로고
    • A parallel fdtd algorithm using the MPI library
    • Apr., IEEE
    • C. Guiffaut and K. Mahdjoubi, "A parallel fdtd algorithm using the MPI library," Antennas and Propagation Magazine, IEEE, vol. 43, no. 2, pp. 94-103, Apr 2001.
    • (2001) Antennas and Propagation Magazine , vol.43 , Issue.2 , pp. 94-103
    • Guiffaut, C.1    Mahdjoubi, K.2
  • 7
    • 84976827033 scopus 로고
    • A data locality optimizing algorithm
    • M. E. Wolf and M. S. Lam, "A data locality optimizing algorithm," SIGPLAN Not., vol. 26, no. 6, pp. 30-44, 1991.
    • (1991) SIGPLAN Not. , vol.26 , Issue.6 , pp. 30-44
    • Wolf, M.E.1    Lam, M.S.2
  • 10
    • 0000881430 scopus 로고
    • Solution of the firstorder form of the 3d discrete ordinates equation on a massively parallel processor
    • K. R. Koch, R. S. Baker, and R. E. Alcouffe, "Solution of the firstorder form of the 3d discrete ordinates equation on a massively parallel processor." Transactions of the American Nuclear Society, pp. 65:198-199, 1992.
    • (1992) Transactions of the American Nuclear Society , vol.65 , pp. 198-199
    • Koch, K.R.1    Baker, R.S.2    Alcouffe, R.E.3
  • 11
    • 0032635362 scopus 로고    scopus 로고
    • New tiling techniques to improve cache temporal locality
    • Y. Song and Z. Li, "New tiling techniques to improve cache temporal locality," SIGPLAN Not., vol. 34, no. 5, pp. 215-228, 1999.
    • (1999) SIGPLAN Not. , vol.34 , Issue.5 , pp. 215-228
    • Song, Y.1    Li, Z.2
  • 14
    • 33947307610 scopus 로고    scopus 로고
    • The memory behavior of cache oblivious stencil computations
    • M. Frigo and V. Strumpen, "The memory behavior of cache oblivious stencil computations," J. Supercomput., vol. 39, no. 2, pp. 93-112, 2007.
    • (2007) J. Supercomput. , vol.39 , Issue.2 , pp. 93-112
    • Frigo, M.1    Strumpen, V.2
  • 15
    • 0032403014 scopus 로고    scopus 로고
    • Two-dimensional fdtd analysisof a pulsed microwave confocal system for breast cancer detection:Ixed-focus and antenna-array sensors
    • Dec.
    • S. Hagness, A. Taflove, and J. Bridges, "Two-dimensional fdtd analysisof a pulsed microwave confocal system for breast cancer detection:ixed-focus and antenna-array sensors," Biomedical Engineering, IEEE Transactions on, vol. 45, no. 12, pp. 1470-1479, Dec. 1998.
    • (1998) Biomedical Engineering, IEEE Transactions on , vol.45 , Issue.12 , pp. 1470-1479
    • Hagness, S.1    Taflove, A.2    Bridges, J.3
  • 16
    • 0032184569 scopus 로고    scopus 로고
    • A complete electromagnetic simulation of the separated-aperture sensor for detecting buried land mines
    • Oct.
    • J. Bourgeois and G. Smith, "A complete electromagnetic simulation of the separated-aperture sensor for detecting buried land mines," Antennas and Propagation, IEEE Transactions on, vol. 46, no. 10, pp. 1419-1426, Oct 1998.
    • (1998) Antennas and Propagation, IEEE Transactions on , vol.46 , Issue.10 , pp. 1419-1426
    • Bourgeois, J.1    Smith, G.2
  • 18
    • 48849096436 scopus 로고    scopus 로고
    • A new direction in computational electromagnetics: Solving large problems using the parallel fdtd on the bluegene/l supercomputer providing teraflop-level performance
    • April
    • W. Yu, X. Yang, Y. Liu, L. ching Ma, T. Sul, N.-T. Huang, R. Mittral, R. Maaskane, Y. Lu, Q. Che, R. Lu, and Z. Su, "A new direction in computational electromagnetics: Solving large problems using the parallel fdtd on the bluegene/l supercomputer providing teraflop-level performance," Antennas and Propagation Magazine, IEEE, vol. 50, no. 2, pp. 26-44, April 2008.
    • (2008) Antennas and Propagation Magazine, IEEE , vol.50 , Issue.2 , pp. 26-44
    • Yu, W.1    Yang, X.2    Liu, Y.3    Ma, L.C.4    Sul, T.5    Huang, N.-T.6    Mittral, R.7    Maaskane, R.8    Lu, Y.9    Che, Q.10    Lu, R.11    Su, Z.12
  • 21
    • 77951455496 scopus 로고    scopus 로고
    • Tiling techniques to map applications to multi-core systems
    • [Online]. Available
    • D. Orozco and G. Gao, "Tiling techniques to map applications to multi-core systems," CAPSL Technical Memo Number 87, 2009. [Online]. Available: Http://www.capsl.udel.edu/publications.shtml
    • (2009) CAPSL Technical Memo Number 87
    • Orozco, D.1    Gao, G.2
  • 23
    • 84877019178 scopus 로고    scopus 로고
    • The case of the missing supercomputer performance: Achieving optimal performance on the 8,192 processors of asci q
    • F. Petrini, D. Kerbyson, and S. Pakin, "The case of the missing supercomputer performance: Achieving optimal performance on the 8,192 processors of asci q," Supercomputing, 2003 ACM/IEEE Conference, pp. 55-55, Nov. 2003.
    • (2003) Supercomputing, 2003 ACM/IEEE Conference , pp. 55
    • Petrini, F.1    Kerbyson, D.2    Pakin, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.