메뉴 건너뛰기




Volumn , Issue , 2016, Pages 583-595

LaPerm: Locality Aware Scheduler for Dynamic Parallelism on GPUs

Author keywords

dynamic parallelism; GPU; irregular applications; memory locality; thread block scheduler

Indexed keywords

COMPUTER ARCHITECTURE; PROGRAM PROCESSORS; ROUTERS; SCHEDULING;

EID: 84988443467     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ISCA.2016.57     Document Type: Conference Paper
Times cited : (49)

References (41)
  • 1
    • 41249087856 scopus 로고    scopus 로고
    • General purpose molecular dynamics simulations fully implemented on graphics processing units
    • J. A. Anderson, C. D. Lorenz, and A. Travesset, "General purpose molecular dynamics simulations fully implemented on graphics processing units," Journal of Computational Physics, vol. 227, no. 10, 2008.
    • (2008) Journal of Computational Physics , vol.227 , Issue.10
    • Anderson, J.A.1    Lorenz, C.D.2    Travesset, A.3
  • 2
    • 36849056785 scopus 로고    scopus 로고
    • Real-time deformation of detailed geometry based on mappings to a less detailed physical simulation on the GPU
    • Eurographics Association
    • J. Mosegaard and T. S. Sorensen, "Real-time deformation of detailed geometry based on mappings to a less detailed physical simulation on the GPU," in Proceedings of the 11th Eurographics Conference on Virtual Environments, pp. 105-111, Eurographics Association, 2005.
    • (2005) Proceedings of the 11th Eurographics Conference on Virtual Environments , pp. 105-111
    • Mosegaard, J.1    Sorensen, T.S.2
  • 27
    • 84960162410 scopus 로고    scopus 로고
    • Thermodynamic states in explosion fields
    • Coeur d'Alene Resort, ID, USA
    • A. Kuhl, "Thermodynamic states in explosion fields," in 14th International Symposium on Detonation, Coeur d'Alene Resort, ID, USA, 2010.
    • (2010) 14th International Symposium on Detonation
    • Kuhl, A.1
  • 28
    • 84858427151 scopus 로고    scopus 로고
    • An efficient CUDA implementation of the tree-based barnes hut n-body algorithm
    • M. Burtscher and K. Pingali, "An efficient cuda implementation of the tree-based barnes hut n-body algorithm," GPU computing Gems Emerald edition, p. 75, 2011.
    • (2011) GPU Computing Gems Emerald Edition , pp. 75
    • Burtscher, M.1    Pingali, K.2
  • 33
    • 85019691440 scopus 로고    scopus 로고
    • Testing intrusion detection systems: A critique of the 1998 and 1999 DARPA intrusion detection system evaluations as performed by lincoln laboratory
    • J. McHugh, "Testing intrusion detection systems: a critique of the 1998 and 1999 darpa intrusion detection system evaluations as performed by lincoln laboratory," ACM Transactions on Information and System Security, vol. 3, no. 4, pp. 262-294, 2000.
    • (2000) ACM Transactions on Information and System Security , vol.3 , Issue.4 , pp. 262-294
    • McHugh, J.1
  • 39
    • 84870410502 scopus 로고    scopus 로고
    • Nested data-parallelism on the GPU
    • ACM
    • L. Bergstrom and J. Reppy, "Nested data-parallelism on the GPU," in ACM SIGPLAN Notices, vol. 47, pp. 247-258, ACM, 2012.
    • (2012) ACM SIGPLAN Notices , vol.47 , pp. 247-258
    • Bergstrom, L.1    Reppy, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.