메뉴 건너뛰기




Volumn , Issue , 2013, Pages 219-228

Scheduling parallel programs by work stealing with private deques

Author keywords

dynamic load balancing; nested parallelism; work stealing

Indexed keywords

DEPTH FIRST SEARCH; DESIGN AND IMPLEMENTATIONS; DISTRIBUTION STRATEGIES; DIVIDE AND CONQUER; NESTED PARALLELISM; PROBABILISTIC MODELS; THEORETICAL GUARANTEES; WORK STEALING;

EID: 84875141794     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2442516.2442538     Document Type: Conference Paper
Times cited : (80)

References (44)
  • 3
    • 0031628001 scopus 로고    scopus 로고
    • Thread scheduling for multiprogrammed multiprocessors
    • ACM Press
    • Nimar S. Arora, Robert D. Blumofe, and C. Greg Plaxton. Thread scheduling for multiprogrammed multiprocessors. In SPAA '98, pages 119-129. ACM Press, 1998.
    • (1998) SPAA '98 , pp. 119-129
    • Arora, N.S.1    Blumofe, R.D.2    Greg Plaxton, C.3
  • 4
    • 0344584867 scopus 로고    scopus 로고
    • The natural work-stealing algorithm is stable
    • May
    • Petra Berenbrink, Tom Friedetzky, and Leslie Ann Goldberg. The natural work-stealing algorithm is stable. SIAM J. Comput., 32:1260-1279, May 2003.
    • (2003) SIAM J. Comput. , vol.32 , pp. 1260-1279
    • Berenbrink, P.1    Friedetzky, T.2    Goldberg, L.A.3
  • 6
    • 84858427811 scopus 로고    scopus 로고
    • Internally deterministic parallel algorithms can be fast
    • NY, USA, ACM
    • Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons, and Julian Shun. Internally deterministic parallel algorithms can be fast. In PPoPP '12, pages 181-192, NY, USA, 2012. ACM.
    • (2012) PPoPP '12 , pp. 181-192
    • Blelloch, G.E.1    Fineman, J.T.2    Gibbons, P.B.3    Shun, J.4
  • 7
    • 0029696091 scopus 로고    scopus 로고
    • A provable time and space efficient implementation of NESL
    • ACM
    • Guy E. Blelloch and John Greiner. A provable time and space efficient implementation of NESL. In ICFP '96, pages 213-225. ACM, 1996.
    • (1996) ICFP '96 , pp. 213-225
    • Blelloch, G.E.1    Greiner, J.2
  • 10
    • 0000269759 scopus 로고    scopus 로고
    • Scheduling multithreaded computations by work stealing
    • September
    • Robert D. Blumofe and Charles E. Leiserson. Scheduling multithreaded computations by work stealing. J. ACM, 46:720-748, September 1999.
    • (1999) J. ACM , vol.46 , pp. 720-748
    • Blumofe, R.D.1    Leiserson, C.E.2
  • 12
    • 32144435090 scopus 로고    scopus 로고
    • Dynamic circular work-stealing deque
    • David Chase and Yossi Lev. Dynamic circular work-stealing deque. In SPAA '05, pages 21-28, 2005.
    • (2005) SPAA '05 , pp. 21-28
    • Chase, D.1    Lev, Y.2
  • 14
    • 55849100059 scopus 로고    scopus 로고
    • Solving large, irregular graph problems using adaptive work-stealing
    • Guojing Cong, Sreedhar B. Kodali, Sriram Krishnamoorthy, Doug Lea, Vijay A. Saraswat, and Tong Wen. Solving large, irregular graph problems using adaptive work-stealing. In ICPP, pages 536-545, 2008.
    • (2008) ICPP , pp. 536-545
    • Cong, G.1    Kodali, S.B.2    Krishnamoorthy, S.3    Lea, D.4    Saraswat, V.A.5    Wen, T.6
  • 18
    • 0022676728 scopus 로고
    • COMPARISON of RECEIVER-INITIATED and SENDER-INITIATED ADAPTIVE LOAD SHARING
    • DOI 10.1016/0166-5316(86)90008-8
    • Derek L. Eager, Edward D. Lazowska, and John Zahorjan. A comparison of receiver-initiated and sender-initiated adaptive load sharing. Perform. Eval., 6(1):53-68, 1986. (Pubitemid 16538292)
    • (1986) Performance Evaluation , vol.6 , Issue.1 , pp. 53-68
    • Eager, D.L.1    Lazowska, E.D.2    Zahorjan, J.3
  • 19
    • 85028891596 scopus 로고
    • A message passing implementation of lazy task creation
    • Marc Feeley. A message passing implementation of lazy task creation. In Parallel Symbolic Computing, pages 94-107, 1992.
    • (1992) Parallel Symbolic Computing , pp. 94-107
    • Feeley, M.1
  • 23
    • 0347507496 scopus 로고    scopus 로고
    • The implementation of the Cilk-5 multithreaded language
    • Matteo Frigo, Charles E. Leiserson, and Keith H. Randall. The implementation of the Cilk-5 multithreaded language. In PLDI, pages 212-223, 1998.
    • (1998) PLDI , pp. 212-223
    • Frigo, M.1    Leiserson, C.E.2    Randall, K.H.3
  • 26
    • 32844466488 scopus 로고    scopus 로고
    • A dynamic-sized nonblocking work stealing deque
    • February
    • Danny Hendler, Yossi Lev, Mark Moir, and Nir Shavit. A dynamic-sized nonblocking work stealing deque. Distrib. Comput., 18:189-207, February 2006.
    • (2006) Distrib. Comput. , vol.18 , pp. 189-207
    • Hendler, D.1    Lev, Y.2    Moir, M.3    Shavit, N.4
  • 27
    • 0036954275 scopus 로고    scopus 로고
    • Non-blocking steal-half work queues
    • Danny Hendler and Nir Shavit. Non-blocking steal-half work queues. In PODC, pages 280-289, 2002.
    • (2002) PODC , pp. 280-289
    • Hendler, D.1    Shavit, N.2
  • 28
    • 0036954486 scopus 로고    scopus 로고
    • Work dealing
    • ACM
    • Danny Hendler and Nir Shavit. Work dealing. In SPAA '02, pages 164-172. ACM, 2002.
    • (2002) SPAA '02 , pp. 164-172
    • Hendler, D.1    Shavit, N.2
  • 29
    • 67650093461 scopus 로고    scopus 로고
    • Backtracking-based load balancing
    • ACM
    • Tasuku Hiraishi, Masahiro Yasugi, Seiji Umatani, and Taiichi Yuasa. Backtracking-based load balancing. In PPoPP '09, pages 55-64. ACM, 2009.
    • (2009) PPoPP '09 , pp. 55-64
    • Hiraishi, T.1    Yasugi, M.2    Umatani, S.3    Yuasa, T.4
  • 30
    • 84875186606 scopus 로고    scopus 로고
    • Intel. Cilk Plus. http://software.intel.com/en-us/articles/intel-cilk- plus/.
    • Cilk Plus
  • 31
    • 84875155436 scopus 로고    scopus 로고
    • Specifications at
    • Intel. Intel Xeon Processor X7550. Specifications at http://ark.intel. com/products/46498/Intel-Xeon-Processor-X7550-(18M-Cache-2-00-GHz-6-40-GTs- Intel- QPI).
    • Intel Xeon Processor X7550
  • 33
    • 35348855586 scopus 로고    scopus 로고
    • Carbon: Architectural support for fine-grained parallelism on chip multiprocessors
    • June
    • Sanjeev Kumar, Christopher J. Hughes, and Anthony Nguyen. Carbon: architectural support for fine-grained parallelism on chip multiprocessors. SIGARCH Computer Architecture News, 35:162-173, June 2007.
    • (2007) SIGARCH Computer Architecture News , vol.35 , pp. 162-173
    • Kumar, S.1    Hughes, C.J.2    Nguyen, A.3
  • 36
    • 0031635830 scopus 로고    scopus 로고
    • Analyses of load stealing models based on differential equations
    • NY, USA, ACM
    • Michael Mitzenmacher. Analyses of load stealing models based on differential equations. In SPAA '98, pages 212-221, NY, USA, 1998. ACM.
    • (1998) SPAA '98 , pp. 212-221
    • Mitzenmacher, M.1
  • 37
    • 84987792525 scopus 로고
    • A simple load balancing scheme for task allocation in parallel machines
    • NY, USA, ACM
    • Larry Rudolph, Miriam Slivkin-Allalouf, and Eli Upfal. A simple load balancing scheme for task allocation in parallel machines. In SPAA '91, pages 237-245, NY, USA, 1991. ACM.
    • (1991) SPAA '91 , pp. 237-245
    • Rudolph, L.1    Slivkin-Allalouf, M.2    Upfal, E.3
  • 39
    • 77952259532 scopus 로고    scopus 로고
    • Flexible architectural support for fine-grain scheduling
    • NY, USA, ACM
    • Daniel Sanchez, Richard M. Yoo, and Christos Kozyrakis. Flexible architectural support for fine-grain scheduling. In ASPLOS '10, pages 311-322, NY, USA, 2010. ACM.
    • (2010) ASPLOS '10 , pp. 311-322
    • Sanchez, D.1    Yoo, R.M.2    Kozyrakis, C.3
  • 40
    • 0036395865 scopus 로고    scopus 로고
    • Randomized receiver initiated load-balancing algorithms for tree-shaped computations
    • Peter Sanders. Randomized receiver initiated load-balancing algorithms for tree-shaped computations. Comput. J., 45(5):561-573, 2002.
    • (2002) Comput. J. , vol.45 , Issue.5 , pp. 561-573
    • Sanders, P.1
  • 41
    • 84875158398 scopus 로고    scopus 로고
    • Miser - A dynamically loadable memory allocator for multi-threaded applications
    • Barry Tannenbaum. Miser - a dynamically loadable memory allocator for multi-threaded applications. Intel Software Network, 2009.
    • (2009) Intel Software Network
    • Tannenbaum, B.1
  • 42
    • 78650866403 scopus 로고    scopus 로고
    • A tighter analysis of work stealing
    • Algorithms and Computation - 21st International Symposium, ISAAC 2010, Springer
    • Marc Tchiboukdjian, Nicolas Gast, Denis Trystram, Jean-Louis Roch, and Julien Bernard. A tighter analysis of work stealing. In Algorithms and Computation - 21st International Symposium, ISAAC 2010, volume 6507 of LNCS, pages 291-302. Springer, 2010.
    • (2010) LNCS , vol.6507 , pp. 291-302
    • Tchiboukdjian, M.1    Gast, N.2    Trystram, D.3    Roch, J.-L.4    Bernard, J.5
  • 44
    • 0242276122 scopus 로고    scopus 로고
    • Pursuing laziness for efficient implementation of modern multithreaded languages
    • Seiji Umatani, Masahiro Yasugi, Tsuneyasu Komiya, and Taiichi Yuasa. Pursuing laziness for efficient implementation of modern multithreaded languages. In ISHPC, pages 174-188, 2003.
    • (2003) ISHPC , pp. 174-188
    • Umatani, S.1    Yasugi, M.2    Komiya, T.3    Yuasa, T.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.