메뉴 건너뛰기




Volumn 44, Issue 3, 2009, Pages 253-264

Accelerating Critical Section Execution with asymmetric multi-core architectures

Author keywords

CMP; Critical Sections; Heterogeneous Cores; Locks; Multi core; Parallel Programming

Indexed keywords

LOCKS (FASTENERS); MULTICORE PROGRAMMING; MULTIPROCESSING SYSTEMS; PARALLEL PROGRAMMING; SCALABILITY;

EID: 67650796120     PISSN: 15232867     EISSN: None     Source Type: Journal    
DOI: 10.1145/1508284.1508274     Document Type: Conference Paper
Times cited : (59)

References (52)
  • 2
    • 84869546473 scopus 로고    scopus 로고
    • Opening Tables scalability in MySQL
    • Opening Tables scalability in MySQL. MySQL Performance Blog. http://www.mysqlperformanceblog.com/2006/11/21/-opening-tables-scalability, 2006.
    • (2006) MySQL Performance Blog
  • 3
    • 67650799807 scopus 로고    scopus 로고
    • SQLite database engine version 3.5.8. 2008.
    • SQLite database engine version 3.5.8. 2008.
  • 4
    • 84869513543 scopus 로고    scopus 로고
    • SysBench: a system performance benchmark version 0.4.8, 2008
    • SysBench: a system performance benchmark version 0.4.8. http://sysbench.sourceforge.net, 2008.
  • 5
    • 2842544791 scopus 로고
    • Replacing locks by higher-level primitives
    • Technical Report TR94-237, Rice University
    • S. Adve et al. Replacing locks by higher-level primitives. Technical Report TR94-237, Rice University, 1994.
    • (1994)
    • Adve, S.1
  • 6
    • 85060036181 scopus 로고
    • Validity of the single processor approach to achieving large scale computing capabilities
    • G. M. Amdahl. Validity of the single processor approach to achieving large scale computing capabilities. In AFIPS, 1967.
    • (1967) AFIPS
    • Amdahl, G.M.1
  • 7
    • 0003605996 scopus 로고
    • NAS parallel benchmarks
    • RNR-94-007, NASA Ames Research Center
    • D. H. Bailey et al. NAS parallel benchmarks. Technical Report Tech. Rep. RNR-94-007, NASA Ames Research Center, 1994.
    • (1994) Technical Report Tech. Rep
    • Bailey, D.H.1
  • 8
    • 84976783312 scopus 로고
    • Implementing remote procedure calls
    • A. D. Birrell and B. J. Nelson. Implementing remote procedure calls. ACM Trans. Comput. Syst., 2(1):39-59, 1984.
    • (1984) ACM Trans. Comput. Syst , vol.2 , Issue.1 , pp. 39-59
    • Birrell, A.D.1    Nelson, B.J.2
  • 9
    • 0034292014 scopus 로고    scopus 로고
    • OdinMP/CCp - a portable implementation of OpenMP for C
    • C. Brunschen et al. OdinMP/CCp - a portable implementation of OpenMP for C. Concurrency: Prac. and Exp., 2000.
    • (2000) Concurrency: Prac. and Exp
    • Brunschen, C.1
  • 11
    • 33646144623 scopus 로고    scopus 로고
    • The OpenMP source code repository
    • A. J. Dorta et al. The OpenMP source code repository. In Euromicro, 2005.
    • (2005) Euromicro
    • Dorta, A.J.1
  • 13
    • 67650783856 scopus 로고    scopus 로고
    • Distinguished Engineer, Sun Microsystems. Personal communication, November
    • G. Grohoski. Distinguished Engineer, Sun Microsystems. Personal communication, November 2007.
    • (2007)
    • Grohoski, G.1
  • 14
    • 0027262011 scopus 로고
    • Transactional memory: Architectural support for lock-free data structures
    • M. Herlihy and J. Moss. Transactional memory: architectural support for lock-free data structures. In ISCA-20, 1993.
    • (1993) ISCA-20
    • Herlihy, M.1    Moss, J.2
  • 15
    • 48249118853 scopus 로고    scopus 로고
    • Amdahl's law in the multicore era
    • M. Hill and M. Marty. Amdahl's law in the multicore era. IEEE Computer, 41(7), 2008.
    • (2008) IEEE Computer , vol.41 , Issue.7
    • Hill, M.1    Marty, M.2
  • 16
    • 10044290701 scopus 로고    scopus 로고
    • Using hardware operations to reduce the synchronization overhead of task pools
    • R. Hoffmann et al. Using hardware operations to reduce the synchronization overhead of task pools. ICPP, 2004.
    • (2004) ICPP
    • Hoffmann, R.1
  • 17
    • 67650828762 scopus 로고    scopus 로고
    • Intel. Prescott New Instructions Software Dev. Guide. 2004.
    • Intel. Prescott New Instructions Software Dev. Guide. 2004.
  • 18
    • 67650809647 scopus 로고    scopus 로고
    • Intel. Source code for Intel threading building blocks
    • Intel. Source code for Intel threading building blocks.
  • 20
    • 67650835556 scopus 로고    scopus 로고
    • Intel. IA-32 Intel Architecture Software Dev. Guide, 2008.
    • Intel. IA-32 Intel Architecture Software Dev. Guide, 2008.
  • 21
    • 35348921111 scopus 로고    scopus 로고
    • Core fusion: Accommodating software diversity in chip multiprocessors
    • E. Ipek et al. Core fusion: accommodating software diversity in chip multiprocessors. In ISCA-34, 2007.
    • (2007) ISCA-34
    • Ipek, E.1
  • 22
    • 20344374162 scopus 로고    scopus 로고
    • Niagara: A 32-Way Multithreaded SPARC Processor
    • P. Kongetira et al. Niagara: A 32-Way Multithreaded SPARC Processor. IEEE Micro, 25(2):21-29, 2005.
    • (2005) IEEE Micro , vol.25 , Issue.2 , pp. 21-29
    • Kongetira, P.1
  • 24
    • 28244437702 scopus 로고    scopus 로고
    • Heterogeneous chip multiprocessors
    • R. Kumar et al. Heterogeneous chip multiprocessors. IEEE Computer, 38(11), 2005.
    • (2005) IEEE Computer , vol.38 , Issue.11
    • Kumar, R.1
  • 25
    • 0016090932 scopus 로고
    • A new solution of Dijkstra's concurrent programming problem
    • August
    • L. Lamport. A new solution of Dijkstra's concurrent programming problem. CACM, 17(8):453-455, August 1974.
    • (1974) CACM , vol.17 , Issue.8 , pp. 453-455
    • Lamport, L.1
  • 26
    • 0030685588 scopus 로고    scopus 로고
    • The SGI Origin: A ccNUMA Highly Scalable Server
    • J. Laudon and D. Lenoski. The SGI Origin: A ccNUMA Highly Scalable Server. In ISCA, pages 241-251, 1997.
    • (1997) ISCA , pp. 241-251
    • Laudon, J.1    Lenoski, D.2
  • 27
    • 0000988422 scopus 로고
    • Branch-and-bound methods: A survey
    • E. L. Lawler and D. E. Wood. Branch-and-bound methods: A survey. Operations Research, 14(4):699-719, 1966.
    • (1966) Operations Research , vol.14 , Issue.4 , pp. 699-719
    • Lawler, E.L.1    Wood, D.E.2
  • 28
    • 36148968813 scopus 로고    scopus 로고
    • OpenUH: An optimizing, portable OpenMP compiler
    • C. Liao et al. OpenUH: an optimizing, portable OpenMP compiler. Concurr. Comput. : Pract. Exper., 2007.
    • (2007) Concurr. Comput. : Pract. Exper
    • Liao, C.1
  • 29
    • 0036949529 scopus 로고    scopus 로고
    • Speculative synchronization: Applying thread-level speculation to explicitly parallel applications
    • J. F. Mart́inez and J. Torrellas. Speculative synchronization: applying thread-level speculation to explicitly parallel applications. In ASPLOS-X, 2002.
    • (2002) ASPLOS-X
    • Mart́inez, J.F.1    Torrellas, J.2
  • 30
    • 33947328378 scopus 로고    scopus 로고
    • T. Morad et al. Performance, power efficiency and scalability of asymmetric cluster chip multiprocessors. CAL, 2006.
    • T. Morad et al. Performance, power efficiency and scalability of asymmetric cluster chip multiprocessors. CAL, 2006.
  • 31
    • 47349098275 scopus 로고    scopus 로고
    • MineBench: A Benchmark Suite for Data Mining Workloads
    • R. Narayanan et al. MineBench: A Benchmark Suite for Data Mining Workloads. In IISWC, 2006.
    • (2006) IISWC
    • Narayanan, R.1
  • 32
    • 42549083083 scopus 로고    scopus 로고
    • Implementation and evaluation of OpenMP for Hitachi SR8000
    • Y. Nishitani et al. Implementation and evaluation of OpenMP for Hitachi SR8000. In ISHPC-3, 2000.
    • (2000) ISHPC-3
    • Nishitani, Y.1
  • 33
    • 0012528554 scopus 로고    scopus 로고
    • Speculative lock elision: Enabling highly concurrent multithreaded execution
    • R. Rajwar and J. Goodman. Speculative lock elision: Enabling highly concurrent multithreaded execution. In MICRO-34, 2001.
    • (2001) MICRO-34
    • Rajwar, R.1    Goodman, J.2
  • 34
    • 0036949284 scopus 로고    scopus 로고
    • Transactional lock-free execution of lock-based programs
    • R. Rajwar and J. R. Goodman. Transactional lock-free execution of lock-based programs. In ASPLOS-X, 2002.
    • (2002) ASPLOS-X
    • Rajwar, R.1    Goodman, J.R.2
  • 35
    • 0030672607 scopus 로고    scopus 로고
    • The interaction of software prefetching with ILP processors in shared-memory systems
    • P. Ranganathan et al. The interaction of software prefetching with ILP processors in shared-memory systems. In ISCA-24, 1997.
    • (1997) ISCA-24
    • Ranganathan, P.1
  • 36
    • 41349084540 scopus 로고    scopus 로고
    • TxLinux: Using and managing hardware transactional memory in an operating system
    • C. Rossbach et al. TxLinux: using and managing hardware transactional memory in an operating system. In SOSP'07, 2007.
    • (2007) SOSP'07
    • Rossbach, C.1
  • 37
    • 0003023157 scopus 로고    scopus 로고
    • Design of OpenMP compiler for an SMP cluster
    • Sept
    • M. Sato et al. Design of OpenMP compiler for an SMP cluster. In EWOMP, Sept. 1999.
    • (1999) EWOMP
    • Sato, M.1
  • 38
    • 49249086142 scopus 로고    scopus 로고
    • Larrabee: A many-core x86 architecture for visual computing
    • L. Seiler et al. Larrabee: a many-core x86 architecture for visual computing. ACM Trans. Graph., 2008.
    • (2008) ACM Trans. Graph
    • Seiler, L.1
  • 39
    • 34548020216 scopus 로고    scopus 로고
    • Thread migration to improve synchronization performance
    • S. Sridharan et al. Thread migration to improve synchronization performance. In Workshop on OSIHPA, 2006.
    • (2006) Workshop on OSIHPA
    • Sridharan, S.1
  • 40
    • 84869518407 scopus 로고    scopus 로고
    • The Standard Performance Evaluation Corporation
    • The Standard Performance Evaluation Corporation. Welcome to SPEC. http://www.specbench.org/.
    • Welcome to SPEC
  • 41
    • 48249131157 scopus 로고    scopus 로고
    • ACMP: Balancing Hardware Efficiency and Programmer Efficiency
    • Technical Report TR-HPS-2007-001
    • M. Suleman et al. ACMP: Balancing Hardware Efficiency and Programmer Efficiency. Technical Report TR-HPS-2007-001, 2007.
    • (2007)
    • Suleman, M.1
  • 42
    • 67650085373 scopus 로고    scopus 로고
    • An Asymmetric Multi-core Architecture for Accelerating Critical Sections
    • Technical Report TR-HPS-2008-003
    • M. Suleman et al. An Asymmetric Multi-core Architecture for Accelerating Critical Sections. Technical Report TR-HPS-2008-003, 2008.
    • (2008)
    • Suleman, M.1
  • 43
    • 70549085458 scopus 로고    scopus 로고
    • Feedback-driven threading: Power-efficient and high-performance execution of multi-threaded workloads on CMPs
    • M. Suleman et al. Feedback-driven threading: power-efficient and high-performance execution of multi-threaded workloads on CMPs. In ASPLOS XIII, 2008.
    • (2008) ASPLOS , vol.13
    • Suleman, M.1
  • 46
    • 84957872108 scopus 로고    scopus 로고
    • The impact of speeding up critical sections with data prefetching and forwarding
    • P. Trancoso and J. Torrellas. The impact of speeding up critical sections with data prefetching and forwarding. In ICPP, 1996.
    • (1996) ICPP
    • Trancoso, P.1    Torrellas, J.2
  • 47
    • 49549084422 scopus 로고    scopus 로고
    • A Third-Generation 65nm 16-Core 32-Thread Plus 32-Scout-Thread CMT SPARC Processor
    • M. Tremblay et al. A Third-Generation 65nm 16-Core 32-Thread Plus 32-Scout-Thread CMT SPARC Processor. In ISSCC, 2008.
    • (2008) ISSCC
    • Tremblay, M.1
  • 48
    • 0029200683 scopus 로고
    • Simultaneous multithreading: Maximizing on-chip parallelism
    • D. M. Tullsen et al. Simultaneous multithreading: Maximizing on-chip parallelism. In ISCA-22, 1995.
    • (1995) ISCA-22
    • Tullsen, D.M.1
  • 50
    • 84869528164 scopus 로고    scopus 로고
    • Wikipedia. Fifteen puzzle. http://en.wikipedia.org/wiki/-Fifteenpuzzle.
    • Fifteen puzzle
  • 51
    • 0029179077 scopus 로고
    • The SPLASH-2 programs: Characterization and methodological considerations
    • S. C. Woo et al. The SPLASH-2 programs: Characterization and methodological considerations. In ISCA-22, 1995.
    • (1995) ISCA-22
    • Woo, S.C.1
  • 52
    • 34247133517 scopus 로고    scopus 로고
    • Ablego: A function outlining and partial inlining framework
    • P. Zhao and J. N. Amaral. Ablego: a function outlining and partial inlining framework. Softw. Pract. Exper., 37(5):465-491, 2007.
    • (2007) Softw. Pract. Exper , vol.37 , Issue.5 , pp. 465-491
    • Zhao, P.1    Amaral, J.N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.