메뉴 건너뛰기




Volumn 2003-January, Issue , 2003, Pages 46-55

Instruction replication: Reducing delays due to inter-PE communication latency

Author keywords

Clustered processors; Instruction Distribution; Instruction Replication; Instructions per Cycle; Inter PE communication; Load Imbalance

Indexed keywords

PARALLEL PROCESSING SYSTEMS;

EID: 33746287193     PISSN: 1089795X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/PACT.2003.1238001     Document Type: Conference Paper
Times cited : (10)

References (23)
  • 1
    • 0033717865 scopus 로고    scopus 로고
    • Clock Rate versus IPC: The End of the Road for Conventional Microarchitectures
    • V. Agarwal, M. S. Hrishikesh, S. W. Keckler and D. Burger, "Clock Rate versus IPC: the End of the Road for Conventional Microarchitectures," Proc. ISCA-27, 2000.
    • (2000) Proc. ISCA-27
    • Agarwal, V.1    Hrishikesh, M.S.2    Keckler, S.W.3    Burger, D.4
  • 2
    • 84962184017 scopus 로고    scopus 로고
    • An Empirical Study of the Scalability Aspects of Instruction Distribution Algorithms for Clustered Processors
    • A. Aggarwal and M. Franklin, "An Empirical Study of the Scalability Aspects of Instruction Distribution Algorithms for Clustered Processors," Proc. ISPASS, 2001.
    • (2001) Proc. ISPASS
    • Aggarwal, A.1    Franklin, M.2
  • 3
    • 0038346226 scopus 로고    scopus 로고
    • Dynamically Managing the Communication-Parallelism Trade-off in Future Clustered Processors
    • R. Balasubramonian, S. Dwarkadas, and D.H. Albonesi, "Dynamically Managing the Communication-Parallelism Trade-off in Future Clustered Processors," Proc. ISCA, 2003.
    • (2003) Proc. ISCA
    • Balasubramonian, R.1    Dwarkadas, S.2    Albonesi, D.H.3
  • 4
    • 0000503873 scopus 로고    scopus 로고
    • Instruction Distribution Heuristics for Quad-Cluster, Dynamically-Scheduled, Superscalar Processors
    • A. Baniasadi and A. Moshovos, "Instruction Distribution Heuristics for Quad-Cluster, Dynamically-Scheduled, Superscalar Processors," Proc. MICRO-33, 2000.
    • (2000) Proc. MICRO-33
    • Baniasadi, A.1    Moshovos, A.2
  • 5
    • 0002640853 scopus 로고    scopus 로고
    • Guest Editors Introduction: Billion-Transistor Architectures
    • September
    • D. Burger and J. R. Goodman, "Guest Editors Introduction: Billion-Transistor Architectures," COMPUTER, Vol. 30, No. 9, pp. 46-49, September 1997.
    • (1997) Computer , vol.30 , Issue.9 , pp. 46-49
    • Burger, D.1    Goodman, J.R.2
  • 6
    • 0003465202 scopus 로고    scopus 로고
    • Technical Report, Computer Sciences Department, University of Wisconsin, June
    • D. Burger and T. Austin, "The Simplescalar Tool Set," Technical Report, Computer Sciences Department, University of Wisconsin, June 1997.
    • (1997) The Simplescalar Tool Set
    • Burger, D.1    Austin, T.2
  • 10
    • 0003007259 scopus 로고    scopus 로고
    • The Multicluster Architecture: Reducing Cycle Time Through Partitioning
    • K. Farkas, P. Chow, N. Jouppi, and Z. Vranesic, "The Multicluster Architecture: Reducing Cycle Time Through Partitioning," Proc. Micro-30, 1997.
    • (1997) Proc. Micro-30
    • Farkas, K.1    Chow, P.2    Jouppi, N.3    Vranesic, Z.4
  • 12
    • 84944936630 scopus 로고    scopus 로고
    • PEWS: A Decentralized Dynamic Scheduler for ILP Processing
    • G. A. Kemp and M. Franklin, "PEWS: A Decentralized Dynamic Scheduler for ILP Processing," Proc. ICPP,Vol.I, pp. 239-246, 1996.
    • (1996) Proc. ICPP , vol.1 , pp. 239-246
    • Kemp, G.A.1    Franklin, M.2
  • 13
    • 0034836754 scopus 로고    scopus 로고
    • CARS: A New Code Generation Framework for Clustered ILP Processors
    • K. Kailas, K. Ebcioglu, and A. Agrawala, "CARS: A New Code Generation Framework for Clustered ILP Processors," Proc. HPCA-7, pp.133-143, 2001.
    • (2001) Proc. HPCA-7 , pp. 133-143
    • Kailas, K.1    Ebcioglu, K.2    Agrawala, A.3
  • 15
    • 0030685021 scopus 로고    scopus 로고
    • The Alpha 21264: A 500 MHz Out-of-Order Execution Microprocessor
    • D. Leibholz and R. Razdan, "The Alpha 21264: A 500 MHz Out-of-Order Execution Microprocessor," Proc. Compcon, pp. 28-36, 1997.
    • (1997) Proc. Compcon , pp. 28-36
    • Leibholz, D.1    Razdan, R.2
  • 16
    • 0030676681 scopus 로고    scopus 로고
    • Complexity-Effective Superscalar Processors
    • S. Palacharla, N. P. Jouppi, and J. E. Smith, "Complexity-Effective Superscalar Processors," Proc. ISCA-24, pp. 206-218, 1997.
    • (1997) Proc. ISCA-24 , pp. 206-218
    • Palacharla, S.1    Jouppi, N.P.2    Smith, J.E.3
  • 17
    • 84968856056 scopus 로고    scopus 로고
    • Reducing Wire Delay Penalty through Value Prediction
    • J. M. Parcerisa and A. Gonzalez, "Reducing Wire Delay Penalty through Value Prediction," Proc. Micro-33, 2000.
    • (2000) Proc. Micro-33
    • Parcerisa, J.M.1    Gonzalez, A.2
  • 20
    • 0031628484 scopus 로고    scopus 로고
    • Exploiting Idle Floating-Point Resources For Integer Execution
    • S. S. Sastry, S. Palacharla, and J. E. Smith, "Exploiting Idle Floating-Point Resources For Integer Execution," Proc. PLDI, 1998.
    • (1998) Proc. PLDI
    • Sastry, S.S.1    Palacharla, S.2    Smith, J.E.3
  • 21
    • 0030129806 scopus 로고    scopus 로고
    • The MIPS R10000 Superscalar Microprocessor
    • April
    • K. C. Yeager, "The MIPS R10000 Superscalar Microprocessor," IEEE Micro, pp. 28-40, April 1996.
    • (1996) IEEE Micro , pp. 28-40
    • Yeager, K.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.