메뉴 건너뛰기




Volumn 48, Issue 2, 1999, Pages 111-120

A trace cache microarchitecture and evaluation

Author keywords

Instruction cache; Instruction fetching; Multiple branch prediction; Superscalar processors; Trace cache

Indexed keywords

BANDWIDTH; COMPUTER ARCHITECTURE; MICROPROCESSOR CHIPS; RESPONSE TIME (COMPUTER SYSTEMS);

EID: 0033077095     PISSN: 00189340     EISSN: None     Source Type: Journal    
DOI: 10.1109/12.752652     Document Type: Review
Times cited : (68)

References (30)
  • 1
    • 0030407614 scopus 로고    scopus 로고
    • Integrating a Misprediction Recovery Cache (MRC) into a Superscalar Pipeline
    • Dec.
    • J. Bondi, A. Nanda, and S. Dutta, "Integrating a Misprediction Recovery Cache (MRC) into a Superscalar Pipeline," Proc. 29th Int'l Symp. Microarchitecture, pp. 14-23, Dec. 1996.
    • (1996) Proc. 29th Int'l Symp. Microarchitecture , pp. 14-23
    • Bondi, J.1    Nanda, A.2    Dutta, S.3
  • 2
    • 0003510233 scopus 로고    scopus 로고
    • Evaluating Future Microprocessors: The Simplescalar Toolset
    • Computer Sciences Dept., Univ. of Wisconsin-Madison, July
    • D. Burger, T. Austin, and S. Bennett, "Evaluating Future Microprocessors: The Simplescalar Toolset," Technical Report CS-TR-96-1308, Computer Sciences Dept., Univ. of Wisconsin-Madison, July 1996.
    • (1996) Technical Report CS-TR-96-1308
    • Burger, D.1    Austin, T.2    Bennett, S.3
  • 4
    • 0029529601 scopus 로고
    • Control Flow Prediction with Tree-Like Subgraphs for Superscalar Processors
    • Nov.
    • S. Dutta and M. Franklin, "Control Flow Prediction with Tree-Like Subgraphs for Superscalar Processors," Proc. 28th Int'l Symp. Microarchitecture, pp. 258-263, Nov. 1995.
    • (1995) Proc. 28th Int'l Symp. Microarchitecture , pp. 258-263
    • Dutta, S.1    Franklin, M.2
  • 5
    • 0019596071 scopus 로고
    • Trace Scheduling: A Technique for Global Microcode Compaction
    • July
    • J. Fisher, "Trace Scheduling: A Technique for Global Microcode Compaction," IEEE Trans. Computers, vol. 30, no. 7, pp. 478-490, July 1981.
    • (1981) IEEE Trans. Computers , vol.30 , Issue.7 , pp. 478-490
    • Fisher, J.1
  • 7
    • 0031334458 scopus 로고    scopus 로고
    • Alternative Fetch and Issue Policies for the Trace Cache Fetch Mechanism
    • Dec.
    • D. Friendly, S. Patel, and Y. Patt, "Alternative Fetch and Issue Policies for the Trace Cache Fetch Mechanism," Proc. 30th Int'l Symp. Microarchitecture, pp. 24-33, Dec. 1997.
    • (1997) Proc. 30th Int'l Symp. Microarchitecture , pp. 24-33
    • Friendly, D.1    Patel, S.2    Patt, Y.3
  • 8
    • 0032312214 scopus 로고    scopus 로고
    • Putting the Fill Unit to Work: Dynamic Optimizations for Trace Cache Microprocessors
    • Dec.
    • D. Friendly, S. Patel, and Y. Patt, "Putting the Fill Unit to Work: Dynamic Optimizations for Trace Cache Microprocessors," Proc. 31st Int'l Symp. Microarchitecture, Dec. 1998.
    • (1998) Proc. 31st Int'l Symp. Microarchitecture
    • Friendly, D.1    Patel, S.2    Patt, Y.3
  • 10
    • 0030379515 scopus 로고    scopus 로고
    • Increasing the Instruction Fetch Rate Via Block-Structured Instruction Set Architectures
    • Dec.
    • E. Hao, P.-Y. Chang, M. Evers, and Y. Patt, "Increasing the Instruction Fetch Rate Via Block-Structured Instruction Set Architectures," Proc. 29th Int'l Symp. Microarchitecture, pp. 191-200, Dec. 1996.
    • (1996) Proc. 29th Int'l Symp. Microarchitecture , pp. 191-200
    • Hao, E.1    Chang, P.-Y.2    Evers, M.3    Patt, Y.4
  • 11
    • 3242747062 scopus 로고
    • Trace Selection for Compiling Large C Application Programs to Microcode
    • Dec.
    • W. Hwu and P. Chang, "Trace Selection for Compiling Large C Application Programs to Microcode," Proc. 21st Int'l Symp. Microarchitecture, Dec. 1988.
    • (1988) Proc. 21st Int'l Symp. Microarchitecture
    • Hwu, W.1    Chang, P.2
  • 12
    • 0027595384 scopus 로고
    • The Superblock: An Effective Technique for VLIW and Superscalar Compilation
    • Jan.
    • W. Hwu et al. , "The Superblock: An Effective Technique for VLIW and Superscalar Compilation," J. Supercomputing, vol. 7, no. 1, pp. 229-248, Jan. 1993.
    • (1993) J. Supercomputing , vol.7 , Issue.1 , pp. 229-248
    • Hwu, W.1
  • 15
    • 24244474144 scopus 로고
    • Expansion Caches for Superscalar Processors
    • Computer Science Laboratory, Stanford Univ., June
    • J. Johnson, "Expansion Caches for Superscalar Processors," Technical Report CSL-TR-94-630, Computer Science Laboratory, Stanford Univ., June 1994.
    • (1994) Technical Report CSL-TR-94-630
    • Johnson, J.1
  • 16
    • 0006639334 scopus 로고
    • Performance Benefits of Large Execution Atomic Units in Dynamically Scheduled Machines
    • June
    • S. Melvin and Y. Patt, "Performance Benefits of Large Execution Atomic Units in Dynamically Scheduled Machines," Proc. Third Int'l Conf. Supercomputing, pp. 427-432, June 1989.
    • (1989) Proc. Third Int'l Conf. Supercomputing , pp. 427-432
    • Melvin, S.1    Patt, Y.2
  • 17
    • 0026153014 scopus 로고
    • Exploiting Fine-Grained Parallelism Through a Combination of Hardware and Software Techniques
    • May
    • S. Melvin and Y. Patt, "Exploiting Fine-Grained Parallelism Through a Combination of Hardware and Software Techniques," Proc. 18th Int'l Symp. Computer Architecture, pp. 287-296, May 1991.
    • (1991) Proc. 18th Int'l Symp. Computer Architecture , pp. 287-296
    • Melvin, S.1    Patt, Y.2
  • 18
    • 0024169229 scopus 로고
    • Hardware Support for Large Atomic Units in Dynamically Scheduled Machines
    • Dec.
    • S. Melvin, M. Shebanow, and Y. Patt, "Hardware Support for Large Atomic Units in Dynamically Scheduled Machines," Proc. 21st Int'l Symp. Microarchitecture, pp. 60-66, Dec. 1988.
    • (1988) Proc. 21st Int'l Symp. Microarchitecture , pp. 60-66
    • Melvin, S.1    Shebanow, M.2    Patt, Y.3
  • 19
    • 0030674213 scopus 로고    scopus 로고
    • Exploiting Instruction Level Parallelism in Processors by Caching Scheduled Groups
    • June
    • R. Nair and M. Hopkins, "Exploiting Instruction Level Parallelism in Processors by Caching Scheduled Groups," Proc. 24th Int'l Symp. Computer Architecture, pp. 13-25, June 1997.
    • (1997) Proc. 24th Int'l Symp. Computer Architecture , pp. 13-25
    • Nair, R.1    Hopkins, M.2
  • 20
    • 0031594002 scopus 로고    scopus 로고
    • Improving Trace Cache Effectiveness with Branch Promotion and Trace Packing
    • June
    • S. Patel, M. Evers, and Y. Patt, "Improving Trace Cache Effectiveness with Branch Promotion and Trace Packing," Proc. 25th Int'l Symp. Computer Architecture, pp. 262-271, June 1998.
    • (1998) Proc. 25th Int'l Symp. Computer Architecture , pp. 262-271
    • Patel, S.1    Evers, M.2    Patt, Y.3
  • 21
    • 0003696135 scopus 로고    scopus 로고
    • Critical Issues Regarding the Trace Cache Fetch Mechanism
    • Electrical Eng. and Computer Science Dept., Univ. of Michigan
    • S. Patel, D. Friendly, and Y. Patt, "Critical Issues Regarding the Trace Cache Fetch Mechanism," Technical Report CSE-TR-335-97, Electrical Eng. and Computer Science Dept., Univ. of Michigan, 1997.
    • (1997) Technical Report CSE-TR-335-97
    • Patel, S.1    Friendly, D.2    Patt, Y.3
  • 22
    • 33747407947 scopus 로고    scopus 로고
    • "Dynamic Flow Instruction Cache Memory Organized Around Trace Segments Independent of Virtual Address Line," U.S. Patent Number 5,381,533, Jan. 1995
    • A. Peleg and U. Weiser, "Dynamic Flow Instruction Cache Memory Organized Around Trace Segments Independent of Virtual Address Line," U.S. Patent Number 5,381,533, Jan. 1995.
    • Peleg, A.1    Weiser, U.2
  • 23
    • 0003869390 scopus 로고    scopus 로고
    • Trace Cache: A Low Latency Approach to High Bandwidth Instruction Fetching
    • Computer Sciences Dept., Univ. of Wisconsin-Madison, Apr.
    • E. Rotenberg, S. Bennett, and J. Smith, "Trace Cache: A Low Latency Approach to High Bandwidth Instruction Fetching," Technical Report 1310, Computer Sciences Dept., Univ. of Wisconsin-Madison, Apr. 1996.
    • (1996) Technical Report 1310
    • Rotenberg, E.1    Bennett, S.2    Smith, J.3
  • 24
  • 27
    • 0029509983 scopus 로고
    • Improving CISC Instruction Decoding Performance Using a Fill Unit
    • Nov.
    • M. Smotherman and M. Franklin, "Improving CISC Instruction Decoding Performance Using a Fill Unit," Proc. 28th Int'l Symp. Microarchitecture, pp. 219-229, Nov. 1995.
    • (1995) Proc. 28th Int'l Symp. Microarchitecture , pp. 219-229
    • Smotherman, M.1    Franklin, M.2
  • 28
    • 0030684575 scopus 로고    scopus 로고
    • Multiscalar Execution Along a Single Flow of Control
    • Aug.
    • K. Sundararaman and M. Franklin, "Multiscalar Execution Along a Single Flow of Control," Proc. ICPP '97, Aug. 1997.
    • (1997) Proc. ICPP '97
    • Sundararaman, K.1    Franklin, M.2
  • 29
    • 0030644743 scopus 로고    scopus 로고
    • Improving Superscalar Instruction Dispatch and Issue by Exploiting Dynamic Code Sequences
    • June
    • S. Vajapeyam and T. Mitra, "Improving Superscalar Instruction Dispatch and Issue by Exploiting Dynamic Code Sequences," Proc. 24th Int'l Symp. Computer Architecture, pp. 1-12, June 1997.
    • (1997) Proc. 24th Int'l Symp. Computer Architecture , pp. 1-12
    • Vajapeyam, S.1    Mitra, T.2
  • 30
    • 84969344997 scopus 로고
    • Increasing the Instruction Fetch Rate via Multiple Branch Prediction and a Branch Address Cache
    • July
    • T.-Y. Yeh, D.T. Marr, and Y.N. Patt, "Increasing the Instruction Fetch Rate via Multiple Branch Prediction and a Branch Address Cache," Proc. Seventh Int'l Conf. Supercomputing, pp. 67-76, July 1993.
    • (1993) Proc. Seventh Int'l Conf. Supercomputing , pp. 67-76
    • Yeh, T.-Y.1    Marr, D.T.2    Patt, Y.N.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.