메뉴 건너뛰기




Volumn 42, Issue 4, 1993, Pages 396-412

Branch Target Buffer Design and Optimization

Author keywords

Branch; branch problem; branch target buffer; cache memory; CPU design; CPU performance evaluation; pipeline; pipelining

Indexed keywords

COMPUTATIONAL COMPLEXITY; COMPUTER SYSTEMS; DESIGN; OPTIMIZATION; PERFORMANCE; PIPELINE PROCESSING SYSTEMS;

EID: 0027578544     PISSN: 00189340     EISSN: None     Source Type: Journal    
DOI: 10.1109/12.214687     Document Type: Article
Times cited : (86)

References (60)
  • 1
    • 84941500168 scopus 로고
    • Am29000 Streamlined Instruction Processor User Manual
    • Advanced Micro DevicesAdvanced Micro Devices
    • Advanced Micro Devices, Am29000 Streamlined Instruction Processor User Manual, Advanced Micro Devices, 1988.
    • (1988)
  • 2
    • 84941512148 scopus 로고
    • Amdahl 470 V/6 Machine Reference Manual
    • AmdahlAmdahl, Sunny-vale, CA
    • Amdahl, Amdahl 470 V/6 Machine Reference Manual, Amdahl, Sunny-vale, CA, 1976.
    • (1976)
  • 3
    • 2842517957 scopus 로고
    • The IBM System/360 Model 91: Machine philosophy and instruction-handling
    • Jan.
    • D. W. Anderson, F. J. Sparacio, and R.M. Tomasulo, “The IBM System/360 Model 91: Machine philosophy and instruction-handling,” IBM J. Res. Develop., 8–24 Jan. 1967.
    • (1967) IBM J. Res. Develop. , pp. 8-24
    • Anderson, D.W.1    Sparacio, F.J.2    Tomasulo, R.M.3
  • 4
    • 0025240268 scopus 로고
    • The IBM RISC System 6000 Processor: Hardware overview
    • H. B. Bakoglu, G. F. Grohoski, and R.K. Montoye, “The IBM RISC System 6000 Processor: Hardware overview,” IBM J. Res. Develop., vol. 34, no. 1, pp. 12–22, Jan. 1990.
    • (1990) IBM J. Res. Develop. , vol.34 , Issue.1 , pp. 12-22
    • Bakoglu, H.B.1    Grohoski, G.F.2    Montoye, R.K.3
  • 8
    • 3142767739 scopus 로고
    • VAX 11/780 Architecture Handbook
    • Digital Equipment Corp.Digital Equipment Corp.
    • Digital Equipment Corp., VAX 11/780 Architecture Handbook, Digital Equipment Corp., 1977
    • (1977)
  • 9
    • 0023209052 scopus 로고
    • Branch folding in the CRISP Microprocessor: Reducing branch delay to zero
    • D. R. Ditzel and H. R. McLellan, “Branch folding in the CRISP Microprocessor: Reducing branch delay to zero,” in Proc. 14th Annu. Symp. Comput. Architecture, 1987, pp. 2–9.
    • (1987) Proc. 14th Annu. Symp. Comput. Architecture , pp. 2-9
    • Ditzel, D.R.1    McLellan, H.R.2
  • 10
    • 1542556730 scopus 로고
    • Fortran execution time benchmark
    • Mar.
    • N. Doduc, “Fortran execution time benchmark,” unpublished report, V. 20, Mar. 1989.
    • (1989) , vol.20
    • Doduc, N.1
  • 11
    • 0026244261 scopus 로고
    • Branch strategies: Modeling and opti-mization
    • Oct.
    • P. Dubey and M. J. Flynn, “Branch strategies: Modeling and opti-mization,” IEEE Trans. Comput., vol. 40, no. 10, pp. 1159–1167, Oct. 1991.
    • (1991) IEEE Trans. Comput. , vol.40 , Issue.10 , pp. 1159-1167
    • Dubey, P.1    Flynn, M.J.2
  • 12
    • 84937650914 scopus 로고
    • The CDC 7600 and SCOPE 76
    • T. H. Elrod, “The CDC 7600 and SCOPE 76,” Datamation, pp. 80–85, 1970.
    • (1970) Datamation , pp. 80-85
    • Elrod, T.H.1
  • 13
    • 84941488516 scopus 로고
    • Computer architecture - Designing for speed
    • D. FolgeT and E. Basart, “Computer architecture - Designing for speed,” in Proc. Spring COMPCON 1983, pp. 25–31.
    • (1983) Proc. Spring COMPCON. , pp. 25-31
    • FolgeT, D.1    Basart, E.2
  • 14
    • 84941518307 scopus 로고
    • Fujitsu Microelectronics
    • Fujitsu Microelectronics, Inc.
    • Fujitsu Microelectronics, Inc., MB86900 RISC Processor Architecture Manual, Fujitsu Microelectronics, Inc., 1987.
    • (1987) MB86900 RISC Processor Architecture Manual
  • 15
    • 0016595187 scopus 로고
    • Optimization of scalar instructions for the advanced scientific computer
    • S. N. Gaulding and D. P. Madison, Jr., “Optimization of scalar instructions for the advanced scientific computer,” in Proc. Spring COMPCON 1975, pp. 189–193.
    • (1975) Proc. Spring COMPCON 1975 , pp. 189-193
    • Gaulding, S.N.1    Madison, D.P.2
  • 18
    • 84941506066 scopus 로고
    • VAX address and instruction traces
    • R. R. Henry, “VAX address and instruction traces,” unpublished report, 1983.
    • (1983) unpublished report
    • Henry, R.R.1
  • 19
    • 0003789873 scopus 로고
    • Aspects of cache memory and instruction buffer performance
    • U.C. Berkeley, Tech. Rep. UCB/CSD 87/381, Nov.
    • M. D. Hill, “Aspects of cache memory and instruction buffer performance,” U.C. Berkeley Tech. Rep. UCB/CSD 87/381, Nov. 1987.
    • (1987)
    • Hill, M.D.1
  • 21
    • 0024607203 scopus 로고
    • The CLIPPER processor: Instruction set architecture and implementation
    • Feb.
    • W. Hollingsworth, H. Sachs, and A.J. Smith, “The CLIPPER processor: Instruction set architecture and implementation,” Commun. ACM, pp. 200–219, Feb. 1989.
    • (1989) Commun. ACM , pp. 200-219
    • Hollingsworth, W.1    Sachs, H.2    Smith, A.J.3
  • 22
    • 0003419929 scopus 로고
    • Computer Architecture and Parallel Processing
    • New York: McGraw-Hill
    • K. Hwang and F. A. Briggs, Computer Architecture and Parallel Processing. New York: McGraw-Hill, 1984, 714–729.
    • (1984) , pp. 714-729
    • Hwang, K.1    Briggs, F.A.2
  • 23
    • 84941497365 scopus 로고
    • IBM Maintenance Library System/370 Model 168 Theory of Operation/Diagrams Manual
    • IBM, Poughkeepsie, NY
    • IBM, “IBM Maintenance Library System/370 Model 168 Theory of Operation/Diagrams Manual,” vol. 2, IBM, Poughkeepsie, NY, 1973.
    • (1973) , vol.2
  • 24
    • 84941491890 scopus 로고
    • IBM Maintenance Library 3033 Processor Complex Theory of Operation/Diagrams Manual
    • IBM, Poughkeepsie, NY, Jan.
    • K. Hwang and F. A. Briggs, “IBM Maintenance Library 3033 Processor Complex Theory of Operation/Diagrams Manual,” vols. 1–3, IBM, Poughkeepsie, NY, Jan. 1978.
    • (1978) , vol.1-3
    • Hwang, K.1    Briggs, F.A.2
  • 25
    • 0020504403 scopus 로고
    • Performance measurements on HEP - A pipelined MIMD computer
    • H. F. Jordan, “Performance measurements on HEP - A pipelined MIMD computer,” in Proc. 10th Annu. Symp. Comput. Architecture, 1983, pp. 207–212.
    • (1983) Proc. 10th Annu. Symp. Comput. Architecture , pp. 207-212
    • Jordan, H.F.1
  • 26
    • 0026156263 scopus 로고
    • Branch history table prediction of moving target branches due to subroutine returns
    • May
    • D. Kaeli and P. Emma, “Branch history table prediction of moving target branches due to subroutine returns,” in Proc. 18th ISCA, and Comput. Architecture News, vol. 19, no. 3, pp. 34–41 May 1991.
    • (1991) Proc. 18th ISCA, and Comput. Architecture News , vol.19 , Issue.3 , pp. 34-41
    • Kaeli, D.1    Emma, P.2
  • 27
    • 0004088729 scopus 로고
    • MIPS RISC Architecture
    • Englewood Cliffs, NJ: Prentice-Hall
    • G. Kane, MIPS RISC Architecture. Englewood Cliffs, NJ: Prentice-Hall, 1989.
    • (1989)
    • Kane, G.1
  • 28
    • 9944240529 scopus 로고
    • Supercomputing Systems
    • New York: Van Nostrand Reinhold
    • S. P. Kartashev and S. I Kartashev, Supercomputing Systems. New York: Van Nostrand Reinhold, 1990, 106–153.
    • (1990) , pp. 106-153
    • Kartashev, S.P.1    Kartashev, S.I.2
  • 29
    • 0004172748 scopus 로고
    • The Architecture of Pipelined Computers
    • New York: McGraw-Hill
    • P. M. Kogge, The Architecture of Pipelined Computers. New York: McGraw-Hill, 1981.
    • (1981)
    • Kogge, P.M.1
  • 30
    • 0021204160 scopus 로고
    • Branch prediction strategies and branch target buffer design
    • Jan.
    • J. K. F. Lee and A. J. Smith, “Branch prediction strategies and branch target buffer design,” IEEE Comput. Mag., 6–22, Jan. 1984.
    • (1984) IEEE Comput. Mag. , pp. 6-22
    • Lee, J.K.F.1    Smith, A.J.2
  • 32
    • 84941506378 scopus 로고
    • Reducing the branch penalty in pipelined processors
    • D. J. Lilja, “Reducing the branch penalty in pipelined processors,” IEEE 1988.
    • (1988) IEEE
    • Lilja, D.J.1
  • 33
    • 0023643282 scopus 로고
    • Getting mainframe power out of a CISC Supermicro
    • Sept.
    • T. Manuel, “Getting mainframe power out of a CISC Supermicro,” Electronics, 66–69, Sept. 3, 1987.
    • (1987) Electronics , pp. 66-69
    • Manuel, T.1
  • 35
    • 84941502759 scopus 로고
    • M68000 8 - 116 -132-Bit Microprocessors User’s Manual
    • MotorolaEnglewood Cliffs, NJ: Prentice-Hall
    • Motorola, M68000 8-116-132-Bit Microprocessors User’s Manual. Englewood Cliffs, NJ: Prentice-Hall, 1989.
    • (1989)
  • 36
    • 28044467872 scopus 로고
    • The IBM 360/195
    • Apr.
    • J. O. Murphey and R. M. Wade, “The IBM 360/195,” Datamation, pp. 72–79, Apr. 1970.
    • (1970) Datamation , pp. 72-79
    • Murphey, J.O.1    Wade, R.M.2
  • 37
    • 0026918390 scopus 로고
    • Improving the accuracy of dynamic branch prediction using branch correlation
    • Boston, MA, Oct.
    • S.-T. Pan, K. So, and J. T. Rahmeh, “Improving the accuracy of dynamic branch prediction using branch correlation,” in Proc. ASPLOS V, Boston, MA, Oct. 1992.
    • (1992) Proc. ASPLOS V
    • Pan, S.T.1    So, K.2    Rahmeh, J.T.3
  • 39
    • 84941521519 scopus 로고
    • Branch target buffer design
    • U.C. Berkeley, Comput. Sci. Division Technical UCB/CSD, Dec.
    • C. H. Perleberg, “Branch target buffer design,” U.C. Berkeley Comput. Sci. Division Technical UCB/CSD Dec. 1989.
    • (1989)
    • Perleberg, C.H.1
  • 41
    • 84941488878 scopus 로고
    • Apparatus for branch prediction for computer instructions
    • U.S. Patent 4 914 579 April 3
    • M. Putrino, S. Vassiliadis, A. Huffman, and A. Ngai, “Apparatus for branch prediction for computer instructions,” U.S. Patent 4 914 579, April 3, 1990.
    • (1990)
    • Putrino, M.1    Vassiliadis, S.2    Huffman, A.3    Ngai, A.4
  • 42
  • 43
    • 0017428356 scopus 로고
    • The effect of instruction fetch strategies upon the performance of pipelined instruction units
    • B. R. Rau and G. E. Rossman, “The effect of instruction fetch strategies upon the performance of pipelined instruction units,” in Proc. 4th Annu. Symp. Comput. Architecture, 1977, pp. 80–87.
    • (1977) Proc. 4th Annu. Symp. Comput. Architecture , pp. 80-87
    • Rau, B.R.1    Rossman, G.E.2
  • 44
    • 0017922490 scopus 로고
    • The CRAY-1 computer system
    • Jan.
    • R. M. Russell, “The CRAY-1 computer system,” Commun. ACM, pp. 63–72, Jan. 1978.
    • (1978) Commun. ACM , pp. 63-72
    • Russell, R.M.1
  • 45
    • 84912761651 scopus 로고
    • TRON Project 1987
    • Berlin, Germany: Springer-Verlag
    • K. Sakamura, TRON Project 1987. Berlin, Germany: Springer-Verlag, 1987.
    • (1987)
    • Sakamura, K.1
  • 46
    • 84937647458 scopus 로고
    • A multiminiprocessor system implemented through pipelining
    • Feb.
    • L. E. Shar and E. S. Davidson, “A multiminiprocessor system implemented through pipelining,” IEEE Comput. Mag., pp. 42–51, Feb. 1974.
    • (1974) IEEE Comput. Mag. , pp. 42-51
    • Shar, L.E.1    Davidson, E.S.2
  • 47
    • 0018106484 scopus 로고
    • Sequential program prefetching in memory hierarchies
    • A. J. Smith, “Sequential program prefetching in memory hierarchies,” IEEE Comput. Mag., vol. 11, no. 12, pp. 7–21, Dec. 1978.
    • (1978) IEEE Comput. Mag. , vol.11 , Issue.12 , pp. 7-21
    • Smith, A.J.1
  • 48
    • 0022252426 scopus 로고
    • Cache evaluation and the impact of workload choice
    • June
    • A. J. Smith, “Cache evaluation and the impact of workload choice,” in Proc. 12th Symp. Comput Architecture, June 1985, pp. 64–74.
    • (1985) Proc. 12th Symp. Comput Architecture , pp. 64-74
    • Smith, A.J.1
  • 49
    • 84939323181 scopus 로고
    • Line (block) size choice for CPU cache memories
    • Sept.
    • A. J. Smith, “Line (block) size choice for CPU cache memories,” IEEE Trans. Comput., pp. 1063–1075, Sept. 1987.
    • (1987) IEEE Trans. Comput. , pp. 1063-1075
    • Smith, A.J.1
  • 50
  • 51
    • 0020564767 scopus 로고
    • A study of instruction cache organizations and replacement policies
    • June
    • J. E. Smith and J. R. Goodman, “A study of instruction cache organizations and replacement policies,” in Proc. 10th Symp. Comput. Architecture, June 1983, pp. 132–137.
    • (1983) Proc. 10th Symp. Comput. Architecture , pp. 132-137
    • Smith, J.E.1    Goodman, J.R.2
  • 52
    • 0024611720 scopus 로고
    • Pipeline control for a single cycle VLSI implementation of a complex instruction set computer
    • D. R. Stiles and H. L. McFarland, “Pipeline control for a single cycle VLSI implementation of a complex instruction set computer,” in Proc. Spring COMPCON 1989, pp. 504–508.
    • (1989) Proc. Spring COMPCON 1989 , pp. 504-508
    • Stiles, D.R.1    McFarland, H.L.2
  • 53
    • 84941507309 scopus 로고
    • (of NexGen microsystems), personal interview concerning branch prediction cache of NexGen processor, Sept. 25
    • D. R. Stiles (of NexGen microsystems), personal interview concerning branch prediction cache of NexGen processor, Sept. 25, 1989.
    • (1989)
    • Stiles, D.R.1
  • 54
    • 85042605140 scopus 로고
    • Parallel operation in the Control Data 6600
    • J. E. Thornton, “Parallel operation in the Control Data 6600,” in Proc. AF1PS Pall Joint Comp. Conf., 1964, pp. 33–40.
    • (1964) Proc. AF1PS Pall Joint Comp. Conf. , pp. 33-40
    • Thornton, J.E.1
  • 55
    • 1542661726 scopus 로고
    • SPICE2G.6
    • U.C. Berkeley CAD/IC Group, Mar.
    • U. C. Berkeley CAD/IC Group, “SPICE2G.6,” Mar. 1987.
    • (1987)
  • 56
    • 85034094146 scopus 로고
    • Two Level Adaptive Training Branch Prediction
    • Nov.
    • T.-Y. Yeh and Y. Patt, “Two Level Adaptive Training Branch Prediction,” in Proc. MICRO-24, Nov. 1991, pp. 51–61.
    • (1991) Proc. MICRO-24 , pp. 51-61
    • Yeh, T.Y.1    Patt, Y.2
  • 57
    • 0026867221 scopus 로고
    • Alternative implementations of two-level adaptive branch prediction
    • May
    • T.-Y. Yeh and Y. Patt, “Alternative implementations of two-level adaptive branch prediction,” in Proc. ISCA-19 and Comput. Architecture News, vol. 20, no. 2, pp. 124–135. May 1992.
    • (1992) Proc. ISCA-19 and Comput. Architecture News , vol.20 , Issue.2 , pp. 124-135
    • Yeh, T.Y.1    Patt, Y.2
  • 58
    • 84941512889 scopus 로고
    • of Edgecore), personal interview concerning branch cache in Edge 2000, Sept. 20
    • S. Walter (of Edgecore), personal interview concerning branch cache in Edge 2000, Sept. 20, 1989.
    • (1989)
    • Walter, S.1
  • 59
    • 84941511759 scopus 로고
    • Jump prediction
    • unpublished draft, Feb.
    • L. C. Widdoes, Jr., “Jump prediction,” Stanford Univ., unpublished draft, Feb. 1977.
    • (1977)
    • Widdoes, L.C.1
  • 60
    • 33747448202 scopus 로고
    • The Mitsubishi VLSI CPU in the TRON Project
    • Apr.
    • T. Yoshida and T. Enomoto, “The Mitsubishi VLSI CPU in the TRON Project,” IEEE Micro, p. 24, Apr. 1987.
    • (1987) IEEE Micro , pp. 24
    • Yoshida, T.1    Enomoto, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.