메뉴 건너뛰기




Volumn 80, Issue 9, 2004, Pages 451-468

Speeding up architectural simulations for high-performance processors

Author keywords

Architectural design; Computer architecture; Reduced input sets; Statistical simulation; Trace sampling

Indexed keywords

COMPUTER ARCHITECTURE; COMPUTER SOFTWARE; MICROPROCESSOR CHIPS; PERFORMANCE; PROBABILITY; STATISTICAL METHODS;

EID: 11844256347     PISSN: 00375497     EISSN: None     Source Type: Journal    
DOI: 10.1177/0037549704044326     Document Type: Article
Times cited : (5)

References (86)
  • 2
    • 0032070245 scopus 로고    scopus 로고
    • Performance analysis and its impact on design
    • Bose, P., and T. M. Conte. 1998. Performance analysis and its impact on design. IEEE Computer 31 (5): 41-49.
    • (1998) IEEE Computer , vol.31 , Issue.5 , pp. 41-49
    • Bose, P.1    Conte, T.M.2
  • 3
    • 0032658099 scopus 로고    scopus 로고
    • Challenges in processor modeling and validation
    • Bose, P., T. M. Conte, and T. M. Austin. 1999. Challenges in processor modeling and validation. IEEE Micro 19 (3): 9-14.
    • (1999) IEEE Micro , vol.19 , Issue.3 , pp. 9-14
    • Bose, P.1    Conte, T.M.2    Austin, T.M.3
  • 4
    • 0032626114 scopus 로고    scopus 로고
    • Designing an Alpha microprocessor
    • Reilly, M. 1999. Designing an Alpha microprocessor. IEEE Computer 32 (7): 27-34.
    • (1999) IEEE Computer , vol.32 , Issue.7 , pp. 27-34
    • Reilly, M.1
  • 7
    • 0034226001 scopus 로고    scopus 로고
    • SPEC CPU2000: Measuring CPU performance in the new millennium
    • Henning, J. L. 2000. SPEC CPU2000: Measuring CPU performance in the new millennium. IEEE Computer 33 (7): 28-35.
    • (2000) IEEE Computer , vol.33 , Issue.7 , pp. 28-35
    • Henning, J.L.1
  • 9
    • 0036469652 scopus 로고    scopus 로고
    • SimpleScalar: An infrastructure for computer system modeling
    • Austin, T., E. Larson, and D. Ernst. 2002. SimpleScalar: An infrastructure for computer system modeling. IEEE Computer 35 (2): 59-67.
    • (2002) IEEE Computer , vol.35 , Issue.2 , pp. 59-67
    • Austin, T.1    Larson, E.2    Ernst, D.3
  • 10
    • 0242567636 scopus 로고    scopus 로고
    • Accurate statistical workload modeling
    • Ph.D. diss., Ghent University, Belgium
    • Eeckhout, L. 2002. Accurate statistical workload modeling. Ph.D. diss., Ghent University, Belgium.
    • (2002)
    • Eeckhout, L.1
  • 15
    • 3042722196 scopus 로고    scopus 로고
    • Efficient microprocessor design space exploration through statistical simulation
    • April. Accepted for publication
    • Eeckhout, L., D. Stroobandt, and K. De Bosschere. 2003. Efficient microprocessor design space exploration through statistical simulation. In 36th Annual Simulation Symposium, April. Accepted for publication.
    • (2003) 36th Annual Simulation Symposium
    • Eeckhout, L.1    Stroobandt, D.2    De Bosschere, K.3
  • 23
    • 0030129806 scopus 로고    scopus 로고
    • MIPS R10000 superscalar microprocessor
    • Yeager, K. C. 1996. MIPS R10000 superscalar microprocessor, IEEE Micro 16 (2): 28-40.
    • (1996) IEEE Micro , vol.16 , Issue.2 , pp. 28-40
    • Yeager, K.C.1
  • 25
    • 0028416719 scopus 로고
    • Instruction window size trade-offs and characterization of program parallelism
    • Dubey, P. K.,G.B.Adams III, and M. J. Flynn. 1994. Instruction window size trade-offs and characterization of program parallelism. IEEE Transactions on Computers 43 (4): 431-42.
    • (1994) IEEE Transactions on Computers , vol.43 , Issue.4 , pp. 431-442
    • Dubey, P.K.1    Adams III, G.B.2    Flynn, M.J.3
  • 26
    • 0028404528 scopus 로고
    • Dynamic trace analysis for analytic modeling of superscalar performance
    • Kamin, R. A., III, G. B. Adams III, and P. K. Dubey. 1994. Dynamic trace analysis for analytic modeling of superscalar performance. Performance Evaluation 19 (2-3): 259-76.
    • (1994) Performance Evaluation , vol.19 , Issue.2-3 , pp. 259-276
    • Kamin III, R.A.1    Adams III, G.B.2    Dubey, P.K.3
  • 30
    • 0031140923 scopus 로고    scopus 로고
    • Understanding some simple processor-performance limits
    • Emma, P. G. 1997. Understanding some simple processor-performance limits. IBM Journal of Research and Development 41 (3): 215-32.
    • (1997) IBM Journal of Research and Development , vol.41 , Issue.3 , pp. 215-232
    • Emma, P.G.1
  • 31
    • 0032592098 scopus 로고    scopus 로고
    • Deep-submicron microprocessor design issues
    • Flynn, M. J., P. Hung, and K. W. Rudd. 1999. Deep-submicron microprocessor design issues. IEEE Micro 19 (4): 11-22.
    • (1999) IEEE Micro , vol.19 , Issue.4 , pp. 11-22
    • Flynn, M.J.1    Hung, P.2    Rudd, K.W.3
  • 35
    • 0842286394 scopus 로고
    • Systematic computer architecture prototyping
    • Ph.D. diss., University of Illinois at Urbana-Champaign
    • Conte, T. M. 1992. Systematic computer architecture prototyping. Ph.D. diss., University of Illinois at Urbana-Champaign.
    • (1992)
    • Conte, T.M.1
  • 36
    • 3042767937 scopus 로고    scopus 로고
    • Abstraction via separable components: An empirical study of absolute and relative accuracy in processor performance modeling
    • Technical Report RC 21909, IBM Research Division, T. J. Watson Research Center
    • Brooks, D., M. Martonosi, and P. Bose. 2000. Abstraction via separable components: An empirical study of absolute and relative accuracy in processor performance modeling. Technical Report RC 21909, IBM Research Division, T. J. Watson Research Center.
    • (2000)
    • Brooks, D.1    Martonosi, M.2    Bose, P.3
  • 40
    • 0030285193 scopus 로고    scopus 로고
    • Analysis of benchmark characteristics and benchmark performance prediction
    • Saavedra, R. H., and A. J. Smith. 1996. Analysis of benchmark characteristics and benchmark performance prediction. ACM Transactions on Computer Systems 14 (4): 344-84.
    • (1996) ACM Transactions on Computer Systems , vol.14 , Issue.4 , pp. 344-384
    • Saavedra, R.H.1    Smith, A.J.2
  • 43
    • 0033220924 scopus 로고    scopus 로고
    • Branch prediction, instruction-window size, and cache size: Perforrnance tradeoffs and simulation techniques
    • Skadron, K., P. S. Ahuja, M. Martonosi, and D. W. Clark. 1999. Branch prediction, instruction-window size, and cache size: Perforrnance tradeoffs and simulation techniques. IEEE Transactions on Computers 48 (11): 1260-81.
    • (1999) IEEE Transactions on Computers , vol.48 , Issue.11 , pp. 1260-1281
    • Skadron, K.1    Ahuja, P.S.2    Martonosi, M.3    Clark, D.W.4
  • 44
    • 11844260997 scopus 로고
    • Profile-driven sampled trace generation
    • Technical Report RE 20041, IBM Research Division, T. J. Watson Research Center
    • Debbie, P. K., and R. Nair. 1995. Profile-driven sampled trace generation. Technical Report RE 20041, IBM Research Division, T. J. Watson Research Center.
    • (1995)
    • Debbie, P.K.1    Nair, R.2
  • 45
    • 0006637419 scopus 로고    scopus 로고
    • Evaluation and generation of reduced traces for benchmarks
    • Technical Report RC 20610, IBM Research Division, T. J. Watson Research Center
    • Iyengar, V. S., and L. H. Trevillyan. 1996. Evaluation and generation of reduced traces for benchmarks. Technical Report RC 20610, IBM Research Division, T. J. Watson Research Center.
    • (1996)
    • Iyengar, V.S.1    Trevillyan, L.H.2
  • 47
    • 0034317861 scopus 로고    scopus 로고
    • Validating trace-driven microarchitectural simulations
    • Khalid, H. 2000. Validating trace-driven microarchitectural simulations. IEEE Micro 20 (6): 76-82.
    • (2000) IEEE Micro , vol.20 , Issue.6 , pp. 76-82
    • Khalid, H.1
  • 49
    • 0024107186 scopus 로고
    • Accurate low-cost methods for performance evaluation of cache memory systems
    • Laha, S., J. H. Patel, and R. K. Iyer. 1988. Accurate low-cost methods for performance evaluation of cache memory systems. IEEE Transactions on Computers 37 (11): 1325-36.
    • (1988) IEEE Transactions on Computers , vol.37 , Issue.11 , pp. 1325-1336
    • Laha, S.1    Patel, J.H.2    Iyer, R.K.3
  • 50
    • 1842871851 scopus 로고
    • Accelerating architectural simulation by parallel execution of trace samples
    • Technical Report SMLI TR-93-22, Sun Microsystems Laboratories, Inc
    • Lauterbach, G. 1993. Accelerating architectural simulation by parallel execution of trace samples. Technical Report SMLI TR-93-22, Sun Microsystems Laboratories, Inc.
    • (1993)
    • Lauterbach, G.1
  • 52
    • 0028449945 scopus 로고
    • The PowerPC performance modeling methodology
    • Poursepanj, A. 1994. The PowerPC performance modeling methodology. Communications of the ACM 37 (6): 47-55.
    • (1994) Communications of the ACM , vol.37 , Issue.6 , pp. 47-55
    • Poursepanj, A.1
  • 53
    • 0003557978 scopus 로고    scopus 로고
    • Time varying behavior of programs
    • Technical Report UCSD CS99-630, Department of Computer Science and Engineering, University of California, San Diego
    • Sherwood, T., and B. Calder. 1999. Time varying behavior of programs. Technical Report UCSD CS99-630, Department of Computer Science and Engineering, University of California, San Diego.
    • (1999)
    • Sherwood, T.1    Calder, B.2
  • 57
    • 0028445155 scopus 로고
    • A comparison of trace-sampling techniques for multi-megabyte caches
    • Kessler, R. E., M. D. Hill, and D. A. Wood. 1994. A comparison of trace-sampling techniques for multi-megabyte caches. IEEE Transactions on Computers 43 (6): 664-75.
    • (1994) IEEE Transactions on Computers , vol.43 , Issue.6 , pp. 664-675
    • Kessler, R.E.1    Hill, M.D.2    Wood, D.A.3
  • 61
    • 0012580979 scopus 로고    scopus 로고
    • Memory reference reuse latency: Accelerated sampled microarchitecture simulation
    • Technical Report CS-2002-19, Department of Computer Science, University of Virginia
    • Haskins, J. W., Jr., and K. Skadron. 2002. Memory reference reuse latency: Accelerated sampled microarchitecture simulation. Technical Report CS-2002-19, Department of Computer Science, University of Virginia.
    • (2002)
    • Haskins Jr., J.W.1    Skadron, K.2
  • 64
    • 85008031236 scopus 로고    scopus 로고
    • MinneSPEC: A new SPEC benchmark workload for simulation-based computer architecture research
    • June
    • Kleinosowski, A. J., and D. J. Lilja. 2002. MinneSPEC: A new SPEC benchmark workload for simulation-based computer architecture research. Computer Architecture Letters, June.
    • (2002) Computer Architecture Letters
    • Kleinosowski, A.J.1    Lilja, D.J.2
  • 65
    • 0037325558 scopus 로고    scopus 로고
    • Designing workloads for computer architecture research
    • Eeckhout, L., H. Vandierendonck, and K. De Bosschere. 2003. Designing workloads for computer architecture research. IEEE Computer 36 (2): 65-71.
    • (2003) IEEE Computer , vol.36 , Issue.2 , pp. 65-71
    • Eeckhout, L.1    Vandierendonck, H.2    De Bosschere, K.3
  • 66
    • 2942744016 scopus 로고    scopus 로고
    • Techniques for accurate, accelerated processor simulation: An analysis of reduced inputs and sampling
    • Technical Report CS-2002-01, University of Virginia, Department of Computer Science
    • Haskins, J. W., Jr., K. Skadron, A. J. KleinOsowski, and D. J. Lilja. 2002. Techniques for accurate, accelerated processor simulation: An analysis of reduced inputs and sampling. Technical Report CS-2002-01, University of Virginia, Department of Computer Science.
    • (2002)
    • Haskins Jr., J.W.1    Skadron, K.2    Kleinosowski, A.J.3    Lilja, D.J.4
  • 68
    • 0032075553 scopus 로고    scopus 로고
    • Performance simulation of an Alpha microprocessor
    • Reilly, M., and J. Edmondson. 1998. Performance simulation of an Alpha microprocessor. IEEE Computer 31 (5): 50-58.
    • (1998) IEEE Computer , vol.31 , Issue.5 , pp. 50-58
    • Reilly, M.1    Edmondson, J.2
  • 70
    • 0032683935 scopus 로고    scopus 로고
    • Environment for PowerPC microarchitecture exploration
    • Moudgill, M., J.-D. Wellman, and J. H. Moreno. 1999. Environment for PowerPC microarchitecture exploration. IEEE Micro 19 (3): 15-25.
    • (1999) IEEE Micro , vol.19 , Issue.3 , pp. 15-25
    • Moudgill, M.1    Wellman, J.-D.2    Moreno, J.H.3
  • 72
    • 0002986475 scopus 로고    scopus 로고
    • The SimpleScalar tool set
    • for more information
    • Burger, D. C., and T. M. Austin. 1997. The SimpleScalar tool set. Computer Architecmre News, 1997. See also http:/ /www.simplescalar.com for more information
    • (1997) Computer Architecmre News
    • Burger, D.C.1    Austin, T.M.2
  • 73
    • 0036470602 scopus 로고    scopus 로고
    • Rsim: Simulating shared-memory multiprocessors with ILP processors
    • Hughes, C. J., V. S. Pai, P. Ranganathan, and S. V. Adve. 2002. Rsim: Simulating shared-memory multiprocessors with ILP processors. IEEE Computer 35 (2): 40-49.
    • (2002) IEEE Computer , vol.35 , Issue.2 , pp. 40-49
    • Hughes, C.J.1    Pai, V.S.2    Ranganathan, P.3    Adve, S.V.4
  • 77
    • 0032069891 scopus 로고    scopus 로고
    • Calibration of microprocessor performance models
    • Black, B., and J. P. Shen. 1998. Calibration of microprocessor performance models. IEEE Computer 31 (5): 59-65.
    • (1998) IEEE Computer , vol.31 , Issue.5 , pp. 59-65
    • Black, B.1    Shen, J.P.2
  • 83
    • 11844260345 scopus 로고    scopus 로고
    • Performance evaluation of processor operation using trace pre-processing
    • Bose, P. 2000. Performance evaluation of processor operation using trace pre-processing. U.S. Patent 6,059,835.
    • (2000) U.S. Patent 6,059,835
    • Bose, P.1
  • 86
    • 0039771963 scopus 로고    scopus 로고
    • Efficient performance prediction for modern microprocessors
    • Ph.D. diss., Stanford University
    • Ofelt, D. J. 1999. Efficient performance prediction for modern microprocessors. Ph.D. diss., Stanford University.
    • (1999)
    • Ofelt, D.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.