메뉴 건너뛰기




Volumn , Issue , 2014, Pages 97-108

Aladdin: A pre-RTL, power-performance accelerator simulator enabling large design space exploration of customized architectures

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; APPLICATION SPECIFIC INTEGRATED CIRCUITS; COMPUTER ARCHITECTURE; SYSTEM-ON-CHIP;

EID: 84905487457     PISSN: 10636897     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ISCA.2014.6853196     Document Type: Conference Paper
Times cited : (265)

References (56)
  • 3
    • 0036469652 scopus 로고    scopus 로고
    • Simplescalar: An infrastructure for computer system modeling
    • T. M. Austin, E. Larson, and D. Ernst, "Simplescalar: An infrastructure for computer system modeling," IEEE Computer, 2002.
    • (2002) IEEE Computer
    • Austin, T.M.1    Larson, E.2    Ernst, D.3
  • 4
    • 0026867085 scopus 로고
    • Dynamic dependency analysis of ordinary programs
    • T. M. Austin and G. S. Sohi, "Dynamic dependency analysis of ordinary programs," in ISCA, 1992.
    • (1992) ISCA
    • Austin, T.M.1    Sohi, G.S.2
  • 6
    • 84881175680 scopus 로고    scopus 로고
    • Continuous real-world inputs can open up alternative accelerator designs
    • B. Belhadj, A. Joubert, Z. Li, R. Héliot, and O. Temam, "Continuous real-world inputs can open up alternative accelerator designs," in ISCA, 2013.
    • (2013) ISCA
    • Belhadj, B.1    Joubert, A.2    Li, Z.3    Héliot, R.4    Temam, O.5
  • 8
    • 0033719421 scopus 로고    scopus 로고
    • Wattch: A framework for architectural-level power analysis and optimizations
    • D. Brooks, V. Tiwari, and M. Martonosi, "Wattch: A framework for architectural-level power analysis and optimizations," in ISCA, 2000.
    • (2000) ISCA
    • Brooks, D.1    Tiwari, V.2    Martonosi, M.3
  • 9
    • 0029666646 scopus 로고    scopus 로고
    • Memory bandwidth limitations of future microprocessors
    • D. Burger, J. R. Goodman, and A. Kagi, "Memory bandwidth limitations of future microprocessors," in ISCA, 1996.
    • (1996) ISCA
    • Burger, D.1    Goodman, J.R.2    Kagi, A.3
  • 12
    • 84881142714 scopus 로고    scopus 로고
    • Linqits: Big data on little clients
    • E. S. Chung, J. D. Davis, and J. Lee, "Linqits: big data on little clients," ISCA, 2013.
    • (2013) ISCA
    • Chung, E.S.1    Davis, J.D.2    Lee, J.3
  • 13
    • 79951696448 scopus 로고    scopus 로고
    • Single-chip heterogeneous computing: Does the future include custom logic, fpgas, and gpgpus?
    • E. S. Chung, P. A. Milder, J. C. Hoe, and K. Mai, "Single-chip heterogeneous computing: Does the future include custom logic, fpgas, and gpgpus?" in MICRO, 2010.
    • (2010) MICRO
    • Chung, E.S.1    Milder, P.A.2    Hoe, J.C.3    Mai, K.4
  • 14
    • 52649095061 scopus 로고    scopus 로고
    • Veal: Virtualized execution accelerator for loops
    • N. Clark, A. Hormati, and S. A. Mahlke, "Veal: Virtualized execution accelerator for loops," in ISCA, 2008.
    • (2008) ISCA
    • Clark, N.1    Hormati, A.2    Mahlke, S.A.3
  • 15
    • 2442428419 scopus 로고    scopus 로고
    • Application-specific instruction generation for configurable processor architectures
    • J. Cong, Y. Fan, G. Han, and Z. Zhang, "Application-specific instruction generation for configurable processor architectures," in FPGA, 2004.
    • (2004) FPGA
    • Cong, J.1    Fan, Y.2    Han, G.3    Zhang, Z.4
  • 16
    • 67650692183 scopus 로고    scopus 로고
    • Synthesis of reconfigurable highperformance multicore systems
    • J. Cong, K. Gururaj, and G. Han, "Synthesis of reconfigurable highperformance multicore systems," in FPGA, 2009.
    • (2009) FPGA
    • Cong, J.1    Gururaj, K.2    Han, G.3
  • 19
    • 84876591853 scopus 로고    scopus 로고
    • Neural acceleration for general-purpose approximate programs
    • H. Esmaeilzadeh, A. Sampson, L. Ceze, and D. Burger, "Neural acceleration for general-purpose approximate programs," in MICRO, 2012.
    • (2012) MICRO
    • Esmaeilzadeh, H.1    Sampson, A.2    Ceze, L.3    Burger, D.4
  • 20
    • 80052679438 scopus 로고    scopus 로고
    • Buffer-integrated-cache: A cost-effective sram architecture for handheld and embedded platforms
    • C. F. Fajardo, Z. Fang, R. Iyer, G. F. Garcia, S. E. Lee, and L. Zhao, "Buffer-integrated-cache: A cost-effective sram architecture for handheld and embedded platforms," in DAC, 2011.
    • (2011) DAC
    • Fajardo, C.F.1    Fang, Z.2    Iyer, R.3    Garcia, G.F.4    Lee, S.E.5    Zhao, L.6
  • 22
    • 0036296821 scopus 로고    scopus 로고
    • Slack: Maximizing performance under technological constraints
    • B. A. Fields, R. Bodk, and M. D. Hill, "Slack: Maximizing performance under technological constraints," in ISCA, 2002.
    • (2002) ISCA
    • Fields, B.A.1    Bodk, R.2    Hill, M.D.3
  • 24
    • 79959906704 scopus 로고    scopus 로고
    • Kremlin: Rethinking and rebooting gprof for the multicore age
    • S. Garcia, D. Jeon, C. M. Louie, and M. B. Taylor, "Kremlin: rethinking and rebooting gprof for the multicore age," in PLDI, 2011.
    • (2011) PLDI
    • Garcia, S.1    Jeon, D.2    Louie, C.M.3    Taylor, M.B.4
  • 26
    • 84887502088 scopus 로고    scopus 로고
    • Breaking simd shackles with an exposed flexible microarchitecture and the access execute pdg
    • V. Govindaraju, T. Nowatzki, and K. Sankaralingam, "Breaking simd shackles with an exposed flexible microarchitecture and the access execute pdg," in PACT, 2013.
    • (2013) PACT
    • Govindaraju, V.1    Nowatzki, T.2    Sankaralingam, K.3
  • 27
    • 84863374615 scopus 로고    scopus 로고
    • Bundled execution of recurring traces for energy-efficient general purpose processing
    • S. Gupta, S. Feng, A. Ansari, S. Mahlke, and D. August, "Bundled execution of recurring traces for energy-efficient general purpose processing," in MICRO, 2011.
    • (2011) MICRO
    • Gupta, S.1    Feng, S.2    Ansari, A.3    Mahlke, S.4    August, D.5
  • 29
    • 84905475765 scopus 로고    scopus 로고
    • Optimal huffman tree-height reduction for instruction-level parallelism
    • Department of Computer Sciences The University of Texas at Austin
    • W. Hunt, B. A. Maher, D. Burger, and K. S. Mckinley, "Optimal huffman tree-height reduction for instruction-level parallelism," Technical Report TR-08-34, Department of Computer Sciences The University of Texas at Austin, 2008.
    • (2008) Technical Report TR-08-34
    • Hunt, W.1    Maher, B.A.2    Burger, D.3    McKinley, K.S.4
  • 30
    • 77952985184 scopus 로고    scopus 로고
    • Code coverage and input variability: Effects on architecture and compiler research
    • H. C. Hunter andW. meiW. Hwu, "Code coverage and input variability: effects on architecture and compiler research," in CASES, 2002.
    • (2002) CASES
    • Hunter, H.C.1    Mei, W.2    Hwu, W.3
  • 31
    • 81455154902 scopus 로고    scopus 로고
    • Kismet: Parallel speedup estimates for serial programs
    • D. Jeon, S. Garcia, C. M. Louie, and M. B. Taylor, "Kismet: parallel speedup estimates for serial programs," in OOPSLA, 2011.
    • (2011) OOPSLA
    • Jeon, D.1    Garcia, S.2    Louie, C.M.3    Taylor, M.B.4
  • 32
    • 79951696651 scopus 로고    scopus 로고
    • Sd3: A scalable approach to dynamic data-dependence profiling
    • M. Kim, H. Kim, and C.-K. Luk, "Sd3: A scalable approach to dynamic data-dependence profiling," in MICRO, 2010.
    • (2010) MICRO
    • Kim, M.1    Kim, H.2    Luk, C.-K.3
  • 33
    • 0024068822 scopus 로고
    • Measuring parallelism in computation-intensive scientific/engineering applications
    • M. Kumar, "Measuring parallelism in computation-intensive scientific/engineering applications," IEEE Trans. Computers, 1988.
    • (1988) IEEE Trans. Computers
    • Kumar, M.1
  • 34
    • 0026867146 scopus 로고
    • Limits of control flow on parallelism
    • M. S. Lam and R. P. Wilson, "Limits of control flow on parallelism," in ISCA, 1992.
    • (1992) ISCA
    • Lam, M.S.1    Wilson, R.P.2
  • 37
    • 76749146060 scopus 로고    scopus 로고
    • Mcpat: An integrated power, area, and timing modeling framework for multicore and manycore architectures
    • S. Li, J. H. Ahn, R. D. Strong, J. B. Brockman, D. M. Tullsen, and N. P. Jouppi, "Mcpat: an integrated power, area, and timing modeling framework for multicore and manycore architectures," in MICRO, 2009.
    • (2009) MICRO
    • Li, S.1    Ahn, J.H.2    Strong, R.D.3    Brockman, J.B.4    Tullsen, D.M.5    Jouppi, N.P.6
  • 38
    • 84881144734 scopus 로고    scopus 로고
    • Thin servers with smart pipes: Designing soc accelerators for memcached
    • K. T. Lim, D. Meisner, A. G. Saidi, P. Ranganathan, and T. F. Wenisch, "Thin servers with smart pipes: designing soc accelerators for memcached," in ISCA, 2013.
    • (2013) ISCA
    • Lim, K.T.1    Meisner, D.2    Saidi, A.G.3    Ranganathan, P.4    Wenisch, T.F.5
  • 39
    • 84879851819 scopus 로고    scopus 로고
    • On learning-based methods for designspace exploration with high-level synthesis
    • H.-Y. Liu and L. P. Carloni, "On learning-based methods for designspace exploration with high-level synthesis," in DAC, 2013.
    • (2013) DAC
    • Liu, H.-Y.1    Carloni, L.P.2
  • 40
    • 84862058364 scopus 로고    scopus 로고
    • Compositional system-level design exploration with planning of high-level synthesis
    • H.-Y. Liu, M. Petracca, and L. P. Carloni, "Compositional system-level design exploration with planning of high-level synthesis," in DATE, 2012.
    • (2012) DATE
    • Liu, H.-Y.1    Petracca, M.2    Carloni, L.P.3
  • 41
    • 40349109005 scopus 로고    scopus 로고
    • Pathexpander: Architectural support for increasing the path coverage of dynamic bug detection
    • S. Lu, P. Zhou, W. Liu, Y. Zhou, and J. Torrellas, "Pathexpander: Architectural support for increasing the path coverage of dynamic bug detection," in MICRO, 2006.
    • (2006) MICRO
    • Lu, S.1    Zhou, P.2    Liu, W.3    Zhou, Y.4    Torrellas, J.5
  • 44
    • 84863430504 scopus 로고
    • Measuring limits of parallelism and characterizing its vulnerability to resource constraints
    • L. Rauchwerger, P. K. Dubey, and R. Nair, "Measuring limits of parallelism and characterizing its vulnerability to resource constraints," in MICRO, 1993.
    • (1993) MICRO
    • Rauchwerger, L.1    Dubey, P.K.2    Nair, R.3
  • 45
    • 84889594827 scopus 로고    scopus 로고
    • Quantifying acceleration: Power/performance trade-offs of application kernels in hardware
    • B. Reagen, Y. S. Shao, G.-Y. Wei, and D. Brooks, "Quantifying acceleration: Power/performance trade-offs of application kernels in hardware," in ISLPED, 2013.
    • (2013) ISLPED
    • Reagen, B.1    Shao, Y.S.2    Wei, G.-Y.3    Brooks, D.4
  • 47
    • 84880285819 scopus 로고    scopus 로고
    • Sonic millip3de: A massively parallel 3d-stacked accelerator for 3d ultrasound
    • R. Sampson, M. Yang, S. Wei, C. Chakrabarti, and T. F. Wenisch, "Sonic millip3de: A massively parallel 3d-stacked accelerator for 3d ultrasound," in HPCA, 2013.
    • (2013) HPCA
    • Sampson, R.1    Yang, M.2    Wei, S.3    Chakrabarti, C.4    Wenisch, T.F.5
  • 48
    • 34249810603 scopus 로고    scopus 로고
    • Nosq: Store-load communication without a store queue
    • T. Sha, M. M. K. Martin, and A. Roth, "Nosq: Store-load communication without a store queue," in MICRO, 2006.
    • (2006) MICRO
    • Sha, T.1    Martin, M.M.K.2    Roth, A.3
  • 49
    • 84881437667 scopus 로고    scopus 로고
    • Isa-independent workload characterization and its implications for specialized architectures
    • Y. S. Shao and D. Brooks, "Isa-independent workload characterization and its implications for specialized architectures," in ISPASS, 2013.
    • (2013) ISPASS
    • Shao, Y.S.1    Brooks, D.2
  • 50
    • 84864858301 scopus 로고    scopus 로고
    • A defect-tolerant accelerator for emerging highperformance applications
    • O. Temam, "A defect-tolerant accelerator for emerging highperformance applications," in ISCA, 2012.
    • (2012) ISCA
    • Temam, O.1
  • 51
    • 0026989702 scopus 로고
    • On the limits of program parallelism and its smoothability
    • K. B. Theobald, G. R. Gao, and L. J. Hendren, "On the limits of program parallelism and its smoothability," in MICRO, 1992.
    • (1992) MICRO
    • Theobald, K.B.1    Gao, G.R.2    Hendren, L.J.3
  • 53
    • 0026137115 scopus 로고
    • Limits of instruction-level parallelism
    • D.W. Wall, "Limits of instruction-level parallelism," in ASPLOS, 1991.
    • (1991) ASPLOS
    • Wall, D.W.1
  • 55
    • 84881185269 scopus 로고    scopus 로고
    • Navigating big data with high-throughput, energy-efficient data partitioning
    • L. Wu, R. J. Barker, M. A. Kim, and K. A. Ross, "Navigating big data with high-throughput, energy-efficient data partitioning," in ISCA, 2013.
    • (2013) ISCA
    • Wu, L.1    Barker, R.J.2    Kim, M.A.3    Ross, K.A.4
  • 56
    • 84893898462 scopus 로고    scopus 로고
    • A 3d-stacked logic-in-memory accelerator for application-specific data intensive computing
    • Q. Zhu, B. Akin, H. E. Sumbul, F. Sadi, J. Hoe, L. Pileggi, and F. Franchetti, "A 3d-stacked logic-in-memory accelerator for application-specific data intensive computing," in 3DIC, 2013.
    • (2013) 3DIC
    • Zhu, Q.1    Akin, B.2    Sumbul, H.E.3    Sadi, F.4    Hoe, J.5    Pileggi, L.6    Franchetti, F.7


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.