HybDTM: A coordinated hardware-software approach for dynamic thermal management

Amit Kumar, Li Shang, Li Shiuan Peh, Niraj K. Jha

Research output: Chapter in Book/Report/Conference proceedingConference contribution

100 Scopus citations

Abstract

With ever-increasing power density and cooling costs in modern high-performance systems, dynamic thermal management (DTM) has emerged as an effective technique for guaranteeing thermal safety at run-time. While past works on DTM have focused on different techniques in isolation, they fail to consider a synergistic mechanism using both hardware and software support and hence lead to a significant execution time overhead.In this paper, we propose HybDTM, a methodology for fine-grained, coordinated thermal management using a hybrid of hardware techniques, such as clock gating, and software techniques, such as thermal-aware process scheduling, synergistically leveraging the advantages of both approaches. We show that while hardware techniques can be used reactively to manage thermal emergencies, proactive use of low-overhead software techniques can rely on application-specific thermal profiles to lower system temperature. Our technique involves a novel regression-based thermal model which provides fast and accurate temperature estimates for run-time thermal characterization of applications running on the system, using hardware performance counters, while considering system-level thermal issues. We evaluate HybDTM on an actual desktop system running a number of SPEC2000 benchmarks, in both uniprocessor and simultaneous multi-threading (SMT) environments, and show that it is able to successfully manage the overall temperature with an average execution time overhead of only 9.9% (16.3% maximum) compared to the case without any DTM, as opposed to 20.4% (29.5% maximum) overhead for purely hardware-based DTM.

Original languageEnglish (US)
Title of host publication2006 43rd ACM/IEEE Design Automation Conference, DAC'06
Pages548-553
Number of pages6
DOIs
StatePublished - Dec 1 2006

Publication series

NameProceedings - Design Automation Conference
ISSN (Print)0738-100X

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture
  • Control and Systems Engineering

Keywords

  • Dynamic thermal management
  • Hybrid hardware-software management
  • Thermal model

Fingerprint Dive into the research topics of 'HybDTM: A coordinated hardware-software approach for dynamic thermal management'. Together they form a unique fingerprint.

  • Cite this

    Kumar, A., Shang, L., Peh, L. S., & Jha, N. K. (2006). HybDTM: A coordinated hardware-software approach for dynamic thermal management. In 2006 43rd ACM/IEEE Design Automation Conference, DAC'06 (pp. 548-553). (Proceedings - Design Automation Conference). https://doi.org/10.1145/1146909.1147052