TY - GEN
T1 - HybDTM
T2 - 43rd Annual Design Automation Conference, DAC 2006
AU - Kumar, Amit
AU - Shang, Li
AU - Peh, Li Shiuan
AU - Jha, Niraj K.
PY - 2006
Y1 - 2006
N2 - With ever-increasing power density and cooling costs in modern high-performance systems, dynamic thermal management (DTM) has emerged as an effective technique for guaranteeing thermal safety at run-time. While past works on DTM have focused on different techniques in isolation, they fail to consider a synergistic mechanism using both hardware and software support and hence lead to a significant execution time overhead.In this paper, we propose HybDTM, a methodology for fine-grained, coordinated thermal management using a hybrid of hardware techniques, such as clock gating, and software techniques, such as thermal-aware process scheduling, synergistically leveraging the advantages of both approaches. We show that while hardware techniques can be used reactively to manage thermal emergencies, proactive use of low-overhead software techniques can rely on application-specific thermal profiles to lower system temperature. Our technique involves a novel regression-based thermal model which provides fast and accurate temperature estimates for run-time thermal characterization of applications running on the system, using hardware performance counters, while considering system-level thermal issues. We evaluate HybDTM on an actual desktop system running a number of SPEC2000 benchmarks, in both uniprocessor and simultaneous multi-threading (SMT) environments, and show that it is able to successfully manage the overall temperature with an average execution time overhead of only 9.9% (16.3% maximum) compared to the case without any DTM, as opposed to 20.4% (29.5% maximum) overhead for purely hardware-based DTM.
AB - With ever-increasing power density and cooling costs in modern high-performance systems, dynamic thermal management (DTM) has emerged as an effective technique for guaranteeing thermal safety at run-time. While past works on DTM have focused on different techniques in isolation, they fail to consider a synergistic mechanism using both hardware and software support and hence lead to a significant execution time overhead.In this paper, we propose HybDTM, a methodology for fine-grained, coordinated thermal management using a hybrid of hardware techniques, such as clock gating, and software techniques, such as thermal-aware process scheduling, synergistically leveraging the advantages of both approaches. We show that while hardware techniques can be used reactively to manage thermal emergencies, proactive use of low-overhead software techniques can rely on application-specific thermal profiles to lower system temperature. Our technique involves a novel regression-based thermal model which provides fast and accurate temperature estimates for run-time thermal characterization of applications running on the system, using hardware performance counters, while considering system-level thermal issues. We evaluate HybDTM on an actual desktop system running a number of SPEC2000 benchmarks, in both uniprocessor and simultaneous multi-threading (SMT) environments, and show that it is able to successfully manage the overall temperature with an average execution time overhead of only 9.9% (16.3% maximum) compared to the case without any DTM, as opposed to 20.4% (29.5% maximum) overhead for purely hardware-based DTM.
KW - Dynamic thermal management
KW - Hybrid hardware-software management
KW - Thermal model
UR - http://www.scopus.com/inward/record.url?scp=34547143358&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547143358&partnerID=8YFLogxK
U2 - 10.1145/1146909.1147052
DO - 10.1145/1146909.1147052
M3 - Conference contribution
AN - SCOPUS:34547143358
SN - 1595933816
SN - 1595933816
SN - 9781595933812
T3 - Proceedings - Design Automation Conference
SP - 548
EP - 553
BT - 2006 43rd ACM/IEEE Design Automation Conference, DAC'06
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 24 July 2006 through 28 July 2006
ER -