A Strategy for Queuing Theory-Based Performance and Energy Management in Heterogeneous Data Centers

SUN Jian; LIAO Dan; LI Ke; GONG Yu; SUN Gang

doi:10.3969/j.issn.1001-0548.2018.02.001

With the increasing popularity of Internet services and cloud computing, both energy efficiency and performance guarantees are major concerns for data center operators. However, most of the previous works only address this problem in homogeneous data center environments. This paper investigates how to guarantee performance requirement in terms of waiting time so as to minimize power consumption among multiple types of servers with different processing capabilities in a data center. We address the problem of minimizing the energy consumption of a data center by means of dynamic management policies that switch on/off a certain quantity of servers necessary under given performance requirements in terms of waiting time. We develop a mathematical model using queuing theory to determine the number of servers that should be used i.e., powered on. Experiment results show that our scheme can reduce power consumption compared with the non-group division strategy. Our scheme provides an analytical approach for studying the power-performance tradeoff in the heterogeneous data center.

HTML

对于互联网服务与云计算不断增长的需求使数据中心内运行的服务器数量也由几百台迅速增至几万台，因此数据中心的电量开销巨大。例如，2011年，美国数据中心的能耗就达到了每年1 100万kW/h^[1]，同时其低负载的数据中心能耗是最大能耗的65%左右^[2]。这是因为所有的数据中心都是根据高峰时刻的负载设计的，即一个数据中心的平均负载水平只有最高负载的60%^[3]。

有很多关于数据中心节能的研究^[4-8]，寻找有效的方法在低工作负载时期关闭暂时无用的服务器非常重要^[9-10]。当数据中心的负载增加时，这些服务器可再次开启。然而，这一项操作可能会使一些服务延迟，且对等待时间、业务阻塞率、系统平均业务数量等性能参数产生影响^[9]。当服务器减少时，一些业务可能会遇到不可忽视的延迟，甚至可能堵塞。因此，在限制能耗的同时保持良好的性能有着重要的意义。

许多关于数据中心的研究都建立在所有服务器均有相同处理能力的同构环境假设上^[9-10]。而实际上，数据中心会根据容量的增加以及规格的变化不断升级服务器^[11]。随着时间的推移，最终数据中心将由一系列不同处理能力的异构服务器组成。在异构环境里分配业务时，不同的分配策略会导致功率效率的明显不同。譬如，一些业务在某个特定服务节点上要比在其他节点上运营得要快，这就意味着更少的处理时间以及更低的能耗。因此，在异构环境下选择需要被关闭的服务器又是一大挑战。

基于以上两点建立数据中心模型，其中排队系统为任务单独到达、任务缓存容量有限的M/M/n+ m₁+m₂数学模型。系统中所有的服务器根据其处理能力被分为3组。假设任意发送到数据中心的业务都在合适的服务器节点中接受服务，一旦服务结束，业务离开数据中心。基于实时的负载，根据在队列长度上设置的阈值来开启备用的服务器，根据系统中的总业务数量来关闭备用服务器，从而达到动态调整服务器数量以节约能源的目的。此外，采用优化策略来寻找最佳阈值。

1. 相关工作

近年来大型数据中心中的能源利用率以及性能管理获得了大量的研究关注。这些研究工作可分为两类：动态资源管理和静态资源管理。静态资源管理的实施吸引了不少研究。文献[12]阐述了一种最优化问题得到一个服务器簇中已知工作负载的最佳资源规划策略。然而，由于这种策略通过过度资源供给来满足高负载下的性能要求，该策略将消耗过量的能源。因此，近期更多的工作都围绕根据实时负载开启或关闭服务器的动态情形展开^{[10-11, 13]}。文献[10]也提出了一种动态调整运行中的服务器的方案，以寻找消耗能量与服务质量之间的平衡。文献[13]则通过建立数学模型来权衡数据中心能耗成本最小化与互联网服务利益最大化。

一些关于衡量能源与性能的研究都采取了排队论。文献[11]探讨了单服务器与多服务两种模型，其中建立时间是常值，服务时间和到达时间间隔都服从负指数分布。同时还讨论了时变到达的解决方法；文献[14]讨论了类似的问题，但它提出使用马尔科夫链来预测未来服务需求以判定需要激活的服务器数量。文献[15]提出了一种评估等待时间与功率折中的理论模型。

最领先的研究工作中已经考虑到了异构性，包括异构多核结构的评估。文献[16]探讨了在数据中心内将有限的能量分配给多个异构的服务器从而最大化性能的方法。文献[17]假设数据中心内所有服务器相同，重点研究了能耗与性能的多目标优化。本文将要探索多台异构服务器的数据中心内，性能限定情况下的能耗最优化。

5. 结束语

本文研究了多台异构服务器的数据中心的动态调度机制。首先，使用排队论寻找多台异构服务器的数据中心内功率消耗与平均服务等待时间之间的关系，在保证一定的平均服务等待时间的系统性能前提下，给出了最小化能耗的最优化问题。最后，使用仿真实验证明了本文提出的调度机制可显著改善系统能耗，该方法提供了一种在数据中心级别上权衡功率与性能的有效分析方案。接下来的工作将会使用现实的记录来证明本文的仿真结果，并改进本文的算法来寻找适用于可容纳上百或上千台服务器的大型数据中心的最优解。

Reference (18)

[1]	BROWN R E, BROWN R, MASANET E, et al. Report to congress on server and data center energy efficiency: Public law 109-431[R]. Ernest Orlando Lawrence Berkeley National Laboratory. Berkeley, USA: [s. n. ], 2007.
[2]	GREENBERG A, HAMILTON J, MALTZ D A. The cost of a cloud:Research problems in data center networks[J]. ACM SIGCOMM Computer Communication Review, 2008, 39(1): 68-73. doi: 10.1145/1496091
[3]	BENSON T, AKELLA A, MALTZ D. Network traffic characteristics of data centers in the wild[C]//Internet Measurement Conference (IMC). Melbourne, Australia: [s. n. ], 2010.
[4]	DENG X, WU D, SHEN J. Eco-aware online power management and load scheduling for green cloud datacenters[J]. IEEE Systems Journal, 2016, 10(1): 78-87. doi: 10.1109/JSYST.2014.2344028
[5]	ZHANG Q, SHI W. Energy-efficient workload placement in enterprise datacenters[J]. Computer, 2016, 49(2): 46-52. doi: 10.1109/MC.2016.58
[6]	ROSSI F, XAVIER M, ROSE C. E-eco:Performance-aware energy-efficient cloud data center orchestration[J]. Journal of Network and Computer Applications, 2017, 78(): 83-96. doi: 10.1016/j.jnca.2016.10.024
[7]	XU G, DAI B, HUANG B. Bandwidth-aware energy efficient flow scheduling with SDN in data center networks[J]. Future Generation Computer Systems, 2017, 68(): 163-174. doi: 10.1016/j.future.2016.08.024
[8]	KIM J, CHANG H, JUNG Y. Energy conservation effects of a multi-stage outdoor air enabled cooling system in a data center[J]. Energy and Buildings, 2017, 138(): 257-270. doi: 10.1016/j.enbuild.2016.12.057
[9]	MAZZUCCO M, DYACHUK D, DETERS R. Maximizing cloud providers revenues via energy aware allocation policies[C]//IEEE International Conference on Cloud Computing. Miami, Florida: IEEE, 2010.
[10]	DYACHUK D, MAZZUCCO M. On allocation policies for power and performance[C]//11th ACM/IEEE International Conference on Grid Computing (Grid 2010)-Energy Efficient Grids, Clouds and Clusters Workshop. Brussels, Belgium: IEEE, 2010.
[11]	GANDHI A, GUPTA V, HARCHOL-BALTER M. Optimality analysis of energy-performance trade-off for server farm management[J]. Performance Evaluation, 2010, 67(11): 1155-71. doi: 10.1016/j.peva.2010.08.009
[12]	ZHENG X, CAI Y. Optimal server provisioning and frequency adjustment in server clusters[C]//IEEE 39th International Conference on Parallel Processing Workshops. [S. l. ]: IEEE, 2010.
[13]	GHAMKHARI M, MOHSENIAN-RAD H. Energy and performance management of green data centers:a profit maximization approach[J]. IEEE Transactions on Smart Grid, 2013, 4(2): 1017-1025. doi: 10.1109/TSG.2013.2237929
[14]	GUENTER B, JAIN N, WILLIAMS C. Managing cost, performance and reliability tradeoffs for energy-aware serverprovisioning[C]//IEEE International Conference on Computer Communications. [S. l. ]: IEEE, 2011.
[15]	SCHWARTZ C, PRIES R, TRAN-GIA P. A queuing analysis of an energy-saving mechanism in data centers[C]//IEEE International Conference on Information Networking (ICOIN). [S. l. ]: IEEE, 2012.
[16]	WANG W, LUO J, SONG A. Heterogeneity aware optimal power allocationin data center environments[J]. Pervasive Computing and the Networked World, 2013, 7719(): 513-528.
[17]	LI K. Optimal power allocation among multiple heterogeneous servers in a data center[J]. Sustainable Computing:Informatics and Systems, 2012, 2(1): 13-22. doi: 10.1016/j.suscom.2011.11.002
[18]	INTEL CORP. Intel math kernel library 10. 0-linpack[EB/OL]. [2016-05-10]. http://www.intel.com/cd/software/products/asmo-na/eng/266857.htm,2007.

N	α/s	P_busy¹/W	P_idle¹/ W	P_busy²/W	P_idle²/ W	P_busy³/W	P_idle³/ W
150	250	210	120	240	150	270	180

A Strategy for Queuing Theory-Based Performance and Energy Management in Heterogeneous Data Centers

doi: 10.3969/j.issn.1001-0548.2018.02.001

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views