Resource management for power-constrained HEVC transcoding using reinforcement learning



Downloads per month over past year

Costero Valero, Luis María and Iranfar, Arman and Zapater, Marina and Atienza, David and Olcoz Herrero, Katzalin (2020) Resource management for power-constrained HEVC transcoding using reinforcement learning. IEEE transactions on parallel and distributed systems, 31 (12). pp. 2834-2850. ISSN 1045-9219

[thumbnail of olcoz25 postprint.pdf]

Official URL:


The advent of online video streaming applications and services along with the users' demand for high-quality contents require High Efficiency Video Coding (HEVC), which provides higher video quality and more compression at the cost of increased complexity. On one hand, HEVC exposes a set of dynamically tunable parameters to provide trade-offs among Quality-of-Service (QoS), performance, and power consumption of multi-core servers on the video providers' data center. On the other hand, resource management of modern multi-core servers is in charge of adapting system-level parameters, such as operating frequency and multithreading, to deal with concurrent applications and their requirements. Therefore, efficient multi-user HEVC streaming necessitates joint adaptation of application- and system-level parameters. Nonetheless, dealing with such a large and dynamic design space is challenging and difficult to address through conventional resource management strategies. Thus, in this work, we develop a multi-agent Reinforcement Learning framework to jointly adjust application- and system-level parameters at runtime to satisfy the QoS of multi-user HEVC streaming in power-constrained servers. In particular, the design space, composed of all design parameters, is split into smaller independent sub-spaces. Each design sub-space is assigned to a particular agent so that it can explore it faster, yet accurately. The benefits of our approach are revealed in terms of adaptability and quality (with up to to 4x improvements in terms of QoS when compared to a static resource management scheme), and learning time (6 x faster than an equivalent mono-agent implementation). Finally, we show that the power-capping techniques formulated outperform the hardware-based power capping with respect to quality.

Item Type:Article
Additional Information:

©2020 IEEE Computer Society
This work was supported by the EU (FEDER) and Spanish MINECO (RTI2018-093684-B-I00), MECD (FPU15/02050), CM(S2018/TCS-4423), and UCM (PR65/19-22445), the ERC Consolidator Grant COMPUSAPIEN (GA No. 725657), the H2020 RECIPE project (GA No. 801137), and the H2020 DeepHealth project (GA No. 825111)

Uncontrolled Keywords:Resource management; DVFS; Power capping; Reinforcement learning; Q-learning; HEVC; Self-adaptation
Subjects:Sciences > Computer science > Artificial intelligence
ID Code:62156
Deposited On:28 Sep 2020 17:43
Last Modified:04 Apr 2022 14:58

Origin of downloads

Repository Staff Only: item control page