TY - GEN
T1 - QACO
T2 - 2013 ACM International Conference on Cloud and Autonomic Computing, CAC 2013
AU - Kim, Jinhan
AU - Elnikety, Sameh
AU - He, Yuxiong
AU - Hwang, Seung Won
AU - Ren, Shaolei
PY - 2013
Y1 - 2013
N2 - Web servers provide content to users, with the requirement of providing high response quality within a short response time. Meeting these requirements is challenging, especially in the event of load spikes. Meanwhile, we observe that a response to a request can be adapted or partially executed depending on current resource availability at the server. For example, a web server can choose to send a low or medium resolution image instead of sending the original high resolution image under resource contention. In this paper, we exploit partial execution to expose a trade off between resource consumption and service quality. We show how to manage server resources to improve service quality and responsiveness. Specifically, we develop a framework, called Quota-based Control Optimization (QACO). The quota represents the total amount of resources available for all pending requests. QACO consists of two modules: (1) A control module adjusts the quota to meet the response time target. (2) An optimization module exploits partial execution and allocates the quota to pending requests in a manner that improves total response quality. We evaluate the framework using a system implementation in the Apache Web server, and using a simulation study of a Video-on-Demand server. The results show that under a response time target, QACO achieves a higher response quality than traditional techniques that admit or reject requests without exploiting partial execution.
AB - Web servers provide content to users, with the requirement of providing high response quality within a short response time. Meeting these requirements is challenging, especially in the event of load spikes. Meanwhile, we observe that a response to a request can be adapted or partially executed depending on current resource availability at the server. For example, a web server can choose to send a low or medium resolution image instead of sending the original high resolution image under resource contention. In this paper, we exploit partial execution to expose a trade off between resource consumption and service quality. We show how to manage server resources to improve service quality and responsiveness. Specifically, we develop a framework, called Quota-based Control Optimization (QACO). The quota represents the total amount of resources available for all pending requests. QACO consists of two modules: (1) A control module adjusts the quota to meet the response time target. (2) An optimization module exploits partial execution and allocates the quota to pending requests in a manner that improves total response quality. We evaluate the framework using a system implementation in the Apache Web server, and using a simulation study of a Video-on-Demand server. The results show that under a response time target, QACO achieves a higher response quality than traditional techniques that admit or reject requests without exploiting partial execution.
UR - http://www.scopus.com/inward/record.url?scp=84883738488&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84883738488&partnerID=8YFLogxK
U2 - 10.1145/2494621.2494636
DO - 10.1145/2494621.2494636
M3 - Conference contribution
AN - SCOPUS:84883738488
SN - 9781450321723
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference, CAC 2013
Y2 - 5 August 2013 through 9 August 2013
ER -