ML p(r)ior | WiSeDB: A Learning-based Workload Management Advisor for Cloud Databases

WiSeDB: A Learning-based Workload Management Advisor for Cloud Databases

2016-01-29
1601.08221 | cs.DB
Workload management for cloud databases must deal with the tasks of resource provisioning, query placement and query scheduling in a manner that meets the application's performance goals while minimizing the cost of using cloud resources. Existing solutions have approached these three challenges in isolation, and with only a particular type of performance goal in mind. In this paper, we introduce WiSeDB, a learning-based framework for generating holistic workload management solutions customized to application-defined performance metrics and workload characteristics. Our approach relies on supervised learning to train cost-effective decision tree models for guiding query placement, scheduling, and resource provisioning decisions. Applications can use these models for both batch and online scheduling of incoming workloads. A unique feature of our system is that it can adapt its offline model to stricter/looser performance goals with minimal re-training. This allows us to present alternative workload management strategies that address the typical performance vs. cost trade-off of cloud services. Experimental results show that our approach has very low training overhead while offering low cost strategies for a variety of performance goals and workload characteristics.
PDF

Highlights - Most important sentences from the article

Login to like/save this paper, take notes and configure your recommendations

Related Articles

2018-08-23

Reinforcement learning approaches have long appealed to the data management community due to their a… show more
PDF

Highlights - Most important sentences from the article

2017-11-22
1711.08330 | cs.DB

In this paper we address cardinality estimation problem which is an important subproblem in query op… show more
PDF

Highlights - Most important sentences from the article

2019-05-24

The growing popularity of workflows in the cloud domain promoted the development of sophisticated au… show more
PDF

Highlights - Most important sentences from the article

2019-05-02

Predicting the execution time of queries is an important problem with applications in scheduling, se… show more
PDF

Highlights - Most important sentences from the article

2018-09-03

We describe a new deep learning approach to cardinality estimation. MSCN is a multi-set convolutiona… show more
PDF

Highlights - Most important sentences from the article

2018-08-09

Exhaustive enumeration of all possible join orders is often avoided, and most optimizers leverage he… show more
PDF

Highlights - Most important sentences from the article

2019-04-02

Pre-trained deep learning models are increasingly being used to offer a variety of compute-intensive… show more
PDF

Highlights - Most important sentences from the article

2018-09-01

Cloud-based data analysis is nowadays common practice because of the lower system management overhea… show more
PDF

Highlights - Most important sentences from the article

2019-04-07

Query optimization is one of the most challenging problems in database systems. Despite the progress… show more
PDF

Highlights - Most important sentences from the article

2019-04-23
1904.10125 | cs.DC

Cloud computing has grown to become a popular distributed computing service offered by commercial pr… show more
PDF

Highlights - Most important sentences from the article

2019-04-04

Interactive tools make data analysis more efficient and more accessible to end-users by hiding the u… show more
PDF

Highlights - Most important sentences from the article

2019-01-21
1901.06899 | cs.DC

The NP-hard problem of task scheduling with communication delays (P|prec,c_{ij}|C_{\mathrm{max}}) is… show more
PDF

Highlights - Most important sentences from the article

2019-04-02

Commercial data analytics products such as Microsoft Azure SQL Data Warehouse or Amazon Redshift pro… show more
PDF

Highlights - Most important sentences from the article

2016-11-21

We address a cost optimization problem faced by a user who runs instances of applications in a remot… show more
PDF

Highlights - Most important sentences from the article

2018-10-02

Cassandra is a popular structured storage system with high-performance, scalability and high availab… show more
PDF

Highlights - Most important sentences from the article