Machine learning-based load distribution and balancing in heterogeneous database management systems

Abdennebi, Anes and Elakaş, Anıl and Taşyaran, Fatih and Öztürk, Erdinç and Kaya, Kamer and Yıldırım, Sinan (2021) Machine learning-based load distribution and balancing in heterogeneous database management systems. Concurrency and Computation: Practice and Experience . ISSN 1532-0626 (Print) 1532-0634 (Online) Published Online First http://dx.doi.org/10.1002/cpe.6641

Warning
There is a more recent version of this item available.
Full text not available from this repository. (Request a copy)

Abstract

For dynamic and continuous data analysis, conventional OLTP systems are slow in performance. Today's cutting-edge high-performance computing hardware, such as GPUs, has been used as accelerators for data analysis tasks, which traditionally leverage CPUs on classical database management systems (DBMS). When CPUs and GPUs are used together, the architectural heterogeneity, that is, leveraging hardware with different performance characteristics jointly, creates complex problems that need careful treatment for performance optimization. Load distribution and balancing are crucial problems for DBMSs working on heterogeneous architectures. In this work, focusing on a hybrid, CPU-GPU database management system to process users' queries, we propose heuristical and machine-learning-based (ML-based) load distribution and balancing models. In more detail, we employ multiple linear regression (MLR), random forest (RF), and Adaboost (Ada) models to dynamically decide the processing unit for each incoming query based on the response time predictions on both CPU and GPU. The ML-based models outperformed the other algorithms, as well as the CPU and GPU-only running modes with up to 27%, 29%, and 40%, respectively, in overall performance (response time) while answering intense real-life working scenarios. Finally, we propose to use a hybrid load-balancing model that would be more efficient than the models we tested in this work.
Item Type: Article
Uncontrolled Keywords: CPUs; data analytics; GPUs; high-performance computing; load balancing; load distribution; machine learning
Subjects: Q Science > Q Science (General)
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng.
Faculty of Engineering and Natural Sciences
Depositing User: Sinan Yıldırım
Date Deposited: 07 Jan 2022 15:49
Last Modified: 07 Jan 2022 15:49
URI: https://research.sabanciuniv.edu/id/eprint/42575

Available Versions of this Item

Actions (login required)

View Item
View Item