Optimizing SQL Query Execution Time: A Hybrid Approach Using Machine Learning and Deep Learning Technique

Authors

  • Bethineni Saritha, M. Sadanandam

Keywords:

Sql, Query optimization, machine learning, deep learning.

Abstract

The escalating volume of global data in recent years has posed significant challenges to data management and analysis, particularly regarding query and processing speeds. In response to these challenges, the present research endeavors to advance large-scale data analytics by accelerating query processing and data retrieval by applying machine learning approaches. The proposed innovative machine learning model aims to improve data retrieval speeds and enhance analytical accuracy. By leveraging the estimated execution time as a guiding metric, the research provides a compass for optimizing query performance. This enables informed decision-making to meet performance requirements and ensures efficient resource utilization within real-time database systems. Notably, the hybrid method introduced in this study demonstrates a reduction in processing time and memory usage, signifying a comprehensive approach to enhancing the efficiency of data management and analysis in the face of burgeoning data volumes.

Downloads

Download data is not yet available.

References

Vaidya, K., Dutt, A., Narasayya, V., & Chaudhuri, S. (2021). Leveraging query logs and machine learning for parametric query optimization. Proceedings of the VLDB Endowment, 15(3), 401-413.

Marcus, R., Negi, P., Mao, H., Tatbul, N., Alizadeh, M., & Kraska, T. (2021, June). Bao: Making learned query optimization practical. In Proceedings of the 2021 International Conference on Management of Data (pp. 1275-1288).

Kumar, A., Boehm, M., & Yang, J. (2017, May). Data management in machine learning: Challenges, techniques, and systems. In Proceedings of the 2017 ACM International Conference on Management of Data (pp. 1717-1722).

Vu, T. (2019, June). Deep query optimization. In Proceedings of the 2019 International Conference on Management of Data (pp. 1856-1858).

Azhir, E., Navimipour, N. J., Hosseinzadeh, M., Sharifi, A., & Darwesh, A. (2019). Query optimization mechanisms in the cloud environments: a systematic study. International Journal of Communication Systems, 32(8), e3940.

Park, K., Saur, K., Banda, D., Sen, R., Interlandi, M., &Karanasos, K. (2022, June). End-to-end optimization of machine learning prediction queries. In Proceedings of the 2022 International Conference on Management of Data (pp. 587-601).

Krishnan, S., Yang, Z., Goldberg, K., Hellerstein, J., & Stoica, I. (2018). Learning to optimize join queries with deep reinforcement learning. arXiv preprint arXiv:1808.03196.

Kaoudi, Z., Quiané-Ruiz, J. A., Contreras-Rojas, B., Pardo-Meza, R., Troudi, A., & Chawla, S. (2020, April). ML-based cross-platform query optimization. In 2020 IEEE 36th International Conference on Data Engineering (ICDE) (pp. 1489-1500). IEEE.

Sikdar, S. (2021). Applying Machine Learning to Query Optimization (Doctoral dissertation, Rice University).

Yang, Z., Wang, Z., Huang, Y., Lu, Y., Li, C., & Wang, X. S. (2022). Optimizing machine learning inference queries with correlative proxy models. arXiv preprint arXiv:2201.00309.

Hasan, R., & Gandon, F. (2014, August). A machine learning approach to sparql query performance prediction. In 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) (Vol. 1, pp. 266-273). IEEE.

Doshi, L., Zhuang, V., Jain, G., Marcus, R., Huang, H., Altinbüken, D., ... & Fraser, C. (2023). Kepler: Robust Learning for Parametric Query Optimization. Proceedings of the ACM on Management of Data, 1(1), 1-25.

Ma, Q., & Triantafillou, P. (2019, June). Dbest: Revisiting approximate query processing engines with machine learning models. In Proceedings of the 2019 International Conference on Management of Data (pp. 1553-1570).

Wu, W., Chi, Y., Zhu, S., Tatemura, J., Hacigümüs, H., & Naughton, J. F. (2013, April). Predicting query execution time: Are optimizer cost models really unusable?. In 2013 IEEE 29th International Conference on Data Engineering (ICDE) (pp. 1081-1092). IEEE.

Bzdok, D., Nichols, T. E., & Smith, S. M. (2019). Towards algorithmic analytics for large-scale datasets. Nature Machine Intelligence, 1(7), 296-306.

Hazen, B. T., Boone, C. A., Ezell, J. D., & Jones-Farmer, L. A. (2014). Data quality for data science, predictive analytics, and big data in supply chain management: An introduction to the problem and suggestions for research and applications. International Journal of Production Economics, 154, 72-80.

Li, Q., Chen, Y., Wang, J., Chen, Y., & Chen, H. (2017). Web media and stock markets: A survey and future directions from a big data perspective. IEEE Transactions on Knowledge and Data Engineering, 30(2), 381-399.

Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., ... & Zheng, X. (2016). {TensorFlow}: a system for {Large-Scale} machine learning. In 12th USENIX symposium on operating systems design and implementation (OSDI 16) (pp. 265-283).

Downloads

Published

12.06.2024

How to Cite

Bethineni Saritha. (2024). Optimizing SQL Query Execution Time: A Hybrid Approach Using Machine Learning and Deep Learning Technique. International Journal of Intelligent Systems and Applications in Engineering, 12(4), 110–116. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6179

Issue

Section

Research Article