Automating Machine Learning Workflows with Cloud-Based Pipelines

Authors

  • Pradeep Etikani, Vijaya Venkata Sri Rama Bhaskar, Savita Nuguri, Rahul Saoji, Krishnateja Shiva

Keywords:

inefficiency, accomplishing, organization, enhancement

Abstract

The paper is aimed at discussing cloud-based pipelines for automating machine learning processes. The paper also discusses how these types of systems overcome fundamental issues, which are associated with ML processes such as, including inefficiency, scalability problems and convolutions of collaborating among other similar systems. Cloud-based pipelines use distributed computation and storage to automate the whole ML pipeline right from data processing to model deploying. The study identifies advantages including more efficient process organization, managing resources as well as better integration of employees. Techniques that have been examined are automated data pipeline creation, large-scale model building and training and methods on service deployment and maintenance. Major findings show that the use of this framework leads to a reduction of the time required for accomplishing ML projects and enhancement in the quality of the models developed, in addition to facilitating effective replication of experiments.

Downloads

Download data is not yet available.

References

Alarcon, M.L., Oruche, R., Pandey, A. and Calyam, P., 2022. Cloud-based data pipeline orchestration platform for COVID-19 evidence-based analytics. In Novel AI and Data Science Advancements for Sustainability in the Era of COVID-19 (pp. 159-180). Academic Press.

Bartezzaghi, A., Giurgiu, I., Marchiori, C., Rigotti, M., Sebastian, R. and Malossi, C., 2022, June. Design of a Cloud-Based Data Platform for Standardized Machine Learning Workflows with Applications to Transport Infrastructure. In 2022 IEEE 21st Mediterranean Electrotechnical Conference (MELECON) (pp. 764-769). IEEE.

Bustamante, A.L., Patricio, M.A., Berlanga, A. and Molina, J.M., 2023. Seamless transition from machine learning on the cloud to industrial edge devices with thinger. io. IEEE Internet of Things Journal, 10(18), pp.16548-16563.

Chowdhury, K., Lamacchia, D., Frenk Feldman, V., Mallik, A., Rahman, I. and Alam, Z., 2020, November. A Cloud–Based Smart Engineering and Predictive Computation System for Pipeline Design and Operation Cost Reduction. In Abu Dhabi International Petroleum Exhibition and Conference (p. D012S116R200). SPE.

Colonnelli, I., Cantalupo, B., Spampinato, C., Pennisi, M. and Aldinucci, M., 2021. Bringing AI pipelines onto cloud-HPC: setting a baseline for accuracy of COVID-19 AI diagnosis. arXiv preprint arXiv:2108.01033.

García, Á.L., De Lucas, J.M., Antonacci, M., Zu Castell, W., David, M., Hardt, M., Iglesias, L.L., Moltó, G., Plociennik, M., Tran, V. and Alic, A.S., 2020. A cloud-based framework for machine learning workloads and applications. IEEE access, 8, pp.18681-18692.

Goh, P.J., Hoe, Z.Y., Low, C.Y., Koh, C.T., Mohammad, U., Lee, K. and Tan, C.F., 2021, November. Conceptual design of cloud-based data pipeline for smart factory. In Symposium on Intelligent Manufacturing and Mechatronics (pp. 29-39). Singapore: Springer Nature Singapore.

Mani, D.R., Maynard, M., Kothadia, R., Krug, K., Christianson, K.E., Heiman, D., Clauser, K.R., Birger, C., Getz, G. and Carr, S.A., 2021. PANOPLY: a cloud-based platform for automated and reproducible proteogenomic data analysis. Nature methods, 18(6), pp.580-582.

Quaranta, L., Calefato, F. and Lanubile, F., 2021. A taxonomy of tools for reproducible machine learning experiments. AIxIA 2021.

Spjuth, O., Frid, J. and Hellander, A., 2021. The machine learning life cycle and the cloud: implications for drug discovery. Expert opinion on drug discovery, 16(9), pp.1071-1079.

Xin, D., Miao, H., Parameswaran, A. and Polyzotis, N., 2021, June. Production machine learning pipelines: Empirical analysis and optimization opportunities. In Proceedings of the 2021 International Conference on Management of Data (pp. 2639-2652).

Xin, D., Wu, E.Y., Lee, D.J.L., Salehi, N. and Parameswaran, A., 2021, May. Whither automl? understanding the role of automation in machine learning workflows. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (pp. 1-16).

Chenchala, P. K., Choppadandi, A., Kaur, J., Nakra, V., & Pandian, P. K. G. (2020). Predictive Maintenance and Resource Optimization in Inventory Identification Tool Using ML. International Journal of Open Publication and Exploration, 8(2), 43-50. https://ijope.com/index.php/home/article/view/127

Predictive Maintenance and Resource Optimization in Inventory Identification Tool Using ML. International Journal of Open Publication and Exploration, 8(2), 43-50. https://ijope.com/index.php/home/article/view/127

Fadnavis, N. S., Patil, G. B., Padyana, U. K., Rai, H. P., & Ogeti, P. (2020). Machine learning applications in climate modeling and weather forecasting. NeuroQuantology, 18(6), 135-145. https://doi.org/10.48047/nq.2020.18.6.NQ20194

Tilala, Mitul, and Abhip Dilip Chawda. "Evaluation of Compliance Requirements for Annual Reports in Pharmaceutical Industries." NeuroQuantology 18, no. 11 (November 2020): 138-145. https://doi.org/10.48047/nq.2020.18.11.NQ20244.

AI-Driven Customer Relationship Management in PK Salon Management System. (2019). International Journal of Open Publication and Exploration, ISSN: 3006-2853, 7(2), 28-35. https://ijope.com/index.php/home/article/view/128

Mitul Tilala, Abhip Dilip Chawda, Abhishek Pandurang Benke, Akshay Agarwal. (2022). Regulatory Intelligence: Leveraging Data Analytics for Regulatory Decision-Making. International Journal of Multidisciplinary Innovation and Research Methodology, ISSN: 2960-2068, 1(1), 78–83. Retrieved from https://ijmirm.com/index.php/ijmirm/article/view/77

Tilala, Mitul, and Abhip Dilip Chawda. "Evaluation of Compliance Requirements for Annual Reports in Pharmaceutical Industries." NeuroQuantology 18, no. 11 (November 2020): 138-145. https://doi.org/10.48047/nq.2020.18.11.NQ20244.

Kamuni, Navin, Suresh Dodda, Venkata Sai Mahesh Vuppalapati, Jyothi Swaroop Arlagadda, and Preetham Vemasani. "Advancements in Reinforcement Learning Techniques for Robotics." Journal of Basic Science and Engineering 19, no. 1 (2022): 101-111. ISSN: 1005-0930.

Narukulla, Narendra, Joel Lopes, Venudhar Rao Hajari, Nitin Prasad, and Hemanth Swamy. "Real-Time Data Processing and Predictive Analytics Using Cloud-Based Machine Learning." Tuijin Jishu/Journal of Propulsion Technology 42, no. 4 (2021): 91-102.

Nitin Prasad. (2022). Security Challenges and Solutions in Cloud-Based Artificial Intelligence and Machine Learning Systems. International Journal on Recent and Innovation Trends in Computing and Communication, 10(12), 286–292. Retrieved from https://www.ijritcc.org/index.php/ijritcc/article/view/10750

Big Data Analytics using Machine Learning Techniques on Cloud Platforms. (2019). International Journal of Business Management and Visuals, ISSN: 3006-2705, 2(2), 54-58. https://ijbmv.com/index.php/home/article/view/76

Shah, J., Prasad, N., Narukulla, N., Hajari, V. R., & Paripati, L. (2019). Big Data Analytics using Machine Learning Techniques on Cloud Platforms. International Journal of Business Management and Visuals, 2(2), 54-58. https://ijbmv.com/index.php/home/article/view/76

Downloads

Published

16.01.2023

How to Cite

Pradeep Etikani. (2023). Automating Machine Learning Workflows with Cloud-Based Pipelines. International Journal of Intelligent Systems and Applications in Engineering, 11(1), 375 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6722

Issue

Section

Research Article