"A Hybrid CatBoost–NGBoost Model for Predicting Asthma Attack Risk and Severity from Environmental Triggers"
Keywords:
Asthma prediction, Environmental triggers,Air quality index,CatBoost ,NGBoostAbstract
Asthma is a chronic respiratory condition that is highly sensitive to environmental changes such as air pollution, humidity, temperature, and allergen levels. Early prediction of asthma attacks using environmental data can play a critical role in preventing life-threatening episodes and improving patient quality of life. This study explores the application of machine learning techniques—CatBoost and NGBoost—to predict the likelihood and severity of asthma attacks triggered by environmental factors. CatBoost, a gradient boosting algorithm optimized for categorical features, is used to classify the risk of an asthma attack based on real-time environmental inputs such as air quality index (AQI), pollen levels, humidity, and temperature. Its ability to handle categorical data and missing values makes it ideal for health-related tabular datasets. In parallel, NGBoost (Natural Gradient Boosting) is employed to model the uncertainty and probabilistic distribution of asthma attack severity, providing not just point estimates but also confidence intervals for better risk assessment. The integration of these models enables a robust and interpretable asthma risk prediction system capable of supporting real-time alerts for at-risk individuals. The results demonstrate high predictive accuracy with CatBoost and meaningful probabilistic forecasts using NGBoost, validating their effectiveness in clinical and environmental health monitoring scenarios. This approach offers a promising step toward AI-assisted asthma management, potentially empowering both patients and healthcare providers with timely, data-driven insights.
Downloads
References
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A.V., & Gulin, A. (2018). CatBoost: unbiased boosting with categorical features. Advances in Neural Information Processing Systems (NeurIPS).
Duan, T., Avati, A., Ding, D., Thai, K., Basu, S., Ng, A. Y., & Schuler, A. (2020). NGBoost: Natural Gradient Boosting for Probabilistic Prediction. International Conference on Machine Learning (ICML).
World Health Organization. (2023). Asthma: Key Factshttps://www.who.int/news-room/fact-sheets /detail/asthma
AQI India Portal. (2024). Air Quality Index Real-Time Data.https://aqicn.org/map/india
CDC. (2022). Asthma and Environmental Triggers.https://www.cdc.gov/asthma/triggers.html
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin, A. (2018).CatBoost: Unbiased Boosting with Categorical Features.Advances in Neural Information Processing Systems (NeurIPS), 31.
https://arxiv.org/abs/1706.09516
Duan, T., Avati, A., Kraft, P., & Ng, A. (2020).NGBoost: Natural Gradient Boosting for Probabilistic Prediction.Proceedings of the 36th International Conference on Machine Learning (ICML).https://arxiv.org/abs/1910.03225
World Health Organization (WHO). (2023).Ambient (outdoor) air pollution.https://www.who.int/news-room/fact-sheets/detail/ambient-(outdoor)-air-quality-and-health
Kumar, P., & Goyal, P. (2017).Forecasting air quality index using artificial intelligence techniques.
In Environmental Research, 150, 386–393.https://doi.org/10.1016/j.envres.2016.06.011
National Asthma Education and Prevention Program (NAEPP). (2022).Expert Panel Report: Guidelines for the Diagnosis and Management of Asthma.U.S. Department of Health and Human Services.https://www.nhlbi.nih.gov/health-topics/guidelines-for-diagnosis-management-of-asthma
UCI Machine Learning Repository – Air Quality Data Set.Environmental data for air quality prediction tasks.https://archive.ics.uci.edu/ml/ datasets/air+quality
Scikit-learn documentation (2023).Machine Learning in Python.https://scikit-learn.org/stable/documentation.html
Chakraborty, T., & Dey, N. (2019).Predictive Analytics in Healthcare Using Machine Learning and Data Mining.Springer Nature.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.