Mortality risk prediction for COVID-19 patients using XGBoost models

Using demographic and lab test data received from the HM Hospitales in Spain, I built an XGBoost binary classifier using binary logistic regression that runs on a simple web app using the streamlit module and predicts the mortality risk of a COVID-19 patient. The user has to pass in the appropriate data as shown in the web app, then click the "Make prediction" button to receive the mortality risk score in a scale of 0-100%.

The "Mortality Risk Model Build" folder contains all of the main files used to construct the final xgboost model chosen that runs on the web app. It conaints scripts for the construction of the training datasets, data preprocessing, data storage, data plotting/visualization, different approaches of xgboost model creation, hyperparameter tuning, validation processes, xgboost model performance visualization, etc.

The "Mortality Risk Web App" folder contains the scripts required to run the web-app. In order to run the web-app, do the following:

Open the "predict_page.py" file and in the load_model() function define the data_path where you've stored the "xgboost_model_225.pkl" file.
Go to your IDE's terminal, change directory to the one that contains the web app files and type "streamlit run web_app.py".

WARNING: The specific model is not up to date with the current COVID-19 data and its results should not be taken seriously. A machine learning model is as good as the data it's trained on.

Mortality risk prediction for COVID-19 patients using XGBoost models

Related tags

Overview

Mortality risk prediction for COVID-19 patients using XGBoost models

Owner

Automated machine learning: Review of the state-of-the-art and opportunities for healthcare

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

A Python toolbox to churn out organic alkalinity calculations with minimal brain engagement.

About Solve CTF offline disconnection problem - based on python3's small crawler

Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification

PyPOTS - A Python Toolbox for Data Mining on Partially-Observed Time Series

Skforecast is a python library that eases using scikit-learn regressors as multi-step forecasters

Machine Learning Algorithms

The Fuzzy Labs guide to the universe of open source MLOps

A Streamlit demo to interactively visualize Uber pickups in New York City

A library to generate synthetic time series data by easy-to-use factors and generator

Case studies with Bayesian methods

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Climin is a Python package for optimization, heavily biased to machine learning scenarios

50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster

Practical Time-Series Analysis, published by Packt

Made in collaboration with Chris George for Art + ML Spring 2019.

Skoot is a lightweight python library of machine learning transformer classes that interact with scikit-learn and pandas.

MegFlow - Efficient ML solutions for long-tailed demands.

Both social media sentiment and stock market data are crucial for stock price prediction