Predicting Hospital Stay Length Using Explainable Machine Learning

Project Code :TCPGPY2051

Objective

The objective of this study is to develop and evaluate predictive models for hospital stay length using machine learning algorithms, including Logistic Regression, MLP, Random Forest, Gradient Boosting, and XGBoost. Additionally, the study aims to utilize explainability tools to interpret model predictions and identify the key determinants of hospital stay durations.

Abstract

Predicting the length of hospital stay (LOS) is critical for improving healthcare resource management and patient care. This study investigates the application of explainable machine learning techniques to forecast hospital stay duration using a dataset from Kaggle, comprising various patient and hospital-related features. The primary goal is to develop accurate predictive models and elucidate the underlying factors influencing hospital stay lengths. The study employs multiple machine learning algorithms, including Logistic Regression, Multi-Layer Perceptron (MLP), Random Forest, Gradient Boosting, and XGBoost. Each model's performance is evaluated using standard metrics such as accuracy, precision, recall, and F1-score. Additionally, explainability tools such as SHapley Additive exPlanations (SHAP) are utilized to interpret model predictions and identify the most significant predictors of LOS. The findings demonstrate that advanced machine learning models, particularly ensemble methods, achieve superior predictive accuracy. Moreover, the explainability analysis provides valuable insights into the critical factors influencing hospital stays, thereby enabling healthcare practitioners to make informed decisions and optimize hospital resource allocation. This research underscores the potential of integrating explainable machine learning into healthcare analytics to enhance operational efficiency and patient outcomes.

 

Keywords: Hospital Stay Length, Machine Learning, Logistic Regression, MLP, Random Forest, Gradient Boosting, XGBoost, Explainability, SHAP, Healthcare Analytics.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

H/W SPECIFICATIONS:

Β·         Processor                     : I5/Intel Processor

Β·         RAM                           : 8GB (min)

Β·         Hard Disk                    : 128 GB

Β·         Key Board                  : Standard Windows Keyboard

Β·         Mouse                         : Two or Three Button Mouse

Β·         Monitor                       : Any

S/W SPECIFICATIONS:

β€’      Operating System                   : Windows 7+            

β€’      Server-side Script                   : Python 3.6+

β€’      IDE                                         : PyCharm /  VSCode

β€’      Libraries Used                       : Pandas, Numpy, Matplotlib, OS.

Demo Video