Enhancing Medicare Fraud Detection Through Machine Learning Addressing Class Imbalance With Smote-Enn

Project Code :TCMAPY1308

Objective

To Develop a classification system for Medicare claims into Fraud and Non-Fraud categories by addressing class imbalance using the Synthetic Minority Over-sampling Technique (SMOTE) combined with Edited Nearest Neighbours (ENN), to enhance the detection accuracy of fraudulent claims within the dataset

Abstract

This project presents a novel approach to enhance Medicare fraud detection by addressing the issue of class imbalance using Synthetic Minority Over-sampling Technique combined with Edited Nearest Neighbors (SMOTE-ENN). Traditional fraud detection methods often struggle with imbalanced datasets, where legitimate claims vastly outnumber fraudulent ones, leading to high false negative rates. Our methodology integrates SMOTE-ENN with machine learning algorithms, including Extreme Gradient Boosting (XGBoost), Adaptive Boosting (Adaboost), Light Gradient Boosting Machine (LGBM), Decision Trees (DT), Logistic Regression (LR), and Random Forest Classifiers. The project involves comprehensive data preprocessing, application of SMOTE-ENN to balance the dataset, and training of the models. Performance is evaluated using metrics such as accuracy, precision, recall, F1-score, and AUC-ROC. Experimental results demonstrate that the proposed approach significantly improves the models' ability to detect fraudulent claims, with Decision Trees achieving the highest performance. This study highlights the importance of addressing class imbalance in healthcare fraud detection and provides a robust framework for enhancing detection accuracy.


Keywords: Medicare fraud detection, Machine learning, Class imbalance, SMOTE-ENN, Synthetic Minority Over-sampling Technique.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

H/W CONFIGURATION:

Processor                         - I3/Intel Processor

Hard Disk                               - 160GB

Key Board                              - Standard Windows Keyboard

Mouse                                     - Two or Three Button Mouse

Monitor                                   - SVGA

RAM                                       - 8GB

 

S/W CONFIGURATION:


β€’      Operating System                   :  Windows 7/8/10

β€’      Server side Script                    :  HTML, CSS, Bootstrap & JS

β€’      Programming Language          :  Python

β€’      Libraries                                  :  Flask, Pandas, Mysql.connector, Os, Scikit-learn, Numpy

β€’      IDE/Workbench                      :  PyCharm

β€’      Technology                             :  Python 3.6+

β€’      Server Deployment                 :  Xampp Server

Demo Video