Artificial Flora Algorithm Based Feature Selection With Support Vector Machine for Cardiovascular Disease Classification

Project Code :TCMAPY1743

Objective

This project aims to develop a high-accuracy ensemble model for heart disease prediction by combining AdaBoost, Random Forest, and a Stacking Classifier. Particle Swarm Optimization (PSO) is used to select the most relevant features and fine-tune model parameters, enhancing predictive performance. The system is trained on real-world heart disease datasets with proper preprocessing and class balancing to ensure robustness and generalization. Interpretability is improved by analyzing feature importance from AdaBoost and Random Forest. A user-friendly interface is developed for real-time predictions, making the system suitable for deployment in clinical decision support tools or mobile health applications.

Abstract

Cardiovascular diseases (CVDs) remain the leading cause of mortality globally, contributing to approximately 18 million deaths every year, according to the World Health Organization (WHO). Despite advancements in healthcare technologies, the early diagnosis and accurate prediction of heart diseases continue to pose significant challenges due to the multifactorial nature and complexity of the disease. Traditional diagnostic methods such as clinical assessments, invasive procedures, and physician expertise are often limited by resource constraints, especially in underdeveloped regions. With the rise of artificial intelligence and machine learning in healthcare, data-driven models have shown great potential to assist in the early detection of heart disease by analyzing clinical and physiological data. Machine learning models, especially classification algorithms, can uncover complex patterns within medical datasets that traditional statistical methods might miss. This motivates the development of robust and intelligent predictive systems capable of supporting healthcare professionals in clinical decision-making.

In this study, an automated system for cardiovascular disease classification is proposed, leveraging Support Vector Machine (SVM) as the primary classifier. To enhance the feature selection process and improve the prediction accuracy, the Artificial Flora Algorithm (AFA) is utilized. AFA is a powerful optimization technique inspired by the natural propagation behavior of plants, which enables the selection of the most relevant features from patient data, reducing noise and improving model efficiency. Further improvements are achieved through the incorporation of ensemble methods such as the Stacking Classifier, Adaboost, combining Random Forest models tuned with Particle Swarm Optimization (PSO), to enhance robustness and generalization.

Keyword: Cardiovascular Disease Classification, Heart Disease Prediction, Artificial Flora Algorithm, Support Vector Machine, Feature Selection, Ensemble Learning, Random Forest, Particle Swarm Optimization, Stacking Classifier, Clinical Data Analysis.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

Hardware Requirements

Processor                                 - I3/Intel Processor

Hard Disk                                - 160GB

Key Board                              - Standard Windows Keyboard

Mouse                                     - Two or Three Button Mouse

Monitor                                   - SVGA

RAM                                       - 8GB

Software Requirements:

Operating System                   :  Windows 7/8/10

Server side Script                    :  HTML, CSS, Bootstrap & JS

Programming Language         :  Python

Libraries                                  :  Flask/Django, Pandas, Mysql.connector, Os, Smtplib, Numpy

IDE/Workbench                      :  PyCharm

Technology                             :  Python 3.6+

Server Deployment                 :  Xampp Server

Database                                 :  MySQL

Demo Video