This project develops a web-based application for detecting COVID-19 vaccine hesitancy in South African Twitter data using machine learning. It employs a Random Forest classifier (achieving ~99% accuracy) trained on tweet features such as user metadata, engagement metrics, and content attributes to classify sentiments as negative (hesitancy), neutral, or positive. The Flask-powered platform includes user registration/login, CSV upload for data preview, model performance comparison (Random Forest, Decision Tree, XGBoost, MLP), and an interactive prediction interface. It provides public health insights by analyzing social media sentiment to monitor and address vaccine hesitancy in South Africa.
The COVID-19 vaccination campaign has faced significant challenges due to vaccine hesitancy, and social media platforms, especially Twitter, provide valuable insights into public sentiment. This project aims to detect COVID-19 vaccination hesitancy by analyzing South African Twitter data using machine learning algorithms. The system classifies tweets into three categories: negative sentiment (indicating vaccine hesitancy), neutral sentiment (representing factual updates or questions), and positive sentiment (supporting vaccination). Several machine learning models, including Decision Tree, Random Forest, and XGBoost, are employed for sentiment analysis. Each model is trained on a labeled dataset of tweets, and the system processes uploaded CSV files containing tweet data through a Flask-based web application. The application allows users to upload data and provides real-time sentiment classification with confidence scores. This system offers an effective approach for monitoring and addressing vaccine hesitancy by leveraging social media sentiment. The insights gained can aid health organizations and policymakers in creating targeted campaigns to mitigate vaccine hesitancy and improve public health efforts.
Keywords:
COVID-19, Vaccine Hesitancy, Sentiment Analysis, Machine Learning, Decision Tree, Random Forest, XGBoost, Social Media, Public Health, Public Opinion.
NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Operating System : Windows 7/8/10
Server side Script : HTML, CSS, Bootstrap & JS
Programming Language : Python
Libraries : Flask, Pandas, Torch, Keras, Sklearn, Numpy , Seaborn
IDE/Workbench : VSCODE
Server Deployment : Xampp Server
Database : MySQL
Processor - I3/Intel Processor
RAM - 8GB (min)
Hard Disk - 128 GB
Key Board - Standard Windows Keyboard
Mouse - Two or Three Button Mouse
Monitor - Any