Detecting the Presence of COVID-19 Vaccination Hesitancy From South African Twitter Data Using Machine Learning

Project Code :TCMAPY2073

Objective

This project develops a web-based application for detecting COVID-19 vaccine hesitancy in South African Twitter data using machine learning. It employs a Random Forest classifier (achieving ~99% accuracy) trained on tweet features such as user metadata, engagement metrics, and content attributes to classify sentiments as negative (hesitancy), neutral, or positive. The Flask-powered platform includes user registration/login, CSV upload for data preview, model performance comparison (Random Forest, Decision Tree, XGBoost, MLP), and an interactive prediction interface. It provides public health insights by analyzing social media sentiment to monitor and address vaccine hesitancy in South Africa.

Abstract

The COVID-19 vaccination campaign has faced significant challenges due to vaccine hesitancy, and social media platforms, especially Twitter, provide valuable insights into public sentiment. This project aims to detect COVID-19 vaccination hesitancy by analyzing South African Twitter data using machine learning algorithms. The system classifies tweets into three categories: negative sentiment (indicating vaccine hesitancy), neutral sentiment (representing factual updates or questions), and positive sentiment (supporting vaccination). Several machine learning models, including Decision Tree, Random Forest, and XGBoost, are employed for sentiment analysis. Each model is trained on a labeled dataset of tweets, and the system processes uploaded CSV files containing tweet data through a Flask-based web application. The application allows users to upload data and provides real-time sentiment classification with confidence scores. This system offers an effective approach for monitoring and addressing vaccine hesitancy by leveraging social media sentiment. The insights gained can aid health organizations and policymakers in creating targeted campaigns to mitigate vaccine hesitancy and improve public health efforts.

Keywords:

COVID-19, Vaccine Hesitancy, Sentiment Analysis, Machine Learning, Decision Tree, Random Forest, XGBoost, Social Media, Public Health, Public Opinion.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

SOFTWARE REQUIREMENS

 

Operating System                               :  Windows 7/8/10

Server side Script                                :  HTML, CSS, Bootstrap & JS

Programming Language                     :  Python

Libraries                                              : Flask, Pandas, Torch, Keras, Sklearn,                                                                                     Numpy , Seaborn

IDE/Workbench                                  :  VSCODE

Server Deployment                             :  Xampp Server

Database                                             :  MySQL    

 

HARDWARE REQUIREMENTS

 

Processor                                   - I3/Intel Processor

RAM                                       - 8GB (min)

Hard Disk                                - 128 GB

Key Board                               - Standard Windows Keyboard

Mouse                                      - Two or Three Button Mouse

Monitor                                    - Any

Demo Video

mail-banner
call-banner
contact-banner
Request Video