Category-Based Sentiment Analysis of Sindhi News Headlines Using Machine Learning Deep Learning and Transformer Models

Project Code :TCMAPY1883

Objective

This project focuses on analyzing Sindhi news headlines to determine their sentiment and categorize them as positive, negative, or neutral. Sindhi, being a low-resource language, presents challenges for automated sentiment analysis. By leveraging machine learning, deep learning, and transformer models, the system can effectively capture linguistic nuances and contextual meanings. The resulting tool provides accurate insights into public opinion, media trends, and societal reactions, enabling researchers, policymakers, and analysts to make informed decisions and understand emerging patterns in news reporting efficiently.

Abstract

The increasing reliance on digital journalism in regional languages has accelerated the demand for automated sentiment analysis tools to evaluate public opinion. This study presents a system for Category-Based Sentiment Analysis of Sindhi News Headlines using a combination of Machine Learning (ML), Deep Learning (DL), and Transformer-based models. The web-based framework enables users to register, upload datasets, and experiment with multiple algorithms for sentiment classification into Positive, Neutral, and Negative categories.

Among ML approaches, Random Forest demonstrated high accuracy and stability, outperforming several traditional classifiers such as Decision Tree and LightGBM. However, the sequential nature of language data proved more suitable for DL models. In particular, the Recurrent Neural Network (RNN) achieved the highest accuracy of 98.87%, establishing it as the most effective model for capturing linguistic dependencies in Sindhi text. The system employs a saved tokenizer with sequence padding, ensuring consistent preprocessing of textual inputs.

Keywords

Sentiment Analysis, Sindhi News Headlines, Random Forest, Recurrent Neural Network (RNN), Machine Learning, Deep Learning, Transformer Models, Natural Language Processing, Text Classification, Public Opinion Mining.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

SOFTWARE REQUIREMENS

 

Operating System                               :  Windows 7/8/10

Server side Script                                :  HTML, CSS, Bootstrap & JS

Programming Language                     :  Python

Libraries                                              : Flask, Pandas, Torch, Keras, Sklearn,Numpy , Seaborn

IDE/Workbench                                  :  VSCODE

Server Deployment                             :  Xampp Server

Database                                             :  MySQL    

 

HARDWARE REQUIREMENTS

 

Processor                                   - I3/Intel Processor

RAM                                       - 8GB (min)

Hard Disk                                - 128 GB

Key Board                               - Standard Windows Keyboard

Mouse                                      - Two or Three Button Mouse

Monitor                                    - Any

Demo Video

mail-banner
call-banner
contact-banner
Request Video