The objective of this project is to develop an efficient and scalable framework for news topic classification that leverages deep learning techniques and Natural Language Processing (NLP). The system aims to enhance classification accuracy using machine learning models such as Logistic Regression, Random Forest, Ridge Classifier, XGBoost, and the transformer-based DistilBERT model. An attention mechanism is incorporated to prioritize the most relevant textual features and improve contextual understanding. A key goal of this project is to reduce data volume by extracting important sentences from news articles, thereby minimizing input size while preserving semantic meaning. This approach results in approximately a 50% reduction in inference time without compromising classification accuracy. The framework is designed to improve scalability, computational efficiency, and performance for real-time news categorization applications
The rapid growth of digital content has made information retrieval and classification more challenging than ever. This paper presents a deep learning framework for news topic classification that integrates Natural Language Processing (NLP) techniques with machine learning models such as Logistic Regression, Random Forest, Ridge Classifier, and XGBoost, combined with an attention mechanism for improved feature selection. To address the issue of large data volumes, we propose a novel data reduction methodology that extracts key sentences from news articles, effectively reducing the data size and leading to a 50% reduction in inference time. Our framework maintains the accuracy of the classification model, achieving results comparable to existing approaches. Experimental evaluations using popular datasets like AG News demonstrate the superior performance of our system in terms of classification accuracy and efficiency, offering a scalable solution for real-time news classification.
Keywords: News Topic Classification, Natural Language Processing, Deep Learning, Logistic Regression, Random Forest, Ridge Classifier, XGBoost, Attention Mechanism, Data Volume Reduction, Inference Time Optimization, AG News Dataset, Model Efficiency.
NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Hardware Requirements
Hard Disk - 160GB
Key Board - Standard Windows Keyboard
Mouse - Two or Three Button Mouse
Monitor - SVGA
RAM - 8GB
Software Requirements:
Operating System : Windows 7/8/10
Server side Script : HTML, CSS, Bootstrap & JS
Programming Language : Python
Libraries : Django, Pandas, Numpy, Tensorflow, Scikit-learn.
IDE/Workbench : VS Code
Technology : Python 3.10
Database : SQLite