The objective of this project is to develop an effective news recommendation system for websites that enhances personalization and scalability by combining content-based and collaborative filtering techniques. It aims to address challenges such as the cold-start problem and recommendation diversity by leveraging advanced text embeddings (TF-IDF and BERT) and clustering algorithms. By grouping users and news articles based on interaction vectors, the system optimizes recommendations within similar clusters, reducing noise and improving relevance. The project uses the large-scale MIND dataset to validate the approach, ultimately providing users with timely, accurate, and diverse news tailored to their interests.
This project presents a novel approach for news recommendation on websites by leveraging a clustered-vectors optimization algorithm to enhance personalization and scalability. The proposed system integrates two key models: a Content-Based Deep Recommender (CB-DR) that utilizes TF-IDF or BERT embeddings of news articles to recommend content similar to what users have previously engaged with, effectively addressing the cold-start problem by relying directly on article semantics; and a Clustering-Based Hybrid Recommender (Cluster-HybridRec) that groups users and articles based on interaction vectors, applying collaborative filtering within these clusters to reduce noise and improve recommendation diversity. The combined approach balances content-based and collaborative filtering methods to provide accurate, scalable, and diverse news recommendations. Experiments are conducted on the MIND dataset, a large-scale English news recommendation benchmark, demonstrating that the proposed clustered-vectors optimization algorithm significantly enhances recommendation relevance and user satisfaction compared to traditional methods.
Keywords: News Recommendation, Content-Based Recommender, Collaborative Filtering, Clustering, Hybrid Recommender System, TF-IDF, BERT Embeddings, Clustered-Vectors Optimization, Cold-Start Problem, MIND Dataset, Personalization, Scalability, Recommendation Diversity.
NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

SOFTWARE REQUIREMENS
Operating System : Windows 7/8/10
Server side Script : HTML, CSS, Bootstrap & JS
Programming Language : Python
Libraries Flask, Pandas, Torch, Keras, Sklearn, Numpy , Seaborn
IDE/Workbench : VSCode
Server Deployment : Xampp Server
Database : MySQL
HARDWARE REQUIREMENTS
Processor - I3/Intel Processor
RAM - 8GB (min)
Hard Disk - 128 GB
Key Board - Standard Windows Keyboard
Mouse - Two or Three Button Mouse
Monitor - Any