A Novel Method for News Recommendation on Websites Using the Clustered-Vectors Optimization Algorithm

Project Code :TCPGPY1932

Objective

The objective of this project is to develop an effective news recommendation system for websites that enhances personalization and scalability by combining content-based and collaborative filtering techniques. It aims to address challenges such as the cold-start problem and recommendation diversity by leveraging advanced text embeddings (TF-IDF and BERT) and clustering algorithms. By grouping users and news articles based on interaction vectors, the system optimizes recommendations within similar clusters, reducing noise and improving relevance. The project uses the large-scale MIND dataset to validate the approach, ultimately providing users with timely, accurate, and diverse news tailored to their interests.

Abstract

This project presents a novel approach for news recommendation on websites by leveraging a clustered-vectors optimization algorithm to enhance personalization and scalability. The proposed system integrates two key models: a Content-Based Deep Recommender (CB-DR) that utilizes TF-IDF or BERT embeddings of news articles to recommend content similar to what users have previously engaged with, effectively addressing the cold-start problem by relying directly on article semantics; and a Clustering-Based Hybrid Recommender (Cluster-HybridRec) that groups users and articles based on interaction vectors, applying collaborative filtering within these clusters to reduce noise and improve recommendation diversity. The combined approach balances content-based and collaborative filtering methods to provide accurate, scalable, and diverse news recommendations. Experiments are conducted on the MIND dataset, a large-scale English news recommendation benchmark, demonstrating that the proposed clustered-vectors optimization algorithm significantly enhances recommendation relevance and user satisfaction compared to traditional methods.

Keywords: News Recommendation, Content-Based Recommender, Collaborative Filtering, Clustering, Hybrid Recommender System, TF-IDF, BERT Embeddings, Clustered-Vectors Optimization, Cold-Start Problem, MIND Dataset, Personalization, Scalability, Recommendation Diversity.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

SOFTWARE REQUIREMENS

Operating System                               :  Windows 7/8/10

Server side Script                                :  HTML, CSS, Bootstrap & JS

Programming Language                     :  Python

Libraries                                              Flask, Pandas, Torch, Keras, Sklearn,                                                                                        Numpy , Seaborn

IDE/Workbench                                  :  VSCode

Server Deployment                             :  Xampp Server

Database                                             :  MySQL    

HARDWARE REQUIREMENTS

Processor                                   - I3/Intel Processor

RAM                                       - 8GB (min)

Hard Disk                                - 128 GB

Key Board                               - Standard Windows Keyboard

Mouse                                      - Two or Three Button Mouse

Monitor                                    - Any

Demo Video