Enhancing Phishing Detection A Machine Learning Approach with Feature Selection and Deep Learning Models

Project Code :TCPGPY1813

Objective

The primary objective of this project is to design and implement a robust phishing detection framework that uses advanced machine learning and deep learning models to distinguish between legitimate and phishing websites. The key goals are preprocess and clean the dataset containing labelled website features and apply feature selection techniques for reducing dimensionality and improving model performance. Develop and compare the effectiveness of various models, including GCN, Tab Transformer, Autoencoder, FNN, and DNN

Abstract

Phishing attacks have evolved as a major cybersecurity threat, exploiting user trust and compromising sensitive information. This study proposes an advanced phishing detection framework combining feature selection techniques with machine learning and deep learning models. Using a labeled dataset with the status field indicating legitimate or phishing websites, we evaluate and compare the performance of various models including Graph Convolutional Network (GCN), TabTransformer, Autoencoder, Feedforward Neural Network (FNN), and Deep Neural Network (DNN). By applying optimal feature selection, we enhance model performance, reduce computational complexity, and improve generalization. The system is implemented using Python and deployed with a Flask web interface styled with HTML and CSS, ensuring user-friendly interaction. Our results demonstrate that the integration of deep learning architectures with feature engineering significantly boosts phishing detection accuracy and robustness. This approach offers a scalable and effective solution to safeguard users against phishing threats in real-world applications.

Keywords: Phishing Detection, Feature Selection, GCN, TabTransformer, Autoencoder, FNN, DNN, Flask, Cybersecurity.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

SOFTWARE REQUIREMENS


Operating System :  Windows 7/8/10

Server side Script :  HTML, CSS, Bootstrap & JS

Programming Language :  Python

Libraries Flask, Pandas, Torch, Keras, Sklearn,Numpy , Seaborn

IDE/Workbench :  VSCode

Server Deployment :  Xampp Server

Database :  MySQL     

HARDWARE REQUIREMENTS


Processor                          - I3/Intel Processor

RAM                                       - 8GB (min)

Hard Disk                                - 128 GB

Key Board                               - Standard Windows Keyboard

Mouse                                      - Two or Three Button Mouse

Monitor                                    - Any


Demo Video