PHISHING WEBSITE DETECTION USING MACHINE LEARNING

Project Code :TCMAPY1823

Objective

The primary motive of the Phishing Website Detection Using Machine Learning project is to enhance cybersecurity by identifying and preventing phishing attacks, which are increasingly used to steal sensitive personal and financial information. By leveraging machine learning algorithms, the system aims to automatically analyze website characteristics, such as URL structure, domain features, and suspicious patterns, to distinguish legitimate websites from malicious ones. This proactive approach reduces the risk of identity theft, financial fraud, and data breaches. The project also seeks to provide a scalable, accurate, and real-time detection mechanism that can assist individuals, organizations, and internet users in safe online navigation.

Abstract

The Phishing Website Detection Using Machine Learning project offers a robust and intelligent solution to counter the growing threat of phishing attacks, which continue to compromise online security worldwide. The system focuses on extracting and analyzing multiple URL-based features that help differentiate between legitimate and malicious websites. Key indicators such as domain age, URL length, presence of an IP address, HTTPS usage, redirection patterns, prefix-suffix in domains, URL depth, and the detection of URL shortening services are systematically evaluated. These features are extracted through methods including whois lookups, regular expression matching, and HTML/JavaScript content analysis, enabling the system to capture subtle anomalies often used by phishing sites to deceive users. For example, unusually long URLs, the presence of β€œ@” or β€œ-” symbols, or excessive redirections are flagged as suspicious attributes.

Keywords:

Phishing Detection, Machine Learning, URL Feature Extraction, Random Forest Classifier, Cybersecurity, Web Application Security, Malicious Website Classification.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

Hardware Requirements

Processor                                 - I3/Intel Processor

Hard Disk                                - 160GB

Key Board                               - Standard Windows Keyboard

Mouse                                     - Two or Three Button Mouse

Monitor                                   - SVGA

RAM                                       - 8GB

 

Software Requirements

β€’      Operating System                    :  Windows 7/8/10

β€’      Programming Language         :  Python

β€’      Libraries                                  :  Pandas, Numpy, scikit-learn.

β€’      IDE/Workbench                      :  Visual Studio Code.

Demo Video