Ad Click Fraud Detection Using Machine Learning and Deep Learning Algorithms

Project Code :TCPGPY1973

Objective

This project focuses on detecting fraudulent ad clicks in mobile advertising, a growing concern causing major financial losses. Using a dataset with features like IP address, app ID, device type, and timestamps, the system predicts whether a click results in an actual app download. A wide range of machine learning models—such as Logistic Regression, Random Forest, SVM, XGBoost, and LightGBM—alongside deep learning models including ANN, CNN, LSTM, and GRU were implemented. A Stacking Classifier further improves performance by combining multiple models. The system is deployed via a Flask web application, enabling users to input click data and receive real-time fraud predictions.

Abstract

With the exponential growth of mobile advertising, fraudulent ad clicks have become a major concern, leading to substantial financial losses for advertisers. This project, Ad Click Fraud Detection Using Machine Learning and Deep Learning Algorithms, aims to develop an intelligent and scalable detection system capable of distinguishing between genuine and fraudulent clicks. The dataset contains crucial features such as IP address, application ID, device type, operating system version, publisher channel ID, click timestamp, and attribution time (if the app was downloaded post-click). The target variable is_attributed indicates whether a click led to an actual app download.

A wide range of models were implemented, including traditional Machine Learning algorithms like Logistic Regression, Decision Tree, Random Forest, K-Nearest Neighbors, Naive Bayes, SVM, Gradient Boosting, LightGBM, and XGBoost. Additionally, Deep Learning architectures such as Artificial Neural Networks (ANN), Convolutional Neural Networks (CNN), Deep Neural Networks (DNN), Recurrent Neural Networks (RNN), Long Short-Term Memory (LSTM), and Gated Recurrent Unit (GRU) were explored. A Stacking Classifier was used to combine model strengths and optimize performance. The backend integrates all models to compare results, while the Flask-based frontend allows users to input click features and receive real-time fraud detection outputs.

 

Keywords:
Ad Click Fraud, Machine Learning, Deep Learning, Flask App, Stacking Classifier, LSTM, XGBoost, Click Prediction, Mobile Advertising, Classification.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

SOFTWARE REQUIREMENS

Operating System                               :  Windows 7/8/10

Server side Script                                :  HTML, CSS, Bootstrap & JS

Programming Language                     :  Python

Libraries                                              Flask, Pandas,Tensorflow, Sklearn, Numpy , Seaborn

IDE/Workbench                                  :  VSCode

Technology                                         :  Python 3.8+

Server Deployment                             :  Xampp Server

Database                                             :  MySQL    

 

HARDWARE REQUIREMENTS

Processor                                  - I3/Intel Processor

RAM                                       - 8GB (min)

Hard Disk                                - 128 GB

Key Board                               - Standard Windows Keyboard

Mouse                                      - Two or Three Button Mouse

Monitor                                    - Any

Demo Video

mail-banner
call-banner
contact-banner
Request Video