The project aims to detect fraudulent payment transactions using machine learning techniques. The dataset used is the Banksim1 dataset from Kaggle, containing simulated banking transactions. Current models such as GMB, random forests, decision trees, and logistic regression will serve as benchmarks. The proposed models for this project include HistGradientBoosting, CatBoost, KNN, LGBM, and logistic regression. These algorithms will be evaluated based on their accuracy and efficiency in identifying fraudulent transactions. The goal is to determine which model provides the most reliable and scalable solution for fraud detection.
Fraud detection in financial transactions is crucial for securing payment systems and preventing illegal activities. This project focuses on utilizing machine learning algorithms to develop a fraud detection system for payment transactions. The dataset used is the "Banksim1" dataset from Kaggle, which simulates banking transactions with both legitimate and fraudulent activities. To detect fraudulent transactions, several machine learning algorithms were implemented and evaluated, including K-Nearest Neighbors (KNN), Logistic Regression, CatBoost, HistGradientBoosting, and LightGBM. The models were trained on the dataset and tested for accuracy in identifying fraudulent transactions based on features such as transaction amount, customer ID, and transaction type. The system was built using Python with Flask for the backend and HTML, CSS, and JavaScript for the frontend, allowing users to interact with the fraud detection model. The results show that the machine learning-based approach significantly outperforms traditional rule-based systems, offering high accuracy and efficiency in fraud detection. The CatBoost model demonstrated the best performance in terms of accuracy and speed, while the HistGradientBoosting model showed a strong balance between precision and recall. The project concludes that machine learning models, when applied to payment transaction data, can enhance the detection and prevention of fraudulent activities in payment systems. Future work could involve improving model performance and implementing real-time transaction monitoring.
Keywords:
Fraud detection, machine learning, KNN, Logistic Regression, CatBoost,
HistGradientBoosting, LightGBM, payment transactions, dataset, Flask.
NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

H/W CONFIGURATION:
To ensure optimal performance and scalability of the fraud detection system, the following hardware configuration is recommended:
1. Server:
o Processor: Multi-core processor (Intel Xeon or AMD Ryzen 7 and above) for efficient parallel processing and model inference.
o RAM: Minimum of 16 GB of RAM, with 32 GB recommended for handling large datasets and simultaneous user requests.
o Storage: SSD storage with at least 500 GB for faster data access and efficient model saving/loading.
o GPU (optional): For deep learning or high-performance models (if used), a GPU like NVIDIA Tesla or GTX series would significantly accelerate training and inference times.
2. Database Server:
o Processor: Multi-core processor for handling high transaction loads.
o RAM: Minimum 16 GB for smooth database operations.
o Storage: 500 GB+ SSD for faster database querying and efficient transaction record storage.
3. Network:
o Bandwidth: High-speed internet connection (at least 1 Gbps) to handle large data uploads and multiple simultaneous user interactions without delays.
o Firewall/Security: Hardware firewall and load balancing for ensuring security and high availability.
4. Backup and Redundancy:
o Backup Server: Separate backup systems with redundant storage for data protection and system resilience.
S/W CONFIGURATION:
To ensure smooth operation and efficient performance of the fraud detection system, the following software configuration is recommended: