Toward a Secure Future An Integrated Framework for PII Detection Using NLP and Visual Analysis

Project Code :TCMAPY2402

Objective

This project aims to develop a deep learning framework for detecting Personally Identifiable Information (PII) from text and images using models like GRU, LSTM, T5Encoder, and multi-kernel CNN. It includes thorough preprocessing, dataset preparation, and training on diverse PII types to ensure accurate detection. A user-friendly interface and a scalable, robust architecture allow easy access and high-performance identification across multiple data formats.

Abstract

This project presents an integrated framework for detecting Personally Identifiable Information (PII) using advanced techniques in Natural Language Processing (NLP) and visual analysis. The primary goal is to develop a system capable of identifying PII from both text and visual data, ensuring privacy and security. The system leverages a combination of deep learning models, including Gated Recurrent Units (GRU), Long Short-Term Memory (LSTM), Coordinate Attention, Self-Attention, T5Encoder, and Multi-kernel Convolutional Neural Networks (CNN). These models work together to accurately classify and identify sensitive data across various forms of input. The dataset used for this research includes diverse PII examples to train the model effectively. The framework is designed to handle both textual and visual inputs, making it versatile and robust. The system's architecture includes key components such as user registration, prediction, and classification modules, making it user-friendly. The performance of the system is evaluated using standard metrics like accuracy, precision, recall, and F1 score. This project aims to contribute to enhancing data privacy and security practices by enabling automated detection of sensitive information.

Keywords: PII detection, Natural Language Processing, visual analysis, deep learning, GRU, LSTM, Coordinate Attention, Self-Attention, T5Encoder, Multi-kernel CNN.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

HARDWARE REQUIREMENTS

β€’        Processor                                - I5/Intel Processor

β€’        RAM                                       - 8GB (min)

β€’        Hard Disk                                - 160 GB

β€’        Key Board                               - Standard Windows Keyboard

β€’        Mouse                                      - Two or Three Button Mouse

β€’        Monitor                                    - Any

SOFTWARE REQUIREMENS

β€’        Operating System                   :  Windows 7/8/10

β€’        Server side Script                   :  HTML, CSS, Bootstrap & JS

β€’        Programming Language         :  Python

β€’        Libraries                                 :  Flask, Pandas, Mysql. connector, Os, Numpy, Scikit- learn, sklearn, Preprocessor, feature_extraction.text, tensor flow, keras                                                    

β€’         IDE/Workbench                     :  VS-Code

β€’        Technology                             :  Python 3.10+

β€’        Server Deployment                 :  Xampp Server

β€’        Database                                 :  MySQL

Demo Video

mail-banner
call-banner
contact-banner
Request Video