Pyramid Vision Transformer-Based Multi-Scale Feature Fusion and CNN Decoder for Accurate Colon Polyp Segmentation

Project Code :TCMAPY2377

Objective

This project aims to design a deep learning framework combining Pyramid Vision Transformer (PVT) and CNN Decoder for accurate colon polyp segmentation using the PolypDB dataset. It enhances feature extraction and multi-scale fusion for improved accuracy, precision, and recall in polyp detection. The system seeks to automate the process, reducing clinician workload and improving diagnostic efficiency.

Abstract

Colon polyp detection plays a crucial role in early cancer diagnosis and prevention. This research proposes a novel framework for accurate colon polyp segmentation using a Pyramid Vision Transformer (PVT) and CNN Decoder. The system integrates multi-scale feature fusion with the transformer model to capture fine-grained features at various scales. A CNN decoder is then applied to refine the segmentation output. The framework is trained and evaluated on the PolypDB dataset, which contains colonoscopy images and annotated polyps. The Pyramid Vision Transformer excels in handling diverse image scales, while the CNN Decoder enhances the final segmentation by incorporating high-resolution details. This approach aims to improve segmentation accuracy and provide a reliable solution for automated polyp detection, reducing the workload for medical professionals and increasing the chances of early detection. Experimental results show promising segmentation performance with improved accuracy compared to conventional methods. The proposed system is an essential tool for aiding in the diagnosis of colon polyps, facilitating timely medical interventions.


Keywords: Colon Polyp Detection, Pyramid Vision Transformer, CNN Decoder, Image Segmentation, Multi-Scale Feature Fusion, Colonoscopy Images, Medical Image Analysis, Deep Learning, Dataset, Automated Diagnosis

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

HARDWARE REQUIREMENTS

β€’        Processor                                - I5/Intel Processor

β€’        RAM                                       - 8GB (min)

β€’        Hard Disk                                - 160 GB

β€’        Key Board                               - Standard Windows Keyboard

β€’        Mouse                                      - Two or Three Button Mouse

β€’        Monitor                                    - Any

SOFTWARE REQUIREMENS

β€’        Operating System                   :  Windows 7/8/10

β€’        Server side Script                   :  HTML, CSS, Bootstrap & JS

β€’        Programming Language         :  Python

β€’        Libraries                                 :  Flask, Pandas, Mysql. connector, Os, Numpy, Scikit- learn, sklearn.ensemble, MLPRegressor, SVR                                                     

β€’         IDE/Workbench                     :  VS-Code

β€’        Technology                             :  Python 3.10+

β€’        Server Deployment                 :  Xampp Server

β€’        Database                                 :  MySQL

Demo Video