LightGBM: A Leading Force in Breast Cancer Diagnosis Through Machine Learning and Image Processing

Project Code :TMMAAI316

Objective

Utilizing LightGBM, this method improves breast cancer diagnosis with advanced image processing, including color histograms, contours, and HU moments.

Abstract

Leveraging the power of LightGBM, a leading machine learning model, this approach enhances breast cancer diagnosis through advanced image processing techniques. The process begins with image resizing and augmentation via rotation to increase the dataset's diversity. The first step extracts color histogram features by converting images to the HSV color space, calculating and normalizing the histogram of the HUE channel, resulting in a flattened 1D array. Next, contour features are extracted by converting images to grayscale, finding the largest contours, and calculating properties such as the number of contours, maximum area, and maximum perimeter, yielding another 1D array. Subsequently, HU moments are derived by converting images to grayscale, calculating moments, and computing the seven HU moments, forming a flattened array. Additionally, Haralick features are extracted from grayscale images by calculating the Gray-Level Co-occurrence Matrix (GLCM) and deriving contrast, energy, homogeneity, and correlation metrics, creating a final 1D array. These combined extracted features feed into the LightGBM classifier, which outputs classifications across various breast cancer types, including benign (Adenosis, Fibroadenoma, Phyllodes Tumor, Tubular Adenoma) and malignant (Ductal Carcinoma, Lobular Carcinoma, Mucinous Carcinoma, Papillary Carcinoma) categories, culminating in a high-accuracy diagnosis.

Keywords: Breast Cancer Dataset, Features Extraction, Image Processing Techniques, LightGBM Classifier, Machine Learning Algorithm and Accuracy.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

Software: Matlab 2020a or above

Hardware:

Operating Systems:

  • Windows 10
  • Windows 7 Service Pack 1
  • Windows Server 2019
  • Windows Server 2016

Processors:

Minimum: Any Intel or AMD x86-64 processor

Recommended: Any Intel or AMD x86-64 processor with four logical cores and AVX2 instruction set support

Disk:

Minimum: 2.9 GB of HDD space for MATLAB only, 5-8 GB for a typical installation

Recommended: An SSD is recommended A full installation of all MathWorks products may take up to 29 GB of disk space

RAM:

Minimum: 4 GB

Recommended: 8 GB

Learning Outcomes

·   Introduction to Matlab

·   What is EISPACK & LINPACK

·   How to start with MATLAB

·   About Matlab language

·   Matlab coding skills

·   About tools & libraries

·   Application Program Interface in Matlab

·   About Matlab desktop

·   How to use Matlab editor to create M-Files

·   Features of Matlab

·   Basics on Matlab

·   What is an Image/pixel?

·   About image formats

·   Introduction to Image Processing

·   How digital image is formed

·   Importing the image via image acquisition tools

·   Analyzing and manipulation of image.

·   Phases of image processing:

               o  Acquisition

               o  Image enhancement

               o  Image restoration

               o   Color image processing

               o  Image compression

               o   Morphological processing

               o   Segmentation etc.,

·   How to extend our work to another real time applications

·   Project development Skills

               o   Problem analyzing skills

               o   Problem solving skills

               o   Creativity and imaginary skills

               o   Programming skills

               o   Deployment

               o   Testing skills

               o   Debugging skills

               o   Project presentation skills

               o   Thesis writing skills

Demo Video