The objective of this project is to develop an efficient and accurate predictive model for the early detection of ovarian cancer using clinical and biomarker data. The study aims to compare machine learning (ML) and deep learning (DL) techniques, assessing their effectiveness in classifying cancerous and non-cancerous cases. The project focuses on optimizing data preprocessing, including handling missing values, outlier removal, and normalization, while utilizing feature selection methods like SelectKBest, and autoencoders. By evaluating classifiers such as LightGBM, CatBoost, hybrid models, and deep networks, the project seeks to identify the most reliable model for early diagnosis
Ovarian cancer is one of the most challenging gynecological cancers to diagnose early, which significantly impacts survival rates. This study presents a comparative analysis of machine learning (ML) and deep learning (DL) models for the early prediction of ovarian cancer using clinical and biomarker data. The dataset undergoes comprehensive preprocessing, including handling missing values, outlier removal, normalization, and dimensionality reduction via SelectKBest. Feature selection methods such as Feature Importance, and autoencoder-based techniques are employed to enhance model performance. A variety of classifiers, including LightGBM, CatBoost, and a hybrid model (Random Forest + Gradient Boosting), along with deep learning models like Multi-layer Perceptron (MLP), are evaluated. Ensemble models, including the Voting Classifier, are also implemented. The results demonstrate that the CatBoost model, combined with optimized feature selection techniques, achieved the highest accuracy of 88%, showcasing its potential as a reliable tool for the early prediction of ovarian cancer. This study emphasizes the importance of integrating optimized preprocessing, feature engineering, and appropriate model selection to achieve effective early diagnosis.
Keywords: Ovarian cancer, early prediction, machine learning, deep learning, feature selection, CatBoost, LightGBM, hybrid model, ensemble learning, predictive modeling, clinical data, biomarker data, Recursive Feature Elimination, SelectKBest, Multi-layer Perceptron, Voting Classifier.
NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Hardware Requirements
Hard Disk - 160GB
Key Board - Standard Windows Keyboard
Mouse - Two or Three Button Mouse
Monitor - SVGA
RAM - 8GB
Software Requirements:
Operating System : Windows 7/8/10
Server side Script : HTML, CSS, Bootstrap & JS
Programming Language : Python
Libraries : Django, Pandas, Numpy, Tensorflow, Scikit-learn.
IDE/Workbench : VS Code
Technology : Python 3.10
Database : SQLite