In this project, we verify the effectiveness of XGBoost Algorithm in detecting bugs in software, and compared with the other traditional machine learning algorithms like logistic regression, decision trees, random forest and AdaBoost.
Software bug prediction becomes the vital activity during software development and maintenance. Defect prediction at early stages of software development life cycle is a crucial activity of quality assurance process and has been broadly studied in the last two decades.
The early prediction of defective modules in developing software can help the development team to utilize the available resources efficiently and effectively to deliver high quality software product in limited time. Machine learning approach is an effective way to identify the defective modules, which works by extracting the hidden patterns among software attributes.
In this project , several machine learning classification techniques are used to predict the software defects in NASA datasets JM1, CM1, KC2 and PC3. New model was proposed based on tuning the existing XGBoost model by changing its parameter namely n_estimator, learning rate, max depth, and subsample. The results achieved were compared with state-of the art models and our model outperformed them for all datasets.
Keywords: Machine Learning, Dataset, Supervised Learning, Random Forest, XgBoost, Ada Boost, Decision Tree.
NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.
HARDWARE SPECIFICATIONS:
SOFTWARE SPECIFICATIONS: