Integration of Vision Transformer Networks in YOLO for Object Detection: Comparative Study on Plant Disease Detection

Project Code :TCMAPY2395

Objective

The objective of this project is to develop an efficient system for plant disease detection using integrated Vision Transformer (ViT) networks in YOLOv26 and RT-DETR for object detection. By leveraging state-of-the-art deep learning algorithms, the project aims to accurately detect and classify a variety of plant diseases in agricultural images, such as Beans Angular LeafSpot, Strawberry Leaf Spot, and Tomato Blight. The primary goal is to create a real-time, automated system that can assist farmers in identifying plant diseases with high precision, enabling faster interventions, improved crop management, and increased agricultural productivity.

Abstract

This paper presents a comparative study on plant disease detection using Vision Transformer Networks (ViT) integrated with YOLOv26 and RT-DETR for object detection, aiming to improve the accuracy and efficiency of plant disease classification. We analyze the performance of these models in detecting a wide variety of plant diseases, including Beans Angular LeafSpot, Strawberry Leaf Spot, and Tomato Blight, among others. The models are evaluated based on their precision, recall, and mAP scores, with particular focus on detecting complex multi-class diseases in agricultural images. The ViT-based integration with YOLOv26 and RT-DETR demonstrates a significant improvement in detection accuracy over traditional object detection models, supported by advanced post-processing techniques like Soft-NMS and Grad-CAM for interpretability. The comparative analysis highlights the strengths and weaknesses of each model, providing insights into their applicability for real-time plant disease detection systems. Furthermore, the study explores the potential for deploying these models in agricultural environments to aid farmers in quick and reliable disease diagnosis.

Keywords: Plant Disease Detection, Vision Transformer, YOLOv26, RT-DETR, Soft-NMS, Grad-CAM, Object Detection, Deep Learning, Agricultural Imaging, Precision Agriculture

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

SOFTWARE REQUIREMENS

Operating System                               :  Windows 7/8/10

Server side Script                               :  html,css,js

Programming Language                     :  Python

Libraries                                             : Flask, Pandas, pytorch                                                                                                           Numpy , Seaborn

IDE/Workbench                                  :  VSCode

Database                                             :  SQLite  

HARDWARE REQUIREMENTS

Processor                                  - I3/Intel Processor

RAM                                       - 8GB (min)

Hard Disk                                - 128 GB

Key Board                               - Standard Windows Keyboard

Mouse                                      - Two or Three Button Mouse

Monitor                                    - Any

Demo Video

mail-banner
call-banner
contact-banner
Request Video