Conversational Image Recognition Chatbot

Project Code :TCMAPY1915

Objective

The objective of the Conversational Image Recognition Chatbot project is to develop an interactive system that enables users to engage in conversations based on uploaded images. By leveraging the Gemini AI API, the system analyzes the image and generates contextual text responses, allowing users to ask questions and receive relevant answers. The project aims to integrate a seamless user experience with robust backend support using Django and frontend technologies like HTML, CSS, and JavaScript. This system offers a dynamic platform where image recognition and conversational AI combine, enhancing user interaction and providing insights based on visual inputs.

Abstract

The Conversational Image Recognition Chatbot leverages the power of Gemini AI API to enable seamless image-based conversations. This system allows users to upload images and interact with a chatbot that provides meaningful insights and answers based on the image content. The backend of the chatbot is built using Python with the Django framework, which ensures a robust and scalable structure. The Gemini AI API powers the image recognition and conversational capabilities, providing an intelligent interface for users to engage with

Keywords: Conversational AI, Image Recognition, Gemini AI, Django, Chatbot, Image Upload, Text Output, Python, Frontend, Backend, User Interaction, Natural Language Processing.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

Hardware Requirements

Processor                                 - I3/Intel Processor

Hard Disk                                - 160GB

Key Board                              - Standard Windows Keyboard

Mouse                                     - Two or Three Button Mouse

Monitor                                   - SVGA

RAM                                       - 8GB

 

Software Requirements:

Operating System                   :  Windows 7/8/10

Server side Script                    :  HTML, CSS, Bootstrap & JS

Programming Language         :  Python

Libraries                                  :  Django, Pandas, Numpy, NLP and Gemini API.

IDE/Workbench                      :  VS Code

Technology                             :  Python 3.10

Database                                 :  SQLite

Demo Video