The "Vision Voice" project develops a Raspberry Pi-based text-to-audio converter that assists visually impaired individuals by converting written text into audible speech, using optical character recognition (OCR) and speech synthesis to improve access to information.
This project presents a real-time language translation system leveraging Optical Character Recognition (OCR) and speech-to-text technology, developed with a Raspberry Pi microcontroller. The system is designed to translate both spoken language and text captured from images into multiple languages, with audio feedback provided through earphones. Two modes of operation are enabled by push-button switches: speech-to-text mode, where spoken words are transcribed, translated into three languages, and relayed audibly; and OCR detection mode, where an image containing English text is captured via a USB web camera, converted to text, translated, and outputted in multiple languages through text-to-speech. The project utilizes a USB microphone to capture spoken input, while connectors and additional push buttons facilitate interaction between components. This translator aims to provide a seamless, accessible solution for real-time multilingual communication across text and speech inputs, benefiting users in various language contexts.
NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.
