Video summarization using deep learning

Project Code :TCMAPY1548

Objective

This project presents a deep learning-based system for automatic video summarization. It extracts audio, transcribes speech using Wav2Vec2, summarizes text with BERT, and translates outputs into Indian languages.Users can upload videos or provide YouTube links to quickly access condensed summaries.The system improves video content accessibility, saves time, and bridges language barriers efficiently.

Abstract

In today’s digital age, video content is rapidly increasing across platforms, making it challenging to consume and analyze lengthy videos efficiently. This project presents a Video Summarisation system using Deep Learning techniques to address this challenge. The system allows users to either upload a video or provide a URL link. For uploaded videos, the audio is extracted and transcribed using the Wav2Vec2ForCTC model combined with Wav2Vec2Processor, enabling accurate speech-to-text conversion. For YouTube links, the YouTube Transcript API is utilized to directly fetch the textual transcript. The resulting transcription undergoes summarization using a BERT-based algorithm, providing a condensed, meaningful summary. Furthermore, the summarized content is translated into Kannada and other Indian languages using Google Translate API, making it accessible to a wider audience. The system is developed with a front-end based on HTML, CSS, and JavaScript, and a Python-powered backend. This tool offers an efficient, multilingual, and user-friendly solution for fast video content consumption.  

Keywords: Video Summarization, Deep Learning, Wav2Vec2ForCTC, BERT, YouTube Transcript API, Speech Recognition, Natural Language Processing, Summarization, Multilingual Translation, Google Translator.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

 H/W CONFIGURATION

 Processor                                 - I3/Intel Processor 

Hard Disk                                - 160GB 

  Key Board                              - Standard Windows Keyboard 

  Mouse                                     - Two or Three Button Mouse

Monitor                                   - SVGA 

  RAM                                       - 8GB 

  S/W CONFIGURATION: 

  β€’      Operating System                   :  Windows 7/8/10 

  β€’      Server side Script                    :  HTML, CSS, Bootstrap & JS

β€’      Programming Language         :  Python 

  β€’      Libraries                                  :  Flask, Pandas, MySQL. Connector, Tensor flow, Keras

β€’      IDE/Workbench                      :  VS Code 

  β€’      Technology                             :  Python 3.8+ 

  β€’      Server Deployment                 :  Xampp Server

Demo Video

mail-banner
call-banner
contact-banner
Request Video