Text Summarization

Project Code :TCPGPY1882

Objective

This project uses deep learning models—LSTM, Llama, and BART—on the CNN/DailyMail dataset. The goal is to generate concise, readable summaries that maintain key information and improve accessibility across different domains.

Abstract

Text summarization is an essential task in natural language processing that condenses large volumes of text into concise summaries, helping users grasp critical information efficiently. This project aims to leverage deep learning models—LSTM, Llama, and BART—on the CNN/DailyMail dataset to generate high-quality summaries that capture key elements from news articles. By combining these models, we explore both extractive and abstractive summarization methods, optimizing them to produce coherent, human-like summaries. The LSTM model enables sequential understanding of text, while Llama and BART bring transformer-based approaches for handling complex language structures. This ensemble approach seeks to balance summarization accuracy with semantic preservation, ensuring readability and information retention. The project outcomes are expected to improve information accessibility in various applications, from news aggregation to academic and industry research.

 

Keywords: Text Summarization, LSTM, BART, Llama, NLP, Abstractive Summarization, Extractive Summarization, Deep Learning, CNN/DailyMail, Information Retrieval

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

H/W CONFIGURATION:

Processor                                 - I3/Intel Processor

Hard Disk                                - 160GB

Key Board                              - Standard Windows Keyboard

Mouse                                     - Two or Three Button Mouse

Monitor                                   - SVGA

RAM                                       - 8GB

S/W CONFIGURATION:

•      Operating System                   :  Windows 7/8/10

•      Server side Script                    :  HTML, CSS, Bootstrap & JS

•      Programming Language         :  Python

•      Libraries                                  :  Flask, Pandas, MySQL. Connector, Tensor flow, Keras

•      IDE/Workbench                      :  VS Code

•      Technology                             :  Python 3.8+

•      Server Deployment                 :  Xampp Server

Demo Video