Harnessing Mixture of Experts for Abstractive Text Summarization

Project Code :TCMAPY1592

Objective

The objective of this project is to design and develop an advanced text summarization system that leverages a Mixture of Experts (MoE) approach using transformer-based models—PEGASUS, BART, and XLNet—for generating high-quality abstractive summaries. The aim is to intelligently combine the strengths of these models to produce concise, coherent, and contextually accurate summaries from lengthy documents. The system will be trained on well-established datasets like CNN/Daily Mail and movie summaries to ensure it can generalize across various content types.

Abstract

This project aims to develop an advanced text summarization system using a Mixture of Experts (MoE) approach combined with state-of-the-art transformer models. The system is designed to process and summarize long-form textual content such as movie summaries or articles from the CNN/Daily Mail dataset. To achieve high-quality and contextually rich summaries, the project leverages three powerful pre-trained models: PEGASUS, BART, and XLNet. Each of these models brings unique strengths to the summarization task—PEGASUS excels at generating abstractive summaries by pre-training on gap-sentence prediction n, BART is effective due to its bidirectional encoder and autoregressive decoder, and XLNet offers strong generalization by modeling language autoregressively while capturing bidirectional context. These models are integrated within a Mixture of Experts framework, where input data is intelligently routed through the most suitable model or combination of models. This dynamic selection mechanism enhances the model's ability to adapt to different writing styles, tones, and complexities found in diverse text sources. The use of the CNN/Daily Mail or movie summary datasets provides a comprehensive training ground, exposing the system to real-world textual variance and enabling it to learn the structure and semantics required for high-quality summarization. Overall, this project aims to build a robust and efficient summarization system that outperforms single-model baselines by harnessing the collective strengths of multiple transformer experts through MoE. Keywords: PEGASUS, BART, XLNet, CNN/Daily Mail.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

HARDWARE & SOFTWARE REQUIREMENTS

 SOFTWARE REQUIREMENS

Operating System                               :  Windows 7/8/10

Server side Script                                :  HTML, CSS, Bootstrap & JS

Programming Language                     :  Python

Libraries                                              :Flask, Torch, Tensorflow, Pandas, Mysql.connector

IDE/Workbench                                  :  VSCode

Server Deployment                             :  Xampp Server

Database                                             :  MySQL    

 

HARDWARE REQUIREMENTS

Processor                                   - I3/Intel Processor

RAM                                       - 8GB (min)

Hard Disk                                - 128 GB

Key Board                               - Standard Windows Keyboard

Mouse                                      - Two or Three Button Mouse

Monitor                                    - Any



Demo Video