Text-Summarization

Finetuning Flan-T5 for Text Summarization

The dataset used for finetuning is CNN DailyMail dataset. Dataset Link : https://www.kaggle.com/datasets/gowrishankarp/newspaper-text-summarization-cnn-dailymail

Install the following packages from requirements.txt ( run the following command in the terminal )

pip install -r requirements.txt

Overview

In this Project, we used PyTorch, the Transformers library, and Hugging Face's FlanT5 model for fine-tuning on a text summarization task. The training and evaluation loops were written in PyTorch, optimizing for GPU acceleration during the fine-tuning process. The primary focus of the project was text summarization, a challenging natural language processing task. The FlanT5 model, part of the Transformers library, was fine-tuned to generate concise and coherent summaries from given text inputs. The meric used for evaluation process is rouge score.

Dependencies

PyTorch
Transformers Library
Hugging Face FlauBERT Model
GPU for accelerated fine-tuning

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSE		LICENSE
README.md		README.md
config.py		config.py
inference.py		inference.py
main.py		main.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
trainer.py		trainer.py
visualisation.py		visualisation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text-Summarization

About

Releases

Packages

Languages

License

UmerrAhsan/Text-Summarization

Folders and files

Latest commit

History

Repository files navigation

Text-Summarization

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages