Summarization using Transformer Models (Including LLaMA 2)

Here’s an updated README.md file including LLaMA 2 for text summarization:

Summarization using Transformer Models (Including LLaMA 2)

This repository contains code and resources for performing text summarization using various Transformer-based models, including the powerful LLaMA 2 model alongside other models like BERT, DistilBERT, Pegasus, T5, and RoBERTa. The project demonstrates the fine-tuning of these models on the CNN Daily Mail dataset to generate concise and coherent summaries from news articles.

Features

Fine-tuning of various Transformer models, including LLaMA 2 for text summarization tasks.
Memory-efficient training using QLoRa and LoRa quantization techniques.
Detailed evaluation metrics to measure summarization performance.
Instructions to replicate experiments with custom datasets.

Models Implemented

LLaMA 2 7b (Large Language Model Meta AI)
BERT (Bidirectional Encoder Representations from Transformers)
DistilBERT (A smaller, faster, cheaper version of BERT)
Pegasus (Pre-training with Extracted Gap-sentences for Abstractive Summarization)
T5 (Text-To-Text Transfer Transformer)
RoBERTa (Robustly Optimized BERT Pretraining Approach)

Dataset

The models are fine-tuned using the CNN Daily Mail dataset, which is widely used for text summarization tasks and consists of news articles paired with human-written summaries.

Dataset Structure

Articles: Long-form news articles.
Summaries: Short summaries corresponding to each article.

Quantization Techniques

To optimize the training process, the following quantization techniques are employed:

QLoRa: Quantization-aware LoRa, which reduces memory usage while maintaining performance.
LoRa: A low-rank adaptation technique that reduces trainable parameters, speeding up training and inference.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
fine-tuned-bert.ipynb		fine-tuned-bert.ipynb
fine_tuned_llama_2.ipynb		fine_tuned_llama_2.ipynb
nltk-summery.ipynb		nltk-summery.ipynb
pegasus_fine_tuned.ipynb		pegasus_fine_tuned.ipynb
pegasus_fine_tuned2.ipynb		pegasus_fine_tuned2.ipynb
robert2robert.ipynb		robert2robert.ipynb
t5.ipynb		t5.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Summarization using Transformer Models (Including LLaMA 2)

Features

Models Implemented

Dataset

Dataset Structure

Quantization Techniques

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Summarization using Transformer Models (Including LLaMA 2)

Features

Models Implemented

Dataset

Dataset Structure

Quantization Techniques

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages