Text Summarization Using Large Language Models

Overview

Text summarization is a critical Natural Language Processing (NLP) task with applications ranging from information retrieval to content generation. Leveraging Large Language Models (LLMs) has shown remarkable promise in enhancing summarization techniques. This repository will document the experiments conducted as part of the research under the KaggleX BIPOC Mentorship Program 2023, Cohort-3.

Updated Experiment Files

In this project, I have recently updated the experiment files for the CNN Dailymail and XSum datasets, leveraging three different Large Language Models (LLMs). These LLMs include:

MPT-7b-instruct
Falcon-7b-instruct
Text-davinci-003

You can go through the technical paper:https://arxiv.org/abs/2310.10449

Application Side of This Work

This repository contains a notebook for generating summaries of YouTube video comments using the OpenAI ChatGPT Text-davinci-003 model. I conducted experiments with different temperature values to assess the quality of the generated summaries. You can access the results in the "Application" folder, specifically in the "youtube-comment-summarizer.ipynb" file.

Information for Contributors and Collaborators

I encourage contributions and collaboration from the community. Feel free to clone this repository and experiment with various word length and temperature settings to generate your own summaries. I would love to hear about your experiences and insights.

Share Your Experience

I value your feedback and look forward to hearing about your experiences. If you have any questions, suggestions, or insights to share, please don't hesitate to reach out.

Have a Great Learning Journey

I hope this work enhances your learning and research endeavors. Have a great time experimenting and exploring the possibilities.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Application		Application
.DS_Store		.DS_Store
CNN_Falcon_7b_instruct_lbasyal.ipynb		CNN_Falcon_7b_instruct_lbasyal.ipynb
CNN_mpt_7b_instruct_lbasyal.ipynb		CNN_mpt_7b_instruct_lbasyal.ipynb
LICENSE		LICENSE
README.md		README.md
average-word-count-in-25-samples-cnn-xsum.ipynb		average-word-count-in-25-samples-cnn-xsum.ipynb
cnn-openai-chatgpt-lbasyal.ipynb		cnn-openai-chatgpt-lbasyal.ipynb
xsum-openai-chatgpt-lbasyal.ipynb		xsum-openai-chatgpt-lbasyal.ipynb
xsum_Falcon_7b_instruct_lbasyal.ipynb		xsum_Falcon_7b_instruct_lbasyal.ipynb
xsum_mpt_7b_instruct_lbasyal.ipynb		xsum_mpt_7b_instruct_lbasyal.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Application

Application

.DS_Store

.DS_Store

CNN_Falcon_7b_instruct_lbasyal.ipynb

CNN_Falcon_7b_instruct_lbasyal.ipynb

CNN_mpt_7b_instruct_lbasyal.ipynb

CNN_mpt_7b_instruct_lbasyal.ipynb

LICENSE

LICENSE

README.md

README.md

average-word-count-in-25-samples-cnn-xsum.ipynb

average-word-count-in-25-samples-cnn-xsum.ipynb

cnn-openai-chatgpt-lbasyal.ipynb

cnn-openai-chatgpt-lbasyal.ipynb

xsum-openai-chatgpt-lbasyal.ipynb

xsum-openai-chatgpt-lbasyal.ipynb

xsum_Falcon_7b_instruct_lbasyal.ipynb

xsum_Falcon_7b_instruct_lbasyal.ipynb

xsum_mpt_7b_instruct_lbasyal.ipynb

xsum_mpt_7b_instruct_lbasyal.ipynb

Repository files navigation

Text Summarization Using Large Language Models

Overview

Updated Experiment Files

Application Side of This Work

Information for Contributors and Collaborators

Share Your Experience

Have a Great Learning Journey

About

Releases

Packages

Languages

License

lbasyal/LLMs-Text-Summarization

Folders and files

Latest commit

History

Repository files navigation

Text Summarization Using Large Language Models

Overview

Updated Experiment Files

Application Side of This Work

Information for Contributors and Collaborators

Share Your Experience

Have a Great Learning Journey

About

Resources

License

Stars

Watchers

Forks

Languages