GitHub - Delight-In/Text-Summarizer-Project

Text Summarizer Project

The Text Summarizer project is focused on building and training a machine learning model to automatically generate concise summaries of text. The key components of the project include:

Model Selection: The project uses a pre-trained sequence-to-sequence model, such as PEGASUS, which is fine-tuned for text summarization tasks. The model is adapted to work with specific datasets, such as the Samsum dataset, which is commonly used for summarizing dialogues.
Dataset Handling: The project involves loading datasets (potentially from disk) to be used for training the model. These datasets include text and corresponding summaries, enabling the model to learn how to generate summaries from input text.
Training Pipeline: The system leverages the Hugging Face Trainer class, which simplifies the process of training models. The training configuration (such as learning rate, batch size, warmup steps, etc.) is customizable through a configuration class, ModelTrainerConfig.
Evaluation and Optimization: The project supports the evaluation of the model at regular intervals during training, optimizing the model's performance based on specific evaluation metrics.
Saving the Model: After training, the fine-tuned model and tokenizer are saved for future use, enabling the model to generate summaries on new, unseen text.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
config		config
src/TextSummarizer		src/TextSummarizer
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
main.py		main.py
params.yaml		params.yaml
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text Summarizer Project

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Delight-In/Text-Summarizer-Project

Folders and files

Latest commit

History

Repository files navigation

Text Summarizer Project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages