Skip to content

Utilizes parallel deep learning techniques to enhance image captioning capabilities using the PyTorch framework and the COCO dataset

License

Notifications You must be signed in to change notification settings

deveshcode/parallelized-image-captioner-pytorch

Repository files navigation

Parallel Deep Learning for Image Captioning Using PyTorch

Screenshot 2024-04-16 at 10 24 02 AM

Introduction

This project utilizes parallel deep learning techniques to enhance image captioning capabilities using the PyTorch framework and the COCO dataset. By leveraging the power of parallel computing, we aim to improve processing speed and efficiency, enabling more sophisticated image understanding in real-time applications.

Table of Contents

Features

  • Parallel data loading with PyTorch DataLoader.
  • Advanced preprocessing techniques for handling large image datasets.
  • Implementation of a deep learning model using multi-GPU training.
  • Utilization of mixed precision training to optimize memory usage and computational speed.

Requirements

Python 3.8+ PyTorch 1.7+ CUDA Toolkit 11.0+

Installation

Clone the repository and install the required packages:

git clone https://github.com/yourusername/yourprojectname.git
cd yourprojectname
pip install -r requirements.txt

DataLoader

  • Parallel data loading with PyTorch DataLoader.
Screenshot 2024-04-16 at 10 30 04 AM

Preprocessing

  • Advanced preprocessing techniques for handling large image datasets.
Screenshot 2024-04-16 at 10 30 17 AM

Model

  • Implementation of a deep learning model using multi-GPU training.
Screenshot 2024-04-16 at 10 30 30 AM

DataParallelism

  • Utilization of mixed precision training to optimize memory usage and computational speed.
Screenshot 2024-04-16 at 10 30 45 AM

Contributions

If you would like to contribute to this project, you can follow these steps:

  1. Fork the repository on GitHub.
  2. Create a new branch with a descriptive name for your contribution.
  3. Make your changes and commit them to your branch.
  4. Push your branch to your forked repository.
  5. Open a pull request on the original repository, describing your changes and why they should be merged.

We appreciate any contributions to this project and will review and merge them if they align with the project's goals and guidelines.

Acknowledgments

We would like to acknowledge the contributions of the following individuals and organizations to this project:

  • Prof. Handan Liu for her guidance and insights.
  • Abhishek Shankar for his contributions to the project.

License

This project is licensed under the MIT License.

About

Utilizes parallel deep learning techniques to enhance image captioning capabilities using the PyTorch framework and the COCO dataset

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published