Skip to content

This project is a part of the semester long research-based Mini Project under Prof. Mr. Vikas Kumar Singh. This returns textual description or annotations for an input image.

License

Notifications You must be signed in to change notification settings

ArnabKumarRoy02/Image-Caption-Generator

Repository files navigation

Image-Caption-Generator

This repository contains code and resources for image caption generator using deep learning models. The project aims to develop accurate and efficient models for describing an image in words by the model.

Table of Contents

Introduction

The Image Caption Generator is a deep learning based project that automatically generates captions for images using state-of-the-art computer vision techniques and natural language processing models.

Features

  • Automatic image captioning using deep learning models.
  • Pre-trained models for caption generation.
  • Support for various image formats.

Installation

  1. Clone the repository to your local machine:
git clone https://github.com/ArnabKumarRoy02/Image-Caption-Generator.git
  1. Install the required dependencies:
pip install -r requirements.txt

Usage

  1. Make sure you have the required dependencies installed.
  2. Run the app.py script to test it:
flask run
  1. Optionally, you can fine-tune or train your own models using the provided dataset and scripts.

Dataset

The Image Caption Generator uses the Flickr8k dataset for training. Make sure to download the dataset or use your own dataset for training.

Pre-trained Models

We provide pre-trained models that can be use for generating captions without the need for training. You can download the pre-trained models from the following links:

Contributing

Contributions to this project are welcome! If you find any issues or want to add new features, feel free to open an issue or submit a pull request.

License

This project is licensed under the MIT License

Acknowledgments

  • The Image Caption Generator project is built upon the works of several open-source libraries and research papers. We acknowledge the contributions of the research community and the creators of the datasets used in this project.

About

This project is a part of the semester long research-based Mini Project under Prof. Mr. Vikas Kumar Singh. This returns textual description or annotations for an input image.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages