Image-Captioning-Web-App-with-Gemini-Pro-Vision

Introduction

Image captioning has become an essential tool in making content accessible and interactive in digital spaces. With the advent of advanced LLM models like Google’s Gemini Pro Vision, generating captions for images has become more accurate and contextually relevant. In this blog, we will explore how to build a simple web application using Streamlit and Google Google’s Gemini Pro Vision to create a tool that generates captions for uploaded images.

STEPS to run the project:

STEP 01- Clone the repository

Project repo: https://github.com/riad5089/Image-Captioning-Web-App-with-Gemini-Pro-Vision.git

STEP 02-Create a conda environment after opening the repository

python -m venv env

env\Scripts\activate

STEP 03- install the requirements

pip install -r requirements.txt

Project Demo

Deployment

I made a web application using streamlit framework. This web application is hosted in share.streamlit you can check out this app here.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
LICENSE		LICENSE
README.md		README.md
Screenshot_11.png		Screenshot_11.png
config.json		config.json
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image-Captioning-Web-App-with-Gemini-Pro-Vision

Introduction

STEPS to run the project:

STEP 01- Clone the repository

STEP 02-Create a conda environment after opening the repository

STEP 03- install the requirements

Project Demo

Deployment

About

Releases

Packages

Languages

License

riad5089/Image-Captioning-Web-App-with-Gemini-Pro-Vision

Folders and files

Latest commit

History

Repository files navigation

Image-Captioning-Web-App-with-Gemini-Pro-Vision

Introduction

STEPS to run the project:

STEP 01- Clone the repository

STEP 02-Create a conda environment after opening the repository

STEP 03- install the requirements

Project Demo

Deployment

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages