Multimodal Computer Vision and NLP Project: Creating Unlimited Graphic Novels from a Single Prompt

This is a GitHub repository for a multimodal computer vision and natural language processing (NLP) project that aims to generate unlimited graphic novels from a single prompt. The project uses multiple openAI APIs like the ChatGPT API, the Dall-E 2 API, and Vercel edge functions.

Project Overview

The goal of this project is to create an AI-powered system that can generate graphic novels from a single prompt. The system uses a combination of computer vision and NLP techniques to analyze the prompt and generate a story with corresponding images. The system is powered by multiple openAI APIs, including the ChatGPT API, which provides the NLP capabilities, and the Dall-E 2 API, which provides the image generation capabilities. Vercel edge functions are used to connect these APIs and create a seamless experience for the user.

Requirements

To run this project, you will need to have the following:

A valid API key for the openAI ChatGPT API and Dall-E 2 API Node.js (version 14 or later) NPM (version 6 or later)

Usage

Start the server and the system will be available at http://localhost:3000.

To generate a graphic novel, enter a prompt in the input field and click the "Generate" button. The system will then use the openAI ChatGPT API to generate a story and the Dall-E 2 API to generate corresponding images. The resulting graphic novel will be displayed on the screen.

Contributing

Contributions to this project are welcome. To contribute, fork the repository and submit a pull request. Please include a detailed description of your changes.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

If you have any questions or feedback, please feel free to contact the project contributors at [gupta[dot]saksham[at]gmail[dot]com].

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
components		components
pages		pages
public		public
styles		styles
utils		utils
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

sakshamio/comic-create

Folders and files

Latest commit

History

Repository files navigation

Multimodal Computer Vision and NLP Project: Creating Unlimited Graphic Novels from a Single Prompt

Project Overview

Requirements

Usage

Contributing

License

Contact

About

Resources

Stars

Watchers

Forks

Languages