CogKit

Introduction

CogKit is an open-source project that provides a user-friendly interface for researchers and developers to utilize ZhipuAI's CogView (image generation) and CogVideoX (video generation) models. It streamlines multimodal tasks such as text-to-image (T2I), text-to-video (T2V), and image-to-video (I2V). Users must comply with legal and ethical guidelines to ensure responsible implementation.

Visit our Docs to start.

Features

Fine-tuning Methods: Supports LoRA and full-parameter fine-tuning across various setups, including single-machine single-GPU, single-machine multi-GPU, and multi-machine multi-GPU configurations.
Inference: Provides an OpenAI-style API (T2I Only) and a command-line interface for seamless model deployment.
Embed Cache: Optimizes GPU memory usage to enhance efficiency during inference.

Roadmap

Add support for CogView4 ControlNet model
Docker for easy deployment

License

This project is licensed under the Apache 2.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.github/workflows		.github/workflows
docker		docker
docs		docs
quickstart		quickstart
src/cogkit		src/cogkit
tests		tests
tools/converters		tools/converters
.env.template		.env.template
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CogKit

Introduction

Features

Roadmap

License

About

Releases

Packages

Contributors 4

Languages

License

THUDM/CogKit

Folders and files

Latest commit

History

Repository files navigation

CogKit

Introduction

Features

Roadmap

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages