Skip to content

mdsrqbl/omnihuman

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OmniHuman

AI model that understands humanoids and text.


PyPi Documentation Status python

GitHub Repo stars Downloads

Support Us ❤️ PayPal

  1. OmniHuman
    1. Installation
    2. Usage
    3. Citation & License

Installation

pip install omnihuman

or install editable from source

git clone https://github.com/mdsrqbl/omnihuman.git
cd omnihuman
pip install -e .

Usage

import omnihuman
import PIL.Image

text = "Raise both hands and clap overhead."
frames = omnihuman.read_frames("path/to/image.jpg")  # (1, channels, height, width)
# frames = omnihuman.read_frames("path/to/video.mp4")  # (n_frames, channels, height, width)

# model = omnihuman.OmniHuman()
# frames = model.generate_video(text, frames)

PIL.Image.fromarray(frames[-1].permute(1,2,0).numpy()).show()

Full documentation is available at omnihuman.readTheDocs.io.

Citation & License

@misc{mdsr2024omnihuman,
  author = {Mudassar Iqbal},
  title = {OmniHuman: AI model that understands text and humanoids.},
  year = {2024},
  publisher = {GitHub},
  howpublished = {\url{https://github.com/mdsrqbl/omnihuman}}
}

This project is licensed under Apache License 2.0 - see the LICENSE file for details.

You are permitted to use the library & models, create modified versions, or incorporate pieces of the code into your own work. Your product or research, whether commercial or non-commercial, must provide appropriate credit to the original author(s) by citing this repository & research papers.

Stay tuned for research papers!