pip install omnihuman
or install editable from source
git clone https://github.com/mdsrqbl/omnihuman.git
cd omnihuman
pip install -e .
import omnihuman
import PIL.Image
text = "Raise both hands and clap overhead."
frames = omnihuman.read_frames("path/to/image.jpg") # (1, channels, height, width)
# frames = omnihuman.read_frames("path/to/video.mp4") # (n_frames, channels, height, width)
# model = omnihuman.OmniHuman()
# frames = model.generate_video(text, frames)
PIL.Image.fromarray(frames[-1].permute(1,2,0).numpy()).show()
Full documentation is available at omnihuman.readTheDocs.io.
@misc{mdsr2024omnihuman,
author = {Mudassar Iqbal},
title = {OmniHuman: AI model that understands text and humanoids.},
year = {2024},
publisher = {GitHub},
howpublished = {\url{https://github.com/mdsrqbl/omnihuman}}
}
This project is licensed under Apache License 2.0 - see the LICENSE file for details.
You are permitted to use the library & models, create modified versions, or incorporate pieces of the code into your own work. Your product or research, whether commercial or non-commercial, must provide appropriate credit to the original author(s) by citing this repository & research papers.
Stay tuned for research papers!