Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Distribute pre-generated datasets + parametrized GitHub workflows for generating a few frames samples on GitHub CI without having to install Blender/dependencies locally #80

Closed
vadimkantorov opened this issue Jan 31, 2021 · 3 comments
Labels
duplicate This issue or pull request already exists

Comments

@vadimkantorov
Copy link

vadimkantorov commented Jan 31, 2021

It would be very helpful if Kubric put online some example datasets that Kubric's able to produce. This would allow getting familiar with the file structure (even if Kubric requires some polishing and is not ready for mass usage yet)

Another question: how much compute does dataset generation require? How long does it take? If it's hours of single-core cpu, it may be possible to just use GitHub Actions for generating simple CATER-like datasets.

@Qwlouse
Copy link
Collaborator

Qwlouse commented Feb 2, 2021

Perfectly timed question! I am currently writing some getting started docs + accompanying examples. Expect to see an update on this within a week.

Compute requirements depend heavily on the desired resolution, quality, and complexity of the scene. But rendering a single frame like the one from the CATER dataset would probably take about a second on a single CPU. For 5500 videos with 240 frames each this would unfortunately already add up to about 2 weeks CPU time.
But we will be hosting a few generated datasets soonish and plan to expand on this front as the project continues.

@vadimkantorov
Copy link
Author

vadimkantorov commented Feb 2, 2021

A parametrized GitHub action that renders and publishes a mini-dataset to GitHub releases would still be very nice to showcase different options (especially in academic environment where getting root/docker permissions takes quite some time or is impossible. if kubric gets packaged for conda, it's another story) and would allow the user to fork kubric repo and experiment with rendering options and maybe even generate a small dataset versions (e.g. 50 videos of 100 frames each) without installing anything or figuring out google ai platform registration / payments / tools

Thanks!

I've heard that GH runners have 2 CPU cores, so maybe it's even not that bad

@taiya
Copy link
Collaborator

taiya commented Jun 7, 2021

Merged with the issue for the arXiv paper.

@taiya taiya closed this as completed Jun 7, 2021
@taiya taiya added the duplicate This issue or pull request already exists label Jun 7, 2021
@taiya taiya mentioned this issue Jun 7, 2021
3 tasks
@vadimkantorov vadimkantorov changed the title Distribute pre-generated datasets Distribute pre-generated datasets + GitHub workflows for generating a few frames samples on GitHub CI without having to install Blender/dependencies locally Jun 7, 2021
@vadimkantorov vadimkantorov changed the title Distribute pre-generated datasets + GitHub workflows for generating a few frames samples on GitHub CI without having to install Blender/dependencies locally Distribute pre-generated datasets + parametrized GitHub workflows for generating a few frames samples on GitHub CI without having to install Blender/dependencies locally Jun 7, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate This issue or pull request already exists
Development

No branches or pull requests

3 participants