-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
option to prefetch the data #968
Comments
Yes, all CM scripts are modular and so we can do this. For example, the below command will download the full imagenet validation set and exports the downloaded paths.
|
Thank you !
|
Ok, the space before But I am trying this example and nothing is happening, do you know why or how I can debug further ?
Thanks ! |
Hi @jdesfossez, CM scripts installs artifacts to the CM cache and make them available to other CM scripts via API and/or ENV variables. You can see the cache with all artifacts including above model as follows:
You can find your model and extra CM meta files as follows:
Basically CM is a database of objects connected by tags, UIDs and ENV variables ... Please check these 2 tutorials that may give you more ideas behind CM:
That's how we reuse individual CM scripts (and workflows assembled from those scripts) for reproducibility initiatives at conferences and other initiatives to make it easier to run AI on different platforms ... We are interested to know your use cases and how CM can help - please feel free to talk to us via Discord server or we can set up a conf-call ... Thank you for your interest and feedback! |
Hi ! My current goal is to automate performance testing of GPUs in a public cloud environment. I need to easily and quickly compare the impact of various hypervisor-level changes, so this project seems perfect for that purpose. Eventually I will use it as well to submit results. |
Sorry @jdesfossez for the typo -- I was typing on mobile :( "is there a clean way for me to specify at run-time the location of the data " I believe you want to use a private URL here right? Currently we are supporting multiple downloaded sources like this but not custom URLs - we can do this by next release. But for most of the large datasets, there is an option to provide the |
This solution can work for using custom URLs. |
ah perfect, thank you so much ! |
Is there an option to only run the steps that downloads the data set without actually running the benchmarks ?
This would be useful to prepare images.
Thanks !
The text was updated successfully, but these errors were encountered: