-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Data split fails #2
Comments
Hi, could you please make sure that the |
It's empty but I followed the instructions. |
Could you please provide the structure of the |
it contains around 10K models that end with .pth. |
But what is the structure of the |
datasets/mnist_classifiers/model_xx.pth
Like this with many files
בתאריך יום ד׳, 10 ביולי 2024 ב-23:59 מאת Aviv Navon <
***@***.***>:
… But what is the structure of the datasets/mnist_classifiers folder? Are
there other folders inside? Or just the *.pth files (e.g.,
datasets/mnist_classifiers/model_xx.pth)
—
Reply to this email directly, view it on GitHub
<#2 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVTIJUDOOOJ7HV77UHWMYPDZLWOCPAVCNFSM6AAAAABKUUSTGWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRRGQ2TGMZYGE>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Are you sure you pass the correct path as from pathlib import Path
data_root = "datasets/mnist_models"
data_root = Path(data_root)
all_files = [p.as_posix() for p in data_root.glob("**/*.pth")]
all_files[:10] The output should look like this:
|
Thanks, I'll check it out.
I ran python experiments/utils/data/generate_splits.py --data-root
datasets/mnist_classifiers --save-path datasets/splits.json without change.
Should I have done any changes inside the command?
בתאריך יום ה׳, 11 ביולי 2024 ב-0:16 מאת Aviv Navon <
***@***.***>:
… Are you sure you pass the correct path as --data-root?
Try this please:
from pathlib import Pathdata_root = "datasets/mnist_models"data_root = Path(data_root)all_files = [p.as_posix() for p in data_root.glob("**/*.pth")]all_files[:10]
The output should look like this:
['datasets/mnist_models/model_899.pth', 'datasets/mnist_models/model_3082.pth', 'datasets/mnist_models/model_641.pth', 'datasets/mnist_models/model_4935.pth', 'datasets/mnist_models/model_1695.pth', 'datasets/mnist_models/model_7582.pth', 'datasets/mnist_models/model_6844.pth', 'datasets/mnist_models/model_8869.pth', 'datasets/mnist_models/model_5395.pth', 'datasets/mnist_models/model_127.pth']
—
Reply to this email directly, view it on GitHub
<#2 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVTIJUG54ZRZ64CEQWPEEADZLWQBRAVCNFSM6AAAAABKUUSTGWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRRGQ4TCOJTGY>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Try providing a full path to |
['datasets/mnist_models/model_3302.pth', 'datasets/mnist_models/model_2930.pth', 'datasets/mnist_models/model_2542.pth', 'datasets/mnist_models/model_1457.pth', 'datasets/mnist_models/model_1825.pth', 'datasets/mnist_models/model_4309.pth', 'datasets/mnist_models/model_5549.pth', 'datasets/mnist_models/model_9289.pth', 'datasets/mnist_models/model_2123.pth', 'datasets/mnist_models/model_2684.pth'] |
Try running the generate_splits.py command with full path (and maybe provide test/val sizes) |
tried but fails
בתאריך יום ה׳, 11 ביולי 2024 ב-0:28 מאת Aviv Navon <
***@***.***>:
… Try running the generate_splits.py command with full path (and maybe
provide test/val sizes)
—
Reply to this email directly, view it on GitHub
<#2 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVTIJUH4CYI72TXSLVGKUFTZLWRPXAVCNFSM6AAAAABKUUSTGWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRRGUZDOMZSHE>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
I tried running what you asked explicitly and got all_files = [ ]. Although
as I showed before, there are paths.
בתאריך יום ה׳, 11 ביולי 2024 ב-0:30 מאת יונתן סברדלוב <
***@***.***>:
… tried but fails
בתאריך יום ה׳, 11 ביולי 2024 ב-0:28 מאת Aviv Navon <
***@***.***>:
> Try running the generate_splits.py command with full path (and maybe
> provide test/val sizes)
>
> —
> Reply to this email directly, view it on GitHub
> <#2 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/AVTIJUH4CYI72TXSLVGKUFTZLWRPXAVCNFSM6AAAAABKUUSTGWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRRGUZDOMZSHE>
> .
> You are receiving this because you authored the thread.Message ID:
> ***@***.***>
>
|
Could you share the exact command you are using and the full trace? |
Regarding commands:
mkdir datasets
wget "https://www.dropbox.com/s/sv85hrjswaspok4/mnist_classifiers.zip"
unzip -q mnist_classifiers.zip -d datasets
python experiments/utils/data/generate_splits.py --data-root
datasets/mnist_classifiers --save-path datasets/splits.json
בתאריך יום ה׳, 11 ביולי 2024 ב-9:43 מאת Aviv Navon <
***@***.***>:
… Could you share the exact command you are using and the full trace?
Also, could you please try to debug to understand why the file structure
does not fit data_root.glob("**/*.pth") ?
—
Reply to this email directly, view it on GitHub
<#2 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVTIJUADXC7GN2NEBWYONLDZLYSPZAVCNFSM6AAAAABKUUSTGWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRSGE2TGNBYGA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
I think I see the problem, the subfolder is called |
Ok so what is the fix?
בתאריך יום ה׳, 11 ביולי 2024 ב-9:53 מאת Aviv Navon <
***@***.***>:
… I think I see the problem, the subfolder is called mnist_models and not mnist_classifiers
—
Reply to this email directly, view it on GitHub
<#2 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVTIJUE277DDT3WPL4KFKJTZLYTVDAVCNFSM6AAAAABKUUSTGWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRSGE3TCNZXGI>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
python experiments/utils/data/generate_splits.py --data-root datasets/mnist_models --save-path datasets/splits.json |
Also, I suggest providing exact sizes for the test/val splits using |
Worked but inside generate.py what val/test splited you took in the paper?
בתאריך יום ה׳, 11 ביולי 2024 ב-9:59 מאת Aviv Navon <
***@***.***>:
… python experiments/utils/data/generate_splits.py --data-root datasets/mnist_models --save-path datasets/splits.json
—
Reply to this email directly, view it on GitHub
<#2 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVTIJUGJPKKCLFOQ2BZKTYDZLYUOVAVCNFSM6AAAAABKUUSTGWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRSGE4DCNJXHE>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
I believe we provide the full experimental details in the Appendix of the paper |
And what about other datasets?
בתאריך יום ה׳, 11 ביולי 2024 ב-10:07 מאת יונתן סברדלוב <
***@***.***>:
… Worked but inside generate.py what val/test splited you took in the paper?
בתאריך יום ה׳, 11 ביולי 2024 ב-9:59 מאת Aviv Navon <
***@***.***>:
> python experiments/utils/data/generate_splits.py --data-root datasets/mnist_models --save-path datasets/splits.json
>
> —
> Reply to this email directly, view it on GitHub
> <#2 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/AVTIJUGJPKKCLFOQ2BZKTYDZLYUOVAVCNFSM6AAAAABKUUSTGWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRSGE4DCNJXHE>
> .
> You are receiving this because you authored the thread.Message ID:
> ***@***.***>
>
|
We will make the effort to release other datasets and the supporting code in the future |
We've released the code for the CNNs experiments |
Awesome, thanks for the update!
Does it include also CIFAR exps?
…On Tue, Jul 30, 2024, 21:56 Aviv Navon ***@***.***> wrote:
We've released the code for the CNNs experiments
—
Reply to this email directly, view it on GitHub
<#2 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AVTIJUAEXEHJOBOVYQWKFU3ZO7OUXAVCNFSM6AAAAABKUUSTGWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENJZGAYDKMRRGA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Yes. |
Hi I have two issues:
First, I run python experiments/utils/data/generate_splits.py --data-root datasets/mnist_classifiers --save-path datasets/splits.json to create the splits and have the following error: raise ValueError(
ValueError: With n_samples=0, test_size=0.25 and train_size=None, the resulting train set will be empty. Adjust any of the aforementioned parameters.
Second, can you add the networks of all other datasets like CIFAR10, LST?
Thanks!
The text was updated successfully, but these errors were encountered: