Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to run our own dataset #3

Open
HDareYR opened this issue Apr 12, 2024 · 2 comments
Open

How to run our own dataset #3

HDareYR opened this issue Apr 12, 2024 · 2 comments

Comments

@HDareYR
Copy link

HDareYR commented Apr 12, 2024

Thanks for your great work, I want to konw if the work could been used with unpaired dataset ,and if I use my own dataset, is there anything I need to change like load_data() in DDBM-main\ datassets _init_.py

@harveymannering
Copy link

I have tried using this repo on the AFHQ dataset (unpaired animal faces) and it did require changing the load_data() function, yeah. You will also need to create a new dataloader for your own dataset.

I trained the model to convert dogs to cats. However, when I ran inference from the following dog images, I got the corresponding cat images:

image

As you can see, there aren't really strong similarities between the input and output images like you would expect in something like CycleGAN. It's possible I just didn't train the model long enough, or the architecture was too small, or I used the wrong parameters. Either way, I haven't been able to get it to work for unpaired data, but I would be interested to hear if anyone has been able to get this to work, especially the authors.

@alexzhou907
Copy link
Owner

Sorry for the late reply. For unpaired translation this work would not work because it does not solve the Schrodinger bridge problem where you have a coupled forward and backward process to be learned jointly (https://arxiv.org/abs/2110.11291, https://arxiv.org/pdf/2106.01357). But these algorithms are difficult to scale due to some kinds of iterative sampling algorithms during training, so we opt to fix the forward process while learning the reverse, which makes learning simulation-free. But this does deviate from solving optimal transport. Instead, this only learns how to start from arbitrary prior given by data, and no guarantee on the generated samples have optimal coupling with your starting sample.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants