Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

about real dataset (labeled or unlabeled) #4

Open
lx7555 opened this issue Aug 20, 2022 · 7 comments
Open

about real dataset (labeled or unlabeled) #4

lx7555 opened this issue Aug 20, 2022 · 7 comments

Comments

@lx7555
Copy link

lx7555 commented Aug 20, 2022

Thanks for your work, I noticed that you validated the importance of using unlabeled real datasets.
So how do I control in code to implement data states ((labeled or unlabeled)) for training.

Thank you!

@WangWenhao0716
Copy link
Owner

They should be controlled by train.py:

train_loader = get_train_loader(args, dataset_real, args.height, args.width,
                                            args.batch_size, args.workers, args.num_instances, iters,
                                            trainset=pseudo_labeled_dataset_relabel + dataset_syne.train)

pseudo_labeled_dataset_relabel denotes using the unlabeled dataset, and dataset_syne.train denotes using the labels.

@lx7555
Copy link
Author

lx7555 commented Aug 20, 2022

Thank you. As your suggestion,

train_loader = get_train_loader(args, dataset_real, args.height, args.width,
args.batch_size, args.workers, args.num_instances, iters,
trainset=pseudo_labeled_dataset_relabel)

mean real data using the unlabeled dataset (pseudo_labeled)

while
train_loader = get_train_loader(args, dataset_real, args.height, args.width,
args.batch_size, args.workers, args.num_instances, iters,
trainset=pseudo_labeled_dataset_relabel + dataset_syne.train)
mean real data using the labeled dataset (real_labeled)

@WangWenhao0716
Copy link
Owner

No.

train_loader = get_train_loader(args, dataset_real, args.height, args.width,
args.batch_size, args.workers, args.num_instances, iters,
trainset=pseudo_labeled_dataset_relabel)

means real data using the unlabeled dataset (pseudo_labeled)

train_loader = get_train_loader(args, dataset_real, args.height, args.width,
args.batch_size, args.workers, args.num_instances, iters,
trainset=pseudo_labeled_dataset_relabel + dataset_syne.train)

means real data using the unlabeled dataset (pseudo_labeled) while syne data using the labeled dataset.

@lx7555
Copy link
Author

lx7555 commented Aug 22, 2022

Thank you!
So as you mentioned, how to use "adding labeled real-world data to synthetic data" in your paper?
I mean how to change the label state for real-world data?
Best wishes

@lx7555
Copy link
Author

lx7555 commented Aug 22, 2022

Maybe?
train_loader = get_train_loader(args, dataset_real, args.height, args.width,
args.batch_size, args.workers, args.num_instances, iters,
trainset=dataset_real.train + dataset_syne.train

means real data using the labeled dataset (real-labeled) while syne data using the labeled dataset.

@WangWenhao0716
Copy link
Owner

Yes.

@lx7555
Copy link
Author

lx7555 commented Aug 22, 2022

Yes.

Thank you

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants