Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training on original Criterio Dataset. #76

Closed
Karoljv opened this issue Dec 3, 2023 · 2 comments
Closed

Training on original Criterio Dataset. #76

Karoljv opened this issue Dec 3, 2023 · 2 comments

Comments

@Karoljv
Copy link

Karoljv commented Dec 3, 2023

Hey, I have a question about training DeepFM on originial dataset Criterio dataset. Is this possible with the code provided in the repository? Dataset presented in demo version has 19 colummns. I mean train_sample.csv, test_sample.csv etc. Are these columns from Criterio dataset? How can I use original Criterio dataset when the data from this dataset is numerical and the caterogical columns are hashed. How to handle no labels in test set?

@zhujiem
Copy link
Contributor

zhujiem commented Dec 6, 2023

Please follow the benchmark steps in BARS: say https://github.com/reczoo/BARS/tree/main/ctr_prediction/benchmarks/DeepFM/DeepFM_criteo_x1

@zhujiem
Copy link
Contributor

zhujiem commented Dec 6, 2023

If you have test set without labels, you can call model.predict() at https://github.com/reczoo/FuxiCTR/blob/main/fuxictr/pytorch/models/rank_model.py#L244

@zhujiem zhujiem closed this as completed Dec 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants