Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Don't populate labels if label column is not specified in csv parser #679

Merged
merged 2 commits into from
Mar 10, 2023

Conversation

rongou
Copy link
Contributor

@rongou rongou commented Mar 6, 2023

Right now the CSV parser sets labels to 0 if the label column is not specified (or set to -1). This is surprising to the user and leads to cryptic error messages. It's probably better to just leave the labels as empty if not specified.

For vertical federated learning, we may have workers that don't have access to the label, so this would enable them to parse csv shards without erroneously setting labels to 0.

@rongou
Copy link
Contributor Author

rongou commented Mar 6, 2023

@hcho3 @trivialfis

@hcho3 hcho3 merged commit ea21135 into dmlc:main Mar 10, 2023
@hcho3
Copy link
Contributor

hcho3 commented Mar 10, 2023

Merging for now. I'll try to make time to fix the CI.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants