Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle different in-memory data formats for the same input data #22

Open
wynot12 opened this issue Jan 8, 2018 · 0 comments
Open

Handle different in-memory data formats for the same input data #22

wynot12 opened this issue Jan 8, 2018 · 0 comments

Comments

@wynot12
Copy link
Contributor

wynot12 commented Jan 8, 2018

Different apps (e.g., MLR, GBT, Lasso) may use the same input data.
But in some case, they use different in-memory format for the exactly same data.

  • MLR, which is for classification task, maintains values in integer type.
  • Lasso, which is for regression task, maintains values in float type.
  • GBT, which is for both classification and regression, maintains value in float.

This becomes problem in #21, which makes jobs share the input table for the same input file.

We may fix them all to store data in one same type (integer or float) and transform it on use.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant