Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data splitter #33

Open
2 of 3 tasks
raghukiran1224 opened this issue Jul 22, 2021 · 1 comment
Open
2 of 3 tasks

Data splitter #33

raghukiran1224 opened this issue Jul 22, 2021 · 1 comment
Assignees
Labels
cfp-datamodel Codeflare pipelines datamodel good first issue Good for newcomers help wanted Extra attention is needed Prio1 Medium priority, needs attention, order of 2-3 weeks user-story User story

Comments

@raghukiran1224
Copy link
Contributor

raghukiran1224 commented Jul 22, 2021

Overview

As a CFP user, I would like to split a dataset (e.g., np array, pandas dataframe) into smaller objects that can then be fed into other nodes/pipeline. This is especially useful when we have compute intensive tasks and would like to parallelize it easily.

Acceptance Criteria

  • Design for splitter, should be simple and intuitive
  • Implementation as an extension to the Node construct
  • Tests

Questions

  • What type of semantics does the splitter node define?

Assumptions

Reference

@raghukiran1224 raghukiran1224 added good first issue Good for newcomers help wanted Extra attention is needed cfp-datamodel Codeflare pipelines datamodel user-story User story Prio1 Medium priority, needs attention, order of 2-3 weeks labels Jul 22, 2021
@raghukiran1224
Copy link
Contributor Author

The basic utility has been added, putting it as an actual node needs more work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cfp-datamodel Codeflare pipelines datamodel good first issue Good for newcomers help wanted Extra attention is needed Prio1 Medium priority, needs attention, order of 2-3 weeks user-story User story
Projects
None yet
Development

No branches or pull requests

4 participants