Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[data][train] Update "Data Loading and Preprocessing" doc #44165

Merged
merged 24 commits into from
Mar 26, 2024

Conversation

raulchen
Copy link
Contributor

Why are these changes needed?

Related issue number

Checks

  • I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
  • I've run scripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed for https://docs.ray.io/en/master/.
    • I've added any new APIs to the API Reference. For example, if I added a
      method in Tune, I've added it in doc/source/tune/api/ under the
      corresponding .rst file.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • Unit tests
    • Release tests
    • This PR is not tested :(

Signed-off-by: Hao Chen <chenh1024@gmail.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Copy link
Contributor

@scottjlee scottjlee left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

pending fixing docs/premerge failures

Copy link
Contributor

@angelinalg angelinalg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please correct any subjects that I added to convert passive voice sentences to active voice.

Also, consider using Vale to catch spelling and style errors. (go/vale)

@@ -71,6 +72,9 @@ Data ingestion can be set up with four basic steps:
# Step 4: Access the dataset shard for the training worker via
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The indentation and step order looks wrong. This should be step 3?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

from runtime's perspective, this happens after the step 3 below

raulchen and others added 14 commits March 21, 2024 12:49
Co-authored-by: Scott Lee <scottjlee@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: Scott Lee <scottjlee@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: Scott Lee <scottjlee@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
raulchen and others added 4 commits March 21, 2024 14:12
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
@raulchen raulchen merged commit 3cb72ac into ray-project:master Mar 26, 2024
5 checks passed
@raulchen raulchen deleted the ray-train-data-doc branch March 26, 2024 18:39
raulchen added a commit to raulchen/ray that referenced this pull request Mar 27, 2024
…t#44165)

---------

Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: Scott Lee <scottjlee@users.noreply.github.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Signed-off-by: Hao Chen <chenh1024@gmail.com>
stephanie-wang pushed a commit to stephanie-wang/ray that referenced this pull request Mar 27, 2024
…t#44165)


---------

Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: Scott Lee <scottjlee@users.noreply.github.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
khluu pushed a commit that referenced this pull request Mar 29, 2024
…44316)

Pick #44165 (doc changes only) to 2.10 branch
---------

Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: Scott Lee <scottjlee@users.noreply.github.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
ryanaoleary pushed a commit to ryanaoleary/ray that referenced this pull request Jun 7, 2024
…t#44165)


---------

Signed-off-by: Hao Chen <chenh1024@gmail.com>
Co-authored-by: Scott Lee <scottjlee@users.noreply.github.com>
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants