Conversation
Cudf 22.04 changed the df.sample method to accept a 'ignore_index' instead of a 'keep_index' parameter. Fix by changing the param based off of cudf version
Documentation preview |
Click to view CI ResultsGitHub pull request #82 of commit cde760dbd70d6b4a36133d65272f5f55ad7f7449, no merge conflicts.
Running as SYSTEM
Setting status of cde760dbd70d6b4a36133d65272f5f55ad7f7449 to PENDING with url https://10.20.13.93:8080/job/merlin_core/46/console and message: 'Pending'
Using context: Jenkins
Building on master in workspace /var/jenkins_home/workspace/merlin_core
using credential ce87ff3c-94f0-400a-8303-cb4acb4918b5
> git rev-parse --is-inside-work-tree # timeout=10
Fetching changes from the remote Git repository
> git config remote.origin.url https://github.com/NVIDIA-Merlin/core # timeout=10
Fetching upstream changes from https://github.com/NVIDIA-Merlin/core
> git --version # timeout=10
using GIT_ASKPASS to set credentials login for merlin-systems username and pass
> git fetch --tags --force --progress -- https://github.com/NVIDIA-Merlin/core +refs/pull/82/*:refs/remotes/origin/pr/82/* # timeout=10
> git rev-parse cde760dbd70d6b4a36133d65272f5f55ad7f7449^{commit} # timeout=10
Checking out Revision cde760dbd70d6b4a36133d65272f5f55ad7f7449 (detached)
> git config core.sparsecheckout # timeout=10
> git checkout -f cde760dbd70d6b4a36133d65272f5f55ad7f7449 # timeout=10
Commit message: "Fix for cudf 22.04"
> git rev-list --no-walk f7e89cc177414b232546a67b665592f33c347fcf # timeout=10
[merlin_core] $ /bin/bash /tmp/jenkins7837108209131990970.sh
Looking in indexes: https://pypi.org/simple, https://pypi.ngc.nvidia.com
Requirement already satisfied: setuptools in /usr/local/lib/python3.8/dist-packages (62.1.0)
============================= test session starts ==============================
platform linux -- Python 3.8.10, pytest-7.1.2, pluggy-1.0.0
rootdir: /var/jenkins_home/workspace/merlin_core/core, configfile: pyproject.toml
plugins: xdist-2.5.0, forked-1.4.0, cov-3.0.0
collected 337 items / 1 skipped
|
The dataloader was including the shuffle code that is now hosted in merlin-core. This change updates to use the merlin-core version, rather than redefine here. The shuffle_df code had an issue with cudf 22.04 that is fixed in merlin-core by NVIDIA-Merlin/core#82
Click to view CI ResultsGitHub pull request #82 of commit cb5157148aa2ec6872698ede29cd960cc9f44aae, no merge conflicts.
Running as SYSTEM
Setting status of cb5157148aa2ec6872698ede29cd960cc9f44aae to PENDING with url https://10.20.13.93:8080/job/merlin_core/47/console and message: 'Pending'
Using context: Jenkins
Building on master in workspace /var/jenkins_home/workspace/merlin_core
using credential ce87ff3c-94f0-400a-8303-cb4acb4918b5
> git rev-parse --is-inside-work-tree # timeout=10
Fetching changes from the remote Git repository
> git config remote.origin.url https://github.com/NVIDIA-Merlin/core # timeout=10
Fetching upstream changes from https://github.com/NVIDIA-Merlin/core
> git --version # timeout=10
using GIT_ASKPASS to set credentials login for merlin-systems username and pass
> git fetch --tags --force --progress -- https://github.com/NVIDIA-Merlin/core +refs/pull/82/*:refs/remotes/origin/pr/82/* # timeout=10
> git rev-parse cb5157148aa2ec6872698ede29cd960cc9f44aae^{commit} # timeout=10
Checking out Revision cb5157148aa2ec6872698ede29cd960cc9f44aae (detached)
> git config core.sparsecheckout # timeout=10
> git checkout -f cb5157148aa2ec6872698ede29cd960cc9f44aae # timeout=10
Commit message: "Merge branch 'main' into cudf_2204_fix"
> git rev-list --no-walk cde760dbd70d6b4a36133d65272f5f55ad7f7449 # timeout=10
[merlin_core] $ /bin/bash /tmp/jenkins816322271622302664.sh
Looking in indexes: https://pypi.org/simple, https://pypi.ngc.nvidia.com
Requirement already satisfied: setuptools in /usr/local/lib/python3.8/dist-packages (62.1.0)
============================= test session starts ==============================
platform linux -- Python 3.8.10, pytest-7.1.2, pluggy-1.0.0
rootdir: /var/jenkins_home/workspace/merlin_core/core, configfile: pyproject.toml
plugins: xdist-2.5.0, forked-1.4.0, cov-3.0.0
collected 342 items / 1 skipped
|
The dataloader was including the shuffle code that is now hosted in merlin-core. This change updates to use the merlin-core version, rather than redefine here. The shuffle_df code had an issue with cudf 22.04 that is fixed in merlin-core by NVIDIA-Merlin/core#82 Co-authored-by: Julio Perez <37191411+jperez999@users.noreply.github.com>
Cudf 22.04 changed the df.sample method to accept
a 'ignore_index' instead of a 'keep_index' parameter.
Fix by changing the param based off of cudf version