fix(load): strip deprecated use_auth_token from config_kwargs #7654
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fixes #7504
This PR resolves a compatibility error when loading datasets via
load_dataset()
using outdated arguments likeuse_auth_token
.What was happening:
Users passing
use_auth_token
inload_dataset(..., use_auth_token=...)
encountered aValueError
: BuilderConfig ParquetConfig(...) doesn't have a 'use_auth_token' key.Why:
use_auth_token
has been deprecated and removed from config definitions (replaced bytoken
), but theload_dataset()
function still forwarded it via**config_kwargs
to BuilderConfigs, leading to unrecognized key errors.Fix:
We now intercept and strip
use_auth_token
fromconfig_kwargs
insideload_dataset
, replacing it with a warning:This ensures legacy compatibility while guiding users to switch to the token argument.
Let me know if you'd prefer a deprecation error instead of a warning. Thanks!