Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix Distiset after PushToHub and smaller fixes #452

Merged
merged 10 commits into from
Mar 20, 2024

Conversation

alvarobartt
Copy link
Member

@alvarobartt alvarobartt commented Mar 20, 2024

Description

This PR fixes an issue with the Pipelines that have a GlobalStep as PushToHub at the end, and don't propagate the inputs, so that the cache for the last step will contain an empty file, so that the _create_dataset method will fail since there will be no data.

Also _create_dataset docstring, and Pipeline.run and _get_filename return type-hints have been fixed. Besides that the _TYPE_MAP mapping to Parquet types has been updated to include type(None): pa.null() (this could also be handled in a better way, but not sure).

@alvarobartt alvarobartt added this to the 1.0.0 milestone Mar 20, 2024
@alvarobartt alvarobartt self-assigned this Mar 20, 2024
@alvarobartt alvarobartt merged commit 59aada4 into core-refactor Mar 20, 2024
@alvarobartt alvarobartt deleted the fix-distiset-after-push-to-hub branch March 20, 2024 13:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

None yet

2 participants