Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Store] Fix remote ingest parquet filters #5631

Merged

Conversation

tomerm-iguazio
Copy link
Contributor

@tomerm-iguazio tomerm-iguazio commented May 26, 2024

Json serialization changes additional filters by transforming tuples into lists.
This issue cause remote run to fail.

  1. Fix this issue by transform lists back to tuples.
  2. Added system and unit tests.
  3. Implement this solution on ParquetTarget and ParquetSource.
  4. As a following Idea -> In order to properly check our test, we should perform the same process of encoding and decoding from JSON in a local ingestion as well.

ML-6563

@assaf758 assaf758 requested a review from gtopper May 28, 2024 11:38
tests/datastore/test_targets.py Outdated Show resolved Hide resolved
mlrun/datastore/targets.py Outdated Show resolved Hide resolved
mlrun/datastore/utils.py Outdated Show resolved Hide resolved
tests/datastore/test_sources.py Outdated Show resolved Hide resolved
tests/datastore/test_targets.py Outdated Show resolved Hide resolved
mlrun/datastore/utils.py Outdated Show resolved Hide resolved
mlrun/datastore/utils.py Outdated Show resolved Hide resolved
tests/datastore/test_base.py Outdated Show resolved Hide resolved
tests/system/feature_store/test_feature_store.py Outdated Show resolved Hide resolved
mlrun/datastore/sources.py Outdated Show resolved Hide resolved
@gtopper gtopper changed the title [FeatureStore] Fix remote ingest parquet filters [Feature Store] Fix remote ingest parquet filters May 29, 2024
mlrun/datastore/utils.py Outdated Show resolved Hide resolved
mlrun/datastore/utils.py Outdated Show resolved Hide resolved
mlrun/datastore/utils.py Outdated Show resolved Hide resolved
mlrun/datastore/sources.py Outdated Show resolved Hide resolved
mlrun/datastore/utils.py Outdated Show resolved Hide resolved
mlrun/datastore/utils.py Outdated Show resolved Hide resolved
mlrun/datastore/utils.py Outdated Show resolved Hide resolved
@assaf758 assaf758 merged commit e452539 into mlrun:development May 30, 2024
20 checks passed
@tomerm-iguazio tomerm-iguazio deleted the fix_remote_ingest_parquet_filters branch June 3, 2024 12:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants