feat: DuckDB historical retrieval without entity dataframe by Vperiodt · Pull Request #6108 · feast-dev/feast

Vperiodt · 2026-03-14T20:35:19Z

What this PR does / why we need it:

Adds date-range historical retrieval for the DuckDB offline store when entity_df is omitted.

Which issue(s) this PR fixes:

fixes #5832 related to #1611

Misc

Signed-off-by: Vanshika Vanshika <vvanshik@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

ntkathole · 2026-03-18T05:41:04Z

sdk/python/feast/infra/offline_stores/duckdb.py

        full_feature_names: bool = False,
+        **kwargs,
    ) -> RetrievalJob:
+        start_date: Optional[datetime] = kwargs.get("start_date", None)


instead keep it as optional params in get_historical_features

ntkathole · 2026-03-18T05:42:58Z

sdk/python/feast/infra/offline_stores/duckdb.py

+DEFAULT_ENTITY_DF_EVENT_TIMESTAMP_COL = "event_timestamp"
+
+
+def _build_entity_df_from_sources(


add docstring for this

ntkathole · 2026-03-18T05:44:53Z

sdk/python/feast/infra/offline_stores/duckdb.py

        )


+DEFAULT_ENTITY_DF_EVENT_TIMESTAMP_COL = "event_timestamp"


import this from from feast.infra.offline_stores.offline_utils import DEFAULT_ENTITY_DF_EVENT_TIMESTAMP_COL

ntkathole · 2026-03-18T05:50:23Z

@Vperiodt There is integration test tests/integration/offline_store/test_non_entity_mode.py, see if duckdb coverage can be included there

aniketpalu · 2026-03-19T07:53:21Z

sdk/python/feast/infra/offline_stores/duckdb.py

+                    start_date = end_date - timedelta(seconds=max_ttl_seconds)
+                else:
+                    start_date = end_date - timedelta(days=30)
+            start_date = make_tzaware(start_date)


Line 229 - Line 244 is common across the offline store to figure out the start_date & end_date. If its not too much, is it possible to create utility function which can be re-used?

Yes, I’ll refactor this into a reusable utility function and apply it to similar date range calculations across the codebase in a follow-up PR

Signed-off-by: Vanshika Vanshika <vvanshik@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

devin-ai-integration

Devin Review found 1 new potential issue.

View 10 additional findings in Devin Review.

devin-ai-integration · 2026-03-24T06:47:01Z

sdk/python/tests/universal/feature_repos/repo_configuration.py

+    "duckdb": (
+        "local",
+        importlib.import_module(
+            "tests.universal.feature_repos.duckdb_repo_configuration"
+        ).DuckDBDataSourceCreator,
+    ),


🟡 Unconditional eager import of DuckDB config breaks test loading when ibis is not installed

The duckdb entry in OFFLINE_STORE_TO_PROVIDER_CONFIG uses importlib.import_module("tests.universal.feature_repos.duckdb_repo_configuration") at module level. This transitively imports feast.infra.offline_stores.duckdb (via duckdb_repo_configuration.py:1), which has import ibis at the top (sdk/python/feast/infra/offline_stores/duckdb.py:6). If ibis is not installed, loading repo_configuration.py will fail with ModuleNotFoundError, breaking all tests that import from this module — not just DuckDB-specific tests. This breaks the established pattern where non-core offline stores (BigQuery, Redshift, Snowflake at sdk/python/tests/universal/feature_repos/repo_configuration.py:116-137) are imported conditionally behind environment variable checks.

Prompt for agents

In sdk/python/tests/universal/feature_repos/repo_configuration.py, the duckdb entry in OFFLINE_STORE_TO_PROVIDER_CONFIG (lines 94-99) should be moved out of the unconditional dict literal and added conditionally, similar to how BigQuery/Redshift/Snowflake are added at lines 135-137. One approach is to wrap it in a try/except ImportError block or check for ibis availability: try: from tests.universal.feature_repos.duckdb_repo_configuration import DuckDBDataSourceCreator OFFLINE_STORE_TO_PROVIDER_CONFIG["duckdb"] = ("local", DuckDBDataSourceCreator) except ImportError: pass This should be placed after the initial OFFLINE_STORE_TO_PROVIDER_CONFIG dict definition (after line 94 in the original, which only has the "file" entry).

Was this helpful? React with 👍 or 👎 to provide feedback.

Vperiodt and others added 4 commits March 15, 2026 01:57

support historical retrieval without entity_df

7661335

Signed-off-by: Vanshika Vanshika <vvanshik@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

Merge branch 'feast-dev:master' into patch-dataframe

317244c

test cases

7bb36ee

Signed-off-by: Vanshika Vanshika <vvanshik@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

fix-lint

72c1b91

Signed-off-by: Vanshika Vanshika <vvanshik@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

Vperiodt marked this pull request as ready for review March 15, 2026 18:17

Vperiodt requested a review from a team as a code owner March 15, 2026 18:17

This comment was marked as resolved.

Sign in to view

Vperiodt and others added 2 commits March 16, 2026 18:33

Merge branch 'feast-dev:master' into patch-dataframe

da34d49

fix issue

8c3b25a

Signed-off-by: Vanshika Vanshika <vvanshik@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

This comment was marked as resolved.

Sign in to view

fix rename

7ec2d22

Signed-off-by: Vanshika Vanshika <vvanshik@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

ntkathole reviewed Mar 18, 2026

View reviewed changes

aniketpalu reviewed Mar 19, 2026

View reviewed changes

Vperiodt and others added 2 commits March 24, 2026 10:44

Merge branch 'feast-dev:master' into patch-dataframe

fd71654

addressed reviews

6dba432

Signed-off-by: Vanshika Vanshika <vvanshik@redhat.com> rh-pre-commit.version: 2.3.2 rh-pre-commit.check-secrets: ENABLED

devin-ai-integration bot reviewed Mar 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: DuckDB historical retrieval without entity dataframe#6108

feat: DuckDB historical retrieval without entity dataframe#6108
Vperiodt wants to merge 9 commits intofeast-dev:masterfrom
Vperiodt:patch-dataframe

Vperiodt commented Mar 14, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

ntkathole Mar 18, 2026

Uh oh!

ntkathole Mar 18, 2026

Uh oh!

ntkathole Mar 18, 2026

Uh oh!

ntkathole commented Mar 18, 2026

Uh oh!

aniketpalu Mar 19, 2026

Uh oh!

Vperiodt Mar 24, 2026

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

devin-ai-integration bot Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		DEFAULT_ENTITY_DF_EVENT_TIMESTAMP_COL = "event_timestamp"


		def _build_entity_df_from_sources(

Conversation

Vperiodt commented Mar 14, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it:

Which issue(s) this PR fixes:

Misc

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

ntkathole Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

ntkathole Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

ntkathole Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

ntkathole commented Mar 18, 2026

Uh oh!

aniketpalu Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

Vperiodt Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Vperiodt commented Mar 14, 2026 •

edited by devin-ai-integration bot

Loading