Use ordinal encoding for dynamic categorical features in GluonTSAdapter #31

shchur · 2025-08-12T08:14:49Z

Issue #, if available:

Description of changes:

Previously, categorical dynamic features were kept as object dtype, which broke GluonTS models that accept feat_dynamic_real / past_feat_dynamic_real and attempt to convert them to float32 inside the transform. Now categorical features are encoded as integers (using ordinal encoding).

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

shchur · 2025-08-12T08:54:07Z

src/fev/adapters.py

+        df = df.astype(astype_dict)
+        if category_as_ordinal:
+            cat_cols = [col for col in df.select_dtypes(include="category").columns if col != id_column]
+            df = df.assign(**{col: df[col].cat.codes for col in cat_cols})


Possible alternatives:

automatically one-hot-encode categorical columns

drop categorical columns

Ideally, this should be a configurable option, but currently the fev.convert_input_data method does not allow routing kwargs to the individual adapters. @abdulfatir what do you think?

just discussed target encoding as a good option w/ @abdulfatir . why not also use it here?

My initial idea was that adapters perform the bare minimum preprocessing such that the data can be consumed by the respective frameworks, but I agree that we can also incorporate the best practices here.

If we go for target encoding, we should probably enable/disable it via an optional argument to the GluonTSAdapter. Currently these are not supported since fev.convert_input_data does not forward kwargs to the adapters.

How about we

Merge this (or some other simple strategy) as a simple default that unbreaks GluonTS models with covaraites

Add a better strategy after the Task refactor with an optional argument to the GluonTSAdapter?

I would vote for putting as little model-related stuff here as possible. If the user wants to do other types of encodings, they should do this on the model side.

Use ordinal encoding for dynamic categorical features in GluonTSAdapter

c92b4ad

shchur requested a review from abdulfatir August 12, 2025 08:15

shchur commented Aug 12, 2025

View reviewed changes

shchur merged commit 9a835d0 into main Aug 19, 2025

shchur deleted the ordinal-encode-cat-features-gluonts branch August 19, 2025 06:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use ordinal encoding for dynamic categorical features in GluonTSAdapter #31

Use ordinal encoding for dynamic categorical features in GluonTSAdapter #31

Uh oh!

shchur commented Aug 12, 2025 •

edited

Loading

Uh oh!

shchur Aug 12, 2025

Uh oh!

canerturkmen Aug 18, 2025

Uh oh!

shchur Aug 18, 2025

Uh oh!

abdulfatir Aug 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Use ordinal encoding for dynamic categorical features in GluonTSAdapter #31

Use ordinal encoding for dynamic categorical features in GluonTSAdapter #31

Uh oh!

Conversation

shchur commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shchur Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

canerturkmen Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

shchur Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

abdulfatir Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shchur commented Aug 12, 2025 •

edited

Loading