You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When ingesting a parquet file using ingest_catalog.py, this line casts all columns' dtypes to float. Thus the downstream checks for different dtypes will not work as intended.
Perhaps using itertuples instead would help by preserving the dtypes (at the expense of some re-working of the loop)? Note this could introduce a new issue where columns beginning with an underscore are renamed (e.g. _id, which gets mapped to _1). This might be avoided by setting name=None within itertuples.
The text was updated successfully, but these errors were encountered:
When ingesting a parquet file using
ingest_catalog.py
, this line casts all columns' dtypes tofloat
. Thus the downstream checks for different dtypes will not work as intended.Perhaps using
itertuples
instead would help by preserving the dtypes (at the expense of some re-working of the loop)? Note this could introduce a new issue where columns beginning with an underscore are renamed (e.g._id
, which gets mapped to_1
). This might be avoided by settingname=None
withinitertuples
.The text was updated successfully, but these errors were encountered: