You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Enter feedback here]
When saving parquet files as mltables, with one of the following ways, the file data is not viewable
# Assume we are going to save a pandas dataframe once split into train/test sets# 1 doesn't workdf.to_parquet(Path(output_dir, "data.parquet"))
# nor 2df.to_parquet(Path(output_dir, "data.parquet"), engine='pyarrow')
# nor 3pyarrow_table=pa.Table.from_pandas(df)
pq.write_table(pyarrow_table, Path(output_dir, "data.parquet"))
# and finally save the MLTable filetable=mltable.from_parquet_files(paths=[{"pattern": "./*parquet"}])
table.save(Path(output_dir), overwrite=True)
Error:
However, if I explicitly use the Parquet version:
# Note the `version` flag within the `pq.write_table` functionpyarrow_table=pa.Table.from_pandas(df)
pq.write_table(pyarrow_table, Path(output_dir, "data.parquet"), version="1.0")
# and finally save the MLTable filetable=mltable.from_parquet_files(paths=[{"pattern": "./*parquet"}])
table.save(Path(output_dir), overwrite=True)
We can view the data:
Document Details
⚠ Do not edit this section. It is required for learn.microsoft.com ➟ GitHub issue linking.
ID: 916be6d5-bb68-5606-d124-66d56878c9c0
Version Independent ID: c03b4af4-1a7e-00bd-46c3-658c86b8d95a
[Enter feedback here]
When saving parquet files as mltables, with one of the following ways, the file data is not viewable
Error:
However, if I explicitly use the Parquet version:
We can view the data:
Document Details
⚠ Do not edit this section. It is required for learn.microsoft.com ➟ GitHub issue linking.
The text was updated successfully, but these errors were encountered: