-
Notifications
You must be signed in to change notification settings - Fork 3k
Closed
Description
Apache Iceberg version
1.1.0 (latest release)
Query engine
Other
Please describe the bug 🐞
Trying to read iceberg data from Azure Storage Gen 2.
Pyiceberg can get the metadata from Hive Metastore but. When load_table is executed. Following error occurs:
File c:\Python\Python39\lib\site-packages\pyiceberg\table\__init__.py:354, in DataScan.to_pandas(self, **kwargs)
353 def to_pandas(self, **kwargs: Any) -> pd.DataFrame:
--> 354 return self.to_arrow().to_pandas(**kwargs)
File c:\Python\Python39\lib\site-packages\pyiceberg\table\__init__.py:350, in DataScan.to_arrow(self)
348 def to_arrow(self) -> pa.Table:
349 return project_table(
--> 350 self.plan_files(), self.table, self.row_filter, self.projection(), case_sensitive=self.case_sensitive
351 )
File c:\Python\Python39\lib\site-packages\pyiceberg\table\__init__.py:331, in DataScan.plan_files(self)
324 # step 1: filter manifests using partition summaries
325 # the filter depends on the partition spec used to write the manifest file, so create a cache of filters for each spec id
327 manifest_evaluators: Dict[int, Callable[[ManifestFile], bool]] = KeyDefaultDict(self._build_manifest_evaluator)
329 manifests = [
330 manifest_file
--> 331 for manifest_file in snapshot.manifests(io)
...
275 return LocalFileSystem()
276 else:
--> 277 raise ValueError(f"Unrecognized filesystem type in URI: {scheme}")
ValueError: Unrecognized filesystem type in URI: abfss