Polars scan_arrow works if you convert lance into pyarrow Table first.
but direct pl.scan_pyarrow_dataset does not work.
File ~/.venv/lance/lib/python3.10/site-packages/polars/io/pyarrow_dataset/anonymous_scan.py:36, in _scan_pyarrow_dataset(ds, allow_pyarrow_filter)
19 """
20 Pickle the partially applied function `_scan_pyarrow_dataset_impl`.
21
(...)
33
34 """
35 func = partial(_scan_pyarrow_dataset_impl, ds)
---> 36 func_serialized = pickle.dumps(func)
37 return pli.LazyFrame._scan_python_function(
38 ds.schema, func_serialized, allow_pyarrow_filter
39 )
File stringsource:2, in pyarrow._dataset.Dataset.__reduce_cython__()
TypeError: self.dataset,self.wrapped cannot be converted to a Python object for pickling
Polars scan_arrow works if you convert lance into pyarrow Table first.
but direct pl.scan_pyarrow_dataset does not work.
Symptom:
Cause:
Proposed fix:
Uncertainty: