Skip to content

bug: KeyError: JsonType(extension<arrow.json>) #11370

Open
@timoha

Description

@timoha

What happened?

When reading a DeltaTable with DataFusion backend, getting the following exception:

  File "/Users/andrey/src/tessera/.venv/lib/python3.13/site-packages/ibis/backends/datafusion/__init__.py", line 408, in get_schema
    return sch.schema(table.schema)
           ~~~~~~~~~~^^^^^^^^^^^^^^
  File "/Users/andrey/src/tessera/.venv/lib/python3.13/site-packages/ibis/common/dispatch.py", line 115, in call
    return impl(arg, *args, **kwargs)
  File "/Users/andrey/src/tessera/.venv/lib/python3.13/site-packages/ibis/expr/schema.py", line 357, in from_pyarrow_schema
    return PyArrowSchema.to_ibis(schema)
           ~~~~~~~~~~~~~~~~~~~~~^^^^^^^^
  File "/Users/andrey/src/tessera/.venv/lib/python3.13/site-packages/ibis/formats/pyarrow.py", line 265, in to_ibis
    fields = [(f.name, PyArrowType.to_ibis(f.type, f.nullable)) for f in schema]
                       ~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^
  File "/Users/andrey/src/tessera/.venv/lib/python3.13/site-packages/ibis/formats/pyarrow.py", line 160, in to_ibis
    return _from_pyarrow_types[typ](nullable=nullable)
           ~~~~~~~~~~~~~~~~~~~^^^^^
KeyError: JsonType(extension<arrow.json>)

The schema of the delta table is as follows, specifically pa.json_() type for column response is used:

_RESPONSE_SCHEMA = pa.schema({
    "archive": pa.string(),
    "document": pa.string(),
    "page": pa.int64(),
    "as_of": pa.timestamp("s"),
    "response": pa.json_(),
    "response_id": pa.string(),
    "run_id": pa.string(),
})

Upon inspection of ibis code looks like it doesn't support arrow.json and expects a string from JSON type.

What version of ibis are you using?

ibis-framework[datafusion] = 10.6.0

What backend(s) are you using, if any?

datafusion = 47.0.0

Relevant log output

Code of Conduct

  • I agree to follow this project's Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugIncorrect behavior inside of ibis

    Type

    No type

    Projects

    Status

    backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions