Skip to content

pdf files indexing not working #2

@PatJack01

Description

@PatJack01

Hi,

During indexing, I'm getting following message for each pdf file :

WARNING - [backend.semantic_search:run_indexing_task:307] - Could not load file "filename" with UnstructuredLoader.
Traceback (most recent call last):
File "backend\semantic_search.py", line 300, in run_indexing_task
File "langchain_core\document_loaders\base.py", line 43, in load
File "langchain_unstructured\document_loaders.py", line 178, in lazy_load
File "langchain_unstructured\document_loaders.py", line 212, in lazy_load
File "langchain_unstructured\document_loaders.py", line 231, in _elements_json
File "langchain_unstructured\document_loaders.py", line 249, in _elements_via_local
File "unstructured\partition\auto.py", line 211, in partition
File "unstructured\partition\auto.py", line 364, in get
File "unstructured\partition\auto.py", line 382, in load_partitioner
File "importlib_init
.py", line 90, in import_module
File "", line 1387, in _gcd_import
File "", line 1360, in _find_and_load
File "", line 1331, in _find_and_load_unlocked
File "", line 935, in load_unlocked
File "pyimod02_importers.py", line 457, in exec_module
File "unstructured\partition\pdf.py", line 19, in
File "pyimod02_importers.py", line 457, in exec_module
File "unstructured_inference\inference\layout.py", line 18, in
File "pyimod02_importers.py", line 457, in exec_module
File "unstructured_inference\models\base.py", line 8, in
File "pyimod02_importers.py", line 457, in exec_module
File "unstructured_inference\models\detectron2onnx.py", line 9, in
File "pyimod02_importers.py", line 457, in exec_module
File "onnxruntime\quantization_init
.py", line 16, in
File "pyimod02_importers.py", line 457, in exec_module
File "onnxruntime\quantization\shape_inference.py", line 18, in
File "pyimod02_importers.py", line 457, in exec_module
File "onnxruntime\transformers\onnx_utils.py", line 5, in
ModuleNotFoundError: No module named 'fusion_utils'

I'm using out of the box settings
Thanks for help

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions