Skip to content

Empty max pages for confluence extractor fails #85

@a-klos

Description

@a-klos

If the confluence extractor is provided with an empty string for max pages, the confluence extractor will crash as follows:

Traceback (most recent call last):
File "", line 1, in
import sys; sys.path.insert(0, r'/app/services/document-extractor/.venv/lib/python3.13/site-packages/debugpy/_vendored/pydevd'); import pydevd; pydevd.config('http_json', 'debugpy-dap'); pydevd.settrace(host='127.0.0.1', port=59351, suspend=False, trace_only_current_thread=False, patch_multiprocessing=True, access_token='77bb69af34bdc57f971ea308acf51adda1dd9c5e7069e0783c0f1247d5bd0286', client_access_token=None, setup_holder={'access-token': '77bb69af34bdc57f971ea308acf51adda1dd9c5e7069e0783c0f1247d5bd0286', 'client': '127.0.0.1', 'debug-mode': 'debugpy-dap', 'json-dap-http': True, 'multiprocess': True, 'port': 59351, 'ppid': 1, 'preimport': '/app/services/document-extractor/.venv/lib/python3.13/site-packages;debugpy._vendored.force_pydevd', 'server': False, 'skip-notify-stdin': True}); from multiprocessing.spawn import spawn_main; spawn_main(tracker_fd=12, pipe_handle=14)
^^^^^^^^^^^^^^^^^^^^^^
File "/app/services/document-extractor/.venv/lib/python3.13/site-packages/langchain_community/document_loaders/confluence.py", line 455, in load
return list(self._lazy_load(**kwargs))
File "/app/services/document-extractor/.venv/lib/python3.13/site-packages/langchain_community/document_loaders/confluence.py", line 369, in _lazy_load
pages = self.paginate_request(
self.confluence.get_all_pages_from_space,
...<4 lines>...
expand=expand,
)
File "/app/services/document-extractor/.venv/lib/python3.13/site-packages/langchain_community/document_loaders/confluence.py", line 505, in paginate_request
while len(docs) < max_pages:
^^^^^^^^^^^^^^^^^^^^^
TypeError: '<' not supported between instances of 'int' and 'str'

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions