Skip to content

Conversation

@julian-smith-artifex-com
Copy link
Collaborator

No description provided.

@julian-smith-artifex-com julian-smith-artifex-com changed the title Add concurrent get_text(), and some other minor commits. Add concurrent apply_pages() and get_text(), and some other minor commits. May 22, 2024
setup.py
    Add new _apply_pages.py to wheels/installs.
src/__init__.py
    New top-level apply_pages(). Also convenience fn get_text() which uses
    apply_pages().
    Unlike Python's `multiprocessing` module, we also support passing keyword
    args to functions in worker process.
src/_apply_pages.py
    New, contains implementation of apply_pages().
tests/test_pylint.py
    Avoid pylint failure by disabling `R0801: Similar lines in 2 files`.
tests/test_textextract.py
    Test get_text() and show timings.
src/fitz___init__.py
tests/conftest.py
    Use functions to manipulate _g_log_items so that things work
    even when using `fitz` alias.

Timings for 8-core MacOS-arm64 and PDF spec:
    method='multiprocessing' : 3.3x.
    method='fork': 3.6x.
@julian-smith-artifex-com julian-smith-artifex-com deleted the jules branch May 22, 2024 22:25
@github-actions github-actions bot locked and limited conversation to collaborators May 22, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants