Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TIKA-4012 -- improve extraction of embedded docs in PDFs by looking beyond names tree and annotations #1079

Merged
merged 7 commits into from Apr 13, 2023

Commits on Apr 11, 2023

  1. Configuration menu
    Copy the full SHA
    884b1c9 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    47f87ea View commit details
    Browse the repository at this point in the history

Commits on Apr 12, 2023

  1. Configuration menu
    Copy the full SHA
    51fa67e View commit details
    Browse the repository at this point in the history

Commits on Apr 13, 2023

  1. TIKA-4012 -- switch to a list instead of a name-based map for attachm…

    …ents in case there are name collisions in child nodes and we lose attachments.
    tballison committed Apr 13, 2023
    Configuration menu
    Copy the full SHA
    71ccb58 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c1c3bc7 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    49cc0d4 View commit details
    Browse the repository at this point in the history
  4. Merge remote-tracking branch 'origin/main' into TIKA-4012

    # Conflicts:
    #	CHANGES.txt
    tballison committed Apr 13, 2023
    Configuration menu
    Copy the full SHA
    5deb4b3 View commit details
    Browse the repository at this point in the history