Skip to content

Commit

Permalink
Fix line length
Browse files Browse the repository at this point in the history
  • Loading branch information
fiver-watson committed Apr 1, 2024
1 parent 85ab45c commit c3f2880
Showing 1 changed file with 6 additions and 1 deletion.
7 changes: 6 additions & 1 deletion user-manual/import-export/upload-digital-object.rst
Original file line number Diff line number Diff line change
Expand Up @@ -633,7 +633,12 @@ that include a text layer (e.g., exported Word documents) will work. Search
results will refer users to the PDF that contains the search term(s), but will
not reveal the location of the term(s) within the PDF.

Currently, AtoM 2.x truncates indexed PDF text after approximately 16,777,215 characters - which might roughly translate to between 1.5-2.8 million words, depending on the language and words used. This means that any additional text after this limit is reached would not be added to the search index (and therefore would not return any results during searches) - it does **not** mean that the PDF itself will be truncated or missing pages, etc.
Currently, AtoM 2.x truncates indexed PDF text after approximately 16,777,215
characters - which might roughly translate to between 1.5-2.8 million words,
depending on the language and words used. This means that any additional text
after this limit is reached would not be added to the search index (and
therefore would not return any results during searches) - it does **not** mean
that the PDF itself will be truncated or missing pages, etc.

As mentioned above, it is possible to upload multi-page TIFFs or PDF files to
be displayed with a page viewer and to upload each page as a child object of
Expand Down

0 comments on commit c3f2880

Please sign in to comment.