From 37cdf390c160756fa3bca3a1e401630e023300da Mon Sep 17 00:00:00 2001 From: Paul-Cornell Date: Wed, 12 Mar 2025 10:17:59 -0700 Subject: [PATCH] Indicate true as default for split_pdf_page in speed-up-large-files-batches.mdx --- api-reference/partition/speed-up-large-files-batches.mdx | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/api-reference/partition/speed-up-large-files-batches.mdx b/api-reference/partition/speed-up-large-files-batches.mdx index 5d7aac89..fd04a736 100644 --- a/api-reference/partition/speed-up-large-files-batches.mdx +++ b/api-reference/partition/speed-up-large-files-batches.mdx @@ -8,7 +8,7 @@ Choose your partitioning strategy wisely. For example, if you have simple PDFs t To speed up PDF file processing, the [Unstructured SDK for Python](/api-reference/partition/sdk-python) and the [Unstructured SDK for JavaScript/TypeScript](/api-reference/partition/sdk-jsts) provide the following parameters to help speed up processing a large PDF file: -- `split_pdf_page` (Python) or `splitPdfPage` (JavaScript/TypeScript), when set to true, splits the PDF file on the client side before sending it as batches to Unstructured for processing. The number of pages in each batch is determined internally. Batches can contain between 2 and 20 pages. +- `split_pdf_page` (Python) or `splitPdfPage` (JavaScript/TypeScript), when set to true (the default), splits the PDF file on the client side before sending it as batches to Unstructured for processing. The number of pages in each batch is determined internally. Batches can contain between 2 and 20 pages. - `split_pdf_concurrency_level` (Python) or `splitPdfConcurrencyLevel` (JavaScript/TypeScript) is an integer that specifies the number of parallel requests. The default is 5. The maximum is 15. This behavior is ignored unless `split_pdf_page` (Python) or `splitPdfPage` (JavaScript/TypeScript) is also set to true. - `split_pdf_allow_failed` (Python) or splitPdfAllowFailed` (JavaScript/TypeScript), when set to true, allows partitioning to continue even if some pages fail. - `split_pdf_page_range` (Python only) is a list of two integers that specify the beginning and ending page numbers of the PDF file to be sent. A `ValueError` is raised if the specified range is not valid. This behavior is ignored unless `split_pdf_page` is also set to true.