From cdecab33129b39d302e7dcca885b38132d01fa12 Mon Sep 17 00:00:00 2001 From: Paul Cornell Date: Tue, 2 Sep 2025 12:51:21 -0700 Subject: [PATCH 1/2] Update example notebook URLs to point to GitHub --- examplecode/notebooks.mdx | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/examplecode/notebooks.mdx b/examplecode/notebooks.mdx index 648b4750..45060a31 100644 --- a/examplecode/notebooks.mdx +++ b/examplecode/notebooks.mdx @@ -13,7 +13,7 @@ description: "Notebooks contain complete working sample code for end-to-end solu ``Unstructured API`` ``Partition Endpoint`` ``Local file``
- +
This notebook explores using Unstructured API to process financial documents while preserving tabular structure in a way that's usable by downstream applications.
@@ -26,14 +26,14 @@ description: "Notebooks contain complete working sample code for end-to-end solu ``Unstructured API`` ``Workflows`` ``S3`` ``VLM`` ``NER`` ``Elasticsearch`` ``MLK`` ``National Archives``
- +
Learn how to build a RAG pipeline without any embedding models. Use Unstructured to preprocess documents, index them into Elasticsearch, and retrieve using classic BM25 scoring.
``Unstructured API`` ``Workflows`` ``Elasticsearch`` ``BM25``
- +
Learn how to build data processing workflows using the Unstructured API and Python SDK to preprocess unstructured files from S3 and store the structured outputs in Redis Cloud for retrieval.
@@ -47,7 +47,7 @@ description: "Notebooks contain complete working sample code for end-to-end solu ``Unstructured API`` ``Workflows`` ``S3`` ``Qdrant`` ``VLM`` ``Embeddings``
- +
Improve RAG precision with a two-stage retrieval pipeline: fast vector search followed by reranking using Cohere’s re-ranker models.
From 46db7038a7bc66a480a74db5ba35d618bd9e677d Mon Sep 17 00:00:00 2001 From: Paul-Cornell Date: Wed, 3 Sep 2025 10:36:28 -0700 Subject: [PATCH 2/2] Update examplecode/notebooks.mdx Co-authored-by: Maria Khalusova --- examplecode/notebooks.mdx | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/examplecode/notebooks.mdx b/examplecode/notebooks.mdx index 45060a31..0b5601f9 100644 --- a/examplecode/notebooks.mdx +++ b/examplecode/notebooks.mdx @@ -26,7 +26,7 @@ description: "Notebooks contain complete working sample code for end-to-end solu ``Unstructured API`` ``Workflows`` ``S3`` ``VLM`` ``NER`` ``Elasticsearch`` ``MLK`` ``National Archives``
- +
Learn how to build a RAG pipeline without any embedding models. Use Unstructured to preprocess documents, index them into Elasticsearch, and retrieve using classic BM25 scoring.