Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
abaranovskis-redsamurai committed Apr 16, 2024
1 parent 293430a commit a8d670d
Showing 1 changed file with 13 additions and 13 deletions.
26 changes: 13 additions & 13 deletions sparrow-ml/llm/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,13 +6,13 @@ Sparrow Agents - with Sparrow you can build independent LLM agents, and use API

**List of available agents:**

- llamaindex - RAG pipeline with LlamaIndex for PDF processing
- vllamaindex - RAG pipeline with LLamaIndex multimodal for image processing
- vprocessor - RAG pipeline with OCR and LlamaIndex for image processing
- haystack - RAG pipeline with Haystack for PDF processing
- fcall - Function call pipeline
- unstructured-light - RAG pipeline with Unstructured and LangChain
- unstructured - RAG pipeline with Weaviate vector DB query, Unstructured and LangChain
- **llamaindex** - RAG pipeline with LlamaIndex for PDF processing
- **vllamaindex** - RAG pipeline with LLamaIndex multimodal for image processing
- **vprocessor** - RAG pipeline with OCR and LlamaIndex for image processing
- **haystack** - RAG pipeline with Haystack for PDF processing
- **fcall** - Function call pipeline
- **unstructured-light** - RAG pipeline with Unstructured and LangChain, supports PDF and image processing
- **unstructured** - RAG pipeline with Weaviate vector DB query, Unstructured and LangChain, supports PDF and image processing

### RAG runs offline on a local machine

Expand Down Expand Up @@ -221,22 +221,22 @@ Answer:
The stock price of the Exxon is 111.2699966430664. USD
```

Use `unstructured-light` agent to run RAG pipeline with Unstructured library. It helps to improve data pre-processing for LLM
Use `unstructured-light` agent to run RAG pipeline with Unstructured library. It helps to improve data pre-processing for LLM. This agent supports PDF, JPG and PNG files

```
./sparrow.sh "invoice_number, invoice_date, total_gross_worth" "int, str, str" --agent unstructured-light --file-path
/Users/andrejb/infra/shared/katana-git/sparrow/sparrow-ml/llm/data/invoice_1.pdf
```

With `unstructured-light` it is possible to specify option for table data processing only
With `unstructured-light` it is possible to specify option for table data processing only. This agent supports PDF, JPG and PNG files

```
./sparrow.sh "names_of_invoice_items, gross_worth_of_invoice_items, total_gross_worth" "List[str], List[str], str"
--agent unstructured-light --file-path /Users/andrejb/infra/shared/katana-git/sparrow/sparrow-ml/llm/data/invoice_1.pdf
--options tables
```

Use `unstructured` agent to run RAG pipeline with Weaviate query (no separate step to ingest data is required) and Unstructured library
Use `unstructured` agent to run RAG pipeline with Weaviate query (no separate step to ingest data is required) and Unstructured library. This agent supports PDF, JPG and PNG files

```
./sparrow.sh "invoice_number, invoice_date, total_gross_worth" "int, str, str" --agent unstructured --file-path /data/invoice_1.pdf
Expand Down Expand Up @@ -361,7 +361,7 @@ curl -X 'POST' \
-F 'file=@inout-20211211_001.jpg;type=image/jpeg'
```

Inference call with `unstructured-light` agent:
Inference call with `unstructured-light` agent, this agent supports also JPG and PNG files:

```
curl -X 'POST' \
Expand All @@ -376,7 +376,7 @@ curl -X 'POST' \
-F 'file=@invoice_1.pdf;type=application/pdf'
```

Inference call with `unstructured-light` agent, using only tables option:
Inference call with `unstructured-light` agent, using only tables option. This agent supports also JPG and PNG files:

```
curl -X 'POST' \
Expand All @@ -391,7 +391,7 @@ curl -X 'POST' \
-F 'file=@invoice_1.pdf;type=application/pdf'
```

Inference call with `unstructured` agent:
Inference call with `unstructured` agent, this agent supports also JPG and PNG files:

```
curl -X 'POST' \
Expand Down

0 comments on commit a8d670d

Please sign in to comment.