@@ -5,6 +5,14 @@ Retrieval-Augmented Generation system for edge solutions. It is designed to
5
5
curate the RAG pipeline to meet hardware requirements at edge with guaranteed
6
6
quality and performance.
7
7
8
+ ## What's New in this release?
9
+
10
+ - Support image/url data retrieval and display in EC-RAG
11
+ - Support display of LLM-used context sources in UI
12
+ - Support pipeline remove operation in RESTful API and UI
13
+ - Support RAG pipeline performance benchmark and display in UI
14
+ - Fixed known issues in EC-RAG UI and server
15
+
8
16
## Quick Start Guide
9
17
10
18
### (Optional) Build Docker Images for Mega Service, Server and UI by your own
@@ -43,6 +51,8 @@ export GRADIO_PATH="your gradio cache path for transferring files"
43
51
44
52
# Make sure all 3 folders have 1000:1000 permission, otherwise
45
53
# chown 1000:1000 ${MODEL_PATH} ${DOC_PATH} ${GRADIO_PATH}
54
+ # In addition, also make sure the .cache folder has 1000:1000 permission, otherwise
55
+ # chown 1000:1000 $HOME/.cache
46
56
47
57
# Use `ip a` to check your active ip
48
58
export HOST_IP=" your host ip"
@@ -192,7 +202,7 @@ curl -X POST http://${HOST_IP}:16010/v1/settings/pipelines -H "Content-Type: app
192
202
#### Update a pipeline
193
203
194
204
``` bash
195
- curl -X PATCH http://${HOST_IP} :16010/v1/settings/pipelines -H " Content-Type: application/json" -d @tests/test_pipeline_local_llm.json | jq ' .'
205
+ curl -X PATCH http://${HOST_IP} :16010/v1/settings/pipelines/rag_test_local_llm -H " Content-Type: application/json" -d @tests/test_pipeline_local_llm.json | jq ' .'
196
206
```
197
207
198
208
#### Check all pipelines
@@ -204,15 +214,34 @@ curl -X GET http://${HOST_IP}:16010/v1/settings/pipelines -H "Content-Type: appl
204
214
#### Activate a pipeline
205
215
206
216
``` bash
207
- curl -X PATCH http://${HOST_IP} :16010/v1/settings/pipelines/test1 -H " Content-Type: application/json" -d ' {"active": "true"}' | jq ' .'
217
+ curl -X PATCH http://${HOST_IP} :16010/v1/settings/pipelines/rag_test_local_llm -H " Content-Type: application/json" -d ' {"active": "true"}' | jq ' .'
218
+ ```
219
+
220
+ #### Remove a pipeline
221
+
222
+ ``` bash
223
+ # Firstly, deactivate the pipeline if the pipeline status is active
224
+ curl -X PATCH http://${HOST_IP} :16010/v1/settings/pipelines/rag_test_local_llm -H " Content-Type: application/json" -d ' {"active": "false"}' | jq ' .'
225
+ # Then delete the pipeline
226
+ curl -X DELETE http://${HOST_IP} :16010/v1/settings/pipelines/rag_test_local_llm -H " Content-Type: application/json" | jq ' .'
227
+ ```
228
+
229
+ #### Enable and check benchmark for pipelines
230
+
231
+ ``` bash
232
+ # Set ENABLE_BENCHMARK as true before launch services
233
+ export ENABLE_BENCHMARK=" true"
234
+
235
+ # check the benchmark data for pipeline {pipeline_name}
236
+ curl -X GET http://${HOST_IP} :16010/v1/settings/pipelines/{pipeline_name}/benchmark -H " Content-Type: application/json" | jq ' .'
208
237
```
209
238
210
239
### Model Management
211
240
212
241
#### Load a model
213
242
214
243
``` bash
215
- curl -X POST http://${HOST_IP} :16010/v1/settings/models -H " Content-Type: application/json" -d ' {"model_type": "reranker", "model_id": "BAAI/bge-reranker-large", "model_path": "./models/bge_ov_reranker", "device": "cpu"}' | jq ' .'
244
+ curl -X POST http://${HOST_IP} :16010/v1/settings/models -H " Content-Type: application/json" -d ' {"model_type": "reranker", "model_id": "BAAI/bge-reranker-large", "model_path": "./models/bge_ov_reranker", "device": "cpu", "weight": "INT4" }' | jq ' .'
216
245
```
217
246
218
247
It will take some time to load the model.
@@ -226,7 +255,7 @@ curl -X GET http://${HOST_IP}:16010/v1/settings/models -H "Content-Type: applica
226
255
#### Update a model
227
256
228
257
``` bash
229
- curl -X PATCH http://${HOST_IP} :16010/v1/settings/models/BAAI/bge-reranker-large -H " Content-Type: application/json" -d ' {"model_type": "reranker", "model_id": "BAAI/bge-reranker-large", "model_path": "./models/bge_ov_reranker", "device": "gpu"}' | jq ' .'
258
+ curl -X PATCH http://${HOST_IP} :16010/v1/settings/models/BAAI/bge-reranker-large -H " Content-Type: application/json" -d ' {"model_type": "reranker", "model_id": "BAAI/bge-reranker-large", "model_path": "./models/bge_ov_reranker", "device": "gpu", "weight": "INT4" }' | jq ' .'
230
259
```
231
260
232
261
#### Check a certain model
0 commit comments