-
Kalika Sales Dashboard visualization
-
Kalika Dify Agent 3, Kalika Sales and Marketing agent
-
Prompt Workflow
-
AWS Services
-
GoogleCloud and Agentspace
1.Data Extraction: Extract the PO order and proforma invoice with attached file by searching keyword to S3
-
Understand the details of various documents i) Proforma Invoice:A proforma invoice is a preliminary bill that a seller sends to a buyer before a sale is confirmed. It's a non-binding document that's used for planning, budgeting, and estimates. It's also used for customs clearance and financing ii) PO Dump : Pending and Processed data details
-
Create the Parsing process
-
Proforma Invoice
- Searching a with proforma invoice for smtp mail python utility download attached file
- Dump that on scheduling daily to s3
- Extract the content using ocr engine and validate try:
-
PO dump _ On Daily basis Dump is in mail extract excel store in s3
- Implement pg vector
1) Create PG vector on local 2) Upload po_dump excel in PG vector and create a vector 3) Test with Ollama local model and create a streamlit app 4) Test for last 10 days documents with query and response 5) Generate the report and share it with in team group and github -
-
Deploy on Ec2 for further building POC
Problem Statement : Uploading Bank statement, Trading Documents(Last trading transaction,orders session details)
give query response bot.
- User can upload documents such as bank statement, trading sheets with Upload button- UI
- create PG vector to store
- Ollama testing with local model
Solution: Give a isolated space
- Explore the techniques for giving data privacy to user
Here are some notable open-source frameworks and libraries for building Retrieval Augmented Generation (RAG) systems:
-
SWIRL: An infrastructure software that facilitates secure and fast searches across data sources without the need for data movement. It integrates with over 20 large language models (LLMs) and is designed for secure deployment within private clouds.
-
Cognita: A modular framework that organizes RAG components for easier testing and deployment. It supports various document retrievers and is fully API-driven, making it suitable for scalable RAG systems.
-
LLM-Ware: This framework focuses on enterprise-ready RAG pipelines, allowing the integration of small, specialized models. It supports a modular architecture and can operate without a GPU.
-
RAG Flow: An engine that emphasizes deep document understanding, enabling effective integration of structured and unstructured data for citation-grounded question-answering.
-
Graph RAG: A graph-based system that enhances LLM outputs by incorporating structured knowledge graphs, making it ideal for complex enterprise applications.
-
Haystack: An orchestration framework that connects models, vector databases, and file converters to create advanced RAG systems, supporting customizable pipelines for various tasks.
-
Storm: A knowledge curation system that generates comprehensive reports with citations, integrating advanced retrieval methods to support multi-perspective question-asking.
-png-2.png?width=2056&height=1334&name=rag-query-drawio%20(1)-png-2.png)
