Skip to content

Commit 7c9ed04

Browse files
lucasmelogithubchensuyuepre-commit-ci[bot]mkbhanda
authored
ChatQnA - Add Terraform and Ansible Modules information (#970)
Signed-off-by: chensuyue <suyue.chen@intel.com> Signed-off-by: lucasmelogithub <lucas.melo@intel.com> Co-authored-by: chen, suyue <suyue.chen@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>
1 parent 9ff7df9 commit 7c9ed04

File tree

1 file changed

+20
-1
lines changed

1 file changed

+20
-1
lines changed

ChatQnA/README.md

Lines changed: 20 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,26 @@ Chatbots are the most widely adopted use case for leveraging the powerful chat a
44

55
RAG bridges the knowledge gap by dynamically fetching relevant information from external sources, ensuring that responses generated remain factual and current. The core of this architecture are vector databases, which are instrumental in enabling efficient and semantic retrieval of information. These databases store data as vectors, allowing RAG to swiftly access the most pertinent documents or data points based on semantic similarity.
66

7-
## Deploy ChatQnA Service
7+
## 🤖 Automated Terraform Deployment using Intel® Optimized Cloud Modules for **Terraform**
8+
9+
| Cloud Provider | Intel Architecture | Intel Optimized Cloud Module for Terraform | Comments |
10+
| -------------------- | --------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------- |
11+
| AWS | 4th Gen Intel Xeon with Intel AMX | [AWS Module](https://github.com/intel/terraform-intel-aws-vm/tree/main/examples/gen-ai-xeon-opea-chatqna) | Uses Intel/neural-chat-7b-v3-3 by default |
12+
| AWS Falcon2-11B | 4th Gen Intel Xeon with Intel AMX | [AWS Module with Falcon11B](https://github.com/intel/terraform-intel-aws-vm/tree/main/examples/gen-ai-xeon-opea-chatqna-falcon11B) | Uses TII Falcon2-11B LLM Model |
13+
| GCP | 5th Gen Intel Xeon with Intel AMX | [GCP Module](https://github.com/intel/terraform-intel-gcp-vm/tree/main/examples/gen-ai-xeon-opea-chatqna) | Also supports Confidential AI by using Intel® TDX with 4th Gen Xeon |
14+
| Azure | 5th Gen Intel Xeon with Intel AMX | Work-in-progress | Work-in-progress |
15+
| Intel Tiber AI Cloud | 5th Gen Intel Xeon with Intel AMX | Work-in-progress | Work-in-progress |
16+
17+
## Automated Deployment to Ubuntu based system(if not using Terraform) using Intel® Optimized Cloud Modules for **Ansible**
18+
19+
To deploy to existing Xeon Ubuntu based system, use our Intel Optimized Cloud Modules for Ansible. This is the same Ansible playbook used by Terraform.
20+
Use this if you are not using Terraform and have provisioned your system with another tool or manually including bare metal.
21+
| Operating System | Intel Optimized Cloud Module for Ansible |
22+
|------------------|------------------------------------------|
23+
| Ubuntu 20.04 | [ChatQnA Ansible Module](https://github.com/intel/optimized-cloud-recipes/tree/main/recipes/ai-opea-chatqna-xeon) |
24+
| Ubuntu 22.04 | Work-in-progress |
25+
26+
## Manually Deploy ChatQnA Service
827

928
The ChatQnA service can be effortlessly deployed on Intel Gaudi2, Intel Xeon Scalable Processors and Nvidia GPU.
1029

0 commit comments

Comments
 (0)