chatbot-rag-app: adds Kubernetes manifest and instructions #396

codefromthecrypt · 2025-02-17T06:43:42Z

Decided to action this so that we have a coherent experience between docker compose and k8s. This is as close as I could get it. If folks have feedback or a different direction, do tell!

Fixes #366

codefromthecrypt · 2025-02-17T06:49:08Z

note: each thing we do runs back into this. it would be great to have a way to quickly initialize elser not just installing it, but first time use without timeouts for several minutes #307

example-apps/chatbot-rag-app/k8s-manifest.yml

codefromthecrypt · 2025-02-19T09:52:10Z

I have work almost done to make this "normal k8s" local, but wanted to solve the timeout first. so I'll push commit after #397 is merged

codefromthecrypt · 2025-02-20T07:44:10Z

will bump this tomorrow or when an approver looks at #397

codefromthecrypt · 2025-02-21T03:58:02Z

rebased and changed to non-host network k8s. will leave this in draft until #397 is merged as using not-yet-deployed images in k8s is a pain.

codefromthecrypt · 2025-02-25T08:27:31Z

waiting to get the docker image smaller before "ready for review", as I noticed my network lagging #407

codefromthecrypt · 2025-02-26T06:04:01Z

ok things work in general, but I'm not seeing traces in kibana. I have to put this down for a bit as I have other more urgent things to address.

anuraaga · 2025-02-28T01:27:30Z

k8s/README.md

+
+Note: If you haven't checked out this repository, all you need is one file:
+```bash
+wget https://raw.githubusercontent.com/elastic/elasticsearch-labs/refs/heads/main/docker/docker-compose-elastic.yml


Think this is wrong file

k8s/k8s-manifest-elastic.yml

example-apps/chatbot-rag-app/k8s-manifest.yml

codefromthecrypt · 2025-02-28T09:18:37Z

Due to elasticon singapore and Sydney... while excited about this i am not finishing it this weekend. Maybe Tuesday

example-apps/chatbot-rag-app/k8s-manifest.yml

example-apps/chatbot-rag-app/env.example

example-apps/chatbot-rag-app/k8s-manifest.yml

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

codefromthecrypt · 2025-03-01T22:46:55Z

hmm getting gcp auth errors will look into it

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

codefromthecrypt · 2025-03-02T00:17:16Z

GCP vertex now works. I will look into why traces aren't.

@bshetti I can't hold this PR captive for all issues, as once this is in it is easy to complete other topics. So, let's leave elastic cloud commentary for the next PR #379 This one is solving as-is for k8s, and it has been dozens of hours just on that!

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

codefromthecrypt · 2025-03-02T03:20:29Z

Also verified the kubernetes without chatbot-rag-app, rather with pydantic-ai and works fine

codefromthecrypt · 2025-03-02T03:25:24Z

create-index (doesn't use an LLM, just elastic)

chat (proves vertex works)

codefromthecrypt · 2025-03-02T03:26:22Z

in this case I followed the directions in the README with a completely blown away k8s (colima delete; colima start --cpu 8 --memory 16 --network-address --dns 8.8.8.8 --dns 8.8.4.4 --kubernetes --k3s-arg '--disable=local-storage,traefik,metrics-server@server:*'), so I'm very confident the GCP stuff works as nothing was dirty. Thanks for the tips, folks!

example-apps/chatbot-rag-app/k8s-manifest.yml

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

codefromthecrypt · 2025-03-02T06:09:38Z

OK, what I did was ran with the normal instructions, but azure openai (so no secret). It worked fine.

Then, I deleted the configmap and edited in the vertex settings to recreate it, then added the secret as README said, then applied and worked fine.

Thanks for the eagle eyes @anuraaga I think finally this one is ready to merge!

codefromthecrypt · 2025-03-02T06:10:59Z

example-apps/chatbot-rag-app/k8s-manifest.yml

+        - name: gcloud-credentials
+          secret:
+            secretName: gcloud-credentials
+            optional: true  # only read when `LLM_TYPE=vertex`


this part allows vertex config to work, but others to not block on it. the optional applies indirectly to a mount that uses it, so no worries.

codefromthecrypt requested review from EvelienSchellekens, bshetti, estolfo, joemcelroy, stanek-michal, trentm and xrmx February 17, 2025 06:43

codefromthecrypt commented Feb 17, 2025

View reviewed changes

example-apps/chatbot-rag-app/k8s-manifest.yml Outdated Show resolved Hide resolved

codefromthecrypt force-pushed the k8s-chatbot-rag-app branch from 9f0bb96 to 3660c11 Compare February 21, 2025 03:56

codefromthecrypt changed the base branch from main to recover-from-timeout February 21, 2025 03:57

Base automatically changed from recover-from-timeout to main February 21, 2025 12:13

codefromthecrypt mentioned this pull request Feb 21, 2025

Add CODEOWNERS for projects integrated with OpenTelemetry #401

Merged

codefromthecrypt force-pushed the k8s-chatbot-rag-app branch from 3660c11 to 5145b72 Compare February 26, 2025 05:30

anuraaga reviewed Feb 28, 2025

View reviewed changes

example-apps/chatbot-rag-app/k8s-manifest.yml Outdated Show resolved Hide resolved

bshetti reviewed Feb 28, 2025

View reviewed changes

davidgeorgehope reviewed Feb 28, 2025

View reviewed changes

example-apps/chatbot-rag-app/k8s-manifest.yml Outdated Show resolved Hide resolved

codefromthecrypt added 2 commits March 2, 2025 06:15

chatbot-rag-app: adds Kubernetes manifest and instructions

ff7aa7c

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

polish

c56190f

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

codefromthecrypt force-pushed the k8s-chatbot-rag-app branch from 8b19999 to c56190f Compare March 1, 2025 22:16

vertex

6dd0735

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

fix vertex

39121c0

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

codefromthecrypt mentioned this pull request Mar 2, 2025

chatbot-rag-app: re-introduce ES cloud configuration #379

Open

codefromthecrypt added 2 commits March 2, 2025 10:20

think it is ok

14be3db

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

polish

a840478

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

codefromthecrypt marked this pull request as ready for review March 2, 2025 03:20

anuraaga approved these changes Mar 2, 2025

View reviewed changes

anuraaga reviewed Mar 2, 2025

View reviewed changes

example-apps/chatbot-rag-app/k8s-manifest.yml Show resolved Hide resolved

secret optional

2565e34

Signed-off-by: Adrian Cole <adrian.cole@elastic.co>

codefromthecrypt commented Mar 2, 2025

View reviewed changes

codefromthecrypt merged commit 72835b0 into main Mar 2, 2025
4 checks passed

codefromthecrypt deleted the k8s-chatbot-rag-app branch March 2, 2025 06:17

chatbot-rag-app: adds Kubernetes manifest and instructions #396

chatbot-rag-app: adds Kubernetes manifest and instructions #396

Uh oh!

Conversation

codefromthecrypt commented Feb 17, 2025

Uh oh!

codefromthecrypt commented Feb 17, 2025

Uh oh!

Uh oh!

codefromthecrypt commented Feb 19, 2025

Uh oh!

codefromthecrypt commented Feb 20, 2025

Uh oh!

codefromthecrypt commented Feb 21, 2025

Uh oh!

codefromthecrypt commented Feb 25, 2025

Uh oh!

codefromthecrypt commented Feb 26, 2025

Uh oh!

anuraaga Feb 28, 2025

Choose a reason for hiding this comment

Uh oh!

codefromthecrypt Feb 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

codefromthecrypt commented Feb 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codefromthecrypt commented Mar 1, 2025

Uh oh!

codefromthecrypt commented Mar 2, 2025

Uh oh!

codefromthecrypt commented Mar 2, 2025

Uh oh!

codefromthecrypt commented Mar 2, 2025

Uh oh!

codefromthecrypt commented Mar 2, 2025

Uh oh!

Uh oh!

codefromthecrypt commented Mar 2, 2025

Uh oh!

codefromthecrypt Mar 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants