GitHub - Hadar301/install-NeMo-on-OpenShift

Installing NeMo Microservices on OpenShift AI:

Verify the Hardware and Software Requirements for NeMo Microservices
Follow the prerequisites instruction, and create .env file with the following fields:
```
NGC_API_KEY="<your_Nvidia_key>"
HF_Token="<your_HF_key>"
```
Clone this git repository to this working directory.
Connect to your OpenShift cluster.

Run the commands:

chmod +x clear_namespace.sh
chmod +x nemo_prerequisites.sh
chmod +x deploy_microservices.sh
chmod +x run.sh

Verify the existance of GPUs with

oc get nodes -o json | jq -r '.items[] | select(.spec.taints != null) | {name: .metadata.name, taints: .spec.taints}'

Use llama-num.yaml to deploy the LLM using this might take about 10-15 minutes to complete, track the pod's events to make sure that there are no errors (for example authentication error)
```
oc apply -f llama-nim.yaml
```

Expose service:

oc expose svc jupyter-service

or expose the pod:

oc expose pod jupyter-notebook-b7d5479dd-rx8v7 --port=8888 --name=jupyter-notebook-service

Get token via the pod:

oc exec jupyter-notebook-b7d5479dd-rx8v7 -- jupyter server list

the output would look like:

Currently running servers:
http://jupyter-notebook-b7d5479dd-rx8v7:8888/?token=token :: /home/jovya

so in my case the token is simply "token".

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
llamastack		llamastack
.gitignore		.gitignore
README.md		README.md
clear_namespace.sh		clear_namespace.sh
config.py		config.py
deploy_microservices.sh		deploy_microservices.sh
e2e-notebook.ipynb		e2e-notebook.ipynb
llama-nim.yaml		llama-nim.yaml
nemo-configs.yaml		nemo-configs.yaml
nemo-customizer-scc.yaml		nemo-customizer-scc.yaml
nemo-training-security.yaml		nemo-training-security.yaml
nemo_prerequisites.sh		nemo_prerequisites.sh
run.sh		run.sh