DGX Spark LLM Stack – Ollama + AnythingLLM + Cloudflare Tunnel (`nated.ai`)

This project runs local LLMs on your DGX Spark and exposes a nice web UI over the internet via Cloudflare Tunnel at https://nated.ai.

It includes:

Ollama – runs local language models with GPU acceleration on port 11434.
AnythingLLM – a full RAG/chat UI that talks to Ollama (and optionally OpenAI) on port 3001.
Cloudflare Tunnel (cloudflared) – securely publishes AnythingLLM at nated.ai using your Cloudflare account.

1. Stack Overview

Services

ollama
- Docker image: ollama/ollama:latest
- Host port: 11434
- Data volume: ./ollama_data:/root/.ollama
- Uses the NVIDIA runtime with NVIDIA_VISIBLE_DEVICES=all.
anythingllm
- Docker image: mintplexlabs/anythingllm:latest
- Host port: 3001
- Data volume: ./anythingllm_storage:/app/server/storage
- Uses Ollama as the default LLM and embedding engine.
- Can optionally use OpenAI via your OPENAI_API_KEY.
tunnel-spark
- Docker image: cloudflare/cloudflared:latest
- Runs cloudflared tunnel run with your TUNNEL_TOKEN.
- Exposes AnythingLLM at https://nated.ai through Cloudflare’s network.

2. Environment Variables (`.env`)

In the same folder as docker-compose.yml, create a .env file:

OPENAI_API_KEY=sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
TUNNEL_TOKEN=eyJhIjoi...paste-from-cloudflare...

OPENAI_API_KEY – your real OpenAI API key.
TUNNEL_TOKEN – token from Cloudflare when you create the tunnel for nated.ai.

Docker Compose will automatically substitute ${OPENAI_API_KEY} and ${TUNNEL_TOKEN} in docker-compose.yml.

3. Starting the Stack

From the DGX Spark, in the directory with docker-compose.yml:

docker compose pull
docker compose up -d

Check containers:

docker ps

You should see ollama, anythingllm, and tunnel-spark running.

4. Accessing AnythingLLM Locally (LAN)

On the DGX, get its IP address:
```
hostname -I
```
From your Mac (or any machine on the same network), open in a browser:
```
http://&lt;DGX_IP&gt;:3001
# example: http://192.168.1.50:3001
```
First-time AnythingLLM setup:
- Create an admin account.
- Open Settings → LLM Preference.
- Choose Ollama as the LLM provider.
- Confirm:
  - Base URL: http://ollama:11434
  - Default model: llama3.1:8b-instruct-q4_K_M (or any other installed model).
- Save.

You now have a local, GPU-accelerated assistant powered by Ollama, managed through AnythingLLM.

5. Publishing AnythingLLM at `https://nated.ai` (Cloudflare Tunnel)

5.1. Create a Cloudflare Tunnel & Token

Log into the Cloudflare dashboard for nated.ai.
Go to Zero Trust → Networks → Tunnels.
Create a new tunnel (for example, name it spark-tunnel).
Choose the Cloudflared connector option.
Cloudflare will show a command containing a --token value, e.g.:
```
cloudflared tunnel run --token &lt;TUNNEL_TOKEN&gt;
```
Copy the token portion (<TUNNEL_TOKEN>) into your .env file as TUNNEL_TOKEN.
Restart the tunnel container:
```
docker compose up -d tunnel-spark
```

The tunnel-spark service now connects your DGX to Cloudflare.

5.2. Map `nated.ai` to AnythingLLM

In the Cloudflare dashboard:

Edit your tunnel configuration.
Add a Public Hostname:
- Hostname: nated.ai
- Type: HTTP
- URL / Service: http://<DGX_LOCAL_IP>:3001
  (Use the same IP from hostname -I.)
Make sure nated.ai’s DNS record is proxied by Cloudflare (orange cloud icon).

Once the tunnel is healthy, you can reach AnythingLLM from anywhere via:

https://nated.ai

6. Managing Local Models with Ollama

Ollama runs as a server inside the ollama container. You add or manage models using the ollama CLI inside that container.

6.1. Open a Shell in the Ollama Container

On the DGX:

docker exec -it ollama bash

Now you are inside the container.

6.2. List Installed Models

ollama list

This shows all models currently downloaded and ready to use.

6.3. Download (Pull) a New Model

Use ollama pull with the model name you want. Examples:

# Smaller / faster models
ollama pull llama3.1:8b-instruct-q4_K_M
ollama pull mistral:7b-instruct

# Larger, more capable models (if you have the memory and patience)
ollama pull llama3.1:70b
ollama pull gemma2:27b

Ollama will download the model and store it in /root/.ollama (which is mapped to ./ollama_data on the host).

6.4. Test a Model in the Container

ollama run llama3.1:8b-instruct-q4_K_M

Type a quick prompt to verify it works, then press Ctrl+C to exit.

7. Using New Models in AnythingLLM

Once Ollama has a model, AnythingLLM can use it by referring to the model name.

You have two ways to set which model AnythingLLM uses:

Option A – Change the Default Model in `docker-compose.yml`

In docker-compose.yml, update:

environment:
  - OLLAMA_MODEL_PREF=llama3.1:8b-instruct-q4_K_M

to the new model name, e.g.:

  - OLLAMA_MODEL_PREF=llama3.1:70b

Then restart AnythingLLM:

docker compose up -d anythingllm

AnythingLLM will now default to the new model for chats (assuming it exists in Ollama).

Option B – Change the Model in the AnythingLLM UI

Go to the AnythingLLM web UI (http://<DGX_IP>:3001 or https://nated.ai).
Open Settings → LLM Preference.
Select Ollama as provider if it isn’t already.
Set the Model field to the new model name, e.g.:
- llama3.1:8b-instruct-q4_K_M
- llama3.1:70b
- mistral:7b-instruct
- etc.
Save.

From now on, that workspace will send prompts to the selected Ollama model.

8. Using OpenAI and Local Models Together

Because OPENAI_API_KEY is passed into the AnythingLLM container, you can configure OpenAI as an additional provider:

In the AnythingLLM UI, open Settings → LLM Preference.
Choose OpenAI (or the equivalent option).
Set:
- API key: it should read from the environment, or you can paste it manually.
- Model: e.g. gpt-4.1, gpt-4.1-mini, etc.
Save.

Now you can:

Use Ollama for most local/private/general work.
Switch specific workspaces or agents to OpenAI for more powerful or specialized tasks.

9. Common Docker Commands

From the DGX project directory:

# Start or update the stack
docker compose pull
docker compose up -d

# View logs
docker logs -f ollama
docker logs -f anythingllm
docker logs -f tunnel-spark

# Shell into Ollama container
docker exec -it ollama bash

# Stop everything
docker compose down

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
FEATURES.md		FEATURES.md
README.md		README.md
docker-compose.yaml		docker-compose.yaml
flow-chart-component-diagram.txt		flow-chart-component-diagram.txt
flow-chart-end-to-end-request.txt		flow-chart-end-to-end-request.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DGX Spark LLM Stack – Ollama + AnythingLLM + Cloudflare Tunnel (`nated.ai`)

1. Stack Overview

Services

2. Environment Variables (`.env`)

3. Starting the Stack

4. Accessing AnythingLLM Locally (LAN)

5. Publishing AnythingLLM at `https://nated.ai` (Cloudflare Tunnel)

5.1. Create a Cloudflare Tunnel & Token

5.2. Map `nated.ai` to AnythingLLM

6. Managing Local Models with Ollama

6.1. Open a Shell in the Ollama Container

6.2. List Installed Models

6.3. Download (Pull) a New Model

6.4. Test a Model in the Container

7. Using New Models in AnythingLLM

Option A – Change the Default Model in `docker-compose.yml`

Option B – Change the Model in the AnythingLLM UI

8. Using OpenAI and Local Models Together

9. Common Docker Commands

About

Uh oh!

TypeTerrors/nated.ai

Folders and files

Latest commit

History

Repository files navigation

DGX Spark LLM Stack – Ollama + AnythingLLM + Cloudflare Tunnel (nated.ai)

1. Stack Overview

Services

2. Environment Variables (.env)

3. Starting the Stack

4. Accessing AnythingLLM Locally (LAN)

5. Publishing AnythingLLM at https://nated.ai (Cloudflare Tunnel)

5.1. Create a Cloudflare Tunnel & Token

5.2. Map nated.ai to AnythingLLM

6. Managing Local Models with Ollama

6.1. Open a Shell in the Ollama Container

6.2. List Installed Models

6.3. Download (Pull) a New Model

6.4. Test a Model in the Container

7. Using New Models in AnythingLLM

Option A – Change the Default Model in docker-compose.yml

Option B – Change the Model in the AnythingLLM UI

8. Using OpenAI and Local Models Together

9. Common Docker Commands

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

DGX Spark LLM Stack – Ollama + AnythingLLM + Cloudflare Tunnel (`nated.ai`)

2. Environment Variables (`.env`)

5. Publishing AnythingLLM at `https://nated.ai` (Cloudflare Tunnel)

5.2. Map `nated.ai` to AnythingLLM

Option A – Change the Default Model in `docker-compose.yml`