verified-proxy

A local forward proxy that verifies NEAR AI inference backends are running inside Intel TDX Trusted Execution Environments (TEEs) before forwarding your requests.

What it does

When you send a request through the proxy, it:

Resolves the model name to a *.completions.near.ai backend domain
Connects to the backend over TLS and extracts the certificate's SPKI hash
If the SPKI hash is new (first request or cert rotation), runs full TEE attestation:
- Fetches an attestation report from the backend (single TLS connection)
- Verifies the Intel TDX quote using dcap-qvl
- Checks that the TDX report data cryptographically binds the signing key and TLS certificate to the TEE
- Confirms the live certificate matches the attested fingerprint
Caches the verified SPKI hash — subsequent requests skip attestation
Forwards the request and streams the response back

Trust comes from Intel TDX hardware attestation, not from Certificate Authority trust chains.

Quick start

pip install -r requirements.txt
python proxy.py

Then point any OpenAI-compatible client at http://localhost:8080:

curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $API_KEY" \
  -d '{
    "model": "zai-org/GLM-5-FP8",
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 64
  }'

Python (OpenAI SDK)

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8080/v1",
    api_key="your-api-key",
)

response = client.chat.completions.create(
    model="zai-org/GLM-5-FP8",
    messages=[{"role": "user", "content": "Hello"}],
    max_tokens=64,
)
print(response.choices[0].message.content)

How routing works

The proxy determines which backend to forward to in two ways:

Model name (default) — Parses the model field from the JSON request body and looks up the corresponding *.completions.near.ai domain via the public endpoint discovery API (GET https://completions.near.ai/endpoints).

Explicit domain — Set the X-Backend-Domain header to target a specific backend directly:

curl http://localhost:8080/v1/chat/completions \
  -H "X-Backend-Domain: glm-5.completions.near.ai" \
  -H "Content-Type: application/json" \
  -d '{"model": "zai-org/GLM-5-FP8", "messages": [{"role": "user", "content": "Hi"}]}'

All paths are forwarded transparently (/v1/chat/completions, /v1/models, etc.).

How verification works

All steps happen on a single TCP connection to the backend. This guarantees verification and request forwarding target the exact same server (no DNS round-robin mismatch), and that no client data is sent before verification completes.

Client                     verified-proxy                  *.completions.near.ai (TEE)
  |                             |                                    |
  |-- POST /v1/chat/completions |                                    |
  |   model: GLM-5-FP8         |                                    |
  |                             |-- resolve model → domain           |
  |                             |                                    |
  |                             |== TLS handshake ==================>|
  |                             |   (no HTTP data sent yet)          |
  |                             |   extract SPKI from cert           |
  |                             |                                    |
  |                             |   SPKI in cache?                   |
  |                             |   ├─ yes → skip to step 4          |
  |                             |   └─ no  → verify on same conn:    |
  |                             |                                    |
  |                             |-- GET /attestation/report -------->|
  |                             |<-- attestation JSON ---------------|
  |                             |                                    |
  |                             |   Verify (local, no network):      |
  |                             |   ├─ Intel TDX quote (dcap-qvl)    |
  |                             |   ├─ report_data binds signing     |
  |                             |   │  address + TLS cert + nonce    |
  |                             |   └─ live SPKI == attested SPKI    |
  |                             |   Cache verified SPKI              |
  |                             |                                    |
  |                             |-- Forward client request --------->|
  |                             |   (only after verification)        |
  |<--- Stream response --------|<--- Stream response ---------------|

What gets verified

Check	What it proves
Intel TDX quote	Attestation comes from genuine Intel TDX hardware
Report data binding	The signing key and TLS certificate are bound to this specific TEE
SPKI match	The live TLS connection terminates inside the TEE
Nonce	The attestation is fresh (not replayed)

Security guarantee

No client data is sent before verification. The proxy uses http.client.HTTPSConnection which separates TLS handshake (connect()) from HTTP request sending (request()). The sequence is:

connect() — TLS handshake completes, certificate is available
Extract SPKI hash from the certificate
If uncached: GET /v1/attestation/report on the same connection → full TDX verification
Only after verification: request() sends the client's actual HTTP request

Steps 1-4 all happen on the same TCP connection, so there is no possibility of DNS round-robin routing you to a different (unverified) backend between verification and request.

When re-verification happens

First request to a backend — full attestation (~5-10 seconds)
Certificate rotation — new SPKI detected, triggers re-attestation on the same connection
DNS round-robin — each unique backend SPKI is verified and cached independently
Cached SPKI match — no attestation needed, request forwarded immediately

CLI options

python proxy.py [--port PORT] [--host HOST]

Flag	Default	Description
`--port`	`8080`	Port to listen on
`--host`	`127.0.0.1`	Address to bind to

Requirements

Python 3.11+
aiohttp — HTTP server and client
dcap-qvl — Intel TDX quote verification
cryptography — X.509 certificate parsing and SPKI hash computation

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
proxy.py		proxy.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

verified-proxy

What it does

Quick start

Python (OpenAI SDK)

How routing works

How verification works

What gets verified

Security guarantee

When re-verification happens

CLI options

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

verified-proxy

What it does

Quick start

Python (OpenAI SDK)

How routing works

How verification works

What gets verified

Security guarantee

When re-verification happens

CLI options

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages