Apple Intelligence Web API

A Swift-based web API that exposes Apple's on-device Foundation Models through an OpenAI-compatible HTTP interface. Built with Vapor and designed to run on Apple Intelligence-enabled devices.

Features

OpenAI-compatible: Works with existing OpenAI/OpenRouter client libraries
Non-chat completions: Single-prompt responses
Chat completions: Multi-turn conversations with context
Streaming responses: Real-time token streaming via Server-Sent Events
Multiple models: Base and permissive content guardrails
Authentication
Structured outputs
Tool/function calling
Tests

Running the server

Requirements:

Apple Intelligence-enabled device
Swift 6.0+

Build the project:

swift build

Run the server:

swift run AppleIntelligenceApi serve [--hostname, -H] [--port, -p] [--bind, -b] [--unix-socket]

The API will be available at http://localhost:8080 by default.

Troubleshooting

Port already in use:

lsof -i :8080  # Find out what's using the port
swift run AppleIntelligenceApi serve -p 9000  # Use a different port if needed

Apple Intelligence not available:

Make sure it's enabled: Settings --> Apple Intelligence & Siri
Check your device is supported

Usage

This API follows the same standard as OpenAI and OpenRouter, so it should be straightforward to adopt.

For completeness, here are some examples...

Using cURL

Chat completion:

curl http://localhost:8080/api/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "base",
    "messages": [
      {"role": "user", "content": "What is the capital of France?"}
    ]
  }'

Streaming Response:

curl http://localhost:8080/api/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "base",
    "messages": [
      {"role": "user", "content": "Tell me a story"}
    ],
    "stream": true
  }'

Using Python

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8080/api/v1",
    api_key="not-needed"
)

response = client.chat.completions.create(
    model="base",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Using JavaScript/TypeScript

import OpenAI from 'openai';

const client = new OpenAI({
  baseURL: 'http://localhost:8080/api/v1',
  apiKey: 'not-needed'
});

const response = await client.chat.completions.create({
  model: 'base',
  messages: [{ role: 'user', content: 'Hello!' }],
  stream: true
});

for await (const chunk of response) {
  process.stdout.write(chunk.choices[0]?.delta?.content || '');
}

API reference

For a complete breakdown of how to use the API, I suggest looking at the OpenAI or OpenRouter documentation.

Our API differs in a few key places:

Available models: base (default guardrails) and permissive (relaxed filtering)
Runs server on-device (so no API key needed)
Not all features are available!

Development

Running tests

We currently do not have any tests! If you would like to implement some, please make a PR.

swift test

Project structure

./Sources/routes.swift: API route definition
./Sources/Utils/AbortErrors.swift: Error type definitions
./Sources/Utils/RequestContent.swift: Parsing incoming requests
./Sources/Utils/ResponseSession.swift: Foundation models interface
./Sources/Utils/ResponseGenerator.swift: Generates responses

Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'feat: Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a pull request

Cloud inference

I currently do not plan to offer cloud inference for this API.

If you want to use the server on your local network, you can run the server from any Apple Intelligence-enabled device.

If you want cloud inference, you will probably want a VPS. Of course, this VPS needs to be on Apple Intelligence-enabled hardware; HostMyApple and MacInCloud seem reasonable.

Acknowledgments

Built with Vapor web framework
Uses Apple's Foundation Models Framework
OpenAI-compatible API design based on OpenRouter

Disclaimer

This is an unofficial API wrapper for Apple Intelligence. It is not affiliated with or endorsed by Apple Inc. Use responsibly and in accordance with Apple's terms of service.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.vscode		.vscode
Public		Public
Sources/AppleIntelligenceApi		Sources/AppleIntelligenceApi
Tests/AppleIntelligenceApiTests		Tests/AppleIntelligenceApiTests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Apple Intelligence Web API

Features

Running the server

Troubleshooting

Usage

Using cURL

Using Python

Using JavaScript/TypeScript

API reference

Development

Running tests

Project structure

Contributing

Cloud inference

Acknowledgments

Disclaimer

About

Uh oh!

Releases

Packages

Languages

License

jaspermayone-forks/Apple-Intelligence-API

Folders and files

Latest commit

History

Repository files navigation

Apple Intelligence Web API

Features

Running the server

Troubleshooting

Usage

Using cURL

Using Python

Using JavaScript/TypeScript

API reference

Development

Running tests

Project structure

Contributing

Cloud inference

Acknowledgments

Disclaimer

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages