Omnix Studio

Omnix is a local multi-modal AI studio that allows you to orchestrate vision, speech, and text models entirely on your machine. It also provides a robust local API for other applications to use Omnix as an inference engine.

Features

Multi-Modal: Support for Text, Vision, STT, TTS, Image Generation, and Music Generation.
Local First: All models run locally using WebGPU or WASM.
Theme Support: Polished Light and Dark modes.
Live Mode: Real-time screen and voice analysis.
Sandbox: Built-in environment for generating and running code.

Local API Guide

Omnix provides a local API running on http://localhost:3000/api.

Endpoints

1. Text Generation (`POST /api/text`)

Body: {"prompt": "string", "systemPrompt": "string"}
Response: {"response": "string"}

2. Vision Analysis (`POST /api/vision`)

Body: multipart/form-data
- image: File (Binary)
- prompt: string (Optional)
Response: {"caption": "string", "response": "string"}

3. Director Routing (`POST /api/director`)

Body: {"prompt": "string"}
Response: {"intent": "string", "prompt": "string"}

4. Image Generation (`POST /api/image`)

Body: {"prompt": "string"}
Response: {"status": "success", "url": "string"}

5. Music Generation (`POST /api/music`)

Body: {"prompt": "string"}
Response: {"status": "success", "audioUrl": "string"}

6. Speech-to-Text (`POST /api/stt`)

Body: multipart/form-data
- audio: File (WAV/MP3)
Response: {"text": "string"}

7. Text-to-Speech (`POST /api/tts`)

Body: {"text": "string", "voice": "string"}
Response: {"status": "success", "audioUrl": "string"}

Example Usage (CURL)

curl -X POST http://localhost:3000/api/text \
     -H "Content-Type: application/json" \
     -d '{"prompt": "Hello Omnix!"}'

Electron Setup Guide

The desktop version of Omnix provides unrestricted RAM access, WebGPU acceleration, and native filesystem integration.

Precompiled

Prerequisites

Node.js: v18 or higher recommended.
NPM: Standard package manager.

Installation

Clone the repository (if you haven't already):

git clone https://github.com/LoanLemon/Omnix
cd omnix

Install dependencies:
```
npm install
```

Running the Application

Development Mode

To run the app in development mode with hot-reloading:

Start the Vite development server:
```
npm run start
```

Production Build

To package the application for production:

Build the app:

# Note: You may need to install electron-builder or electron-packager for full distribution
npm run electron:build

Desktop Features

Unrestricted RAM: Up to 16GB of heap memory for large models.
WebGPU Acceleration: Hardware acceleration enabled by default.
Minimize to Tray: Moves to system tray on close/minimize.
Local Filesystem: Direct interaction with local files.

Troubleshooting

WebGPU Errors: Ensure your graphics drivers are up to date. Some older GPUs may not support WebGPU.
Port Conflicts: If port 3000 is occupied, the Electron app may fail to connect in dev mode.

Developed by Dustin Lee at LemOne Labs.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
components/ui		components/ui
electron		electron
lib		lib
src		src
AGENTS.md		AGENTS.md
README.md		README.md
components.json		components.json
electron-readme.md		electron-readme.md
index.html		index.html
metadata.json		metadata.json
package-lock.json		package-lock.json
package.json		package.json
server.ts		server.ts
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Omnix Studio

Features

Local API Guide

Endpoints

1. Text Generation (`POST /api/text`)

2. Vision Analysis (`POST /api/vision`)

3. Director Routing (`POST /api/director`)

4. Image Generation (`POST /api/image`)

5. Music Generation (`POST /api/music`)

6. Speech-to-Text (`POST /api/stt`)

7. Text-to-Speech (`POST /api/tts`)

Example Usage (CURL)

Electron Setup Guide

Precompiled

Prerequisites

Installation

Running the Application

Development Mode

Production Build

Desktop Features

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Omnix Studio

Features

Local API Guide

Endpoints

1. Text Generation (POST /api/text)

2. Vision Analysis (POST /api/vision)

3. Director Routing (POST /api/director)

4. Image Generation (POST /api/image)

5. Music Generation (POST /api/music)

6. Speech-to-Text (POST /api/stt)

7. Text-to-Speech (POST /api/tts)

Example Usage (CURL)

Electron Setup Guide

Precompiled

Prerequisites

Installation

Running the Application

Development Mode

Production Build

Desktop Features

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Text Generation (`POST /api/text`)

2. Vision Analysis (`POST /api/vision`)

3. Director Routing (`POST /api/director`)

4. Image Generation (`POST /api/image`)

5. Music Generation (`POST /api/music`)

6. Speech-to-Text (`POST /api/stt`)

7. Text-to-Speech (`POST /api/tts`)

Packages