InferHub

Unified multimodal AI inference platform exposing LLM, ASR, TTS and Vision through low-latency APIs with streaming, observability and rollout controls.

Phase 1 creates the production-shaped foundation:

FastAPI API gateway bootstrap
typed configuration
PostgreSQL, Redis, ClickHouse and Kafka local dependencies
health and readiness endpoints
Groq integration scaffold
Docker Compose and floci local cloud notes

See docs/phase-01.md for architecture, commands and Sarvam alignment. See docs/phase-02.md for authentication, authorization, rate limiting and model registry. See docs/phase-03.md for gRPC worker services and Groq integration. See docs/phase-04.md for client-facing inference APIs and WebSockets.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
backend		backend
docs		docs
frontend		frontend
infra		infra
proto		proto
tests		tests
workers		workers
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InferHub

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

InferHub

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages