Stars
A curated list with resources about node-based UIs
LLM abstractions that aren't obstructions
Example of a monorepo setup for python projects using uv
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and…
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Asterinas is a secure, fast, and general-purpose OS kernel, written in Rust and providing Linux-compatible ABI.
🔬 A Ruby library for carefully refactoring critical paths.
A GitHub Action for uploading files to a Google Cloud Storage (GCS) bucket.
This demo showcases different approaches to handling the delay during RAG (Retrieval-Augmented Generation) lookups in a voice-enabled AI assistant
OpenAI Realtime API Relay Server / CLI in Python. You can talk to it and it responds
This benchmark tests how well LLMs incorporate a set of 10 mandatory story elements (characters, objects, core concepts, attributes, motivations, etc.) in a short creative story
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
🦋 A way to manage your versioning and changelogs with a focus on monorepos
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Helw150 / levanter
Forked from stanford-crfm/levanterLegible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
Agents that build knowledge graphs and explore textual worlds by asking questions
Official implementation of the TTS model Lina-Speech
Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.
A minimal library for writing text adventure games in Python 3
OpenAPI / Swagger, AsyncAPI & Semoasa definitions to (re)Slate compatible markdown