
Starred repositories
LLM plugin to access Google's Gemini family of models
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
IE's cross platform midi map editor for arbitrary controllers
Official PyTorch implementation for "Large Language Diffusion Models"
A simple tool to extract data from PDFs and images using Google's gemini-2.0-flash-001 model.
Prompt, run, edit, and deploy full-stack web applications
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
A web UI for the `llm` command line tool
Build LangGraph agents with large numbers of tools
Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical Systems
an experimental new programming language based on interaction nets
Implementing the 4 agentic patterns from scratch
Official Repo for "TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding"
deepbeepmeep / Wan2GP
Forked from Wan-Video/Wan2.1Wan 2.1 for the GPU Poor
A music frequency visualizer using D3.js
Exploration of automated dataset selection approaches at large scales.
ASLP-lab / C2SER
Forked from zxzhao0/C2SERWe propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through contextual perception and chain of Thought (CoT).
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion