-
Spyral AI
- London, UK
-
12:48
(UTC -12:00) - https://www.spyral.ai
- @rob_clucas
LLM
A modular graph-based Retrieval-Augmented Generation (RAG) system
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
A blazing fast inference solution for text embeddings models
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
A lightweight data processing framework built on DuckDB and 3FS.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepEP: an efficient expert-parallel communication library
FlashMLA: Efficient Multi-head Latent Attention Kernels





