- Taiwan
Stars
This is the reading list of Large Language Model-Based Data Science Agent
A flexible distributed key-value datastore that is optimized for caching and other realtime workloads.
Limit Order Book for high-frequency trading (HFT), as described by WK Selph, implemented in Python3 and C
An up-to-date list of time-series related papers in AI venues.
A lightweight data processing framework built on DuckDB and 3FS.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Merlion: A Machine Learning Framework for Time Series Intelligence
Toolkit for linearizing PDFs for LLM datasets/training
The native Rust implementation for Apache Hudi, with Python API bindings.
Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
verl: Volcano Engine Reinforcement Learning for LLMs
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.
An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl
Rust implementation of Apache Iceberg with integration for Datafusion
Avellaneda-Stoikov HFT market making algorithm implementation
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Open source Loom alternative. Beautiful, shareable screen recordings.
Comfortably monitor your Internet traffic 🕵️♂️
Penpot: The open-source design tool for design and code collaboration
Everything you need to build state-of-the-art foundation models, end-to-end.
A cli for spinning up and managing Ray clusters for the Daft Query Engine.