thomas-yanxin

Regular bencher

thomas-yanxin thomas-yanxin

Regular bencher

不是逢人苦眷君，亦狂亦侠亦温文。

283 followers · 182 following

Achievements

x2 x3

Achievements

x2 x3

Highlights

Developer Program Member

Organizations

Lists (13)

Sort

Starred repositories

Roblox / cube

Roblox Foundation Model for 3D Intelligence

Jupyter Notebook 278 15 Updated Mar 21, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 24,471 2,135 Updated Mar 21, 2025

pzhren / InfiniteWorld

Python 44 9 Updated Feb 19, 2025

UMass-Embodied-AGI / 3D-Mem

[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"

Python 87 4 Updated Mar 16, 2025

Psi-Robot / DexGraspVLA

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Python 158 10 Updated Mar 20, 2025

fuse-model / FuSe

Python 41 1 Updated Jan 13, 2025

valeoai / VideoActionModel

VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).

Jupyter Notebook 74 5 Updated Mar 7, 2025

freddyaboulton / fastrtc

The python library for real-time communication

JavaScript 3,172 270 Updated Mar 21, 2025

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,068 77 Updated Mar 2, 2025

Tencent / Hunyuan3D-1

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Python 3,347 255 Updated Jan 21, 2025

abi / screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 69,168 8,519 Updated Mar 20, 2025

fishaudio / audio-preprocess

Preprocess Audio for training

Python 318 58 Updated Mar 3, 2025

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,778 230 Updated Dec 5, 2024

opendilab / CleanS2S

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体！

Python 375 36 Updated Mar 4, 2025

g-battaglia / kerykeion

Data-Driven Astrology 💫 Kerykeion is a Python library for astrology. It generates SVG charts and extracts detailed data for birth charts, synastry, transits, and composite charts.

Python 373 128 Updated Mar 6, 2025

theriftlab / immanuel-python

Quickly produce both human-readable and JSON-formatted astrology chart data based on the Swiss Ephemeris and astro.com.

Python 67 14 Updated Mar 17, 2025

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,786 8,907 Updated Aug 14, 2024

chenfei-wu / TaskMatrix

Python 34,512 3,298 Updated Jan 6, 2024

TEN-framework / TEN-Agent

TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaki…

Python 5,156 584 Updated Mar 21, 2025

thomas-yanxin / LLM-Inference

5 Updated Sep 24, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,860 196 Updated Nov 14, 2024

arcee-ai / EvolKit

EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).

Jupyter Notebook 207 24 Updated Oct 30, 2024