Skip to content
View upbit's full-sized avatar
  • China

Block or report upbit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A Comprehensive Solution for Identifying and Managing Duplicate Photos in Immich

Python 319 25 Updated Oct 18, 2024

An alternative to the immich-CLI command that doesn't depend on nodejs installation. It tries its best for importing google photos takeout archives.

Go 2,726 82 Updated Mar 26, 2025

GGUF Quantization support for native ComfyUI models

Python 1,729 109 Updated Mar 23, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 9,207 997 Updated Mar 26, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,364 807 Updated Mar 27, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,379 810 Updated Mar 1, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,097 531 Updated Mar 26, 2025

Development repository for the Triton language and compiler

MLIR 15,006 1,889 Updated Mar 27, 2025

https://wavespeed.ai/ [WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.

Python 908 36 Updated Mar 27, 2025

Get your Pixiv token easily (for running upbit/pixivpy)

Python 131 10 Updated Mar 4, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 5,377 526 Updated Mar 24, 2025

An open-source, cross-platform terminal for seamless workflows

Go 9,804 321 Updated Mar 25, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero

Python 19,430 1,612 Updated Mar 24, 2025
Go 2,584 81 Updated Mar 19, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…

C++ 5,395 609 Updated Mar 27, 2025

Call Between Golang and Rust Asynchronously

Rust 271 20 Updated Mar 17, 2025

A Loki hook for Logrus

Go 15 6 Updated Jan 21, 2023

A Windows GUI toolkit for the Go Programming Language

Go 6,947 897 Updated Jan 21, 2024

Pure Go implementation of the WebRTC API

Go 14,479 1,702 Updated Mar 27, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,937 651 Updated Mar 27, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,329 639 Updated Feb 10, 2025

Community fork of PlayCover

Swift 9,241 790 Updated Feb 3, 2025

Application Kernel for Containers

Go 16,271 1,350 Updated Mar 27, 2025

Manage apps of iOS devices

C 1,283 266 Updated Dec 2, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,081 1,374 Updated Mar 3, 2025

real time face swap and one-click video deepfake with only a single image

Python 48,457 7,118 Updated Mar 26, 2025

MLX: An array framework for Apple silicon

C++ 19,952 1,141 Updated Mar 27, 2025

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 27,183 1,661 Updated Mar 21, 2025

my blog

C# 23 2 Updated Oct 6, 2024

Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.

Python 5,962 426 Updated Mar 27, 2025
Next
Showing results