Skip to content
View HYPJUDY's full-sized avatar
🌌
(๑>◡<๑)
🌌
(๑>◡<๑)

Highlights

  • Pro

Organizations

@researchmm

Block or report HYPJUDY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,379 1,440 Updated Mar 10, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 25,857 2,488 Updated Mar 27, 2025

A fork to add multimodal model training to open-r1

Python 1,130 58 Updated Feb 8, 2025

Fully open reproduction of DeepSeek-R1

Python 23,398 2,126 Updated Mar 27, 2025

Investigating CoT Reasoning in Autoregressive Image Generation

Python 567 20 Updated Mar 26, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,208 245 Updated Mar 26, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 42,879 6,505 Updated Mar 27, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 82,294 61,042 Updated Mar 24, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 24,568 2,148 Updated Mar 27, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,452 146 Updated Mar 27, 2025

Playing Pokemon Red with Reinforcement Learning

Jupyter Notebook 7,247 689 Updated Mar 6, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,586 74 Updated Feb 11, 2025

Get your documents ready for gen AI

Python 25,491 1,521 Updated Mar 26, 2025

This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.

304 9 Updated Mar 13, 2025

Official inference framework for 1-bit LLMs

C++ 12,847 907 Updated Feb 18, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,956 112 Updated Jul 29, 2024

VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning

Python 104 11 Updated Sep 17, 2024

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 618 400 Updated Jul 4, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,937 651 Updated Mar 27, 2025

Deezer source separation library including pretrained models.

Python 26,596 2,911 Updated Jan 24, 2025

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Jupyter Notebook 406 51 Updated Jan 10, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,329 639 Updated Feb 10, 2025

Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)

Python 79 4 Updated Sep 21, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,302 395 Updated Mar 12, 2025

Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.

Python 114 19 Updated Mar 18, 2023

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,683 355 Updated Dec 7, 2024

Tools to download and cleanup Common Crawl data

Python 993 147 Updated Apr 25, 2023

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Python 1,428 145 Updated Jun 10, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,633 74 Updated Aug 15, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,384 77 Updated Sep 27, 2024
Next
Showing results