Skip to content
View Hytn's full-sized avatar

Block or report Hytn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Sky-T1: Train your own O1 preview model within $450

Python 3,072 312 Updated Mar 2, 2025

LLM for Scientific Research Survey

64 1 Updated Jan 22, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,369 82 Updated Feb 19, 2025

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 2,673 437 Updated Jan 24, 2025

Fantastic Data Engineering for Large Language Models

79 4 Updated Dec 29, 2024

Recipes to scale inference-time compute of open models

Python 1,033 104 Updated Feb 25, 2025

Ongoing research training transformer models at scale

Python 11,670 2,618 Updated Mar 7, 2025

Build resilient language agents as graphs.

Python 9,794 1,627 Updated Mar 6, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 102,551 16,611 Updated Mar 7, 2025

Simple, unified interface to multiple Generative AI providers

Python 11,585 1,122 Updated Mar 6, 2025

A curated, but incomplete, list of data-centric AI resources.

1,084 78 Updated Jun 26, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,370 531 Updated Mar 7, 2025

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …

C# 17,756 4,218 Updated Jan 8, 2025

🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Ollama, etc s…

Python 1,240 60 Updated Feb 3, 2025

150+ quantitative finance Python programs to help you gather, manipulate, and analyze stock market data

Python 2,498 133 Updated Aug 20, 2024

A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)

Python 19,488 2,742 Updated Mar 7, 2025

Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents

Python 436 27 Updated Jan 15, 2025

DSBench: How Far are Data Science Agents from Becoming Data Science Experts?

Jupyter Notebook 43 3 Updated Feb 19, 2025

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook 4,052 1,073 Updated Jan 1, 2025

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 1,757 156 Updated Mar 7, 2025

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,293 140 Updated Mar 7, 2025

AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.

Python 775 97 Updated Feb 26, 2025

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 630 78 Updated Jan 14, 2025

O1 Replication Journey

1,969 65 Updated Jan 14, 2025
Python 1,341 51 Updated Nov 21, 2024

A platform for developers to simulate collaborative research activities

Python 140 20 Updated Mar 6, 2025
Jupyter Notebook 27 4 Updated Mar 3, 2025

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Python 2,176 526 Updated Mar 7, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,592 480 Updated Jan 8, 2024
Next
Showing results