Skip to content
View melnimr's full-sized avatar
🎯
Exploring vs. Exploiting...
🎯
Exploring vs. Exploiting...
  • LA, California, USA
  • 02:31 (UTC -07:00)

Block or report melnimr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ViZDoom ViZDoom Public

    Forked from Farama-Foundation/ViZDoom

    Reinforcement Learning environments based on the 1993 game Doom :godmode:

    C++

  2. unslothai/unsloth unslothai/unsloth Public

    Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

    Python 44.4k 3.6k

  3. Farama-Foundation/ViZDoom Farama-Foundation/ViZDoom Public

    Reinforcement Learning environments based on the 1993 game Doom :godmode:

    C++ 1.9k 420

  4. aae-train-donkeycar aae-train-donkeycar Public

    Forked from araffin/aae-train-donkeycar

    Code used to train an augmented auto-encoder (aka denoising auto-encoder with more augmentations) for the DonkeyCar simulator.

    Python

  5. amazon-sagemaker-examples amazon-sagemaker-examples Public

    Forked from aws/amazon-sagemaker-examples

    Example notebooks that show how to apply machine learning, deep learning and reinforcement learning in Amazon SageMaker

    Jupyter Notebook

  6. awesome-rl awesome-rl Public

    Forked from aikorea/awesome-rl

    Reinforcement learning resources curated