feat: Add unsloth Dockerfile and gptoss 20b finetuning notebook for strix halo #126

0xrushi · 2025-11-13T12:20:32Z

No description provided.

gemini-code-assist · 2025-11-13T12:20:46Z

Summary of Changes

Hello @0xrushi, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly expands the platform's capabilities by introducing dedicated support for fine-tuning large language models on AMD Strix Halo hardware. It provides a complete environment setup via a new Dockerfile and a practical, step-by-step guide in a Jupyter notebook, enabling users to leverage Unsloth for efficient LLM training on this specific AMD architecture.

Highlights

New Dockerfile for AMD Strix Halo: A new Dockerfile (Dockerfile_Strix_Halo) has been added to set up a ROCm-enabled environment specifically for AMD Strix Halo, including necessary libraries like bitsandbytes, xformers, flash-attention, and Unsloth.
GPT-OSS 20B Fine-tuning Notebook: A Jupyter notebook (original_template/gpt-oss-(20B)_StrixHalo-Fine-tuning.ipynb) has been introduced, providing a comprehensive tutorial for fine-tuning the gpt-oss-20b large language model on AMD Strix Halo using the Unsloth library.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds a Dockerfile and a Jupyter notebook to support fine-tuning the gpt-oss-20b model on AMD Strix Halo hardware. The additions are a solid foundation for this new capability. My review provides feedback to improve the Dockerfile's security, efficiency, and image size, as well as to enhance the notebook's correctness, maintainability, and reliability by addressing issues like incorrect cell dependencies and redundant code. I've included specific code suggestions to help implement these improvements.

gemini-code-assist · 2025-11-13T12:23:13Z

Dockerfile_Strix_Halo

+RUN apt-get update && apt-get install -y sudo && \
+    useradd -m -s /bin/bash user && \
+    usermod -aG sudo user && \
+    echo "user ALL=(ALL) NOPASSWD:ALL" >> /etc/sudoers && \
+    apt-get clean && rm -rf /var/lib/apt/lists/*


Granting the user passwordless sudo for all commands (NOPASSWD:ALL) poses a significant security risk, as it provides unrestricted root access without any authentication. While this may be convenient for a development environment, it is a dangerous practice. Consider running as the root user and only switching to a non-root user at the end of the Dockerfile, or grant more specific, limited sudo permissions if root access is required for certain operations.

we should definitely prevent passwordless sudo for all commands.

gemini-code-assist · 2025-11-13T12:23:13Z

original_template/gpt-oss-(20B)_StrixHalo-Fine-tuning.ipynb

+   "source": [
+    "## Dataset Preparation\n",
+    "\n",
+    "Loads a small quotes dataset, converts to chat format, and compiles Harmony-style text with the tokenizer's chat template."
+   ]


The 'Quick Model Smoke Test' cell is marked as optional, but the 'Dataset Preparation' cell that follows depends on the tokenizer object created in the smoke test. If a user skips the optional cell, the notebook will fail with a NameError. The notebook's cell execution flow should be corrected to handle this dependency, for instance by moving the model and tokenizer loading to before the dataset preparation cell or by making the smoke test mandatory.

gemini-code-assist · 2025-11-13T12:23:13Z

Dockerfile_Strix_Halo

+WORKDIR /opt/src
+
+# bitsandbytes (ROCm)
+RUN git clone -b rocm_enabled_multi_backend https://github.com/ROCm/bitsandbytes.git


To reduce the Docker image size, it's a good practice to use git clone --depth 1 to perform a shallow clone. This avoids downloading the entire commit history for repositories that are only used for building dependencies. This recommendation applies to all git clone operations in this file (lines 30, 38, 43).

RUN git clone --depth 1 -b rocm_enabled_multi_backend https://github.com/ROCm/bitsandbytes.git

gemini-code-assist · 2025-11-13T12:23:13Z

Dockerfile_Strix_Halo

+RUN git clone https://github.com/ROCm/flash-attention.git
+RUN cd flash-attention && git checkout v2.7.4-cktile && python setup.py install


To optimize image size and reduce the number of layers, it's best to combine cloning, building, and cleanup into a single RUN layer. This prevents the cloned source code from being persisted in an intermediate layer. Using git clone --depth 1 is also recommended.

RUN git clone --depth 1 https://github.com/ROCm/flash-attention.git && \ cd flash-attention && git checkout v2.7.4-cktile && python setup.py install && \ cd .. && rm -rf flash-attention

gemini-code-assist · 2025-11-13T12:23:14Z

Dockerfile_Strix_Halo

+RUN python -m pip install --no-cache-dir .
+RUN python -m pip install --no-cache-dir jupyterlab ipywidgets ipykernel tqdm
+RUN python -m pip install --no-cache-dir 'unsloth_zoo>=2025.5.7'


Each RUN command creates a new layer in the Docker image. To optimize image size and reduce the number of layers, you should combine these consecutive pip install commands into a single RUN instruction.

RUN python -m pip install --no-cache-dir . jupyterlab ipywidgets ipykernel tqdm 'unsloth_zoo>=2025.5.7'

gemini-code-assist · 2025-11-13T12:23:14Z

original_template/gpt-oss-(20B)_StrixHalo-Fine-tuning.ipynb

+    "Option B — Local Docker build from this repo:\n",
+    "\n",
+    "```bash\n",
+    "docker build -f Dockerfile -t unsloth-strix-halo .\n",


The docker build command refers to Dockerfile, but the Dockerfile added in this pull request is named Dockerfile_Strix_Halo. The filename in the command should be updated to match the new file to avoid errors for users following these instructions.

Suggested change

"docker build -f Dockerfile -t unsloth-strix-halo .\n",

"docker build -f Dockerfile_Strix_Halo -t unsloth-strix-halo .\n",

gemini-code-assist · 2025-11-13T12:23:14Z

original_template/gpt-oss-(20B)_StrixHalo-Fine-tuning.ipynb

+    "dtype = None\n",
+    "\n",
+    "model, tokenizer = FastLanguageModel.from_pretrained(\n",
+    "    model_name       = \"unsloth/gpt-oss-20b\",\n",


The model name is hardcoded here. It's better to use the MODEL_NAME variable defined in the 'Configuration and Hyperparameters' cell to ensure consistency and make it easier to change the model for different runs.

Suggested change

" model_name = \"unsloth/gpt-oss-20b\",\n",

" model_name = MODEL_NAME,\n",

gemini-code-assist · 2025-11-13T12:23:14Z

original_template/gpt-oss-(20B)_StrixHalo-Fine-tuning.ipynb

+    "LR      = 2e-4\n",
+    "EPOCHS  = 1\n",
+    "BATCH_SIZE = 1  # keep small for safety\n",


The hyperparameters LR, EPOCHS, and BATCH_SIZE are redefined here, but they have already been defined in the 'Configuration and Hyperparameters' cell. To improve maintainability and avoid potential inconsistencies, you should remove these redundant definitions and use the variables set in the configuration cell.

danielhanchen · 2025-11-18T04:37:23Z

Oh thank you! We'll review this

strixhalo/gptossnotebok

457994b

gemini-code-assist bot reviewed Nov 13, 2025

View reviewed changes

0xrushi mentioned this pull request Nov 13, 2025

fix: unsloth fixes for gfx1151 unslothai/unsloth#3588

Open

0xrushi and others added 3 commits November 13, 2025 07:36

cleanup

5d03dcf

Remove depth option from git clone commands

5b7ab8d

Restore old dockerfile

be60c00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add unsloth Dockerfile and gptoss 20b finetuning notebook for strix halo #126

feat: Add unsloth Dockerfile and gptoss 20b finetuning notebook for strix halo #126

0xrushi commented Nov 13, 2025

Uh oh!

gemini-code-assist bot commented Nov 13, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 13, 2025

Uh oh!

rolandtannous Nov 20, 2025

Uh oh!

gemini-code-assist bot Nov 13, 2025

Uh oh!

gemini-code-assist bot Nov 13, 2025

Uh oh!

gemini-code-assist bot Nov 13, 2025

Uh oh!

gemini-code-assist bot Nov 13, 2025

Uh oh!

gemini-code-assist bot Nov 13, 2025

Uh oh!

gemini-code-assist bot Nov 13, 2025

Uh oh!

gemini-code-assist bot Nov 13, 2025

Uh oh!

danielhanchen commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		RUN git clone https://github.com/ROCm/flash-attention.git
		RUN cd flash-attention && git checkout v2.7.4-cktile && python setup.py install

	"docker build -f Dockerfile -t unsloth-strix-halo .\n",
	"docker build -f Dockerfile_Strix_Halo -t unsloth-strix-halo .\n",

	" model_name = \"unsloth/gpt-oss-20b\",\n",
	" model_name = MODEL_NAME,\n",

feat: Add unsloth Dockerfile and gptoss 20b finetuning notebook for strix halo #126

Are you sure you want to change the base?

feat: Add unsloth Dockerfile and gptoss 20b finetuning notebook for strix halo #126

Conversation

0xrushi commented Nov 13, 2025

Uh oh!

gemini-code-assist bot commented Nov 13, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

rolandtannous Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

danielhanchen commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants