Build 2025 Lab329

Fine-Tune End-to-End Distillation Models with Azure AI Foundry Models and Foundry Local

This workshop provides an in-depth journey into fine-tuning an end-to-end distillation process utilizing DeepSeek V3 as a teacher model and Phi4-mini as a student model. Participants will explore the theoretical underpinnings, practical applications, and engage in hands-on exercises to perfect the art of implementing and optimizing distillation techniques in AI projects.

Key topics include the concept of model distillation and its significance in modern AI, an overview of DeepSeek V3 and Phi4-mini, step-by-step demonstrations of the fine-tuning process, real-world case studies, and discussions of best practices and optimization strategies. Attendees will gain insights into leveraging the Azure AI Foundry, and the Azure AI Foundry Models to streamline model selection, enhance fine-tuning efficiency, and optimize deployment strategies and consumption of Local model with ONNX and Foundry Local.

Tailored for data scientists, machine learning engineers, and AI enthusiasts, this session equips attendees with critical skills to elevate their AI solutions through advanced distillation techniques and Azure-powered tooling.

This workshop provides hands-on experience with model distillation using Microsoft Azure AI Foundry. Learn how to extract knowledge from Large Language Models (LLMs) and transfer it to Smaller Language Models (SLMs) while maintaining good performance and validate the model with the ONNX GenAI Runtime and Foundry Local.

Workshop Overview

Through a series of notebooks, this workshop demonstrates the complete workflow of model distillation, fine-tuning, and deployment using Azure Machine Learning (AzureML) platform, with a particular focus on optimizing models and deploying them to production environments.

Folder Structure

Lab329/: Main workshop content
- Notebooks/: Jupyter notebooks implementing the entire distillation process
- LocalFoundryEnv/: Configuration files for local ONNX inference on edge devices
lab_manual/: Detailed lab manual with step-by-step instructions

Workshop Flow

The workshop follows these key steps:

Knowledge Distillation (01.AzureML_Distillation.ipynb):
- Load a commonsense QA dataset from Hugging Face
- Prepare data for knowledge distillation
- Use a "teacher" model to generate high-quality answers for training the "student" model
Model Fine-tuning and Conversion (02.AzureML_FineTuningAndConvertByMSOlive.ipynb):
- Fine-tune the Phi-4-mini model using the LoRA (Low-Rank Adaptation) method
- Use Microsoft Olive tools to optimize and convert the model to ONNX format
- Apply quantization techniques (int4 precision) to decrease model size
Model Inference Using ONNX Runtime GenAI (03.AzureML_RuningByORTGenAI.ipynb):
- Load the optimized model in ONNX format
- Configure adapters and tokenizers
- Perform inference and generate responses
Model Registration to AzureML (04.AzureML_RegisterToAzureML.ipynb):
- Register the optimized model to the Azure Machine Learning workspace
- Set appropriate model metadata for deployment
Local Model Download (05.Local_Download.ipynb):
- Download registered models for local development or deployment
Local Inference (06.Local_Inference.ipynb):
- Run the optimized ONNX model locally for inference using ONNX GenAI Runtime
- Test model performance and capabilities
Local Inference with Foundry Local (07.Local_inference_AIFoundry.ipynb):
- Use Azure AI Foundry for local model inference
- Explore integration with the Microsoft Foundry Local platform

Session Resources

Resources	Links	Description
Build session page	https://build.microsoft.com/sessions/LAB329	Event session page with downloadable recording, slides, resources, and speaker bio
Microsoft Learn	https://aka.ms/build25/plan/CreateAgenticAISolutions	Official Collection or Plan with skilling resources to learn at your own pace

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Name		Name	Last commit message	Last commit date
Latest commit History 208 Commits
.devcontainer		.devcontainer
.github		.github
Lab329		Lab329
img		img
infra		infra
lab_manual		lab_manual
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
azure.yaml		azure.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Build 2025 Lab329

Fine-Tune End-to-End Distillation Models with Azure AI Foundry Models and Foundry Local

Workshop Overview

Folder Structure

Workshop Flow

Session Resources

Contributing

Trademarks

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Languages

License

microsoft/Build25-LAB329

Folders and files

Latest commit

History

Repository files navigation

Build 2025 Lab329

Fine-Tune End-to-End Distillation Models with Azure AI Foundry Models and Foundry Local

Workshop Overview

Folder Structure

Workshop Flow

Session Resources

Contributing

Trademarks

About

Topics

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Languages

Packages