Github Actions Workflow for Tinker and Azure #327

ultmaster · 2025-11-21T17:33:41Z

Support Tinker by $150 with personal Tinker API key.

Support Azure via Managed Identity.

…ghtning into chore/tinker-pipeline

Copilot

Pull request overview

This PR adds GitHub Actions workflows to automate CI testing for Tinker and Azure examples. Key improvements include parameterizable test configurations with --ci flags for faster verification, fixed dependency versions for reproducibility, enhanced timeout configurations, and improved error handling.

Adds automated CI workflows for Tinker and Azure examples
Introduces --ci mode for faster test execution with smaller datasets
Pins crewai version and removes git commit hash from tinker-cookbook dependency
Improves LiteLLM configuration with longer timeouts and better error handling

Reviewed changes

Copilot reviewed 12 out of 13 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
uv.lock	Pins crewai to 1.2.0, unpins tinker-cookbook commit, adds new dependencies (ipython, pyvis, etc.)
scripts/litellm_run.sh	Makes port configurable via CLI argument with proper variable substitution
pyproject.toml	Pins crewai to ==1.2.0, removes git rev from tinker-cookbook
examples/tinker/tests/test_tinker_llm.py	Adds threaded store, launch mode config, and trajectory assertions
examples/tinker/q20_train.py	Adds CI mode support with smaller datasets and batch sizes
examples/tinker/q20_evaluate.py	Adds CI mode with sample limits, timeout increases, and result validation
examples/tinker/hello.py	Adds CI mode with reduced dataset sizes
examples/tinker/agl_tinker/llm.py	Switches to tinker tokenizer utility, adds timeout config, fixes callback setting
examples/azure/train_capital_agent.py	Adds CLI args for iterations and cleanup, adds try-finally for cleanup
examples/azure/aoai_finetune.py	Updates to new Azure API format, adds cleanup_deployments method
.github/workflows/examples-tinker.yml	New workflow for Tinker CI with multiple test scenarios
.github/workflows/examples-azure.yml	New workflow for Azure OpenAI examples
.github/workflows/benchmark.yml	Placeholder benchmark workflow

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-21T17:36:02Z

examples/tinker/agl_tinker/llm.py

+        # If not adding return token ids, we need to add the opentelemetry callback.
+        # Otherwise, we set it to default.
+        callbacks=["opentelemetry"] if not add_return_token_ids else None,


The comment logic is inverted or unclear. The code sets callbacks to ['opentelemetry'] when add_return_token_ids is False, but the comment says 'If not adding return token ids, we need to add the opentelemetry callback. Otherwise, we set it to default.' This suggests the callback should be added when NOT adding token ids, which matches the code. However, the original code at line 309 was callbacks=['opentelemetry'] if add_return_token_ids else None, which is the opposite. Either clarify the comment to explain why the logic was inverted, or verify this is the correct behavior.

Copilot · 2025-11-21T17:36:02Z

examples/azure/aoai_finetune.py

+                method={
+                    "type": "supervised",
+                    "supervised": {
+                        "hyperparameters": {
+                            "batch_size": self.finetune_batch_size,
+                            "learning_rate_multiplier": self.finetune_learning_rate,
+                            "n_epochs": self.finetune_epochs,
+                        }
+                    },
                },


The API structure change from hyperparameters as a direct parameter to nested under method.supervised.hyperparameters appears to be a breaking change to the Azure OpenAI API. Verify this is compatible with the API version specified in the workflow (AZURE_OPENAI_API_VERSION: 2025-04-01-preview) and consider documenting the minimum required API version.

Copilot · 2025-11-21T17:36:03Z

.github/workflows/examples-tinker.yml

+          sleep 5
+          python q20_train.py runner --n-runners 4 &
+          sleep 5
+          python q20_train.py algo --model qwen4b --ci
+          sleep 5
+
+          pkill -f agl && echo "SIGTERM sent to agl" || echo "No agl process found"
+          while pgrep -f agl; do
+            echo "Waiting for agl to finish..."
+            sleep 5
+          done
+          pkill -f q20_train.py && echo "SIGTERM sent to q20_train.py" || echo "No q20_train.py process found"
+          while pgrep -f q20_train.py; do
+            echo "Waiting for q20_train.py to finish..."
+            sleep 5
+          done
+          echo "q20_train.py has finished."


The process cleanup logic uses pkill -f with broad pattern matching (agl and q20_train.py) which could inadvertently kill unrelated processes on a shared runner. Consider using more specific process identifiers, storing PIDs from the background jobs, or adding a unique identifier to the commands to ensure only the correct processes are terminated.

Suggested change

sleep 5

python q20_train.py runner --n-runners 4 &

sleep 5

python q20_train.py algo --model qwen4b --ci

sleep 5

pkill -f agl && echo "SIGTERM sent to agl" || echo "No agl process found"

while pgrep -f agl; do

echo "Waiting for agl to finish..."

sleep 5

done

pkill -f q20_train.py && echo "SIGTERM sent to q20_train.py" || echo "No q20_train.py process found"

while pgrep -f q20_train.py; do

echo "Waiting for q20_train.py to finish..."

sleep 5

done

echo "q20_train.py has finished."

AGL_PID=$!

sleep 5

python q20_train.py runner --n-runners 4 &

RUNNER_PID=$!

sleep 5

python q20_train.py algo --model qwen4b --ci

sleep 5

# Cleanup: terminate background processes by PID

if kill -0 "$AGL_PID" 2>/dev/null; then

kill "$AGL_PID" && echo "SIGTERM sent to agl (PID $AGL_PID)" || echo "Failed to send SIGTERM to agl"

else

echo "No agl process found (PID $AGL_PID not running)"

fi

while kill -0 "$AGL_PID" 2>/dev/null; do

echo "Waiting for agl (PID $AGL_PID) to finish..."

sleep 5

done

if kill -0 "$RUNNER_PID" 2>/dev/null; then

kill "$RUNNER_PID" && echo "SIGTERM sent to q20_train.py runner (PID $RUNNER_PID)" || echo "Failed to send SIGTERM to q20_train.py runner"

else

echo "No q20_train.py runner process found (PID $RUNNER_PID not running)"

fi

while kill -0 "$RUNNER_PID" 2>/dev/null; do

echo "Waiting for q20_train.py runner (PID $RUNNER_PID) to finish..."

sleep 5

done

echo "q20_train.py runner has finished."

ultmaster added 30 commits November 20, 2025 17:49

disable cache

0e7f882

update aoai pipeline

77078aa

.

1be817b

update tinker pipeline

7d18c4c

.

72defc1

.

ee8b0db

.

7187f72

.

21da4db

.

10f4038

.

baadd8d

pin crewAI to 1.2.0

e588c4f

downgrade crewai

33a7ccd

.

e6eb05f

try to fix tests

2ff2f03

add timeout

5a3cf89

.

628545e

.

610c423

Merge branch 'chore/tinker-pipeline' of github.com:microsoft/agent-li…

20f31a5

…ghtning into chore/tinker-pipeline

.

485ecf3

debug

c8d99b2

use qwen3_instruct renderer

97176a8

.

85f2fc2

update finetune

643e34f

.

dbbd009

.

c243120

,

42c1f2e

update azure and tinker pipeline

db3c08f

Adding new yamls

74f1718

revert

18185ab

revert

e0dfdf0

Copilot AI review requested due to automatic review settings November 21, 2025 17:33

Copilot started reviewing on behalf of ultmaster November 21, 2025 17:34 View session

Copilot finished reviewing on behalf of ultmaster November 21, 2025 17:34

Copilot AI reviewed Nov 21, 2025

View reviewed changes

add badge

6994cf4

ultmaster merged commit 4cf8fb9 into main Nov 21, 2025
14 checks passed

totoluo pushed a commit to totoluo/agent-lightning that referenced this pull request Dec 6, 2025

Github Actions Workflow for Tinker and Azure (microsoft#327)

8e8b5cc

totoluo pushed a commit to totoluo/agent-lightning that referenced this pull request Dec 6, 2025

Github Actions Workflow for Tinker and Azure (microsoft#327)

b1eb437

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Github Actions Workflow for Tinker and Azure #327

Github Actions Workflow for Tinker and Azure #327

Uh oh!

ultmaster commented Nov 21, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Nov 21, 2025

Uh oh!

Copilot AI Nov 21, 2025

Uh oh!

Copilot AI Nov 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-          sleep 5
-          python q20_train.py runner --n-runners 4 &
-          sleep 5
-          python q20_train.py algo --model qwen4b --ci
-          sleep 5
-          pkill -f agl && echo "SIGTERM sent to agl" || echo "No agl process found"
-          while pgrep -f agl; do
-            echo "Waiting for agl to finish..."
-            sleep 5
-          done
-          pkill -f q20_train.py && echo "SIGTERM sent to q20_train.py" || echo "No q20_train.py process found"
-          while pgrep -f q20_train.py; do
-            echo "Waiting for q20_train.py to finish..."
-            sleep 5
-          done
-          echo "q20_train.py has finished."
+          AGL_PID=$!
+          sleep 5
+          python q20_train.py runner --n-runners 4 &
+          RUNNER_PID=$!
+          sleep 5
+          python q20_train.py algo --model qwen4b --ci
+          sleep 5
+          # Cleanup: terminate background processes by PID
+          if kill -0 "$AGL_PID" 2>/dev/null; then
+            kill "$AGL_PID" && echo "SIGTERM sent to agl (PID $AGL_PID)" || echo "Failed to send SIGTERM to agl"
+          else
+            echo "No agl process found (PID $AGL_PID not running)"
+          fi
+          while kill -0 "$AGL_PID" 2>/dev/null; do
+            echo "Waiting for agl (PID $AGL_PID) to finish..."
+            sleep 5
+          done
+          if kill -0 "$RUNNER_PID" 2>/dev/null; then
+            kill "$RUNNER_PID" && echo "SIGTERM sent to q20_train.py runner (PID $RUNNER_PID)" || echo "Failed to send SIGTERM to q20_train.py runner"
+          else
+            echo "No q20_train.py runner process found (PID $RUNNER_PID not running)"
+          fi
+          while kill -0 "$RUNNER_PID" 2>/dev/null; do
+            echo "Waiting for q20_train.py runner (PID $RUNNER_PID) to finish..."
+            sleep 5
+          done
+          echo "q20_train.py runner has finished."

Github Actions Workflow for Tinker and Azure #327

Github Actions Workflow for Tinker and Azure #327

Uh oh!

Conversation

ultmaster commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ultmaster commented Nov 21, 2025 •

edited

Loading