add How_to_Use_Pipeline_to_Evaluate_AI_Modes.md by davidwtf · Pull Request #67 · alauda/knowledge

davidwtf · 2025-09-29T07:21:45Z

Summary by CodeRabbit

Documentation
- Added a comprehensive user guide for evaluating AI models via a DevOps pipeline, using YOLOv5 as an example.
- Covers prerequisites, preparing models and COCO-format validation data, and handling large files.
- Details RBAC and pipeline setup, multi-stage evaluation job orchestration, pipeline parameters, triggers, and lifecycle behavior.
- Explains integration with Evidently for metrics and dashboards and how to monitor results.

coderabbitai · 2025-09-29T07:21:52Z

Walkthrough

Adds a new English documentation page describing an end-to-end Alauda DevOps pipeline to evaluate AI models (example: YOLOv5) using Volcano jobs, COCO-style validation, and Evidently integration; includes prerequisites, RBAC/helpers, YAML and VolcanoJob examples, parameters, triggers, and monitoring steps (≤50 words).

Changes

Cohort / File(s)	Summary
Documentation: AI model evaluation pipeline guide `docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md`	New comprehensive guide covering prerequisites (Alauda DevOps, Volcano, Evidently, repos, GPUs), data/model preparation (COCO format, Git LFS), RBAC helper scripts, YAML Pipeline and VolcanoJob examples, parameter catalog (repos, evaluation, Evidently, resources), trigger/monitoring instructions, lifecycle and dashboard notes.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  actor User
  participant Pipeline as Alauda DevOps Pipeline
  participant Volcano as Volcano Scheduler
  participant Job as Eval Container
  participant YOLO as YOLOv5 val.py
  participant Evidently as Evidently Service

  rect rgb(235,245,255)
    User->>Pipeline: create PipelineRun (repos, params)
    Pipeline->>Volcano: submit VolcanoJob (YAML, env, volumes)
  end

  rect rgb(245,255,235)
    Volcano->>Job: schedule GPU task
    Job->>Job: clone repos, prepare COCO data (LFS handling)
    Job->>YOLO: run validation -> produce COCO metrics & artifacts
    YOLO-->>Job: metrics + artifacts
  end

  rect rgb(255,245,235)
    Job->>Evidently: upload report / create project (API key)
    Evidently-->>Job: report URL / status
    Job-->>Pipeline: attach artifacts & completion status
    Pipeline-->>User: PipelineRun status + report link
  end

  alt Failure
    Job-->>Pipeline: failure + logs
    Pipeline-->>User: failure notification
  end

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Focus review on the new documentation file for accuracy and clarity.
Check RBAC and YAML snippets for correctness and copy-paste readiness.
Verify commands referencing Git LFS, Evidently endpoint/API details, and VolcanoJob parameters.

Poem

I hop through YAML, nibble on each line,
Volcano rumbles — GPUs hum fine.
YOLO counts boxes, metrics take flight,
Evidently glows with dashboard light.
Carrots and checkpoints — a rabbit's delight. 🥕

Pre-merge checks and finishing touches

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Title Check	⚠️ Warning	The pull request title states "add How_to_Use_Pipeline_to_Evaluate_AI_Modes.md" but the actual file being added is "How_to_Use_Pipeline_to_Evaluate_AI_Models.md" — note the filename contains a typo where "Modes" should be "Models". While the title does convey that a documentation file is being added and references the general purpose, the incorrect filename makes it misleading and inaccurate. Someone scanning the PR history based on this title would search for the wrong filename, which undermines clarity and creates confusion.	Update the PR title to use the correct filename: "add How_to_Use_Pipeline_to_Evaluate_AI_Models.md" (changing "Modes" to "Models"). Optionally, consider making the title more descriptive about the content, such as "Add documentation for evaluating AI models with Alauda DevOps pipeline", which would better summarize the main value of the change beyond just the filename.
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch add/yolov5-evaluating

📜 Recent review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 0ff0096 and 4f073d0.

📒 Files selected for processing (1)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md (1 hunks)

🧰 Additional context used

🧠 Learnings (2)

📚 Learning: 2025-09-29T08:32:26.877Z

Learnt from: davidwtf
PR: alauda/knowledge#67
File: docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md:298-304
Timestamp: 2025-09-29T08:32:26.877Z
Learning: In YOLOv5 model evaluation, when specifying data configuration files like "coco.yaml", the code automatically looks for the file in the data/ subdirectory, so referencing "coco.yaml" as default will resolve to "data/coco.yaml" automatically.

Applied to files:

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md

📚 Learning: 2025-09-29T08:33:21.808Z

Learnt from: davidwtf
PR: alauda/knowledge#67
File: docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md:541-552
Timestamp: 2025-09-29T08:33:21.808Z
Learning: The Evidently UI API POST /api/projects endpoint returns the project ID directly as the response body, not as a JSON object with an .id field.

Applied to files:

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md

🪛 LanguageTool

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md

[style] ~25-~25: This adverb was used twice in the sentence. Consider removing one of them or replacing them with a synonym.
Context: ...**: Install Evidently UI following the [Evidently installation documentation](How_to_Inst...

(ADVERB_REPETITION_PREMIUM)

🔇 Additional comments (1)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md (1)

1-1146: Comprehensive documentation well-executed; all prior issues resolved.

The documentation is thorough and well-structured, covering prerequisites, setup (RBAC, Dockerfile), pipeline configuration, parameters, trigger instructions, and monitoring. All previously flagged critical issues have been addressed:

Secret redaction (line 524): ✓ using [REDACTED]

Metric parsing (line 977): ✓ using correct awk field

Description formatting (lines 993, 998): ✓ balanced punctuation

Code fence language hints (line 50): ✓ declared with ```text

Filename alignment (Models vs Modes): ✓ consistent

Developer clarifications incorporated: YOLOv5 auto-resolution, Evidently API response format, pod naming strategy

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Modes.md (1)
50-56: Add a language hint to this code fence
Our docs tooling (markdownlint MD040) requires every fenced block to declare a language. Tag this one as text (or another appropriate lexer) to unblock the lint step.
-```
+```text
 images/
    val2017/          # val2017.zip extracted content
 annotations/         # annotations_trainval2017.zip extracted content
 val2017.txt          # ...

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between dd53201 and 47131e0.

📒 Files selected for processing (1)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Modes.md (1 hunks)

🧰 Additional context used

🪛 markdownlint-cli2 (0.18.1)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Modes.md

50-50: Fenced code blocks should have a language specified

(MD040, fenced-code-language)

🪛 GitHub Actions: Build and Update

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Modes.md

[error] 1-1: MDX compile error: Dead link found. The link ".." points to a non-existent HTML page (How_to_Use_Pipeline_to_Train_AI_Models.html).

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 47131e0 and a321b23.

📒 Files selected for processing (1)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Modes.md (1 hunks)

🧰 Additional context used

🪛 markdownlint-cli2 (0.18.1)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Modes.md

50-50: Fenced code blocks should have a language specified

(MD040, fenced-code-language)

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (2)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Modes.md (2)

90-100: Clean up Dockerfile deps (duplicate numpy; apt cleanup).

Remove duplicate/conflicting numpy pins.
Prefer apt-get clean.

-RUN apt-get update && \
-    export DEBIAN_FRONTEND=noninteractive && \
-    apt-get install -yq --no-install-recommends git git-lfs unzip curl ffmpeg libfreetype6-dev && \
-    apt clean && rm -rf /var/lib/apt/lists/*
+RUN apt-get update && \
+    export DEBIAN_FRONTEND=noninteractive && \
+    apt-get install -yq --no-install-recommends git git-lfs unzip curl ffmpeg libfreetype6-dev && \
+    apt-get clean && rm -rf /var/lib/apt/lists/*
...
-      "numpy<2.0.0" \
+      "numpy<2.0.0" \
       "opencv-python<4.12.0" \
-      "numpy>=1.18.5" \
       "PyYAML>=5.3.1" \

Also applies to: 95-96

1145-1147: Wording nit: “Check” instead of “Checkout”.

Minor grammar polish for the section header.

-### Checkout PipelineRun status and logs
+### Check PipelineRun status and logs

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a321b23 and 3f3d231.

📒 Files selected for processing (1)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Modes.md (1 hunks)

🔇 Additional comments (4)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Modes.md (4)

50-56: Good fix: code fence language added.
The directory layout fence now declares text, satisfying markdownlint MD040.

302-305: Weights default may be incompatible with YOLOv5 val.py.

Default models/model.torchscript may not be supported by val.py (expects .pt). Please verify or clarify in the doc.

If needed, adjust defaults and copy step:

-      default: "models/model.torchscript"
+      default: "models/model.pt"
...
-                        cp -f "/mnt/workspace/output/${OUTPUT_MODEL_PATH}" "/mnt/workspace/model/${EVALUATE_ARG_WEIGHTS}"
+                        cp -f "/mnt/workspace/output/${OUTPUT_MODEL_PATH}" "/mnt/workspace/model/${EVALUATE_ARG_WEIGHTS}"
+                        # Ensure weights format matches YOLOv5 val.py expectations (.pt by default)

Also applies to: 471-476

42-43: Add a valid cross-link to the training guide (fix docs build and navigation).

The text references the training doc but doesn’t link it. Please link to the actual page and ensure the path exists to avoid link-check failures.

Apply this diff:

-The model to be evaluated comes from the output of the `yolov5-training` pipeline. Refer to the **How to Use Pipeline to Train AI Models** for details on how to train models and obtain the trained model files.
+The model to be evaluated comes from the output of the `yolov5-training` pipeline. Refer to **[How to Use Pipeline to Train AI Models](How_to_Use_Pipeline_to_Train_AI_Models.md)** for details on training and obtaining model files.

To verify the target exists, run:

#!/bin/bash
fd -a 'How_to_Use_Pipeline_to_Train_AI_Models.md' docs | nl -ba
fd -a 'How_to_Install_and_use_Evidently.md' docs | nl -ba

609-621: Avoid disabling TLS verification for Git (security posture).

Cloning with -c http.sslVerify=false undermines TLS. Use default verification and make any bypass opt‑in if absolutely required.

-                            git -c http.sslVerify=false -c lfs.activitytimeout=36000 lfs pull
+                            git -c lfs.activitytimeout=36000 lfs pull
...
-                              GIT_LFS_SKIP_SMUDGE=1 git -c http.sslVerify=false -c lfs.activitytimeout=36000 clone -b $branch "$clone_url" .
+                              GIT_LFS_SKIP_SMUDGE=1 git -c lfs.activitytimeout=36000 clone -b $branch "$clone_url" .
...
-                              GIT_LFS_SKIP_SMUDGE=1 git -c http.sslVerify=false -c lfs.activitytimeout=36000 clone "$clone_url" .
+                              GIT_LFS_SKIP_SMUDGE=1 git -c lfs.activitytimeout=36000 clone "$clone_url" .
...
-                              git -c http.sslVerify=false -c lfs.activitytimeout=36000 lfs pull
+                              git -c lfs.activitytimeout=36000 lfs pull

Also applies to: 614-617, 621-621

⛔ Skipped due to learnings

Learnt from: davidwtf
PR: alauda/knowledge#60
File: docs/en/solutions/How_to_Use_Pipeline_to_Train_AI_Models.md:565-583
Timestamp: 2025-09-23T02:29:55.305Z
Learning: In enterprise environments using Alauda DevOps pipelines for AI model training, self-signed certificates are commonly used for internal Git repositories, requiring the `-c http.sslVerify=false` option to be supported rather than completely removed.

coderabbitai

Actionable comments posted: 5

🧹 Nitpick comments (3)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md (3)
574-586: Avoid embedding credentials in clone URLs; prefer helpers.

Current approach URL‑encodes user/token and places them in the URL, which can leak via process lists and error messages. Use a credential helper or HTTP extra headers.

Example (no URL mutation):
# Prior to clone:
AUTH="$(printf '%s:%s' "$GIT_USER" "$GIT_TOKEN" | base64 -w0)"
git -c http.extraheader="AUTHORIZATION: Basic ${AUTH}" clone -b "$branch" "$url" .
git -c lfs.activitytimeout=36000 lfs pull
Or use GIT_ASKPASS to supply credentials on demand.

Also applies to: 592-627

90-106: Consolidate duplicate numpy constraints and pin compatible versions.

You list numpy<2.0.0 and numpy>=1.18.5 separately; combine to a single spec. Also ensure versions are compatible with the base PyTorch image.

Apply:
-      "numpy<2.0.0" \
-      "opencv-python<4.12.0" \
-      "numpy>=1.18.5" \
+      "numpy>=1.18.5,<2.0.0" \
+      "opencv-python<4.12.0" \
46-47: Prefer HTTPS for dataset downloads.

Switch to https to avoid MITM risks.

Apply:
-Download the validation images from [val2017.zip](http://images.cocodataset.org/zips/val2017.zip) and the annotation information from [annotations_trainval2017.zip](http://images.cocodataset.org/annotations/annotations_trainval2017.zip).
+Download the validation images from [val2017.zip](https://images.cocodataset.org/zips/val2017.zip) and the annotation information from [annotations_trainval2017.zip](https://images.cocodataset.org/annotations/annotations_trainval2017.zip).

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3f3d231 and afe667e.

📒 Files selected for processing (1)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md (1 hunks)

🔇 Additional comments (5)

docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md (5)

524-524: Do not print secrets to logs (EVIDENTLY_API_KEY leakage).

Remove/redact the API key echo to avoid credential exposure in PipelineRun/Pod logs.

Apply:

-            echo "Evidently api key: $EVIDENTLY_API_KEY"
+            # Do not log secrets
+            echo "Evidently api key: [REDACTED]"

977-977: Fix mAP_0_75 parsing (wrong awk field).

Use field 2 like the others; field 3 yields empty.

Apply:

-              mAP_0_75=$(grep -E '^mAP_0_75:' /tmp/report.log | awk '{print $3}')
+              mAP_0_75=$(grep -E '^mAP_0_75:' /tmp/report.log | awk '{print $2}')

993-1000: Tidy metric descriptions (remove stray “)”).

Unbalanced punctuation in two descriptions.

Apply:

-                    "description": "Average Precision, IoU=0.50:0.95 area=all maxDets=100)",
+                    "description": "Average Precision, IoU=0.50:0.95 area=all maxDets=100",
@@
-                    "description": "Average Precision, IoU=0.50 area=all maxDets=100)",
+                    "description": "Average Precision, IoU=0.50 area=all maxDets=100",

609-617: Avoid disabling TLS verification in git commands.

-c http.sslVerify=false weakens security and risks MITM. Remove it or document trusted CA configuration instead.

Apply:

-                            git -c http.sslVerify=false -c lfs.activitytimeout=36000 lfs pull
+                            git -c lfs.activitytimeout=36000 lfs pull
@@
-                              GIT_LFS_SKIP_SMUDGE=1 git -c http.sslVerify=false -c lfs.activitytimeout=36000 clone -b \$branch \"\$clone_url\" .
+                              GIT_LFS_SKIP_SMUDGE=1 git -c lfs.activitytimeout=36000 clone -b \$branch \"\$clone_url\" .
@@
-                              GIT_LFS_SKIP_SMUDGE=1 git -c http.sslVerify=false -c lfs.activitytimeout=36000 clone \"\$clone_url\" .
+                              GIT_LFS_SKIP_SMUDGE=1 git -c lfs.activitytimeout=36000 clone \"\$clone_url\" .
@@
-                              git -c http.sslVerify=false -c lfs.activitytimeout=36000 lfs pull
+                              git -c lfs.activitytimeout=36000 lfs pull

Also applies to: 621-621

⛔ Skipped due to learnings

Learnt from: davidwtf
PR: alauda/knowledge#60
File: docs/en/solutions/How_to_Use_Pipeline_to_Train_AI_Models.md:565-583
Timestamp: 2025-09-23T02:29:55.305Z
Learning: In enterprise environments using Alauda DevOps pipelines for AI model training, self-signed certificates are commonly used for internal Git repositories, requiring the `-c http.sslVerify=false` option to be supported rather than completely removed.

629-639: Incorrect prediction filename assumption
YOLOv5 v7.x saves the JSON as <weights_stem>_predictions.json in runs/val/exp (e.g. best_predictions.json), not a bare predictions.json. Using ${MODEL_NAME}_predictions.json is correct if MODEL_NAME matches the weight filename stem.

Likely an incorrect or invalid review comment.

zhaomingkun1030

LGTM

* add How_to_Use_Pipeline_to_Evaluate_AI_Modes.md * update * update * update * update * fix typo HAMi

davidwtf had a problem deploying to translate September 29, 2025 07:21 — with GitHub Actions Failure

coderabbitai Bot reviewed Sep 29, 2025

View reviewed changes

Comment thread docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Modes.md Outdated

davidwtf temporarily deployed to translate September 29, 2025 07:36 — with GitHub Actions Inactive

coderabbitai Bot reviewed Sep 29, 2025

View reviewed changes

Comment thread docs/en/solutions/How_to_Use_Pipeline_to_Evaluate_AI_Models.md

davidwtf temporarily deployed to translate September 29, 2025 08:03 — with GitHub Actions Inactive

coderabbitai Bot reviewed Sep 29, 2025

View reviewed changes

davidwtf temporarily deployed to translate September 29, 2025 08:16 — with GitHub Actions Inactive

davidwtf temporarily deployed to translate September 29, 2025 08:19 — with GitHub Actions Inactive

coderabbitai Bot reviewed Sep 29, 2025

View reviewed changes

typhoonzero approved these changes Sep 29, 2025

View reviewed changes

davidwtf added 6 commits October 30, 2025 21:26

add How_to_Use_Pipeline_to_Evaluate_AI_Modes.md

622a132

update

fb7faca

update

75e1f73

update

d8b6c13

update

1340a21

fix typo HAMi

4f073d0

davidwtf force-pushed the add/yolov5-evaluating branch from 0ff0096 to 4f073d0 Compare October 30, 2025 13:27

davidwtf temporarily deployed to translate October 30, 2025 13:28 — with GitHub Actions Inactive

davidwtf enabled auto-merge (squash) November 7, 2025 06:45

sinbadonline approved these changes Nov 7, 2025

View reviewed changes

davidwtf merged commit 90cf1ee into main Nov 7, 2025
2 checks passed

davidwtf deleted the add/yolov5-evaluating branch November 7, 2025 07:40

zhaomingkun1030 approved these changes Nov 7, 2025

View reviewed changes

changluyi pushed a commit to changluyi/knowledge that referenced this pull request Apr 23, 2026

add How_to_Use_Pipeline_to_Evaluate_AI_Modes.md (alauda#67)

3e9f291

* add How_to_Use_Pipeline_to_Evaluate_AI_Modes.md * update * update * update * update * fix typo HAMi

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add How_to_Use_Pipeline_to_Evaluate_AI_Modes.md#67

add How_to_Use_Pipeline_to_Evaluate_AI_Modes.md#67
davidwtf merged 6 commits intomainfrom
add/yolov5-evaluating

davidwtf commented Sep 29, 2025 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Sep 29, 2025 •

edited

Loading

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhaomingkun1030 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

davidwtf commented Sep 29, 2025 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhaomingkun1030 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

davidwtf commented Sep 29, 2025 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Sep 29, 2025 •

edited

Loading