diff --git a/README.md b/README.md index f3e4b9bc1..6567b263a 100644 --- a/README.md +++ b/README.md @@ -73,7 +73,7 @@ git checkout -b triton_v3.3.x origin/triton_v3.3.x export FLAGTREE_BACKEND=tsingmicro python3 -m pip install . --no-build-isolation -v ``` -[ascend](https://github.com/FlagTree/flagtree/tree/triton_v3.2.x/third_party/ascend/) +[ascend](https://github.com/FlagTree/flagtree/blob/triton_v3.2.x/python/setup_tools/setup_helper.py) ```shell # Recommended: Use the Dockerfile flagtree/dockerfiles/Dockerfile-ubuntu20.04-python3.9-ascend # After registering an account at https://www.hiascend.com/developer/download/community/result?module=cann, diff --git a/README_cn.md b/README_cn.md index 84ce16d81..047e1e861 100644 --- a/README_cn.md +++ b/README_cn.md @@ -73,7 +73,7 @@ git checkout -b triton_v3.3.x origin/triton_v3.3.x export FLAGTREE_BACKEND=tsingmicro python3 -m pip install . --no-build-isolation -v ``` -[ascend](https://github.com/FlagTree/flagtree/tree/triton_v3.2.x/third_party/ascend/) +[ascend](https://github.com/FlagTree/flagtree/blob/triton_v3.2.x/python/setup_tools/setup_helper.py) ```shell # 推荐使用镜像 flagtree/dockerfiles/Dockerfile-ubuntu20.04-python3.9-ascend # 在 https://www.hiascend.com/developer/download/community/result?module=cann diff --git a/reports/v0.2.0/release_notes_v0.2.0_cn.md b/reports/v0.2.0/release_notes_v0.2.0_cn.md new file mode 100644 index 000000000..001570fb4 --- /dev/null +++ b/reports/v0.2.0/release_notes_v0.2.0_cn.md @@ -0,0 +1,36 @@ +[English](./release_notes_v0.2.0.md) + +## FlagTree 0.2.0 Release + +### Highlights + +FlagTree 继承前一版本的能力,持续集成新的后端,拓展对 Triton 版本的支持,提供硬件感知优化能力。项目当前处于初期,目标是兼容各芯片后端现有适配方案,统一代码仓库,打造代码共建平台,快速实现单仓库多后端支持。 + +### New features + +* 新增多后端支持 + +目前支持的后端包括 iluvatar、xpu (klx)、mthreads、__metax__、__aipu__(arm npu)、__ascend__ npu & cpu、__tsingmicro__、cambricon,其中 __加粗__ 为本次新增。
+各新增后端保持前一版本的能力:跨平台编译与快速验证、高差异度模块插件化、CI/CD、质量管理能力。
+ +* 两种编译路径支持 + +支持 TritonGPU、Linalg 两种编译路径。对非 GPGPU 后端提供多种接入范式,新增 FLIR 仓库支持基于 Linalg Dialect 扩展的后端编译。 + +* 新增 Triton 版本支持 + +目前支持的 Triton 版本包括 3.0.x、3.1.x、__3.2.x__、__3.3.x__,其中 __加粗__ 为本次新增。 + +* 硬件感知优化支持 + +支持为后端通用或特有的硬件特性提供指导编程接口。通过注解等兼容式地扩展,在语言层添加指导信息以提高具体后端的性能。当前已实现对 Async DMA 的硬件感知优化支持。 + +* 与 FlagGems 算子库联合建设 + +在版本适配、后端适配、推理芯片特性适配等方面,与 [FlagGems](https://github.com/FlagOpen/FlagGems) 算子库联合支持相关特性。 + +### Looking ahead + +GPGPU 后端代码将作整合,将后端差异化改动与 TritonGPU 解耦;非 GPGPU 后端将在 FLIR 基础上横向整合,对通用 Pass 进行统一设计。
+为后端厂商提供 Triton 适配版本升级指南:3.0 -> 3.1 -> 3.2 -> 3.3。
+CI/CD 将添加 FlagGems 算子库功能测试。
diff --git a/reports/v0.2.0/report_tests.md b/reports/v0.2.0/report_tests.md new file mode 100644 index 000000000..e11e82be2 --- /dev/null +++ b/reports/v0.2.0/report_tests.md @@ -0,0 +1,14 @@ + + +## FlagTree Test-Report + +FlagTree tests are validated on different backends, but currently the tests consist of only unit tests, which we will refine in the future for smaller or larger scale tests. + +### 1. Python unit test: + +|              | default | xpu (klx) | iluvatar | mthreads | metax | +|----------------------|---------------------------|-------------------------------------------|------------------------------------------------|------------------------------------------------|---------------------------------------------| +| Number of unit tests | 11353 items | 12623 items | 14808 items | 10392 items | 13728 items | +| Script location | flagtree/python/test/unit | flagtree/third_party/xpu/python/test/unit | flagtree/third_party/iluvatar/python/test/unit | flagtree/third_party/mthreads/python/test/unit | flagtree/third_party/metax/python/test/unit | +| Test command | python3 -m pytest -s | python3 -m pytest -s | python3 -m pytest -s | python3 -m pytest -s | python3 -m pytest -s | +| Passing rate | 100% | 100% | 100% | 100% | 100% |