We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
@TobeyQin
Code freeze: 9/1/2021 Bug Bash date: 9/2/2021 Release date: 9/17/2021
Device P2P Bandwidth (Tool: Nvidia p2pBandwidthLatencyTest Tool) -- Delayed
Contribution related -- @lynex
Document -- @TobeyQin
Process monitor
Coding style -- @abuccts
The text was updated successfully, but these errors were encountered:
Docs - Upgrade version and release note (#209)
15f22e2
__Description__ Upgrade version and release note. Closes #95 and #170. __Major Revisions__ * Upgrade package versions * Add release note for v0.3.0
b875c44
TobeyQin
Successfully merging a pull request may close this issue.
Release Manager
@TobeyQin
Endgame
Code freeze: 9/1/2021
Bug Bash date: 9/2/2021
Release date: 9/17/2021
Main Features
SuperBench Framework
SB Runner -- @abuccts
PR: Runner - Support mpi mode #146
SB Benchmarks -- @guoshzhao
PR: Benchmarks: Code Revision - revise the DockerBenchmark base class #179 and Benchmarks: Docker Benchmarks - Setup Docker-in-Docker environment #180
Single-node Validation
Micro-benchmarks -- @guoshzhao @yukirora
PR: Benchmarks: Add Benchmark - Add memory bandwidth benchmark for cuda. #114
Device P2P Bandwidth (Tool: Nvidia p2pBandwidthLatencyTest Tool) -- Delayed
PR: Benchmarks: Add Benchmark - Add IB Loopback performance benchmark. #112 and Benchmarks: Build Pipeline - Add perftest as a submodule and add build logic #129
PR: Benchmarks: Add Benchmark - Add NCCL performance benchmark. #113 and Benchmarks: Build Pipeline - Add nccl-tests as a submodule and add build logic. #128
PR: Benchmarks: Build Pipeline - Add FIO benchmark tool #127 and Benchmarks: Add Benchmark - Add disk performance benchmark #132 and Benchmarks: Revise Benchmark - Add readwrite I/O pattern #161
PR: Benchmarks: Add Benchmark - Add GPU SM copy benchmark #162 and Benchmarks: Add Benchmark - Add GPU SM copy benchmark #169
Support AMD
Docker Image Support -- @guoshzhao ETA: 7/16/2021
Micro Benchmarks
PR: Benchmarks: Build Pipeline - Support rocm cmake build #137 and Benchmarks: Add Benchmark - Add the source code of rocm kernel launch overhead benchmark. #136
PR: Benchmarks: Build Pipeline - add rccl-tests as a submodule with building logic #139 and Benchmarks: Add Benchmark - Revise and add rccl microbenchmark for rocm. #143
PR: Benchmarks: Build Pipeline - Add rocBLAS building logic in third_party #144 and Benchmarks: Code Revision - Extract base class for gemm flops microbenchmark #165
PR: Benchmarks: Code Revision - Extract base class for memory bandwidth microbenchmark #159 and Benchmarks: Add Benchmark - Add memory bus bandwidth performance microbenchmark for amd #153
E2E Benchmarks -- @guoshzhao ETA: 7/16/2021
Result Summary -- @cp5555
PR: Benchmarks: Add Feature - Add reduce function support for output summary. #147, Benchmarks: Add Feature - Set reduce type for current benchmarks' metrics. #149, and Runner: Add Feature - Generate summarized output files. #157
Bug Fix
PR: Benchmarks: Fix Bug - Fix bug of VGG models failed on A100 GPU with batch_size=128 #134
Other Improvement
Contribution related -- @lynex
Document -- @TobeyQin
Add metric reasoning doc -- @cp5555 @guoshzhaoProcess monitor
Coding style -- @abuccts
Backlogs
Multi-Node Benchmarks
UI Design
The text was updated successfully, but these errors were encountered: