Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

V0.3.0 Release Plan #95

Closed
29 of 33 tasks
TobeyQin opened this issue Jun 9, 2021 · 0 comments · Fixed by #209
Closed
29 of 33 tasks

V0.3.0 Release Plan #95

TobeyQin opened this issue Jun 9, 2021 · 0 comments · Fixed by #209
Assignees

Comments

@TobeyQin
Copy link
Contributor

TobeyQin commented Jun 9, 2021

Release Manager

@TobeyQin

Endgame

Code freeze: 9/1/2021
Bug Bash date: 9/2/2021
Release date: 9/17/2021

Main Features

SuperBench Framework

SB Runner -- @abuccts

SB Benchmarks -- @guoshzhao

Single-node Validation

Micro-benchmarks -- @guoshzhao @yukirora

    • Device P2P Bandwidth (Tool: Nvidia p2pBandwidthLatencyTest Tool) -- Delayed

      Metrics Unit Description
      P2P_BW_Max GB/s The maximum bandwidth in Bidirectional P2P=Enabled Bandwidth Matrix for all GPUs
      P2P_BW_Min GB/s The minimum bandwidth
      P2P_BW_Avg GB/s The average bandwidth

Support AMD

Docker Image Support -- @guoshzhao ETA: 7/16/2021

Micro Benchmarks

E2E Benchmarks -- @guoshzhao ETA: 7/16/2021

    • CNN models -- User PyTorch TORCHVISION.MODELS sub-package
      • ResNet: ResNet-50, ResNet-101, ResNet-152
      • DenseNet: DenseNet-169, DenseNet-201 ​
      • VGG: VGG-11, VGG-13, VGG-16, VGG-19​
    • BERT -- Use huggingface Transformers
      • BERT
      • BERT LARGE
    • LSTM -- Use PyTorch TORCH.NN sub-package
    • GPT-2 -- Use huggingface Transformers

Result Summary -- @cp5555

Bug Fix

Other Improvement

  1. Contribution related -- @lynex

  2. Document -- @TobeyQin

  3. Process monitor

    • Add Heart beat to monitor process health
    • Auto kill all processes on all nodes
  4. Coding style -- @abuccts

    • Add vscode online

Backlogs

Multi-Node Benchmarks

  • Mellanox ClusterKit
  • GPCNeT

UI Design

@TobeyQin TobeyQin self-assigned this Jun 9, 2021
@TobeyQin TobeyQin pinned this issue Jun 9, 2021
@abuccts abuccts linked a pull request Sep 22, 2021 that will close this issue
abuccts added a commit that referenced this issue Sep 24, 2021
__Description__

Upgrade version and release note. Closes #95 and #170.

__Major Revisions__

* Upgrade package versions
* Add release note for v0.3.0
abuccts added a commit that referenced this issue Sep 24, 2021
__Description__

Upgrade version and release note. Closes #95 and #170.

__Major Revisions__

* Upgrade package versions
* Add release note for v0.3.0
@cp5555 cp5555 closed this as completed Sep 26, 2021
@cp5555 cp5555 unpinned this issue Aug 8, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants