Skip to content

Commit

Permalink
Introduce reproducible ResNet-101 benchmark in README
Browse files Browse the repository at this point in the history
  • Loading branch information
sublee committed Jun 24, 2019
1 parent 718e02b commit 984003c
Show file tree
Hide file tree
Showing 2 changed files with 18 additions and 12 deletions.
15 changes: 9 additions & 6 deletions README.ko.md
Original file line number Diff line number Diff line change
Expand Up @@ -89,12 +89,12 @@ API 문서를 비롯한 자세한 문서는 [torchgpipe.readthedocs.io][rtd]에
### ResNet-101 속도 벤치마크

실험 | torchgpipe | GPipe (논문)
---------- | ----: | ----:
naive-1 | 1 | 1
pipeline-1 | 0.74 | 0.8
pipeline-2 | 1.352 | 1.418
pipeline-4 | 2.181 | 2.182
pipeline-8 | 2.808 | 2.891
---------- | -----: | -----:
naive-1 | 1x | 1x
pipeline-1 | 0.736x | 0.8x
pipeline-2 | 1.350x | 1.418x
pipeline-4 | 2.291x | 2.182x
pipeline-8 | 3.114x | 2.891x

GPipe 논문의 그림3 (b)에 보고된 ResNet-101 학습 속도 벤치마크를
재현했습니다.
Expand All @@ -105,6 +105,9 @@ GPipe 없이 한 장치에서 ResNet-101을 학습 시켰을 때 상대속도를
Parallelism이 적용되지 않고 Checkpointing 오버헤드만 있어서 naive-1에 비해
오히려 더 느립니다.

[examples/resnet101_performance_benchmark](examples/resnet101_performance_benchmark)에서
실험 코드를 확인할 수 있습니다.

### AmoebaNet-D 메모리 벤치마크

<table>
Expand Down
15 changes: 9 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -97,12 +97,12 @@ references.
### ResNet-101 Performance Benchmark

Experiment | torchgpipe | GPipe (original)
---------- | ----: | ----:
naive-1 | 1 | 1
pipeline-1 | 0.74 | 0.8
pipeline-2 | 1.352 | 1.418
pipeline-4 | 2.181 | 2.182
pipeline-8 | 2.808 | 2.891
---------- | -----: | -----:
naive-1 | 1x | 1x
pipeline-1 | 0.736x | 0.8x
pipeline-2 | 1.350x | 1.418x
pipeline-4 | 2.291x | 2.182x
pipeline-8 | 3.114x | 2.891x

The table shows the reproduced performance benchmark on ResNet-101, as stated
by reported in Figure 3(b) of the paper.
Expand All @@ -114,6 +114,9 @@ k partitions with GPipe using k devices. Pipeline-1 is slower than naive-1
since it does not benefit from pipeline parallelism but has checkpointing
overhead.

The reproducible code can be found in
[examples/resnet101_performance_benchmark](examples/resnet101_performance_benchmark).

### AmoebaNet-D Memory Benchmark

<table>
Expand Down

0 comments on commit 984003c

Please sign in to comment.