Skip to content

Commit

Permalink
Merge pull request #673 from nicolas-chaulet/pytorch-profiler
Browse files Browse the repository at this point in the history
Pytorch profiler
  • Loading branch information
nicolas-chaulet committed Nov 9, 2021
2 parents 2495ec4 + 1fbf090 commit 69a06f7
Show file tree
Hide file tree
Showing 5 changed files with 219 additions and 158 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ See our [paper](https://arxiv.org/pdf/2010.04642.pdf) at 3DV for an overview of

- CUDA 10 or higher (if you want GPU version)
- Python 3.7 or higher + headers (python-dev)
- PyTorch 1.8 or higher (PyTorch >= 1.9 is recommended)
- PyTorch 1.8.1 or higher (PyTorch >= 1.9 is recommended)
- A Sparse convolution backend (optional) see [here](https://github.com/nicolas-chaulet/torch-points3d#3d-sparse-convolution-support) for installation instructions

Install with
Expand Down
14 changes: 13 additions & 1 deletion conf/training/default.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -38,6 +38,18 @@ wandb:
config:
model_name: ${model_name}

# parameters for TensorBoard Visualization
# parameters for TensorBoard Visualization
tensorboard:
log: True
pytorch_profiler:
log: True # activate PyTorch Profiler in TensorBoard
nb_epoch: 3 # number of epochs to profile (0 -> all).
skip_first: 10 # number of first iterations to skip.
wait: 5 # number of iterations where the profiler is disable.
warmup: 3 # number of iterations where the profiler starts tracing but the results are discarded. This is for reducing the profiling overhead. The overhead at the beginning of profiling is high and easy to bring skew to the profiling result.
active: 5 # number of iterations where the profiler is active and records events.
repeat: 0 # number of cycle wait/warmup/active to realise before stoping profiling (0 -> all).
record_shapes: True # save information about operator’s input shapes.
profile_memory: True # track tensor memory allocation/deallocation.
with_stack: True # record source information (file and line number) for the ops.
with_flops: True # use formula to estimate the FLOPS of specific operators (matrix multiplication and 2D convolution).

0 comments on commit 69a06f7

Please sign in to comment.