Very slow speed in complex_norm() function. #740

lbjcom · 2020-06-23T02:15:22Z

🐛 Bug

When I call spectrogram() in torchaudio 0.5.0 my training script runs very slow. I downgraded to pytorch 1.4 and torchaudio 0.4.0 then the script runs fast. So I compared the difference betweeen 0.4.0 and 0.5.0 and found that the complex_norm() is the root cause of the slow speed:

audio/torchaudio/functional.py

Line 171 in bc1df48

spec_f = complex_norm(spec_f, power=power)

If I install torchaudio 0.5.0 and change the line 171 to the 0.4.0's one:

- spec_f = complex_norm(spec_f, power=power)
+ spec_f = spec_f.pow(power).sum(-1)  # get power of "complex" tensor

and the speed is same as 0.4.0's.

For example in my training script with torachaudio 0.5.0:

--------- TRAINING - Epoch:     1/ 1000 ------------
| Batch:  2461/ 2461, 100.00%                      |
| Loss:   3.253154@lr=1.000000e-03                 |
| Speed:   69.64 files/sec, Elapsed Time: 00:58:53 |
----------------------------------------------------

The speed is about 70 files/sec. But with the 0.4.0's code:

--------- TRAINING - Epoch:     1/ 1000 ------------
| Batch:  2461/ 2461, 100.00%                      |
| Loss:   3.275024@lr=1.000000e-03                 |
| Speed:  408.14 files/sec, Elapsed Time: 00:10:02 |
----------------------------------------------------

As you can see the speed of 0.4.0's is about 5.7x faster than 0.5.0's. I can not share the script because the code is from my company. But I think that the speed difference comes from the process that call's the complex_norm() function. For example voxceleb_trainer calls spectrogram() in main process and there's no speed reduction while my script calls spectrogram() in a forked process which is forked by pytorch's DataLoader.

To Reproduce

Steps to reproduce the behavior:

Install torchaudio 0.5.0
run the script.
get slow speed
change the spec_f = complex_norm(spec_f, power=power) to spec_f = spec_f.pow(power).sum(-1)
run the script again.
get normal speed

Expected behavior

the speed of my script becomes normal when I use either torchaudio 0.4.0 or 0.5.0.

Environment

What commands did you used to install torchaudio (conda/pip/build from source)?
- pip
If you are building from source, which commit is it?
What does torchaudio.__version__ print? (If applicable)
- 0.5.0

Please copy and paste the output from our
environment collection script
(or fill out the checklist below manually).

Collecting environment information...
PyTorch version: 1.5.0+cu101
Is debug build: No
CUDA used to build PyTorch: 10.1

OS: Ubuntu 18.04.3 LTS
GCC version: (Ubuntu 7.4.0-1ubuntu1~18.04.1) 7.4.0
CMake version: version 3.10.2

Python version: 3.6
Is CUDA available: Yes
CUDA runtime version: 10.1.243
GPU models and configuration:
GPU 0: Tesla P40
GPU 1: Tesla P40
GPU 2: Tesla P40
GPU 3: Tesla P40
GPU 4: Tesla P40
GPU 5: Tesla P40
GPU 6: Tesla P40
GPU 7: Tesla P40

Nvidia driver version: 418.87.00
cuDNN version: Could not collect

Versions of relevant libraries:
[pip3] numpy==1.18.4
[pip3] torch==1.5.0+cu101
[pip3] torchaudio==0.5.0
[pip3] torchvision==0.6.0+cu101
[conda] Could not collect

The text was updated successfully, but these errors were encountered:

vincentqb · 2020-06-23T21:55:30Z

Thanks for helping debug this! Can you try with either of the following in spectrogram?

spec_f.pow(2.).sum(-1).pow(0.5 * power)  # This should be equivalent

torch.norm(spec_f, 2, -1).pow(power)  # This is what complex_norm does

power = int(power)
torch.norm(spec_f, 2, -1).pow(power)  # This is what complex_norm does, but cast to int first

lbjcom · 2020-06-24T00:00:20Z

Thanks for helping debug this! Can you try with either of the following in spectrogram?
spec_f.pow(2.).sum(-1).pow(0.5 * power)  # This should be equivalent

=> OK.

torch.norm(spec_f, 2, -1).pow(power)  # This is what complex_norm does

=> SLOW

power = int(power)
torch.norm(spec_f, 2, -1).pow(power)  # This is what complex_norm does, but cast to int first

=> SLOW

Looks like torch.norm() has some problems. And as I mentioned in the body when spectrogram() is called in the main process there's no problem. This only happens when spectrogram() is called in a forked process.

PetrochukM · 2020-06-24T05:17:15Z

I think this is an issue I posted about in torch/audio #455 already.

vincentqb · 2020-06-24T15:41:39Z

Thanks to both of you for debugging :) This does look like the same problem. While this is being fixed, let's replace complex_norm here by

# Replace by torch.norm once issue is fixed
# https://github.com/pytorch/pytorch/issues/34279
complex_tensor.pow(2.).sum(-1).pow(0.5 * power)

Do you want to open a pull request for this?

lbjcom · 2020-06-24T20:36:41Z

I think this is an issue I posted about in torch/audio #455 already.

Looks like you checked the speed in CPU. When I tested my script I ran the script on CPU because cuda() can not be used in a forked process. When torch.norm() is called in GPU tensors there's no problem.

vincentqb · 2020-06-26T14:56:17Z

Closed by #747

Co-authored-by: holly1238 <77758406+holly1238@users.noreply.github.com>

lbjcom mentioned this issue Jun 24, 2020

rollback torch.norm() in spectrogram() #747

Merged

vincentqb closed this as completed Jun 26, 2020

mthrok pushed a commit to mthrok/audio that referenced this issue Dec 13, 2022

Fix usage of correct ScriptModule (pytorch#740)

82dbee9

Co-authored-by: holly1238 <77758406+holly1238@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Very slow speed in complex_norm() function. #740

Very slow speed in complex_norm() function. #740

lbjcom commented Jun 23, 2020

vincentqb commented Jun 23, 2020 •

edited

lbjcom commented Jun 24, 2020

PetrochukM commented Jun 24, 2020 •

edited

vincentqb commented Jun 24, 2020 •

edited

lbjcom commented Jun 24, 2020

vincentqb commented Jun 26, 2020

Very slow speed in complex_norm() function. #740

Very slow speed in complex_norm() function. #740

Comments

lbjcom commented Jun 23, 2020

🐛 Bug

To Reproduce

Expected behavior

Environment

vincentqb commented Jun 23, 2020 • edited

lbjcom commented Jun 24, 2020

PetrochukM commented Jun 24, 2020 • edited

vincentqb commented Jun 24, 2020 • edited

lbjcom commented Jun 24, 2020

vincentqb commented Jun 26, 2020

vincentqb commented Jun 23, 2020 •

edited

PetrochukM commented Jun 24, 2020 •

edited

vincentqb commented Jun 24, 2020 •

edited