Migrate `sinh` and `sinh_` from the TH to Aten (CUDA) #28527

xuhdev · 2019-10-23T18:19:15Z

Stack from ghstack:

Remove definitions of acosh and asinh from TH #28696 Remove definitions of acosh and asinh from TH
Migrate sinh and sinh_ from the TH to Aten (CUDA) #28527 Migrate sinh and sinh_ from the TH to Aten (CUDA)
Migrate asin and asin_ from the TH to Aten (CUDA) #28482 Migrate asin and asin_ from the TH to Aten (CUDA)
Migrate sin and sin_ from the TH to Aten (CUDA) #28237 Migrate sin and sin_ from the TH to Aten (CUDA)

Benchmark (Debian Buster, CUDA 9.2, Quadro P400, turbo off, Release, gcc 7.4):

import timeit

for n, t in [(10_000, 20000),
             (100_000, 20000)]:
    for dtype in ('torch.half', 'torch.float', 'torch.double'):
        print(f'torch.sinh(a) a.numel() == {n} for {t} times {dtype}')
        print(timeit.timeit(f'torch.sinh(a); torch.cuda.synchronize()', setup=f'import torch; a=torch.arange({n}, dtype={dtype}, device="cuda")', number=t))

Before:

torch.sinh(a) a.numel() == 10000 for 20000 times torch.half
0.3807680979998622
torch.sinh(a) a.numel() == 10000 for 20000 times torch.float
0.37430476099962107
torch.sinh(a) a.numel() == 10000 for 20000 times torch.double
1.0580407639999976
torch.sinh(a) a.numel() == 100000 for 20000 times torch.half
0.7996397469996737
torch.sinh(a) a.numel() == 100000 for 20000 times torch.float
1.010930432999885
torch.sinh(a) a.numel() == 100000 for 20000 times torch.double
7.310400856999877

After:

torch.sinh(a) a.numel() == 10000 for 20000 times torch.half
0.3720399889998589
torch.sinh(a) a.numel() == 10000 for 20000 times torch.float
0.3694016069994177
torch.sinh(a) a.numel() == 10000 for 20000 times torch.double
1.0551542660004998
torch.sinh(a) a.numel() == 100000 for 20000 times torch.half
0.7431191599998783
torch.sinh(a) a.numel() == 100000 for 20000 times torch.float
0.9953043630002867
torch.sinh(a) a.numel() == 100000 for 20000 times torch.double
7.3146168890007175

Close #24628

Differential Revision: D18124732

Benchmark (Debian Buster, CUDA 9.2, Quadro P400, turbo off, Release): ```python import timeit for n, t in [(10_000, 20000), (100_000, 20000)]: for dtype in ('torch.half', 'torch.float', 'torch.double'): print(f'torch.sinh(a) a.numel() == {n} for {t} times {dtype}') print(timeit.timeit(f'torch.sinh(a); torch.cuda.synchronize()', setup=f'import torch; a=torch.arange({n}, dtype={dtype}, device="cuda")', number=t)) ``` Before: ``` torch.sinh(a) a.numel() == 10000 for 20000 times torch.half 0.3807680979998622 torch.sinh(a) a.numel() == 10000 for 20000 times torch.float 0.37430476099962107 torch.sinh(a) a.numel() == 10000 for 20000 times torch.double 1.0580407639999976 torch.sinh(a) a.numel() == 100000 for 20000 times torch.half 0.7996397469996737 torch.sinh(a) a.numel() == 100000 for 20000 times torch.float 1.010930432999885 torch.sinh(a) a.numel() == 100000 for 20000 times torch.double 7.310400856999877 ``` After: ``` torch.sinh(a) a.numel() == 10000 for 20000 times torch.half 0.3720399889998589 torch.sinh(a) a.numel() == 10000 for 20000 times torch.float 0.3694016069994177 torch.sinh(a) a.numel() == 10000 for 20000 times torch.double 1.0551542660004998 torch.sinh(a) a.numel() == 100000 for 20000 times torch.half 0.7431191599998783 torch.sinh(a) a.numel() == 100000 for 20000 times torch.float 0.9953043630002867 torch.sinh(a) a.numel() == 100000 for 20000 times torch.double 7.3146168890007175 ``` Close #24628 [ghstack-poisoned]

Benchmark (Debian Buster, CUDA 9.2, Quadro P400, turbo off, Release): ```python import timeit for n, t in [(10_000, 20000), (100_000, 20000)]: for dtype in ('torch.half', 'torch.float', 'torch.double'): print(f'torch.sinh(a) a.numel() == {n} for {t} times {dtype}') print(timeit.timeit(f'torch.sinh(a); torch.cuda.synchronize()', setup=f'import torch; a=torch.arange({n}, dtype={dtype}, device="cuda")', number=t)) ``` Before: ``` torch.sinh(a) a.numel() == 10000 for 20000 times torch.half 0.3807680979998622 torch.sinh(a) a.numel() == 10000 for 20000 times torch.float 0.37430476099962107 torch.sinh(a) a.numel() == 10000 for 20000 times torch.double 1.0580407639999976 torch.sinh(a) a.numel() == 100000 for 20000 times torch.half 0.7996397469996737 torch.sinh(a) a.numel() == 100000 for 20000 times torch.float 1.010930432999885 torch.sinh(a) a.numel() == 100000 for 20000 times torch.double 7.310400856999877 ``` After: ``` torch.sinh(a) a.numel() == 10000 for 20000 times torch.half 0.3720399889998589 torch.sinh(a) a.numel() == 10000 for 20000 times torch.float 0.3694016069994177 torch.sinh(a) a.numel() == 10000 for 20000 times torch.double 1.0551542660004998 torch.sinh(a) a.numel() == 100000 for 20000 times torch.half 0.7431191599998783 torch.sinh(a) a.numel() == 100000 for 20000 times torch.float 0.9953043630002867 torch.sinh(a) a.numel() == 100000 for 20000 times torch.double 7.3146168890007175 ``` Close #24628 ghstack-source-id: 501674a Pull Request resolved: #28527

Summary: Pull Request resolved: pytorch/pytorch#28527 Benchmark (Debian Buster, CUDA 9.2, Quadro P400, turbo off, Release, gcc 7.4): ```python import timeit for n, t in [(10_000, 20000), (100_000, 20000)]: for dtype in ('torch.half', 'torch.float', 'torch.double'): print(f'torch.sinh(a) a.numel() == {n} for {t} times {dtype}') print(timeit.timeit(f'torch.sinh(a); torch.cuda.synchronize()', setup=f'import torch; a=torch.arange({n}, dtype={dtype}, device="cuda")', number=t)) ``` Before: ``` torch.sinh(a) a.numel() == 10000 for 20000 times torch.half 0.3807680979998622 torch.sinh(a) a.numel() == 10000 for 20000 times torch.float 0.37430476099962107 torch.sinh(a) a.numel() == 10000 for 20000 times torch.double 1.0580407639999976 torch.sinh(a) a.numel() == 100000 for 20000 times torch.half 0.7996397469996737 torch.sinh(a) a.numel() == 100000 for 20000 times torch.float 1.010930432999885 torch.sinh(a) a.numel() == 100000 for 20000 times torch.double 7.310400856999877 ``` After: ``` torch.sinh(a) a.numel() == 10000 for 20000 times torch.half 0.3720399889998589 torch.sinh(a) a.numel() == 10000 for 20000 times torch.float 0.3694016069994177 torch.sinh(a) a.numel() == 10000 for 20000 times torch.double 1.0551542660004998 torch.sinh(a) a.numel() == 100000 for 20000 times torch.half 0.7431191599998783 torch.sinh(a) a.numel() == 100000 for 20000 times torch.float 0.9953043630002867 torch.sinh(a) a.numel() == 100000 for 20000 times torch.double 7.3146168890007175 ``` Close #24628 Test Plan: Imported from OSS Differential Revision: D18124732 Pulled By: VitalyFedyunin fbshipit-source-id: 054b0c0884ac12de2dd1a92c5de916aaf047f9e9

facebook-github-bot · 2019-10-30T23:38:12Z

@VitalyFedyunin merged this pull request in e0009fd.

This was referenced Oct 23, 2019

Migrate sin and sin_ from the TH to Aten (CUDA) #28237

Closed

Migrate asin and asin_ from the TH to Aten (CUDA) #28482

Closed

xuhdev requested a review from VitalyFedyunin October 23, 2019 18:22

ifedan approved these changes Oct 24, 2019

View reviewed changes

VitalyFedyunin approved these changes Oct 24, 2019

View reviewed changes

xuhdev mentioned this pull request Oct 25, 2019

Remove definitions of acosh and asinh from TH #28696

Closed

facebook-github-bot closed this in e0009fd Oct 30, 2019

facebook-github-bot added the merged label Oct 30, 2019

facebook-github-bot deleted the gh/xuhdev/45/head branch November 3, 2019 15:15

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Migrate `sinh` and `sinh_` from the TH to Aten (CUDA) #28527

Migrate `sinh` and `sinh_` from the TH to Aten (CUDA) #28527

Uh oh!

xuhdev commented Oct 23, 2019 •

edited

Loading

Uh oh!

facebook-github-bot commented Oct 30, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Migrate sinh and sinh_ from the TH to Aten (CUDA) #28527

Migrate sinh and sinh_ from the TH to Aten (CUDA) #28527

Uh oh!

Conversation

xuhdev commented Oct 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Oct 30, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Migrate `sinh` and `sinh_` from the TH to Aten (CUDA) #28527

Migrate `sinh` and `sinh_` from the TH to Aten (CUDA) #28527

xuhdev commented Oct 23, 2019 •

edited

Loading