New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add 3D+ input support for fp8 rowwise GEMM #2845

Closed

jianyuh wants to merge 1 commit into pytorch:main from jianyuh:export-D59671644

Member

jianyuh commented Jul 15, 2024

Summary:
The input activation can be 3D+, with first dimension as batch dimension.

1. If the input tensor is {M, K}, the output tensor is {M, N}.
1. If the input tensor is {b, M, K}, the output tensor is {b, M, N}.

This Diff adds the support, to match nn.Linear function / matmul supports.

Differential Revision: D59671644

facebook-github-bot added the cla signed label

netlify bot commented Jul 15, 2024 •

edited

Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`7f8ea83`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/6696271974f6cd0008e92c91
😎 Deploy Preview	https://deploy-preview-2845--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Contributor

facebook-github-bot commented Jul 15, 2024

This pull request was exported from Phabricator. Differential Revision: D59671644

facebook-github-bot added the fb-exported label

jianyuh added a commit to jianyuh/FBGEMM that referenced this pull request


          Add 3D+ input support for fp8 rowwise GEMM (pytorch#2845)

ea5b36e

Summary:
Pull Request resolved: pytorch#2845

The input activation can be 3D+, with first dimension as batch dimension.

- 1. If the input tensor is {M, K}, the output tensor is {M, N}.
- 2. If the input tensor is {b, M, K}, the output tensor is {b, M, N}.

This Diff adds the support, to match nn.Linear function / matmul supports.

Differential Revision: D59671644

jianyuh force-pushed the export-D59671644 branch from e626670 to ea5b36e Compare

July 15, 2024 15:40

Contributor

facebook-github-bot commented Jul 15, 2024

This pull request was exported from Phabricator. Differential Revision: D59671644

1 similar comment

Contributor

facebook-github-bot commented Jul 15, 2024

This pull request was exported from Phabricator. Differential Revision: D59671644

jianyuh added a commit to jianyuh/FBGEMM that referenced this pull request


          Add 3D+ input support for fp8 rowwise GEMM (pytorch#2845)

bdb30fc

Summary:
Pull Request resolved: pytorch#2845

The input activation can be 3D+, with first dimension as batch dimension.

- 1. If the input tensor is {M, K}, the output tensor is {M, N}.
- 2. If the input tensor is {b, M, K}, the output tensor is {b, M, N}.

This Diff adds the support, to match nn.Linear function / matmul supports.

Differential Revision: D59671644

jianyuh force-pushed the export-D59671644 branch from ea5b36e to bdb30fc Compare

July 15, 2024 15:45

Contributor

facebook-github-bot commented Jul 16, 2024

This pull request was exported from Phabricator. Differential Revision: D59671644

jianyuh added a commit to jianyuh/FBGEMM that referenced this pull request


          Add 3D+ input support for fp8 rowwise GEMM (pytorch#2845)

73d6565

Summary:
Pull Request resolved: pytorch#2845

The input activation can be 3D+, with first dimension as batch dimension.

- 1. If the input tensor is {M, K}, the output tensor is {M, N}.
- 2. If the input tensor is {b, M, K}, the output tensor is {b, M, N}.

This Diff adds the support, to match nn.Linear function / matmul supports.

Reviewed By: sijiac, jiawenliu64, jwfromm

Differential Revision: D59671644

jianyuh force-pushed the export-D59671644 branch from bdb30fc to 73d6565 Compare

July 16, 2024 07:45


          Add 3D+ input support for fp8 rowwise GEMM (pytorch#2845)

7f8ea83

Summary:
Pull Request resolved: pytorch#2845

The input activation can be 3D+, with first dimension as batch dimension.

- 1. If the input tensor is {M, K}, the output tensor is {M, N}.
- 2. If the input tensor is {b, M, K}, the output tensor is {b, M, N}.

This Diff adds the support, to match nn.Linear function / matmul supports.

Additionally, this Diff adds a FP8 architecture check:

```

functools.lru_cache
def check_fp8_arch() -> None:
    arch_major = torch.cuda.get_device_properties(torch.cuda.current_device()).major
    arch_minor = torch.cuda.get_device_properties(torch.cuda.current_device()).minor
    if torch.version.cuda and arch_major < 9:
        raise Exception("FP8 can only work on Nvidia H100+ GPUs with sm90+ support!")
    if torch.version.hip and (arch_major < 9 or arch_minor < 4):
        raise Exception(
            "FP8 can only work on Nvidia MI300x+ GPUs with gfx942+ support!"
        )
```

Reviewed By: sijiac, jiawenliu64, jwfromm

Differential Revision: D59671644

Contributor

facebook-github-bot commented Jul 16, 2024

This pull request was exported from Phabricator. Differential Revision: D59671644

jianyuh force-pushed the export-D59671644 branch from 73d6565 to 7f8ea83 Compare

July 16, 2024 07:54

facebook-github-bot closed this in

903e928

facebook-github-bot added the Merged label

Contributor

facebook-github-bot commented Jul 16, 2024

This pull request has been merged in 903e928.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment