quantized w2l #1623

mcr229 · 2024-01-18T01:00:47Z

Summary: With the features add prior to this diff, Quantized W2L should successfully work now

Differential Revision: D52809537

pytorch-bot · 2024-01-18T01:00:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/1623

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 5c3cc32 with merge base 6a1c7a2 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

Build documentation / build (buck2) / Build doc (gh)
RuntimeError: Command docker exec -t 581c7c1da34baf59e5f141d00439b8b1f46e91af7d64db1d7f5a4bebc234b874 /exec failed with exit code 2

BROKEN TRUNK - The following job failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Lint / lintrunner / linux-job (gh)
>>> Lint for extension/pybindings/test/make_test.py:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-01-18T01:01:14Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Differential Revision: D52809537

facebook-github-bot · 2024-01-18T19:44:07Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Differential Revision: D52809537

facebook-github-bot · 2024-01-18T23:39:41Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Differential Revision: D52809537

facebook-github-bot · 2024-01-18T23:40:38Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Differential Revision: D52809537

facebook-github-bot · 2024-01-19T20:12:17Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Differential Revision: D52809537

facebook-github-bot · 2024-01-19T20:12:56Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

facebook-github-bot · 2024-01-22T21:03:07Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

facebook-github-bot · 2024-01-22T21:13:43Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

facebook-github-bot · 2024-01-22T21:14:23Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

facebook-github-bot · 2024-01-22T23:50:15Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

facebook-github-bot · 2024-01-22T23:51:17Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

facebook-github-bot · 2024-01-22T23:53:35Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

facebook-github-bot · 2024-01-23T00:07:35Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

facebook-github-bot · 2024-01-23T00:08:14Z

This pull request was exported from Phabricator. Differential Revision: D52809537

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

Summary: Creating new buck targets for serialization and schema. Want to split out Targets which explicitly use the schema (things like passes and node visitors use the dataclasses instantiated in the schema) and targets which are actually serializing, like xnnpack_preprocess Reviewed By: digantdesai Differential Revision: D52809539

Summary: Some funky behavior can happen when we see fused activations under quantization. Specifically we see: ``` dequant --> op --> act --> quant ``` In serialization logic, this sometimes can create some issues. As a result, it if the activation can be fused with the previous op, then we delete the activation, and embed the activation's min/max to the op's node metadata. In serialization, we will check for this metadata and properly apply the activation if found. Reviewed By: digantdesai Differential Revision: D52809538

Summary: Continued from the previous diff, we now apply the pass to the xnnpack delegate, and modify the serialization logic to use the pass's helper to get activation constraints if the node has been fused with an activation Reviewed By: digantdesai Differential Revision: D52852651

Summary: Adding a test to guard FP32 W2L delegation and inference Reviewed By: digantdesai Differential Revision: D52219632

Summary: Quantization was not supported for our conv1d pass in the past, we now add support so that we can delegate and run quantized conv1d graphs. The logic added to the pass was primarily adding q/dq nodes around the squeeze/unsqueeze nodes so that when xnnpack reshapes intermediate tensors to perform 2d conv, the squeeze and unsqueeze would both be quantized. Reviewed By: digantdesai Differential Revision: D52809536

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

facebook-github-bot · 2024-01-23T05:54:39Z

This pull request was exported from Phabricator. Differential Revision: D52809537

facebook-github-bot · 2024-01-23T05:55:15Z

This pull request was exported from Phabricator. Differential Revision: D52809537

facebook-github-bot · 2024-01-23T18:42:05Z

This pull request has been merged in c0c2fab.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 18, 2024

facebook-github-bot added the fb-exported label Jan 18, 2024

mcr229 force-pushed the export-D52809537 branch from ef58ff2 to b912426 Compare January 18, 2024 19:43

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 18, 2024

quantized w2l (pytorch#1623)

b912426

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Differential Revision: D52809537

mcr229 force-pushed the export-D52809537 branch from b912426 to 6551916 Compare January 18, 2024 23:39

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 18, 2024

quantized w2l (pytorch#1623)

6551916

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Differential Revision: D52809537

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 18, 2024

quantized w2l (pytorch#1623)

d7244fb

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Differential Revision: D52809537

mcr229 force-pushed the export-D52809537 branch from 6551916 to d7244fb Compare January 18, 2024 23:40

mcr229 force-pushed the export-D52809537 branch from d7244fb to 32f29c8 Compare January 19, 2024 20:12

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 19, 2024

quantized w2l (pytorch#1623)

32f29c8

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Differential Revision: D52809537

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 19, 2024

quantized w2l (pytorch#1623)

73ed5fb

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Differential Revision: D52809537

mcr229 force-pushed the export-D52809537 branch from 32f29c8 to 73ed5fb Compare January 19, 2024 20:12

mcr229 force-pushed the export-D52809537 branch from 73ed5fb to 15fb70e Compare January 22, 2024 21:02

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 22, 2024

quantized w2l (pytorch#1623)

15fb70e

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 22, 2024

quantized w2l (pytorch#1623)

4023a49

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

mcr229 force-pushed the export-D52809537 branch from 15fb70e to 4023a49 Compare January 22, 2024 21:13

mcr229 force-pushed the export-D52809537 branch from 4023a49 to 98ab145 Compare January 22, 2024 21:14

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 22, 2024

quantized w2l (pytorch#1623)

98ab145

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 22, 2024

quantized w2l (pytorch#1623)

b8458e5

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

mcr229 force-pushed the export-D52809537 branch from 98ab145 to b8458e5 Compare January 22, 2024 23:50

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 22, 2024

quantized w2l (pytorch#1623)

c6b6c9f

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

mcr229 force-pushed the export-D52809537 branch from b8458e5 to c6b6c9f Compare January 22, 2024 23:51

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 22, 2024

quantized w2l (pytorch#1623)

f4d39ed

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

mcr229 force-pushed the export-D52809537 branch from c6b6c9f to f4d39ed Compare January 22, 2024 23:53

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 23, 2024

quantized w2l (pytorch#1623)

ca4ab9e

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

mcr229 force-pushed the export-D52809537 branch from f4d39ed to ca4ab9e Compare January 23, 2024 00:07

mcr229 force-pushed the export-D52809537 branch from ca4ab9e to a804548 Compare January 23, 2024 00:08

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 23, 2024

quantized w2l (pytorch#1623)

a804548

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

mcr229 force-pushed the export-D52809537 branch from a804548 to c40e403 Compare January 23, 2024 05:54

mcr229 added a commit to mcr229/executorch that referenced this pull request Jan 23, 2024

quantized w2l (pytorch#1623)

c40e403

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

mcr229 added 6 commits January 22, 2024 21:54

FP32 Wav2Letter Tests (pytorch#1625)

117c93f

Summary: Adding a test to guard FP32 W2L delegation and inference Reviewed By: digantdesai Differential Revision: D52219632

quantized w2l (pytorch#1623)

5c3cc32

Summary: With the features add prior to this diff, Quantized W2L should successfully work now Reviewed By: digantdesai Differential Revision: D52809537

mcr229 force-pushed the export-D52809537 branch from c40e403 to 5c3cc32 Compare January 23, 2024 05:54

facebook-github-bot closed this in c0c2fab Jan 23, 2024

facebook-github-bot added the Merged label Jan 23, 2024

quantized w2l #1623

quantized w2l #1623

Uh oh!

Conversation

mcr229 commented Jan 18, 2024

Uh oh!

pytorch-bot bot commented Jan 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/1623

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

facebook-github-bot commented Jan 18, 2024

Uh oh!

facebook-github-bot commented Jan 18, 2024

Uh oh!

facebook-github-bot commented Jan 18, 2024

Uh oh!

facebook-github-bot commented Jan 18, 2024

Uh oh!

facebook-github-bot commented Jan 19, 2024

Uh oh!

facebook-github-bot commented Jan 19, 2024

Uh oh!

facebook-github-bot commented Jan 22, 2024

Uh oh!

facebook-github-bot commented Jan 22, 2024

Uh oh!

facebook-github-bot commented Jan 22, 2024

Uh oh!

facebook-github-bot commented Jan 22, 2024

Uh oh!

facebook-github-bot commented Jan 22, 2024

Uh oh!

facebook-github-bot commented Jan 22, 2024

Uh oh!

facebook-github-bot commented Jan 23, 2024

Uh oh!

facebook-github-bot commented Jan 23, 2024

Uh oh!

facebook-github-bot commented Jan 23, 2024

Uh oh!

facebook-github-bot commented Jan 23, 2024

Uh oh!

facebook-github-bot commented Jan 23, 2024

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 18, 2024 •

edited

Loading